Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SB7790 switch support #6

Open
kramanella opened this issue Jun 3, 2020 · 7 comments
Open

SB7790 switch support #6

kramanella opened this issue Jun 3, 2020 · 7 comments

Comments

@kramanella
Copy link

kramanella commented Jun 3, 2020

ibswinfo supports Mellanox SB7790 unmanaged switches running firmware 11.1100.0072 or greater with 1 exception. The '-T' flag is unsupported. All other info and vitals are captured.
Thanks!
Mark

Sample output:
`...

temperature (C) | 50
max temp (C) | 56
QSFP#01 (C) | 0
QSFP#02 (C) | 0
QSFP#03 (C) | 0
QSFP#04 (C) | 0
QSFP#05 (C) | 0
QSFP#06 (C) | 0
QSFP#07 (C) | 0
QSFP#08 (C) | 0
QSFP#09 (C) | 0
QSFP#10 (C) | 0
QSFP#11 (C) | 0
QSFP#12 (C) | 0
QSFP#13 (C) | 0
QSFP#14 (C) | 0
QSFP#15 (C) | 0
QSFP#16 (C) | 0
QSFP#17 (C) | 0
QSFP#18 (C) | 0
QSFP#19 (C) | 0
QSFP#20 (C) | 0
QSFP#21 (C) | 0
QSFP#22 (C) | 0
QSFP#23 (C) | 0
QSFP#24 (C) | 0
QSFP#25 (C) | 0
QSFP#26 (C) | 0
QSFP#27 (C) | 0
QSFP#28 (C) | 0
QSFP#29 (C) | 0
QSFP#30 (C) | 0
QSFP#31 (C) | 0
QSFP#32 (C) | 0
QSFP#33 (C) | 0
QSFP#34 (C) | 0
QSFP#35 (C) | 0
QSFP#36 (C) | 0

...`

@kcgthb
Copy link
Member

kcgthb commented Jun 5, 2020

Hi @kramanella

Ah, interesting!

Would you mind sending me the output of:

# ibswinfo.sh -d <device_id> -o inventory | egrep '^part_number|version'

as well as:

# mlxreg -d <device_id> --reg_name MTMP --get  --indexes "sensor_index=0x1"

And of course, you're positive that there are cables plugged in those ports, right?

@kramanella
Copy link
Author

kramanella commented Jun 6, 2020 via email

kcgthb added a commit that referenced this issue Jun 6, 2020
@kcgthb
Copy link
Member

kcgthb commented Jun 6, 2020

Thanks for the output!

It looks like the registers are correctly showing the temperature, so I'm not 100% sure why the script shows 0.

Could you please try the version from the SB7790branch at https://github.com/stanford-rc/ibswinfo/blob/SB7790/ibswinfo.sh and see if that fixes the issue?

@kramanella
Copy link
Author

kramanella commented Jun 6, 2020 via email

@kcgthb
Copy link
Member

kcgthb commented Jun 10, 2020

Ah sorry, I didn't ask for the right index before.
Could you please run these 2 commands instead?

# mlxreg -d <device_id> --reg_name MTMP --get --indexes "sensor_index=0x39"
# mlxreg -d <device_id> --reg_name MTMP --get --indexes "sensor_index=0x40

@kramanella
Copy link
Author

kramanella commented Jun 11, 2020 via email

@kcgthb
Copy link
Member

kcgthb commented Jun 11, 2020

Thank you!

So that's the problem: temperature is 0x00000000 for sensor 0x40 (which is the first port of the switch). The fact that 0x39 doesn't exist confirms that the indexes are not shifted or anything.

Not much can be done about that unfortunately, that would be a firmware limitation on that model. :|

I added a note in the README to mention that limitation, thanks a lot for reporting it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants