During the last couple off days I needed to update the firmware from the Nvidia Tesla T4 card in our servers. When following the installation steps provided by HPE I ran into some issues, so I decided to create a step by step guide on how to update the firmware.
- Download the latest firmware from your vendor
- Upload the RPM file to /usr/local/bin using Winscp or your favorite tool
- Connect using SSH to the host
- Browse to
cd /usr/local/bin
- unpack the RPM file using the following command:
rpm –ivh ./Tesla_T4_90.04.96.00.01-1-0.x86_64.rpm
The RPM file name can be different when upgrading a newer version or other Nvidia card. - Go to the folder where the RPM file is extracted for now this is the Tesla_T4_90.04.96.00.01 folder:
cd /usr/local/bin/Tesla_T4_90.04.96.00.01/
- Change the permissions of the file
chmod +x Tesla_T4_90.04.96.00.01.scexe
- Make sure all nvidia kernel modules are removed
init 3
rmmod nvidia - When you get the following error :
ERROR: Module nvidia is in use
run the following command:service xcp-rrdd-gpumon stop
and then run:rmmod nvidia
- Now we can upgrade the firmware using the following command:
./Tesla_T4_90.04.96.00.01.scexe -f
The SCEXE file name can be different when upgrading a newer version or other Nvidia card.
Choose -i if you would control the upgrade for every card in the host.
- Browse to
- When all the cards are upgraded you need to reboot the host and continue to the next host.
Good luck with upgrading, as you can see it’s easy.