Re: [PATCH v2] hwmon: Driver for temperature sensors on SATA drives

From: Gabriel C
Date: Sun Jan 12 2020 - 06:18:28 EST


Am Sa., 11. Jan. 2020 um 21:24 Uhr schrieb Guenter Roeck <linux@xxxxxxxxxxxx>:
>
> On 1/8/20 7:33 AM, Guenter Roeck wrote:
> > On Tue, Jan 07, 2020 at 08:12:06PM -0500, Martin K. Petersen wrote:
> >>
> >> Guenter,
> >>
> >>> Any idea how I might be able to reproduce this ? So far I have been
> >>> unsuccessful.
> >>>
> >>> Building drivetemp into the kernel, with ahci and everything SCSI
> >>> built as module, doesn't trigger the crash for me. This is with the
> >>> drivetemp patch (v3) as well as commit d188b0675b ("scsi: core: Add
> >>> sysfs attributes for VPD pages 0h and 89h") applied on top of v5.4.7.
> >>
> >> This is with 5.5-rc1. I'll try another kernel.
> >>
> >> My repro is:
> >>
> >> # modprobe drivetemp
> >> # modprobe <any SCSI driver, including ahci>
> >>
> > No luck on my side. Can you provide a traceback ? Maybe we can use it
> > to find out what is happening.
> >
>
> I tried again, this time with v5.5-rc5. Loading and unloading ahci and
> drivetemp in any order does not cause any problems for me.
>
> At this point I don't know what else I could test. I went ahead and
> applied the drivetemp patch to hwmon-next. Maybe we'll get some additional
> test feedback this way.

I've tested Linus git tree from right now + hwmon-next and I cannot
make it crash.
The driver seems to work fine here and temperature reportings are very accurate
on all HDDs on that box. ( 8 x Seagate IronWolf 2 TB (ST2000VN004) )

What I've noticed however is the nvme temperature low/high values on
the Sensors X are strange here.
I'm not sure it is a v5.5 issue or a hwmon-next one right now, I
didn't boot a vanilla v5.5-rc5 yet.

Both nvme's are Samsung SSD 960 EVO 250GB.

They look like this:

nvme-pci-1300
Adapter: PCI adapter
Composite: +27.9ÂC (low = -273.1ÂC, high = +76.8ÂC)
(crit = +78.8ÂC)
Sensor 1: +27.9ÂC (low = -273.1ÂC, high = +65261.8ÂC)
Sensor 2: +29.9ÂC (low = -273.1ÂC, high = +65261.8ÂC)

nvme-pci-6100
Adapter: PCI adapter
Composite: +23.9ÂC (low = -273.1ÂC, high = +76.8ÂC)
(crit = +78.8ÂC)
Sensor 1: +23.9ÂC (low = -273.1ÂC, high = +65261.8ÂC)
Sensor 2: +25.9ÂC (low = -273.1ÂC, high = +65261.8ÂC)

Best Regards,

Gabriel C.