Monitor thermals for Jafner.net #61

Closed
opened 2022-08-11 09:51:18 -07:00 by Jafner · 8 comments
Jafner commented 2022-08-11 09:51:18 -07:00 (Migrated from gitlab.jafner.net)

We've had some unexplained crashes recently. I hypothesize a thermal issue. Let's spin up some thermal monitoring.

Sensor Component
k10temp-pci-00c3 CPU Temp
acpitz-acpi-0 Virtual network interface

Arch Wiki: https://wiki.archlinux.org/title/lm_sensors

Using: https://github.com/ncabatoff/sensor-exporter
With: https://github.com/mindprince/nvidia_gpu_prometheus_exporter

We've had some unexplained crashes recently. I hypothesize a thermal issue. Let's spin up some thermal monitoring. | Sensor | Component | |:------:|:---------:| | `k10temp-pci-00c3` | CPU Temp | | `acpitz-acpi-0` | Virtual network interface | Arch Wiki: https://wiki.archlinux.org/title/lm_sensors Using: https://github.com/ncabatoff/sensor-exporter With: https://github.com/mindprince/nvidia_gpu_prometheus_exporter
Jafner commented 2022-08-11 09:51:18 -07:00 (Migrated from gitlab.jafner.net)

assigned to @Jafner

assigned to @Jafner
Jafner commented 2022-08-11 09:51:24 -07:00 (Migrated from gitlab.jafner.net)

changed the description

changed the description
Jafner commented 2022-08-11 09:51:56 -07:00 (Migrated from gitlab.jafner.net)

mentioned in commit be462d1515

mentioned in commit be462d1515219070a03f84cf641cabe30c8db01f
Jafner commented 2022-08-11 10:05:36 -07:00 (Migrated from gitlab.jafner.net)

It turns out hwmon readouts are included in prometheus node_exporter. Should have thought about that.

The Nvidia one is still valid though.

It turns out hwmon readouts are included in prometheus node_exporter. Should have thought about that. The Nvidia one is still valid though.
Jafner commented 2022-08-11 10:05:53 -07:00 (Migrated from gitlab.jafner.net)

mentioned in commit 8d4d639940

mentioned in commit 8d4d639940ca49d754d9759f6263b79e17d297a3
Jafner commented 2022-08-11 10:07:45 -07:00 (Migrated from gitlab.jafner.net)

mentioned in commit 469a5224cc

mentioned in commit 469a5224cc4bdbd5e776f5660ad0058c1d0c286f
Jafner commented 2022-08-11 10:09:34 -07:00 (Migrated from gitlab.jafner.net)

mentioned in commit 58245269c4

mentioned in commit 58245269c488d3328e2fd325f6e404efe7dc5e7e
Jafner commented 2022-10-23 17:08:35 -07:00 (Migrated from gitlab.jafner.net)

Now monitoring everything provided by the motherboard:

http://grafana.jafner.net/goto/eydFtVN4k?orgId=1

Now monitoring everything provided by the motherboard: http://grafana.jafner.net/goto/eydFtVN4k?orgId=1
Sign in to join this conversation.
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: Jafner/homelab#61
No description provided.