[pve-devel] applied: [PATCH pve-kernel] revert 2 changes in thermal driver causing an early kernel Oops.
Thomas Lamprecht
t.lamprecht at proxmox.com
Fri Apr 5 14:02:52 CEST 2024
Am 05/04/2024 um 11:27 schrieb Stoiko Ivanov:
> The second patch, that is reverted (first):
> `thermal: trip: Drop lockdep assertion from thermal_zone_trip_id()`
> only touches code introduced by the first patch.
> The first patch causes the following Oops (reproduced on an old
> HP DL380 G8):
> ```
> [ 2.960519] ACPI: button: Power Button [PWRF]
> [ 2.963126] BUG: kernel NULL pointer dereference, address: 000000000000000c
> [ 2.965667] #PF: supervisor read access in kernel mode
> [ 2.966954] #PF: error_code(0x0000) - not-present page
> [ 2.966954] PGD 0 P4D 0
> [ 2.966954] Oops: 0000 [#1] PREEMPT SMP PTI
> [ 2.966954] CPU: 0 PID: 1 Comm: swapper/0 Tainted: G I 6.5.13-4-pve #1
> [ 2.966954] Hardware name: HP ProLiant DL380p Gen8, BIOS P70 05/24/2019
> [ 2.966954] RIP: 0010:step_wise_throttle+0x48/0x360
> [ 2.966954] Code: 04 25 28 00 00 00 48 89 45 d0 31 c0 48 63 c6 48 8d 14 40 48 8b 87 50 03 00 00 4c 8d 24 90 e8 cf d0 ff ff c6 45 bf 00 89 45 b4 <41> 8b 04 24 41 39 85 78 03 00 00 0f 8d a9 02 00 00 0f 1f 44 00 00
> [ 2.966954] RSP: 0000:ffff9e2b8014bae8 EFLAGS: 00010246
> [ 2.966954] RAX: 0000000000000002 RBX: 0000000000000001 RCX: 0000000000000000
> [ 2.966954] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
> [ 2.966954] RBP: ffff9e2b8014bb40 R08: 0000000000000000 R09: 0000000000000000
> [ 2.966954] R10: 0000000000000000 R11: 0000000000000000 R12: 000000000000000c
> [ 2.966954] R13: ffff8c7ac421d000 R14: 0000000000000001 R15: 0000000000000000
> [ 2.966954] FS: 0000000000000000(0000) GS:ffff8c7def600000(0000) knlGS:0000000000000000
> [ 2.966954] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 2.966954] CR2: 000000000000000c CR3: 0000000513a34001 CR4: 00000000000606f0
> [ 2.966954] Call Trace:
> [ 2.966954] <TASK>
> ```
>
> the relevant mainline kernels (6.6.15), corresponding to the
> Ubuntu-patchset (which mixes changes from 6.6.15, with ones from
> 6.1.76) [0] - also boot happily - so I strongly assume that the
> changes depend on one of the many commits introduced in linux-upstream
> between v6.5.1 and v6.6.1.
> As it looks like a refactoring (upon which later changes are based),
> and not a bug-fix in itself - simply dropping it seems sensible.
>
> Signed-off-by: Stoiko Ivanov <s.ivanov at proxmox.com>
> ---
> ...rip-Drop-lockdep-assertion-from-ther.patch | 24 ++
> ...ore-Store-trip-pointer-in-struct-the.patch | 343 ++++++++++++++++++
> 2 files changed, 367 insertions(+)
> create mode 100644 patches/kernel/0014-Revert-thermal-trip-Drop-lockdep-assertion-from-ther.patch
> create mode 100644 patches/kernel/0015-Revert-thermal-core-Store-trip-pointer-in-struct-the.patch
>
>
applied, thanks!
More information about the pve-devel
mailing list