bugproxy
2017-07-18 09:29:51 UTC
Public bug reported:
== Comment: #0 - PAVAMAN SUBRAMANIYAM <***@in.ibm.com> - 2017-05-22 05:12:38 ==
---Problem Description---
HMI TFMR HDEC parity error is throwing Severe Machine check interrupt
---uname output---
Linux zz376p1 4.10.0-21-generic #23~16.04.1-Ubuntu SMP Tue May 2 12:54:57 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux
Machine Type = P9
---System Hang---
The system hangs indefinitely and we have to reboot the system to recover back.
---Debugger---
A debugger is not configured
Immediately after injecting the above error, we get Severe Machine check interrupt [[Not recovered]
Contact Information = ***@in.ibm.com
Stack trace output:
no
Oops output:
[ 288.655336] Severe Machine check interrupt [[Not recovered]
[ 288.655339] Severe Machine check interrupt [[Not recovered]
[ 288.655342] Severe Machine check interrupt [[Not recovered]
[ 288.655345] Severe Machine check interrupt [[Not recovered]
[ 288.655348] Initiator: CPU
[ 288.655349] Initiator: CPU
[ 288.655352] Error type: Real address [Load/Store (foreign)]
[ 288.655354] Initiator: CPU
[ 288.655357] Effective address: 333035342dfe3030
[ 288.655360] Error type: Real address [Load/Store (foreign)]
[ 288.655366] Error type: Real address [Load/Store (foreign)]
[ 288.655369] Effective address: 333035342e013030
[ 288.655371] Effective address: 333035342e073030
[ 288.655418] opal: Reboot type 1 not supported
[ 288.655420] opal: Reboot type 1 not supported
[ 288.655422] opal: Reboot type 1 not supported
[ 288.655423] Kernel panic - not syncing: PowerNV Unrecovered Machine Check
[ 288.655430] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G M 4.10.0-21-generic #23~16.04.1-Ubuntu
[ 288.655433] Call Trace:
[ 288.655450] Sending IPI to other CPUs
[ 288.656767] Initiator: CPU
[ 288.656834] Error type: Real address [Load/Store (foreign)]
[ 288.656945] Effective address: 333035342e043030
[ 288.657060] opal: Reboot type 1 not supported
[ 298.655034] ERROR: 3 cpu(s) not responding
[ 298.655183] Activate system reset (dumprestart) to stop other cpu(s)
System Dump Info:
The system is not configured to capture a system dump.
*Additional Instructions for ***@in.ibm.com:
-Attach sysctl -a output output to the bug.
== Comment: #3 - MAHESH J. SALGAONKAR <***@in.ibm.com> - 2017-06-29 03:23:30 ==
(In reply to comment #2)
** Affects: ubuntu-power-systems
Importance: Undecided
Status: New
** Affects: linux (Ubuntu)
Importance: Undecided
Assignee: Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
Status: New
** Tags: architecture-ppc64le bugnameltc-154870 severity-critical targetmilestone-inin16043
** Tags added: architecture-ppc64le bugnameltc-154870 severity-critical
targetmilestone-inin16043
** Changed in: ubuntu
Assignee: (unassigned) => Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
** Package changed: ubuntu => kernel-package (Ubuntu)
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1704972
Title:
[LTCTest][Opal][FW910] HMI TFMR HDEC parity error is throwing Severe
Machine check interrupt
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704972/+subscriptions
== Comment: #0 - PAVAMAN SUBRAMANIYAM <***@in.ibm.com> - 2017-05-22 05:12:38 ==
---Problem Description---
HMI TFMR HDEC parity error is throwing Severe Machine check interrupt
---uname output---
Linux zz376p1 4.10.0-21-generic #23~16.04.1-Ubuntu SMP Tue May 2 12:54:57 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux
Machine Type = P9
---System Hang---
The system hangs indefinitely and we have to reboot the system to recover back.
---Debugger---
A debugger is not configured
Immediately after injecting the above error, we get Severe Machine check interrupt [[Not recovered]
Contact Information = ***@in.ibm.com
Stack trace output:
no
Oops output:
[ 288.655336] Severe Machine check interrupt [[Not recovered]
[ 288.655339] Severe Machine check interrupt [[Not recovered]
[ 288.655342] Severe Machine check interrupt [[Not recovered]
[ 288.655345] Severe Machine check interrupt [[Not recovered]
[ 288.655348] Initiator: CPU
[ 288.655349] Initiator: CPU
[ 288.655352] Error type: Real address [Load/Store (foreign)]
[ 288.655354] Initiator: CPU
[ 288.655357] Effective address: 333035342dfe3030
[ 288.655360] Error type: Real address [Load/Store (foreign)]
[ 288.655366] Error type: Real address [Load/Store (foreign)]
[ 288.655369] Effective address: 333035342e013030
[ 288.655371] Effective address: 333035342e073030
[ 288.655418] opal: Reboot type 1 not supported
[ 288.655420] opal: Reboot type 1 not supported
[ 288.655422] opal: Reboot type 1 not supported
[ 288.655423] Kernel panic - not syncing: PowerNV Unrecovered Machine Check
[ 288.655430] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G M 4.10.0-21-generic #23~16.04.1-Ubuntu
[ 288.655433] Call Trace:
[ 288.655450] Sending IPI to other CPUs
[ 288.656767] Initiator: CPU
[ 288.656834] Error type: Real address [Load/Store (foreign)]
[ 288.656945] Effective address: 333035342e043030
[ 288.657060] opal: Reboot type 1 not supported
[ 298.655034] ERROR: 3 cpu(s) not responding
[ 298.655183] Activate system reset (dumprestart) to stop other cpu(s)
System Dump Info:
The system is not configured to capture a system dump.
*Additional Instructions for ***@in.ibm.com:
-Attach sysctl -a output output to the bug.
== Comment: #3 - MAHESH J. SALGAONKAR <***@in.ibm.com> - 2017-06-29 03:23:30 ==
(In reply to comment #2)
We need upstream commit
https://git.kernel.org/powerpc/c/be5c5e843c4afa1c8397cb740b6032 that fixes
this issue.
Hi Breno,
We will be needing this upstream commit to be included in Ubuntu 16.04.3
Did this patch make into Ubuntu 16.04.3 ?https://git.kernel.org/powerpc/c/be5c5e843c4afa1c8397cb740b6032 that fixes
this issue.
Hi Breno,
We will be needing this upstream commit to be included in Ubuntu 16.04.3
** Affects: ubuntu-power-systems
Importance: Undecided
Status: New
** Affects: linux (Ubuntu)
Importance: Undecided
Assignee: Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
Status: New
** Tags: architecture-ppc64le bugnameltc-154870 severity-critical targetmilestone-inin16043
** Tags added: architecture-ppc64le bugnameltc-154870 severity-critical
targetmilestone-inin16043
** Changed in: ubuntu
Assignee: (unassigned) => Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
** Package changed: ubuntu => kernel-package (Ubuntu)
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1704972
Title:
[LTCTest][Opal][FW910] HMI TFMR HDEC parity error is throwing Severe
Machine check interrupt
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704972/+subscriptions
--
ubuntu-bugs mailing list
ubuntu-***@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
ubuntu-bugs mailing list
ubuntu-***@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs