My system crashes after installing nvidia gtx 1650 super

Hello,
I have a ryzen 7 1700 on a b450 msi board and a gtx 1650 super. Before installing this GPU the system had a radeon one and was stable. I wonder if I have a config problem or if the hardware is the culprit of constant crashes.
Reading previous posts, I generated a bug report right after the last crash but I have no idea what to look for. Some guidance would be greatly appreciated.
I’ll try to attach the nvidia-bug-report.log
I couldn’t attach a file here, so I am sharing in google drive:

https://drive.google.com/file/d/1RZtcDqEtqJEiTFeCFSOWQ4GInEKOHR0s/view?usp=sharing

While looking for the word “error” in the log report I found:

Nov 29 18:18:58 usermanjaro kernel: nvidia-gpu 0000:26:00.3: i2c timeout error 
e0000000

 0.470843] mce: [Hardware Error]: Machine check events logged
[    0.470844] mce: [Hardware Error]: CPU 1: Machine Check: 0 Bank 5: bea0000000000108
[    0.470854] fbcon: Taking over console
[    0.470855] mce: [Hardware Error]: TSC 0 ADDR 1ffff9aedd98c MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
[    0.470861] mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1638227933 SOCKET 0 APIC 2 microcode 8001138
[    0.470879] registered taskstats version 1
[    4.484317] ucsi_ccg: probe of 0-0008 failed with error -110
[    4.554192] mousedev: PS/2 mouse device common for all mice
[    4.629944] Bluetooth: hci0: MSFT filter_enable is already on
[    4.632865] ACPI: \: failed to evaluate _DSM (0x1001)
[    4.632871] ACPI: \: failed to evaluate _DSM (0x1001)
[    4.632950] NET: Registered protocol family 38
[    4.838674] ACPI: \: failed to evaluate _DSM (0x1001)
[    4.838679] ACPI: \: failed to evaluate _DSM (0x1001)
[    0.470843] mce: [Hardware Error]: Machine check events logged
[    0.470844] mce: [Hardware Error]: CPU 1: Machine Check: 0 Bank 5: bea0000000000108
[    0.470854] fbcon: Taking over console
[    0.470855] mce: [Hardware Error]: TSC 0 ADDR 1ffff9aedd98c MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
[    0.470861] mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1638227933 SOCKET 0 APIC 2 microcode 8001138
[    0.470879] registered taskstats version 1

Those are the lines that I found in the nvidia bug report log. But since I don’t know what I am looking for, I might missed something.

Can someone direct me to where the problem is? Thanks.

Please edit your topic title to actually reflect the problem is. Please don’t use run-on, broken phrases. Help us help you.

Please see:

Sounds like a hardware error:

Thank you Yochanan, I modified the title, I hope that is more self-explanatory now.
Yes, I am guessing that the problem is hardware. I just don’t know where, I suspect the video card.

No, you didn’t.


Tip: When pasting terminal output on Discourse forums, one can either…

  • Use the Preformatted text </> toolbar button.

  • Add three backticks ` above and below the text (Markdown):

    ```
    type or paste code here
    ```

  • Use HTML:

    <pre><code>
    type or paste code here
    </pre></code>

Please edit your post accordingly.

Sorry about that, I put the three backticks in the quoted text.

No, you didn’t. If you had, the formatting of your post would look a lot nicer. Notice nothing changed?

Hint: A single quote is not the same as a backtick `.

You still have not edited your topic title yet you tell me that you have. In most circles, that’s considered a lie and is not acceptable. My patience is wearing thin. Without useful information presented clearly, no one is going to be able to help.

Since you just installed new hardware, either it’s the cause or you may have bumped something in the process of installing it. It’s happened to me before. I would reseat the CPU, RAM and GPU as well as double-check all cable connections.

Yochanan,

I am new to forums, I usually don’t ask for help. I am trying to figure out how this works.
English is not my native language, so I missed the difference between a backtick and a single quote. Sorry for that.
Thank you for bearing with me and your time. I thought I changed the topic before, but somehow it didn’t show.
I saw you edited my post and now it has all the backticks in the right places, thank you.

Means what? Does the whole machine freez and you have to do a hard reset?

Can you provide the output of this 4 commands?

inxi -Fxxa
mhwd -li
mhwd -l
ls -la /etc/X11/xorg.conf.d/

And if your kernel is still at 5.13 like said in your profile you should update while 5.13 is EOL. :grinning:

No worries. Thank you for editing the title. It actually wasn’t me who edited your post, another moderator snuck in and did it. :wink:

I removed and installed the card again. It’s been a day without crashes, crossing my fingers. I’ll mark it as solved. Thank you, Yochanan.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.