So, it's probably one of motherboard, cpu, or PSU. At a stretch, it could be the GPU.
I have another spare GPU I could swap in. I have a near-identical CPU that I could swap in (it's in use, but I can temporarily borrow it).
PSU and mobo are trickier.
So, I'll have to try the easy ones first. Swap the GPU and see if windows still hard crashes like that, then the cpu, then start working on the others.
If you'd like to help me get back online (and gay cats, of course), donations would help. I'm kinda broke and not having a working computer is not going to help.
19 hours in the different slot, no crashes. Very strange.
So, theories:
1. that slot was just bad/dirty. Possible, I guess? The other GPU worked fine in that slot, though.
2. The GPU might be running at 8x PCIe instead of 16x PCIe. Maybe that pushes it over some timing/temperature threshold and makes it not crash?
over an hour with the GPU back in the Crashy Slot and no crashes.
huh. Maybe it was temperature based?
My GPU isn't getting THAT hot, my fans aren't even maxing out.
GPU temp hit a max of 65C with a hot spot of 76C.
Those aren't out of range for a GPU under load, and they're not trending upward at all, it's stable.
@foone Could it be voltage?
@mos_8502 if it is, it's generating the wrong voltages in the motherboard, because I did replace the PSU
@foone I was thinking about the voltage settings in the BIOS/UEFI menu, but that's also possible. Bad regulators suck.
@foone @mos_8502 maybe there's a sensitive timing issue going on. Maybe it's doing its cycles just a bit sooner than expected, or there's a bit of lag on the address/data lines. At that speed I can't imagine what could go imperceptibly wrong enough to cause trouble.
I suppose one way to experiment would be to manually set the latency timings to be a bit more generous than the SPD implies, but at that point it's already a sign of a larger fault looming if it started cropping up now...