js checked logs and found this (after a restart)
Apr 27 00:09:20 Tower kernel: pcieport 0000:00:1c.6: AER: Multiple Corrected error message received from 0000:04:00.0
Apr 27 00:09:20 Tower kernel: nvidia 0000:04:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Apr 27 00:09:20 Tower kernel: nvidia 0000:04:00.0: device [10de:21c4] error status/mask=00000001/0000a000
Apr 27 00:09:20 Tower kernel: nvidia 0000:04:00.0: [ 0] RxErr
Apr 27 00:09:21 Tower kernel: pcieport 0000:00:1c.6: AER: Corrected error message received from 0000:04:00.0
Apr 27 00:09:21 Tower kernel: nvidia 0000:04:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Apr 27 00:09:21 Tower kernel: nvidia 0000:04:00.0: device [10de:21c4] error status/mask=00000001/0000a000
Apr 27 00:09:21 Tower kernel: nvidia 0000:04:00.0: [ 0] RxErr (First)
Apr 27 00:09:27 Tower kernel: pcieport 0000:00:1c.6: AER: Multiple Corrected error message received from 0000:04:00.0
Apr 27 00:09:27 Tower kernel: nvidia 0000:04:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Apr 27 00:09:27 Tower kernel: nvidia 0000:04:00.0: device [10de:21c4] error status/mask=00000001/0000a000
Apr 27 00:09:27 Tower kernel: nvidia 0000:04:00.0: [ 0] RxErr (First)