Skip to content

Commit e0d11bd

Browse files
kentrussellalexdeucher
authored andcommitted
drm/amdgpu: Send RMA CPER at bad page loading
Some older builds weren't sending RMA CPERs when the bad page threshold was exceeded. Newer builds have resolved this, but there could be systems out there with bad page numbers higher than the threshold, that haven't sent out an RMA CPER. To be thorough and safe, send an RMA CPER when we load the table, if the threshold is met or exceeded, instead of waiting for the next UE to trigger the CPER. Signed-off-by: Kent Russell <kent.russell@amd.com> Reviewed-by: Tao Zhou <tao.zhou1@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
1 parent 91544c4 commit e0d11bd

1 file changed

Lines changed: 4 additions & 0 deletions

File tree

drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1712,6 +1712,10 @@ int amdgpu_ras_eeprom_check(struct amdgpu_ras_eeprom_control *control)
17121712
dev_warn(adev->dev, "RAS records:%u exceeds 90%% of threshold:%d",
17131713
control->ras_num_bad_pages,
17141714
ras->bad_page_cnt_threshold);
1715+
if (amdgpu_bad_page_threshold != 0 &&
1716+
control->ras_num_bad_pages >= ras->bad_page_cnt_threshold)
1717+
amdgpu_dpm_send_rma_reason(adev);
1718+
17151719
} else if (hdr->header == RAS_TABLE_HDR_BAD &&
17161720
amdgpu_bad_page_threshold != 0) {
17171721
if (hdr->version >= RAS_TABLE_VER_V2_1) {

0 commit comments

Comments
 (0)