環境:
commvault
x3650m3
win2012
異常狀況:
已經運作多年的備份主機,突然開始在備份時會當機,備份比較大的資料才會當機,只是少量的就不會,不執行備份放一整個晚上也都沒問題。
當機後通常會進到藍底白字畫面,搜尋相關錯誤資訊,但未有任何進度就會直接重開機。
沒有memory dump檔,事件檢視器也沒有任何可參考的資訊,只會有意外當機的事項。
IMM 的log裡也沒有任何的異常。
raid程式裡,硬碟與raid卡都健康。
用內建的記憶體測試,正常。
解決過程:
關閉防毒->無效
調低備份的Maximum number of parallel data transfer operations->無效
關機拔電後重開->無效
拔掉六條記憶體擦一擦,只插回基本的兩條->無效
在找不到任何方法下,先把記憶體全部插回,再試試,居然就正常了....
Environment:
Commvault
IBM x3650 M3
Windows Server 2012
Issue:
A backup server that had been running normally for many years suddenly began crashing during backup operations.
The crash only occurred when backing up large amounts of data. Backups with small amounts of data completed normally, and the server could remain powered on overnight without issues if no backup jobs were running.
After the crash, the system usually entered a blue screen. When attempting to view the error details, the system rebooted before any useful information could be collected.
Additional observations:
-
No memory dump files were generated.
-
Event Viewer only recorded an unexpected shutdown with no useful details.
-
IMM logs showed no abnormalities.
-
RAID management reported that the disks and RAID controller were healthy.
-
The built-in memory diagnostic test passed without errors.
Troubleshooting Process:
-
Disabled antivirus → No effect
-
Reduced Maximum number of parallel data transfer operations in Commvault → No effect
-
Powered off the server and unplugged power before restarting → No effect
-
Removed six memory modules, cleaned them, and left only two installed → No effect
With no other solutions available, all memory modules were reinstalled again.
Unexpectedly, after reinstalling all the RAM, the system returned to normal operation and the backup jobs completed successfully.