System Stress Tests

June 1, 2016
sysadmin hpc raid

I recently received a question about SGI’s pandora after someone found my run-pandora.sh script in my hpc-admin-scripts repo. They were looking for a way to test a server with a fair bit of memory in a short amount of time. They’d tried Memtest86 and found it to be incredibly slow when running single-threaded or proved too unstable when running on all cores. When they found my repo, they figured it’d be worth asking about pandora in the hopes it would be appropriate for their needs. While I believe pandora is an SGI proprietary tool, I was able to direct them to some more generic alternatives. Since this info may be valuable to others, I figured I’d go ahead and capture it here (and add to this list if I remember any others I’ve used).

I hadn’t seen stress-ng before, but it sounds interesting based on the brief blog post.

Note: In my experience, stress tests like these will not necessarily crash a server-class machine due to the redundancy provided by technology like ECC, RAID, etc. I would typically run tools like these and monitor system logs for any signs of flakey hardware, issue scrub commands for RAID arrays, and especially check for any corrected memory errors via EDAC or MCEs.

3dprinting12 alabama7 amalgam4 android2 apple21 auto6 blog24 cat-diary5 cats18 chicago3 college18 comparch5 cooking2 define30 film19 gaming37 georgia2 halloween2 hosting13 hpc11 hugo3 humor35 huntsville5 illinois37 ios2 ireland3 kids5 meme5 monte-sano2 music23 photography35 politics2 programming11 pumpkins2 raid6 rants19 reading25 research11 snow6 sysadmin18 tales-of-the-weird14 tech54 tennessee2 travel11 video8 work29