dasharo-issues icon indicating copy to clipboard operation
dasharo-issues copied to clipboard

48GB SODIMMs get hot during MemTest86+

Open mkopec opened this issue 1 year ago • 3 comments

Component

Dasharo firmware

Device

NovaCustom V54 14th Gen, NovaCustom V56 14th Gen

Dasharo version

v0.9.1-rc8

Dasharo Tools Suite version

n/a

Test case ID

n/a

Brief summary

The memory sticks reach up to 120 degrees Celsius during memtest86+

How reproducible

100%

How to reproduce

  • install 48GB NovaCustom Goodram module in V540TNx / V560TNx
  • boot Memtest86+ v10.7 from USB stick
  • Observe temperatures reported bym Memtest86+ during testing

Expected behavior

Memory stick stay within the operating temperature range, e.g. 0-85 deg C

Actual behavior

Temperatures reach up to 120 degrees

Screenshots

No response

Additional context

  • Seems to affect 48GB modules mostly
  • Doesn't appear in normal usage

Solutions you've tried

Workaround - set the fans to run at 100% during testing

mkopec avatar Nov 06 '24 11:11 mkopec

On v54 with cooling mode enabled for whole test duration peak temp was 102 deg C

SebastianCzapla avatar Nov 07 '24 10:11 SebastianCzapla

GOODRAM allows operation temperatures of up to 85 °C, anything above could cause damage.

Please use a cooling stand to reduce temperature during memtes86.

@mkopec @SebastianCzapla Please confirm that this issue is limited to memtest86.

wessel-novacustom avatar Nov 08 '24 07:11 wessel-novacustom

@wessel-novacustom We were not able to provide definite answer, we could not read the sticks temperature in Linux. However, besides the temperature, I have not experienced loss of stability or crashes when using 48GB stick.

SebastianCzapla avatar Nov 21 '24 09:11 SebastianCzapla

Tested on Dasharo v1.0.0-rc5, V540TND: test stopped after 87 C.

MemTest86 V11.4 Free

matmacieje avatar Aug 27 '25 14:08 matmacieje

on v1.0.0-rc5 V560TNE the temperature exceeds 85 C quite early in the testing.

wiktormowinski avatar Aug 28 '25 11:08 wiktormowinski

Some pointers from generic Hardware troubleshooting... 1 - Get second opinion with other modules. Temperature can vary wildly between different vendors and models, and some are actually bad, like using decorative heatsinks: https://old.reddit.com/r/sffpc/comments/1470w7u/hot_ddr5_temperatures_probably_best_to_avoid/ So single out that the issue is the module. If it can be tested with propietary BIOS even better so that it discard Firmware side issue. Heck, there are tools to read SPD parameters like how much Voltage does the module wants, in case that it ask more than standard 1.1V for some reason. Never have seen it happen, but may be worth to check. 2 - In case that MemTest86 is reading temperatures wrong for whatever reason, and due to Linux lack of Hardware tools, you may need to use Windows, produce some heavy memory load (Like with TestMem5), and check if temperature measurements are consistent with MemTest86. Note that lot of Software I tend to use for free may require commercial licensing for your usage. 3 - Being stable means nothing because DDR5 can throttle and you lose memory bandwidth/performance. This should also be possible to benchmark, albeit I can't think on the tools (Ironically I recall seeing this done on Linux). https://www.youtube.com/watch?v=vCmlRVH893M You may not be able to reproduce his results since he uses a completely different platform with more telemetry features.

zirblazer avatar Aug 28 '25 13:08 zirblazer