Usable Memory Issue


Recommended Posts

Hey guys,

A couple of months ago I built a dual CPU system with a Supermicro X10 motherboard.  Since I built the system only half of the memory has been usable and I quickly accepted that and moved on.  Recently as I've become more experienced with unRaid I realized that NUMA 0 only had 2GB of memory.  So the slots on CPU 1 are where the issue exists.  When moving modules between slots the ram on CPU 0 was not usable while all the ram on CPU 1 is.  Windows 10 also only sees about half of the memory 49/96GB when the RAM is installed in the correct slots.  I have 6 DIMMs using 6/8 slots and all of them work when in CPU 1 slots. All of the memory has always been detected in the BIOS.  I swapped the CPUs between slots and the issue was still present with CPU0 DIMMs.  I also checked for bent pins and did not see any.  I have not had any stability issues over the last two months.  I have spent hours today searching for and trying to figure out this problem to no avail.  At this point I feel like the RAM is being reserved for some reason or my motherboard is faulty.  The motherboard is a Supermicro X10DAL-i.  I will post the results of some diagnostic commands that I used to narrow this down. Any ideas are appreciated.  Thank you.

 

# numactl --hardware
available: 2 nodes (0-1)
node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 28 29 30 31 32 33 34 35 36 37 38 39 40 41
node 0 size: 1869 MB
node 0 free: 10 MB
node 1 cpus: 14 15 16 17 18 19 20 21 22 23 24 25 26 27 42 43 44 45 46 47 48 49 50 51 52 53 54 55
node 1 size: 48378 MB
node 1 free: 9856 MB
node distances:
node   0   1
  0:  10  21
  1:  21  10


 

# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.0 present.

Handle 0x0026, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x0025
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P1-DIMMA1
    Bank Locator: P0_Node0_Channel0_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 40152481
    Asset Tag: P1-DIMMA1_AssetTag (date:15/09)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x0027, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x0025
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P1-DIMMB1
    Bank Locator: P0_Node0_Channel1_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 401524A7
    Asset Tag: P1-DIMMB1_AssetTag (date:15/09)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x0029, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x0028
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P1-DIMMC1
    Bank Locator: P0_Node0_Channel2_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 41755E2E
    Asset Tag: P1-DIMMC1_AssetTag (date:15/41)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x002A, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x0028
    Error Information Handle: Not Provided
    Total Width: Unknown
    Data Width: Unknown
    Size: No Module Installed
    Form Factor: DIMM
    Set: None
    Locator: P1-DIMMD1
    Bank Locator: P0_Node0_Channel3_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: Unknown
    Manufacturer: NO DIMM
    Serial Number: NO DIMM
    Asset Tag: NO DIMM
    Part Number: NO DIMM
    Rank: Unknown
    Configured Memory Speed: Unknown
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x002C, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x002B
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P2-DIMME1
    Bank Locator: P1_Node1_Channel0_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 3138105A
    Asset Tag: P2-DIMME1_AssetTag (date:16/01)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x002D, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x002B
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P2-DIMMF1
    Bank Locator: P1_Node1_Channel1_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 31380E6E
    Asset Tag: P2-DIMMF1_AssetTag (date:16/01)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x002F, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x002E
    Error Information Handle: Not Provided
    Total Width: 72 bits
    Data Width: 64 bits
    Size: 16384 MB
    Form Factor: DIMM
    Set: None
    Locator: P2-DIMMG1
    Bank Locator: P1_Node1_Channel2_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: 2133 MT/s
    Manufacturer: Samsung
    Serial Number: 4175B7C4
    Asset Tag: P2-DIMMG1_AssetTag (date:15/41)
    Part Number: M393A2G40DB0-CPB   
    Rank: 2
    Configured Memory Speed: 2133 MT/s
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown

Handle 0x0030, DMI type 17, 40 bytes
Memory Device
    Array Handle: 0x002E
    Error Information Handle: Not Provided
    Total Width: Unknown
    Data Width: Unknown
    Size: No Module Installed
    Form Factor: DIMM
    Set: None
    Locator: P2-DIMMH1
    Bank Locator: P1_Node1_Channel3_Dimm0
    Type: DDR4
    Type Detail: Synchronous
    Speed: Unknown
    Manufacturer: NO DIMM
    Serial Number: NO DIMM
    Asset Tag: NO DIMM
    Part Number: NO DIMM
    Rank: Unknown
    Configured Memory Speed: Unknown
    Minimum Voltage: Unknown
    Maximum Voltage: Unknown
    Configured Voltage: Unknown
# numastat qemu

Per-node process memory usage (in MBs) for PID 31873 (qemu-system-x86)
                           Node 0          Node 1           Total
                  --------------- --------------- ---------------
Huge                         0.00            0.00            0.00
Heap                         0.02            0.04            0.05
Stack                        0.01            0.02            0.04
Private                     63.94        20674.24        20738.18
----------------  --------------- --------------- ---------------
Total                       63.97        20674.30        20738.27

 

syslog.txt

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.