We have executed the validation plan described previously on two ASUS machines. These meet the generic Gen 2 hardware specifications as specified earlier in this thread. Some abbreviated specs:
- 2x AMD EPYC 7313 (3,00 GHz, 16-Core, 128 MB)
- 512 GB (16x 32GB) ECC Reg ATP DDR4 3200 RAM
- 32TB (5x 6,4TB) NVMe Kioxia SSD 3D-NAND TLC U.3 (Kioxia CM6-V)
- Swiss price: 21’595.75 CHF
Validation results:
-
Low Level
- Stress tests (using
stress-ng
) - increase confidence in the hardware configuration and these specific machine instances - Passed - System benchmarks (using
sysbench
) - Gauge performance against known Gen 1 node performance - Passed- About 2x performance increase for cpu and memory.
- Disk performance ranges from equally good to better for the majority of tests
- SEV-SNP capability - Verified BIOS and kernel support working in tandem - Pending
- Stress tests (using
-
High Level
- Method: deploy machines into subnets and ensure subnet metrics do not deviate negatively by a meaningful threshold
- Low usage subnet deployment. Scalability benchmarks (system baseline and large memory)
- All metrics nominal - Passed
- All metrics nominal - Passed
- High usage subnet deployment
- All metrics nominal, except
- Individual node checkpointing performance discrepancy of <3-6%.
This has no impact on subnet performance, but we’re still keeping an eye on it.
- Individual node checkpointing performance discrepancy of <3-6%.
- Passed
- All metrics nominal, except
We have updated the node provider hardware wiki to include this ASUS server configuration. An example ASUS quote and bill of materials (BOM) is available for interested community members.