TLDR: The focus is on community feedback for GPU types and usage in the upcoming Gen 3 node specifications for Dfinity. Current nodes lack GPU support, and the discussion centers on whether to include GPUs in the Gen 3 specifications or in specific sub-nets, seeking community insights on GPU utilization.
- Status Quo:
- Current gen 2 nodes in Dfinity’s network boast 500 GB of RAM and substantial processing power, adept at parallelizing processes and running multiple single-threaded WASM canisters, each with a 4GB RAM limit. Despite their capability, these nodes do not feature GPU support, which could offer significant improvements in execution parallelization.
- However, the architectural design of GPUs does not effectively support task-specific parallelization. For instance, a task requiring only 1GB of VRAM would monopolize an entire GPU with 80GB VRAM, leading to potential inefficiencies.
- The integration of GPUs, in any form, into Dfinity’s nodes, therefore, represents a major step forward, transcending the current state-of-the-art (SotA) hardware capabilities of the gen 2 nodes. The question now is whether to integrate GPUs into the new Gen 3 specifications — and if so, what are the VRAM and compute requirements — or to develop specific GPU-focused sub-nets.
- What We Are Asking the Community:
- Community input is sought to determine the type of GPU integration preferred. Key questions include:
- How would GPUs be utilized in various projects?
- What level of compute power is needed?
- How long would the GPU be in use for typical tasks?
- What are the VRAM requirements for these applications?
This approach aims to align the development of Gen 3 nodes with the practical needs and technological aspirations of the Dfinity community.