Technical Working Group: Scalability & Performance

icpp · April 15, 2024, 7:42pm

I am working on the next version of ICGPT, a playground for on-chain Llama2 models, and I am looking into performance improvements by using horizontal scaling with a load-balancer in front of multiple LLMs.

I was suprised about the non-linear scaling with the experimental load-balancer and would love to get a full understanding of why this is. Would I be able to join one of the upcoming WG meetings to ask some questions?

Topic		Replies	Views
ICP.Lab Storage & Scalability Summaries Developers	18	4754	April 9, 2025
LAMENT: A tale of constant struggle of what it's like trying to scale on ICP Developers	73	3544	November 3, 2024
Let's solve these crucial protocol weaknesses Developers	128	12833	July 9, 2024
Subnets with heavy compute load: what can you do now & next steps Developers	174	4108	November 26, 2024
High User Traffic Incident Retrospective - Thursday September 2, 2021 Developers	50	8959	October 30, 2021

Technical Working Group: Scalability & Performance

Related topics