Hi, this is the developer from DFinance, we launched our testnet today, it all went smoothly until a while ago, users talking about loading too slow, I tried to call our backend canister with dfx, but I got 504 error, is this because of too much traffic? I thought IC has load balance?
Weāve also launched a whitelist event which involved mutiple people to login to our website icpunks.com and had exact problem. It looks like that we did a stress test
As far as we can tell this has to do with the fact that subnet pjljw handles by far the most queries, combined with an issue we discovered recently, where we only cache the compiled Wasm when executing updates, not queries. Meaning that before an update is executed on a canister, each query will compile the Wasm from scratch. Because of the load, some queries will be really slow, while other queries will time out before they even get a chance to execute.
Weāre working on a fix, but it may take a while to deploy. In the meantime, the more canisters on pjljw handle at least one update, the less contention among queries, so this should become better over time.
Make sure to run an update query on each of your canisters. (o:
More seriously though, I donāt think thereās anything you can do. Iāll try to figure out if I can find a way to run a replicated query (i.e. run a query via an ingress message) on all canisters on the subnet, to prime the cache. Actually, now that I think of it, anyone could do it.
We noticed the issue happened again. The root cause has been identified and the fix is on the way. Apologies for the inconvenience and thanks for your patience.