Uploading many assets crashes subnets

bob11 · October 31, 2021, 5:22am

ToniqLabs uploads GBs and GBs of assets to the IC every week. When we upload, there is a good chance that the subnet crashes, or is at least severely impacted by data upload. Here is a screenshot of Internet Computer Network Status when we started to upload some assets. Notice that finalization rate goes down, cycles used goes way up, and state increases over time. The problem is that oftentimes it is so taxing on a subnet that it just dies (errors), and requires pulling the state to figure out where your upload left off, and then restarting the upload.

This is a problem for us right now. Moonwalkers took a LONG time to upload, partly because they were massive, and partly because the subnet kept failing on us.

I’d love some insight into why this is happening and what can be done to fix it.

Thanks!!

diegop · October 31, 2021, 5:37am

Thanks for letting folks know. I will ping the team to make sure folks see this.

free · October 31, 2021, 6:19am

The subnet is not crashing, it is merely rate limiting updates. Because of the many memory writes, the orthogonal persistence implementation accumulates lots of “heap deltas” (modified heap pages) and, in order to avoid running out of RAM and crashing, stops handling any more transactions until the next checkpoint.

Here is a chart of the heap delta:

The gap just about an hour ago is likely because your canister ran out of cycles. The likely reason for that is the large ingress messages and high number of instrunctions:

free · October 31, 2021, 6:29am

Finalization rate drops during the execution of these large ingress messages and recovers when the subnet is no longer accepting any for a while:

This is merely because it takes a couple of seconds to execute each of those large ingress messages:

This affects the latency of any other transactions executing at the same time (execution round takes 2 seconds instead of <1) and the interval between checkpoints (these happen every 500 rounds by default, with those 500 rounds are stretched over longer a period) but not much else.

free · October 31, 2021, 6:59am

See above, this is merely an effect of executing large ingress messages. Not entirely sure why uploading a couple of MB of images requires 4B instructions, but that’s what I’m seeing. (These metrics cover the whole subnet not just your canister(s), but there was very little other traffic happening on that subnet before your uploads, so I’ll just assume here that what I’m seeing is due to your uploads.)

Also an effect of the large ingress messages and large number of instructions. Are you doing image handling in your canister? I.e. rebuilding an image out of tiles? That may account for the CPU usage.

This is merely because of the uploads. Your canister is holding more and more images. I’m seeing a gradual increase from 1.25 GB to 1.5 GB over the first half of the 3 hour interval covered by the charts above (uploads, I suppose), then a step increase to 3 GB.

The errors are ingress messages timing out. Ingress messages have a maximum TTL of 5 or 6 minutes and because the subnet stops handling transactions for up to 5 minutes; and the messages themselves require a couple of seconds each to execute; I’m seeing 5 ingress messages expiring just around the time (and everytime) when transaction execution resumes.

There should be no need for that. I understand that transactions hanging for 5 minutes and timing out is annoying as heck, but apart from the timed out transactions, all others should reliably execute and succeed (unless, of course, the canister itself fails them). But regardless, all updates that return success will have resulted in the write going through; all updates returning errors (timeout or trap) will have resulted in no changes to the state. There should be no need to do anything beyond retrying failed updates. If you find that’s not the case, then it’s definitely an issue that requires investigation.

I hope my comments above covered the first part (what’s happening). Regarding the second (what can be done to fix it), a couple of things can.

For one, you can rate limit your uploads, so the subnet doesn’t run out of heap delta so early in the 500 rounds checkpoint interval. This is merely a workaround and not something you, as a canister developer, should have to deal with. We need to figure out better ways of handling huge heap deltas. I know we’re already working on offloading them to disk instead of keeping them in RAM. Maybe it would also be feasible to have a dynamoc checkpointing interval instead of a fixed one, as long as it’s deterministic (e.g. “every 500 rounds or the heap delta reaches the configured limit”); IIUC this is also connected to the DKG interval, so I’ll ask the Consensus team about it.

Second, regarding cycle burn rate, you can try figuring out why your canister needs 4B instructions to handle what is presumably 2 MB worth of images. It’s not going to change anything about the heap delta and the resulting suspension of transaction execution, resulting in huge latency and expired ingress messages, but it should make it cheaper to upload assets.

Third, you can better keep track of the status of each ingress message you send. If it fails for any reason, retry it. If it succeeds, you’re all done. There should be no need to pull the state to figure out what went through and what didn’t.

Happy to answer any other questions you may have.

northman · October 31, 2021, 1:15pm

If I understand correctly, a canister that changes state of a single boolean value from true to false could take 5 minutes if an ingesting canister is on the same subnet and is concurrently causing heap deltas to be created.

Perhaps rate limiting should not be left in the hands of the developer if one dapp is able to impact the performance of an entire subnet. It is very simple to mount a denial of service attack simply by doing a large upload if your canister is resident on the same subnet.

Is there a performance monitoring canister on a subnet or a system variable that can be queried that indicates what the performance looks like in the subnet? Similar to a ping? It would ge nice to warn users - expect delays. One could always measure the nanoseconds between calls in a loop but that may exacerbate the problem?

It may be true that “the subnet did not crash” but a users/dapp owner’s perspective may be quite different and less nuanced.

abk · November 1, 2021, 7:48am

100% correct @northman. We actually put out a fix for this in this recent release (see the item “Execution: Introduce per canister heap delta limit to share the heap delta capacity in a fairer manner between canisters.”) That change basically adds rate limiting of the generated heap delta on a per-canister basis. So if a single canister does many writes then it should no longer prevent the whole subnet from making progress. Instead, just that canister may be prevented from running for several rounds.

free · November 1, 2021, 7:52am

Also, there’s ongoing work to use mmap-ed memory for heap deltas, so the OS can easily offload that memory to disk as necessary som the maximum heap delta can be dictated by available disk rather than available RAM (so there will virtually be no max heap limit anymore). This is just another of the ICs growing pains.

free · November 1, 2021, 10:12am

@bob11, any chance you could share the relevant canister code (and what kind of traffic the canister was handling at the time)? The Motoko team want to understand why the canister was touching so much memory (in order to find a general solution to the underlying problem, whatever it turns out to be).

claudio · November 1, 2021, 1:57pm

Did a little digging - not sure this is the code, but if the code use patterns like this:

github.com

Toniq-Labs/extendable-token/blob/1f7ef3e2cff3f95509f607ba022326fe280f7039/examples/advanced.mo#L68

    
      
          
          
private var _metadata = HashMap.HashMap<TokenIndex, (Metadata, Balance)>(1, Nat32.equal, func(x : Nat32) : Hash.Hash {x});
          Iter.iterate<(TokenIndex, (Metadata, Balance))>(_metadataState.vals(), func(x, _index) {
            _metadata.put(x.0, x.1);
          });
          
          
  
          //State functions
          system func preupgrade() {
            Iter.iterate(_registry.entries(), func(x : (TokenIndex, TokenLedger), _index : Nat) {
              _registryState := Array.append(_registryState, [(x.0, Iter.toArray(x.1.entries()))]);
            });
            _metadataState := Iter.toArray(_metadata.entries());
          };
          system func postupgrade() {
            _registryState := [];
            _metadataState := [];
          };
          
          
public shared(msg) func changeAdmin(newAdmin : Principal) : async () {
            assert(msg.caller == _admin);

then that could potentially touch a quadratic amount of memory, dirtying lots of pages even if not much data survives/remains live. Depends on the size of the data of course.

(Array.append typically copies its arguments and allocates a new array. Do that in a loop and you touch lots of memory, most of which will be gc’ed but still expands the wasm memory footprint)

matthewhammer · November 1, 2021, 6:17pm

Nice find!

If it is indeed caused by code like this (seems quite possible), then the easiest solution is to move away from any use of Array.append, in any context.

It’s always a worse version of doing .append on a Buffer, which does not suffer from the same time and space issues as Array.append.

Here’s an issue for more discussion: Deprecate Array.append, and aim to remove it. · Issue #296 · dfinity/motoko-base · GitHub

stephenandrews · November 3, 2021, 10:23am

Nice write up - definitely need to look to optimize our code then. We use Array.append a lot though - what is the best alternative?

claudio · November 3, 2021, 12:15pm

It looks like @matthewhammer is in the process of drafting some guidance on this here (Deprecate Array.append, and aim to remove it. · Issue #296 · dfinity/motoko-base · GitHub).

In the particular example above, you should be able to avoid the quadratic behaviour by constructing a suitable array from _registry.entries and appending it just once.

Something like:

let entries = Iter.toArray(_register.entries());
let extra = Array.map(ext, func((k,v)) { ... });
_registeryState := Array.append(_registryState, extra);

But there are many ways to skin this cat.

(It’s not really the append that is the problem - it’s O(n) - but the fact that you are calling append repeatedly in the implicit loop of Iter.iterate, make the loop 0(n^2))

Topic		Replies	Views
Subnets `mpubz`, `brlsh`, and `pjljw` Incident Retrospective - Friday October 15, 2021 Developers	19	1323	October 20, 2021
High User Traffic Incident Retrospective - Thursday September 2, 2021 Developers	50	8971	October 30, 2021
Subnets with heavy compute load: what can you do now & next steps Developers	174	4123	November 26, 2024
Fixing incorrect message memory fee Developers	25	2150	October 6, 2023
Scalability of update calls in a common scenario Developers	21	4029	October 30, 2020

Uploading many assets crashes subnets

Related topics