Deterministic Time Slicing

akhilesh.singhania · March 1, 2022, 9:16am

Good catch Ulan! Yes, this is why @skilesare is seeing the problem he reported above. We should really try to get the local replica to emulate the mainnet as closely as possible.

dymayday · April 25, 2022, 7:39pm

That feature would alleviate a massive pain in the developer experience, so I’m super enthusiast to see it implemented.

Is this still in the pipes ? Any news or update on this ?

ulan · April 25, 2022, 8:01pm

Thanks, it is great to hear that the feature will be useful! We are implementing it and hoping to get the initial prototype around end of May/June.

icme · August 14, 2022, 11:03am

Just wanted to check back in regarding the progress on the Deterministic Time Slicing feature.

I’ve been load testing insertion into several data structures recently (Buffer, HashMap, RBTree), all of which seem to start hitting the message instruction limit on canisters much earlier than their heap capacity, regardless of their insertion runtimes ( i.e. O(1) vs. O(log(n) ).

I’d be interested in further testing out how these data structures perform as they scale within canisters, and DTS would be key in doing so.

Just curious @ulan or @dieter.sommer - is some variant of Deterministic Time Slicing enabled for the bitcoin integration project, or are all update types of transactions/operations able to be completed over a single round of consensus?

ulan · August 15, 2022, 6:46am

The implementation of DTS is close to completion. We hope to merge the main remaining changes in 1-2 weeks. After that we need to do stress testing to ensure that we didn’t not miss anything. Once that’s done I’ll update this thread and give a more concrete timeline for shipping.

(Buffer, HashMap, RBTree) all of which seem to start hitting the message instruction limit on canisters much earlier than their heap capacity, regardless of their insertion runtimes

That is expected for HashMap because it occasionally copies the entire backing store in O(n) time, but the result for RBTree is surprising because it should have the worst case runtime of O(m * log(n)) where m is the number of operations and n is the number of elements. As long as m is not large for each message (e.g. doesn’t exceed 10^6), I would expect it to not hit the instruction limit.

is some variant of Deterministic Time Slicing enabled for the bitcoin integration project, or are all update types of transactions/operations able to be completed over a single round of consensus?

AFAIK @ielashi has carefully designed the operations to not exceed a single round.

ielashi · August 15, 2022, 8:56am

At the moment, the Bitcoin API is implemented in the replica, which means we’re not subject to the same cycles limitation as Wasm canisters. We’ve added some measures so we don’t spend too much time on execution within the round, however these measures are rather crude.

We’re now in the process of migrating the Bitcoin API into a wasm canister, and there we’ll be subject to the same cycles limitation as all other canisters. Some computations like block ingestion do require a lot of cycles, even more than what DTS will initially support, and for that we’ll be implementing a simple slicing logic within the canister to stay within the cycles limit.

icme · August 15, 2022, 2:36pm

Great to hear, thanks for the update!

How many rounds of consensus and cycles do you see some of these operations taking?

Also, would love to hear a bit more about the strategies/techniques behind what this simple slicing logic looks like - is it implemented on the replica level or directly in the API (in a way that is currently accessible to IC developers)?

ielashi · August 16, 2022, 6:48am

I have seen some large blocks take up to 385B instructions to be ingested. With a 5B instruction limit, that translates to ~77 execution rounds. A large block can have tens of thousands of inputs/outputs, which translates to tens of thousands of insert/remove operations on our StableBTreeMap data structures. These numbers are using the code that we have in the replica as-is. I expect there would be some low-hanging optimizations that would help reduce the required number of instructions.

Also, would love to hear a bit more about the strategies/techniques behind what this simple slicing logic looks like - is it implemented on the replica level or directly in the API (in a way that is currently accessible to IC developers)?

We’ll be adding the slicing logic in the canister itself. It hasn’t been written yet, but it’ll be highly specific to the case of block ingestion, so it’s unfortunately not something that can be packaged up and used elsewhere. I’ll share pointers to the slicing code as soon as its available.

On the API level, you may have noticed we added pagination to our get_utxos endpoint, which was one way we were able to stay within the instructions limit for these requests.

ulan · September 10, 2022, 8:01am

Quick update: we are aiming to enable deterministic time slicing on master in the week of September 19th and to roll it out to the mainnet in the subsequent week.

jzxchiang · September 16, 2022, 1:02am

Do you mind sharing the GitHub PR and/or the “Bless Replica Version” NNS proposal when they become available? This is quite an important feature. Thanks!

ulan · September 20, 2022, 3:26pm

Do you mind sharing the GitHub PR and/or the “Bless Replica Version” NNS proposal when they become available?

Absolutely! I’ll share the PR and the proposal. We are currently fixing two blockers that may delay the launch a bit.

ulan · September 28, 2022, 2:35pm

We are currently fixing two blockers that may delay the launch a bit.

One of these blockers turned out to be more tricky than we thought. We are still working on it.

icme · September 28, 2022, 5:02pm

Are any of the blockers something that the community can give input on/or help with in an advisory capacity?

ulan · September 29, 2022, 2:34pm

Thanks for the offer! It is more of an implementation issue about how to ensure correctness of DTS state in certain cases. We have an idea on how to solve and are currently implementing it.

lastmjs · September 30, 2022, 2:15pm

Never give up, never surrender! You’re doing such excellent work

ulan · October 18, 2022, 4:15pm

We fixed the blocker and will be incrementally rolling out DTS in the following weeks.

The version that enables DTS for install_code messages will be deployed on k44fs [proposal to elect a replica binary, proposal to update k44fs].

icme · October 18, 2022, 9:09pm

Awesome news!

@ulan Is it possible to set limits on DTS (i.e. the rounds of consensus it takes to process a message) for specific APIs as a cycle-drain/DDOS prevention mechanism? For example, I’d like to say that this API can span a maximum of 5 rounds of consensus before failing/trapping (instead of using up all 50 rounds worth of cycles).

@Vivienne any ETA on when DTS might be included in dfx (for local testing purposes)? Also, will we be able to profile how many rounds of consensus it takes to process a specific update call?

Vivienne · October 19, 2022, 6:33am

We plan a new 0.12.0 beta release next week. ~~I’ll try to get it in there.~~ EDIT: Managed to update the replica without any problems (PR). The DTS-enabled replica will be included in the next beta release!

No special tooling that I’m aware of. @ulan any chance you know about something?

ulan · October 19, 2022, 8:40am

@icme: We are rolling out DTS slowly to make sure we don’t introduce regressions.

The plan is:

Roll out DTS for install_code messages keeping the instruction limit the same but slicing it into multiple rounds (because install_code messages already have large limits).
Roll out DTS for update calls and responses increasing the instruction limit by 2x.

If that goes well, we will increase the limits more. The final values of the limits are to be defined.

Note that we are not planning to support DTS for queries and heartbeats because they are expected to be short-running. If heartbeat needs to do a long-running computation, then it can make a self-call, which will be an update message and will run with DTS.

Is it possible to set limits on DTS (i.e. the rounds of consensus it takes to process a message) for specific APIs as a cycle-drain/DDOS prevention mechanism?

Right now it is not possible, but we are discussing a canister settings parameter that would allow the controller to set a lower limit that would apply to all methods of the canister. Would that work in your use case?

Supporting individual limits per Wasm methods seems difficult due to the performance and memory overhead because of the need to parse and store this information for each canister.

No special tooling that I’m aware of. @ulan any chance you know about something?

The ic0.performance_counter() would be a way to infer that. Dividing that number by the slice size in instructions would give the number of rounds. The slice size is defined in the replica and is currently set to 2B (it may change though if we find that other values work better).

icme · October 19, 2022, 5:45pm

For the most part, a global canister setting to set a lower limit should be fine:

One use cases that I can thing of off the top of my head as to why I’d want something like this:

To run a daily sync or job that performs some sort of MapReduce across much of the data in the canister. I’d like for this job to be able to span multiple rounds of consensus, but maybe I don’t want just any update call to be able to span multiple rounds of consensus.
Based on the size of the data structure and/or any GC required (i.e. in Motoko), some calls that access or modify large data structures (i.e. insert/delete in a BTree) will periodically need to span multiple rounds of consensus if the operation is inefficient (i.e. scan + filter) or the GC is invoked. For these APIs, I’d like to give them permission to span multiple rounds.

However, for a simple API that sets a more primitive parameter or performs a simple computation there’s no reason for this API to span multiple rounds. Maybe this suggests I’m failing to check size limits of my input parameters, or that I have an expensive operation in my code

I’m not convinced that the per API limits can’t just be solved by the developer testing and putting in these checks themselves, but the lower overall canister limit would definitely be useful.

Topic		Replies	Views
I just hit the instruction limit! Developers	35	1584	July 20, 2024
Proposal: Composite Queries Roadmap community-consideration	77	5917	July 1, 2024
Community Consideration: Explore Query Charging Roadmap	39	3189	October 5, 2024
Wasm module exceeding maximum allowed functions Developers	40	2783	September 14, 2022
Is there a way to give precedence to certain function executions if there is an active queue in the canister's / subnet's execution? Language Support Motoko	33	944	February 2, 2023

Deterministic Time Slicing

Related topics