SNS upgrade canisters with outstanding callbacks

infu · November 29, 2023, 3:56pm

If a proposal is made to upgrade an SNS dapp canister, will it work when there are outstanding callbacks?
Outside of SNS, to upgrade one of our canisters we have to first stop it, then upgrade it, then start it. Outwise it errors “Can’t upgrade because of outstanding callbacks”

It uses timers and makes a lot of calls, so the chances there won’t be any outstanding callbacks are near zero.
We are going to test that out if someone doesn’t have an answer already.
There is also this in Dfinity’s roadmap

Safe canister upgrades: Currently, canisters cannot be upgraded safely without stopping them to ensure there are no outstanding calls. By introducing named callbacks, canisters can be upgraded without stopping, ensuring that canisters can always be upgraded

Not sure if that’s already done and implemented in Motoko, nor how to name callbacks.
The code causing this looks like this:

Timer.setRecurring( #second 5, func () : async () {
 // make cross canister calls
});

domwoe · November 29, 2023, 4:27pm

I’ll forward it to the team, but I don’t think it will work.

This hasn’t been implemented yet (on platform level).

infu · November 29, 2023, 4:34pm

Thanks. In this case, I think creating a shell canister that installs our canister (as an actor class) with a custom script. This way we still upgrade one canister.
I believe actor classes don’t have candid meta and won’t show in the dashboard, that’s a minus.

infu · November 29, 2023, 6:12pm

Ok I’ve made this playground: https://m7sm4-2iaaa-aaaab-qabra-cai.raw.ic0.app/?tag=44537010
Looks like Motoko has everything that’s needed for this workaround.
‘Bucket’ is the sub canister.
‘Map’ is the shell.
These lines will stop, upgrade, start:

    await ic.stop_canister({canister_id = Principal.fromActor(sa)});
    subactor := ?(await B(#upgrade(sa))()); 
    await ic.start_canister({canister_id = Principal.fromActor(sa)});

How to try:

Deploy ‘Map’.
call get_subactor_principal
Open that canister in the dashboard.internetcomputer.org
Call get_version
Modify the version in ‘Bucket’
Upgrade ‘Map’
Call get_version again in a few seconds. It should be different.

The sub actor has candid meta, which is great.

hpeebles · November 29, 2023, 10:45pm

Whenever a proposal to upgrade an SNS controlled canister is executed, the canister being upgraded is first stopped, then upgraded, then started.

This ensures that all outstanding callbacks are processed before the canister is upgraded.

See the code here - https://github.com/dfinity/ic/blob/087190165198a9dc7b3c9cf80e0812c0c03964c6/rs/sns/governance/src/governance.rs#L2495

When a canister is instructed to stop, it will enter the ‘Stopping’ status which means that it will no longer start processing new requests but will continue to process callbacks until they are all complete.

lara · November 30, 2023, 10:55am

Agree with what has been said here already.
One thing to maybe explicitly point out: this means indeed if the dapp canister has some callbacks that are never returned, then it cannot be upgraded (as it never reaches the “stopped” state after the “stopping” state.

There is a section here that describes a few of those risks in the context of generic proposals (so calls that the SNS governance canister would make). I think many of them also apply for dapp canisters, so maybe it would be interesting to consdier them?
Also let us know if we missed something in this section!

icme · January 2, 2024, 3:16am

Is there any plan to work on the named callbacks feature for the IC in 2024, as well as a timeout with each call + with timeout functionality?

Right now this can be used to stage an attack against a third party service, rendering it upgradeable. Therefore, safely communicating with a canister you do not own requires two one-shot (fire and forget) calls, and collaboration from both parties.

Given the latency of inter-canister communication, named callbacks would be much more efficient (just requires one call instead of two).

free · January 3, 2024, 5:25pm

I cannot speak to named callbacks (although I do believe there is no active work going on in that area right now).

But we are putting together a proposal for best-effort messaging with built in timeouts. We should have more information and will be asking for community feedback soon (as in weeks).

infu · January 3, 2024, 7:16pm

Architecture like this will prevent that. If a 3rd party canister attacks a service, it will only take down its own adapter canister and nothing else. Slows down communication, but has other benefits - the two parties involved can customise it and agree on it.

Topic		Replies	Views
Issue with SNS Upgrade: "Stop Canister Request Timed Out" SNS Project Governance	5	106	January 28, 2025
Prioritize safe instantaneous canister upgrades Roadmap	13	2381	April 19, 2022
Can you upgrade a canister without stopping it? Yes, eh, no, eh … maybe Programs & Applications	2	865	March 20, 2022
Proposal to Upgrade the Oldest Swap Canisters SNS Project Governance nns , sns	5	204	November 6, 2024
Issue: Asset Canister Can't Be Upgraded via SNS proposal Developers Discussing	11	523	April 4, 2024

SNS upgrade canisters with outstanding callbacks

Related topics