Stopped canisters and recurring timers (discussion)

ggreif · March 9, 2023, 9:42am

As you probably know, timers in Motoko give only a few guarantees (same applies to global replica timers, so we inherit this):

they expire at (or after) the specified delay
recurrent timers will stick to a regular expiration “grid”

I’d like to describe and discuss an issue with recurring timers in the default mechanism that might be slightly surprising to you.

When a recurring timer is added, the expiration grid is being defined as the current (round) time and the multiples of the specified delay (until upgrade). This is done to eliminate “wander”, and takes care that expected expiration times are equidistant. There will still be perceived “jitter” (variation after the expected expiration time) due to the underlying replica’s coarse temporal resolution.

One consequence of this is that if you set up a recurring timer with zero delay, all expirations coincide, so we treat these as one-off timers.
The other consequence is that you should not specify recurring timers with a small delay (less or comparable to the round time of the subnet) as the timer may lag the “grid” infinitely and never catch up.

Then there is the question about stopped canisters with recurring timers. Stopping also causes the suspension of global timer callbacks to the canister, and thus introducing bad jitter. When the canister is started, the default mechanism will try to process the jobs of all missed expirations according to the grid and thus exhibit a heartbeat-like pattern until the timer has caught up.

If this sounds like a behaviour negating the purpose of timers, you are probably right. The quick sequence of expirations causes unnecessary cycle burn and can probably not be amortised with useful work.

So what I am proposing is to skip missed grid points and catch up instantly by always setting the next scheduled expiration into the future. This would also fix the “too short” delay problem, but potentially break “counting” jobs that rely on every grid point spawning a fresh job.

What do you think?

ZhenyaUsenko · March 9, 2023, 9:56am

For some reason I thought current implementation already skips timers for stopped canisters. I would definitely prefer skipping over processing all missed timers on startup

skilesare · March 9, 2023, 10:47am

I actually thought that all timers were abandoned on upgrade and had to be rehydrates.

Do non-recurring timers persist after upgrade?

icme · March 9, 2023, 10:58am

2 different behaviors with recurring timers.

upgrading a canister kills the timer
stopping and restarting a canister with the timer leads to the catch-up behavior @ggreif described above.

I agree that a timer shouldn’t try to recoup lost execution on restart and should just pick up where it last left off.

ggreif · March 9, 2023, 11:12am

My message was about stop_canister and start_canister messages to the management canister. Not about upgrades per se. (Timers — whatever ilk — won’t survive an upgrade, you have to set new ones.)

skilesare · March 9, 2023, 12:11pm

Ahhh…got it! Makes sense.

ggreif · April 3, 2023, 11:58am

I’ve put up the review for quite some time, feat: when re-adding recurrent timers, skip over past expirations by ggreif · Pull Request #3871 · dfinity/motoko · GitHub.
It is basically a one-liner change and I expect it being merged soon. Most probably it will end up in moc 0.8.7.

(It is released now, and dfx 0.14-beta1 has picked it up.)

Iceypee · April 11, 2023, 4:26pm

How about pre-upgrades. If I wanted to wait x seconds/blocks to allow all my functions await calls to settle, can I throw a ten second timer in the preupgrade that executes all the upgrades after 10 seconds?

ggreif · April 11, 2023, 7:33pm

When control returns from the pre_upgrade hook, the IC will assume that all relevant data is written to stable memory and will thus activate the new binary. All information in the regular heap (where your new timers live) will be erased. So I doubt your idea will work, moreover you’ll expose yourself to data loss.

Please check with the canister lifecycle state machine that is compiled into the Motoko canisters, as it contains the necessary logic to only stop the canister when all outstanding messaging has come to an end.

cryptoschindler · April 27, 2023, 8:27am

Can we add some documentation around this behaviour to the docs here

ggreif · April 27, 2023, 4:11pm

The link you mentioned was about CDK (yes, it applies too there). Then there is Timers Library Limitations, which could be a nice place to explain some of this.

Topic		Replies	Views
Motoko Timer gets cancelled on canister upgrade Language Support Motoko	8	649	June 19, 2023
Timer API redundancy is causing excessive burn rate Developers	10	240	December 4, 2023
Reason/Possibility for timer being canceled by replica? Language Support	5	542	September 29, 2023
Timers and Base Library: Weekly Motoko Updates Language Support Motoko	4	1197	January 19, 2023
Will an upgrade timer Developers rust	9	88	November 9, 2024

Stopped canisters and recurring timers (discussion)

Related topics