Announcing optimized SHA2 for Motoko

timo · June 26, 2023, 9:44am

We have published an optimized Motoko implementation of all SHA2 functions: https://mops.one/sha2

The package provides sha256, sha224, sha512, sha384, sha512-256, sha512-224.

The possible input types are Blob, [Nat8] and Iter<Nat8>.

The most important performance metric is the number of wasm instructions used per block (aka chunk) of the input message. For sha256 this has been reduced by a factor of at least 2.5x compared to other existing implementations. For long messages of type Blob, sha256 now uses 295 instructions per byte hashed.

When hashing small messages the per-message overhead is important as well. This can be measured for example by comparing the instructions needed to hash the empty message. In this metric we see a reduction of 5.2x for sha256.

Finally, the amount of garbage created by heap allocations can be important. We measured that previous implementations of sha256 created at least 6x as much garbage as the size of the input message. This is down to 1.5x now.

Benchmarking details can be found in the README at https://mops.one/sha2

If you have requests to improve the API then please post here or in this OpenChat group.

skilesare · June 26, 2023, 12:31pm

Would it be possible to add an incremental mode that lets you process chunks across rounds? I’m contemplating needing to hash a very large file, or even an entire directory of files to get a hash.

timo · June 26, 2023, 1:28pm

Yes, it has that. It has the typical interface with a Digest class where you can do multiple writes to the Digest and then ask for the sum at the end. You can write types Blob, [Nat8] or from an Iter<Nat8> and you can mix types across multiple writes.

timo · September 17, 2023, 7:10pm

The library has been updated for use with moc 0.9.8 by taking advantage of the NatX conversions between adjacent X (e.g. Nat32 ↔ Nat16 ↔ Nat8).

This brought a decrease in instructions of 3% across all functions, Sha256 and Sha512.

Most notably, the conversions made it worthwhile to store state and message data in Nat16 words instead of Nat32 words. This then allowed us, for Sha256 at least, to eliminate all heap allocations. Indeed, heap allocation (i.e. garbage creation) is now independent of the message size. We can hash from input type Blob of arbitrary length with a constant heap allocation of 1,008 bytes (for instantiating a class). This change then allowed a further reduction in instructions for Sha256 of 4%.

The new version is 0.0.4. Compared to 0.0.2 we see a total decrease in instructions per byte of 7% for Sha256 and 20% for the empty message.

See use mops for all motoko benchmarks by chenyan-dfinity · Pull Request #87 · dfinity/canister-profiling · GitHub for benchmarking results. In particular, “certified map” improves by 12.5%. This makes sense between Merkle trees hash short messages, hence the improvement is expected to be in the middle between the ~7% seen per byte (i.e. for long messages) and the 20% for the empty message.

timo · April 29, 2025, 10:16am

We have optimized SHA2 further and approximately cut down the instructions in half. This was made possible by utilising the Blob random access that was introduced in moc 0.14.8. Having random access available allowed various improvements that show in particular for large messages.

The new release is 0.1.3 and the combined improvements since 0.1.1, since we started using Blob random access, add up to about 50% reduction in instructions. The heap allocations for sha512 have also been largely eliminated as was already the case for sha256 before. You can see benchmarks here: Mops • Motoko Package Manager

The instructions per byte are now 164 for sha256 and 131 for sha512.

skilesare · April 29, 2025, 11:20am

Amazing work!

It looks like you get index level access. I don’t wan to be greedy, but don’t think a range syntax and optimization would help as well?

Ie blob[start,end] => Iter or Array

(I’m guessing the second would allocate some memory). This is probably an under the covers of motoko question, but I’m curious what the most efficient loop over blob would be and if a cooler level implementation would be significantly better. I guess the first question we could benchmark.

timo · April 30, 2025, 8:53am

Yes, I expect further improvements from future extensions in the compiler/language.

To put the current state in perspective, according to canister profiling we are now at 2x the instructions used by the Rust version of sha256. That is 164 instructions per byte for Motoko vs 82 for Rust. I expect that we can close the gap further.

For short messages the Motoko implementation is already faster and that is also why the certified map application performs better in Motoko than in Rust.

I followed up with another release, 0.1.4, which utilised the explodeNat32/64 functions introduced with moc 0.14.9. This speeds up the final sum calculation which results in an improvement in the order of 7% (sha256) and 14% (sha512) for short messages.

JohnNixon6972 · April 30, 2025, 2:30pm

Hey I just wanted to try out to use this library in my custom mops package (which has some utilities)… When I tried to publish my updated package it just fails tests

I have just imported the package didnt even use it.
Screenshot 2025-04-30 at 3.30.53 pm

timo · May 2, 2025, 9:23am

The minimum moc version required for sha2 0.1.3 is moc 0.14.8.
And for sha2 0.1.4 it is moc 0.14.9.
Those minimal requirements are specified in the mops.toml of the sha2 package in the [toolchain] section, for example:

[toolchain]
moc = "0.14.8"

You seem to be running an older version of moc. If you are using dfx then you probably still have an older version because dfx may not have caught up with the latest moc release yet. In that case you have to manually install a newer moc.

You can try to put

[toolchain]
moc = "0.14.9"

in the mops.toml of your own package. Then run

mops install

again.

ZenVoich · May 2, 2025, 1:02pm

You can set minimum required version of moc in [requirements] section, and mops will show a warning if user’s moc version is lower than the required one

[requirements]
moc = "0.14.9"

timo · May 2, 2025, 1:06pm

Thanks for pointing it out. I confused the sections. Unfortunately, sha2 0.1.3 and 0.1.4 were published without the [requirements] section.

Topic		Replies	Views
Porting SHA512 and learning Motoko along Community Tutorials & Video	4	778	January 16, 2022
SHA256 for type Blob Developers	1	422	July 23, 2023
Implementation of xxHash in Motoko as a module Programs & Applications	2	446	May 6, 2023
Losing Precision when Hashing a sha256 Nat Language Support Motoko	3	822	August 4, 2021
Calculate SH256 for a Text General	6	4010	December 18, 2020

Announcing optimized SHA2 for Motoko

Related topics