How does canister storage get implemented

Astrapolis-peasant · September 7, 2022, 4:14pm

Hello ICP folks,
Since the launch of ICP, I have always been curious about how persistent canister memory is implemented. Since most blockchain use kv DB(level DB, rocks DB) to serialize and deserialize tree nodes, you can’t use straightforward data structures in smart contracts. But ICP supports almost all data structures in a native fashion in memory. So my biggest question is how does canister’s persistent storage gets implemented since I am not sensing any persistent RAM in hardware configuration. Does ICP use Memory Map for mirroring memory in SSD, if so, what data structures it uses (mpt?)

Astrapolis-peasant · September 7, 2022, 4:15pm

It would be a great help if someone could post the state implementation source code

roman-kashitsyn · September 7, 2022, 5:51pm

I wrote a blogpost about the orthogonal persistence feature responsible for canisters state [Blog post] IC internals: orthogonal persistence
There are some code pointers at the end of the article if you’re brave

Astrapolis-peasant · September 8, 2022, 5:04am

thank you roman! That’s super helpful!

Astrapolis-peasant · September 8, 2022, 3:02pm

Hi Roman, one more question, how does dirty memory pages further impact state_root_hash calculation and state_change_hash, could you elaborate on it a little bit. Bascially, how the state_root_hash is derived from the canisters memory pages

roman-kashitsyn · September 8, 2022, 7:15pm

Conceptually, the system periodically hashes the entire state of a subnet, including the memory of all canisters, by constructing an on-disk representation of the state, slicing the files into chunks, and building a shallow Merkle tree out of this structure (state as an artifact). I guess the root hash of that tree is what you mean by state_root_hash.

Of course, re-hashing the whole state is very expensive, so the system uses the information about dirty pages to find chunks that need re-hashing.

I’m not sure what you mean by state_change_hash though.

Topic		Replies	Views
ICP.Lab Storage & Scalability Summaries Developers	16	3702	December 6, 2023
Can the IC be used as a block storage replacement? Programs & Applications	5	587	June 4, 2021
Is Canister Application data stored on the IC blockchain or is it just held in memory? Developers	1	448	May 20, 2022
So the IC can't store files well either? General	16	2572	August 8, 2022
StableBTreeMap in Canisters Rust	17	1450	August 3, 2023

How does canister storage get implemented

Related Topics