New ‘Change Subnet Membership’ proposals were submitted yesterday. Reviews to come…
Executed yesterday
New ‘Change Subnet Membership’ proposals were submitted yesterday. Reviews to come…
Since the above discussion I’ve been meaning to take a closer look at certain node providers. Here’s my first attempt.
Sybiling nodes… Does anyone else find this concerning? →
SYBILing nodes! Exploiting IC Network… Community Attention Required!
New ‘Change Subnet Membership’ proposals have been submitted. Reviews to come…
The CO.DELTA team have been discussing the number of offline nodes across subnets. There are many. I plan to periodically update this post, providing timestamps and the state of affairs at that point in time. The aim is to increase visibility, auditing, and to stimulate discussion.
Degraded Nodes Summary at 30/04/2025 07:51:41 UTC
Down Nodes Summary at 30/04/2025 07:51:42 UTC
- 3hhby
- po5od-oz2a3-3mjm5-sv4ss-exwfn-mlbr2-5vf6v-2f52u-uoqh7-yicbb-mae DOWN - 4zbus
- oiso5-hxkkc-elqlo-iyoqg-7oux4-5mscl-zkvoh-b2432-ohrir-b5fcm-gae DOWN - cv73p
- 2mmpk-6kvr3-3syvq-twao3-g3fxc-gz3n2-o3tgf-34irb-idruv-cogfq-sqe DOWN - k44fs
- xmg5b-vziyw-6bzii-yfbmt-zll6k-v5bd5-m3ds6-7vdyy-wmnkd-lznvp-jqe DOWN - lspz2
- lxebw-azsxd-nejr3-ye7ii-vdyjv-kppip-trprn-pdn7o-daluo-5sqtw-eae DOWN - pzp6e
- 3beeq-bvtv4-mxeff-2ypcm-jnw74-envji-bflpz-sipbb-e6gnt-kfptx-5qe DOWN - tdb26
- k2ovz-ptdmz-hzmkh-ntfqn-evhux-ktxz4-26jg6-qjjsa-rpgal-wkdjo-kqe DOWN - x33ed
- r7few-pljgn-iynmr-iprtj-p66dg-qpc5m-2tx4m-245oc-6dzgk-pu2wy-dae DOWN
Isn’t there a remuneration reduction that has taken effect for node providers who have nodes that are offline in subnets? Perhaps @sat or @bjoernek can remind us how that works and if it is applicable. My concern with your plan @Lorimer is that it may not add value when there is already a penalty in place. It may only serve to further demotivate node providers.
As part of performance-based node rewards it was suggested
- Unregistered nodes should receive no rewards.
- Performance of unassigned nodes would be extrapolated from the performance of assigned nodes of a node provider. There is the expectation, that every node provider has at least one assigned node.
For more details, please see here.
It appears that Motion proposal 135054 was Adopted. This was NNS approval to implement the Performance Based Node Rewards. @bjoernek can you confirm that it has already been implemented?
Please correct me if I’m wrong, but this proposal 135054 means that when a node provider has a node that is down in a subnet, it is subject to a remuneration reduction formula and that formula is a penalty that applies to all of their undeployed nodes as well. In other words, nodes in subnets that are not performing well already subject the node provider to a significant financial loss. Is this an accurate summary?
Correct, the according motion proposal has been approved by the NNS. The implementation is work in progress and the feature is not yet live.
Yes, this is a correct summary. I would only add that penalties only apply for longer down times. A failure rate of up to 10% would not result in any reduction.
Thanks @wpb, I understand your concern. However I do think visibility, tracking, and prompts for discussion and reflection are valuable. Even with measures and mechanisms in place, it’s important to check-up on how well they’re working and whether course correction may be needed. As a decentralised network, I think bringing this to the forum regularly is useful.
I’ve added some further details to the structure of that post, based on some feedback from @quint, and also incorporating additional links that I include in my SM reviews. I’ve also provided a summary by node provider showing what percentage of their assigned nodes are degraded or offline.
I plan to create a new post each month, and periodically update that post throughout the month, relying on discord’s change tracking to provide time travel for the reader (over the course of that month). I’ll post one for April in a sec using a post format that I’ll intend not to change (or only change minimally if needed).
Thanks for clarifying @bjoernek. Hopefully this will incentivise NPs to take a more active role in subnet management (raising proposals to swap out their offline/degraded nodes). Of course, if they do this proactively, the measure is no longer a good indicator of their contribution to network resources.
If a node provider has 10 nodes, but only ever has 1 assigned, and the assigned node is always online, but the unassigned nodes are always offline or just degraded - should they really be rewarded as though all nodes are online?
April: Summary at 2025-04-30 22:12
See this post and following discussion for context.
Degraded or Down Nodes
- 3hhby: 1 (4 more would stall the subnet)
- 4zbus: 1 (4 more would stall the subnet)
- bkfrj: 1 (4 more would stall the subnet)
- cv73p: 1 (4 more would stall the subnet)
- k44fs: 1 (4 more would stall the subnet)
- lspz2: 1 (4 more would stall the subnet)
- pzp6e: 2 (10 more would stall the subnet)
- tdb26: 2 (12 more would stall the subnet)
- uzr34: 1 (11 more would stall the subnet)
- w4asl: 1 (4 more would stall the subnet)
- x33ed: 2 (10 more would stall the subnet)
Summary Grouped by Node Provider
- Lukas Helebrandt: 6 out of 6 assigned nodes are degraded or offline (100%)
- 3yok3-yvswm-l4ior-bjh62-fsi73-vfqp3-77wji-coglq-l2bjw-rxrph-uqe DOWN
- afu64-x2met-w2vxb-4lzus-z4fhf-divej-q46gf-wvpek-dj67h-fptsd-aae DOWN
- b44r4-u77ay-myhhv-d6d75-jusik-b7ry2-g6ms5-6okxm-tumyx-rjm4g-4ae DOWN
- izmdg-yz3l4-jp5vr-rva2j-5btpc-bzgt5-7snt4-wwy2g-jaq7v-rzpsv-oae DOWN
- lilkb-wf7hu-iv6gp-gbbv6-f6bpj-amtp5-v2jyi-3s7jw-6dysi-yzm4g-hae DOWN
- mae7q-ggnbj-xb52f-hvest-dgqoc-whvsq-yui5d-5cys2-zeold-l5ydx-oae DOWN
- 3yok3-yvswm-l4ior-bjh62-fsi73-vfqp3-77wji-coglq-l2bjw-rxrph-uqe DOWN
- Bohatyrov Volodymyr: 1 out of 6 assigned nodes are degraded or offline (16.7%)
- Maksym Ishchenko: 1 out of 6 assigned nodes are degraded or offline (16.7%)
- Artem Horodyskyi: 1 out of 7 assigned nodes are degraded or offline (14.3%)
- Vladyslav Popov: 1 out of 7 assigned nodes are degraded or offline (14.3%)
- 87m Neuron, LLC: 1 out of 18 assigned nodes are degraded or offline (5.6%)
- Starbase: 1 out of 19 assigned nodes are degraded or offline (5.3%)
- Iancu Aurel: 1 out of 26 assigned nodes are degraded or offline (3.8%)
- Pindar Technology Limited: 1 out of 27 assigned nodes are degraded or offline (3.7%)
As discussed, performance based node rewards will incentives nodes to stay online. If I understand correctly, currently the only potential penalty is that node might be rejected in SM proposal.
May: Summary at 2025-05-15 18:19 UTC
See this post and following discussion for context.
Degraded or Down Nodes
- 2fq7c: 1 (4 more would stall the subnet)
- k44fs: 1 (4 more would stall the subnet)
- lspz2: 1 (4 more would stall the subnet)
- pzp6e: 1 (11 more would stall the subnet)
Summary Grouped by Node Provider
- Bohatyrov Volodymyr: 1 out of 6 assigned nodes are degraded or offline (16.7%)
- Starbase: 2 out of 19 assigned nodes are degraded or offline (10.5%)
- 87m Neuron, LLC: 1 out of 18 assigned nodes are degraded or offline (5.6%)
With the planned addition of further subnets, the ratio of subnet assigned nodes will increase, meaning a significant portion of each node provider’s nodes will need to be online (also resulting in the collection of performance metrics of these nodes).
If we really encounter a situation where most of a particular node provider’s nodes are continuously offline, this issue can be raised with the provider — and in the worst case, could result in the offboarding of those nodes.
Subnet: e66qm-3cydn-nkf4i-ml4rb-4ro6o-srm5s-x5hwq-hnprz-3meqp-s7vks-5qe - ICP Dashboard
All w4rem nodes were also temporarily offline a second ago when I took the May snapshot. However they appear to be back online now.
Looks like this might be related to GuestOS deployments. e66qm’s one executed about 15 minutes ago.
Update: e66qm back up. Now all nodes in 6pbhf are down. The GuestOS deployment for that subnet was executed 20 minutes ago.
@sat I’ve only just started taking regular snapshots across all subnets. Is it normal to see this sort of downtime during GuestOS deployments?
Cross-posting as this relates to, and addresses some of my questions above.
I also note that in my post above I conflate the proposal execution timestamp with the subnet deployment point in time. The proposal execution just updates the registry (as far as I understand). The subnet itself is later responsible for pulling that config from the registry and updating itself (hence the delay between proposal execution and the subnet going down briefly).