Technical Working Group DeAI

jeshli · July 17, 2024, 11:56pm

Stable memory has a 400GB capacity per canister

zensh · July 18, 2024, 12:02am

Wasm heap memory, per canister: 4GiB
Wasm stable memory, per canister: 400GiB

It’s possible to upload a single 400GB file using ic-oss, and it uses concurrent upload technology with ReadableStream. Maybe the upload speed will be faster.

github.com

ldclabs/ic-oss/blob/main/src/ic_oss/src/bucket.rs#L373


      
          let has_hash = hash.is_some();
          let mut frames = Box::pin(FramedRead::new(stream, ChunksCodec::new(CHUNK_SIZE)));
          let (tx, mut rx) = mpsc::channel::<Result<Progress, String>>(self.concurrency as usize);
          let output = Arc::new(RwLock::new(UploadFileChunksResult {
              id,
              filled: 0,
              uploaded_chunks: exclude_chunks.clone(),
              error: None,
          }));
          
          let uploading_loop = async {
              let mut index = 0;
              let mut hasher = Sha3_256::new();
              let semaphore = Arc::new(Semaphore::new(self.concurrency as usize));
          
              loop {
                  let access_token = self.access_token.clone();
                  let tx1 = tx.clone();
                  let output = output.clone();
                  let permit = semaphore
                      .clone()

gip · July 18, 2024, 2:24am

I guess the question is how fast can the software get access to the memory. For the heap it’s RAM so fast. Is stable loaded on demand from disk?

zensh · July 18, 2024, 3:22am

Running an AI model still requires loading the model file from stable memory to heap memory, so it is still subject to the current 4GB limit of WASM32.

KinicDevContributor · July 18, 2024, 6:00am

Our use case is deploying a vector database which index size is over 70GiB

KinicDevContributor · July 18, 2024, 6:03am

I’m using tokio’s stream_iter for parallelization, but it’s possible that your implementation is faster.
If you have the opportunity to upload tens of GiB, let’s compare!

patnorris · July 26, 2024, 7:10am

Hi everyone, thank you for yesterday’s call (2024.07.25). This is the generated summary (short version, please find the long version here:
DFINITY is expanding its AI team by hiring an AI engineer specializing in GPU optimizations and deep learning libraries to work alongside the current team member, Islam, who has extensive experience in the Internet Computer (IC) and AI. The DeAI group discussed networking to find suitable candidates and expressed enthusiasm for the new role. The team is transitioning with the help of Ulan, planning milestones for GPU and CPU optimizations on IC, and working on various demos such as face recognition and GPT. Members shared interests in decentralized AI and gaming applications. Tim presented Hypercerts from Protocol Labs, which use semi-fungible certificates to incentivize and validate impact in projects like public goods and environmental remediation. The group discussed future collaborations, emphasizing decentralized AI development and community-driven efforts. Action items include sharing the job opening, continuing demo development, and exploring Hypercerts’ applications in AI projects.

Special thanks to Tim for sharing Hypercerts with us💪

Links shared during the call:

DeAI manifesto: Manifesto for Decentralized AI (DeAI) - Google Docs
DeAI docs: AI Documentation - Google Docs
DeAI repo for PRs: DeAIWorkingGroupInternetComputer/Projects at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub
ICP game: https://woglz-oqaaa-aaaal-act2q-cai.icp0.io/
See PDF with info on Hypercerts in this week’s folder: DeAIWorkingGroupInternetComputer/WorkingGroupMeetings/2024.07.25 at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub and more links around what Tim shared below:
https://capitalinstitute.org/wp-content/uploads/2015/04/2015-Regenerative-Capitalism-4-20-15-final.pdf
Leucaena leucocephala - Wikipedia
Greenpill Brasil | Giveth

patnorris · August 2, 2024, 10:38am

Hi everyone, thank you for yesterday’s call (2024.08.01) and special thanks to Krish for sharing his work. This is the generated summary (short version, please find the long version here:

In the latest DeAI call, the success of the recent Web3 AI hackathon was highlighted, which saw high-quality submissions focused on AI development using web technologies. Winning projects, such as DeDa, a decentralized data marketplace, Clanopedia and ICMLPL, which integrates the Flashlight deep learning library, will continue as developer grantees. The community is encouraged to participate in weekly calls for knowledge exchange, utilize the decentralized AI channel on Discord, and explore support programs like the Olympus accelerator for further growth and investment opportunities.

Links shared during the call:

patnorris · August 8, 2024, 6:12pm

Hi everyone, thank you for today’s call (2024.08.08). Special thanks to @icarus for sharing his latest research on viable GPU hardware options for the IC! This is the generated summary (short version, please find the long version here):

During today’s DeAI Working Group call for the Internet Computer, the primary focus was on discussing hardware options for optimizing AI workloads. The group explored various GPUs and AI accelerator cards, considering factors such as cost, power efficiency, and suitability for both inference and training. The discussion highlighted the potential of Tens Torrent’s wormhole cards, which offer a balance of performance and affordability, and emphasized the importance of an open-source software stack for flexibility and problem-solving. There was also mention of ongoing efforts to integrate GPU functionalities into the IC’s host functions and the potential need for customized hardware solutions to support AI-driven tasks effectively.

Links shared during the call:

Wormhole
https://www.esperanto.ai/
DeAI manifesto to contribute to: Manifesto for Decentralized AI (DeAI) - Google Docs
DeVinci alpha release: https://x6occ-biaaa-aaaai-acqzq-cai.icp0.io/

laska189345938458347 · August 10, 2024, 11:32pm

Hello! How i can join to next call?

patnorris · August 12, 2024, 10:21am

Hi there, we meet each Thursday in the ICP Developer Community Discord. We use the voice channel for the call:

Looking forward to having you

patnorris · August 15, 2024, 8:50pm

Hi everyone, thank you for today’s call (2024.08.15). This is the generated summary (short version, please find the long version here):

During the DeAI Working Group call, a presentation was given on Clanopedia, a decentralized, verifiable data source built during the Web3 AI hackathon, focusing on creating a collective knowledge base using a vector database architecture stored on the Internet Computer Protocol. The discussion then shifted to documentation updates, with an extension of the deadline for contributions, and the team explored ongoing work on AI transformers for specific use cases, emphasizing the importance of community collaboration. The call also featured an in-depth discussion on AI regulation, particularly the European Union’s AI Act, its implications for high-risk AI applications, and the evolving landscape of global AI governance. The session wrapped up with reflections on the need for future workshops, code walkthroughs, and regular updates on AI regulations, highlighting the importance of staying informed on legal developments as AI technology advances.

links shared during the call:

DeAI docs to contribute to: AI Documentation - Google Docs
DeAI manifesto to contribute to: Manifesto for Decentralized AI (DeAI) - Google Docs
DeAI repo for project additions (as PR) and existing projects (incl. Vector dbs): DeAIWorkingGroupInternetComputer/Projects at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub
Vector db by Kinic DAO: GitHub - ClankPan/ic-vectune at develop/kinic_db_instance

Please find Tim’s slides on AI regulation in the repo’s folder for today’s meeting too: DeAIWorkingGroupInternetComputer/WorkingGroupMeetings/2024.08.15/AI Law Regulation Compliance - ICP DeA WG Briefing 1_1.pdf at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub

special thanks to @tinybird for the demo on Clanopedia and Tim for sharing his expertise on AI regulations💪

patnorris · August 22, 2024, 6:26pm

Hi everyone, thank you for today’s call (2024.08.22). This is the generated summary (short version, please find the long version here):
The DeAI Working Group for the Internet Computer discussed several key topics, including the one-year anniversary of running LLMs on the ICP, promotional activities, and the potential for organizing an AI symposium. The meeting also reviewed various AI projects on the Internet Computer, covering different programming languages and frameworks, such as TypeScript, C++, Python, and Rust, and discussed potential integrations with Awesome ICP and decentralized AI documentation. The group emphasized the importance of sharing and curating resources, training data, and ensuring comprehensive and up-to-date documentation for AI on ICP. Future discussions will focus on expanding these resources and fostering collaboration.

Links shared during the call:

DeAI event in Palo Alto: Palo Alto AI x Web3 Summit to Debut at Stanford University This October - Decrypt
Updated outline of AI Documentation: AI Documentation - Google Docs
DeAI manifesto to contribute to: Manifesto for Decentralized AI (DeAI) - Google Docs
DeAI repo: DeAIWorkingGroupInternetComputer/Projects at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub
DeAI projects on Awesome ICP: GitHub - dfinity/awesome-internet-computer: A curated list of awesome projects and resources relating to the Internet Computer Protocol
Training data related article: [2407.14933] Consent in Crisis: The Rapid Decline of the AI Data Commons
Open dataset: Dolma | Ai2

JaMarco · August 27, 2024, 4:27am

JaMarco · August 27, 2024, 8:58pm

x.com

Alexander Long

@AlexanderJLong

So crazy to me that 6 months ago literally everyone serious AI researcher I talked to told me stuff like this wasn't feasible. Decentralized Training is gonna work - we're still at DDP stage and there's a long way to go for giant model runs, but no doubt where the vector of… x.com/i/web/status/1… x.com/NousResearch/s…

Nous Research @NousResearch

What if you could use all the computing power in the world to train a shared, open source AI model? Preliminary report: https://t.co/b1XgJylsnV Nous Research is proud to release a preliminary report on DisTrO (Distributed Training Over-the-Internet) a family of… https://t.co/FUXbjd794g https://t.co/h2gQJ4m7lB

7:06 PM - 26 Aug 2024 260 22

patnorris · September 5, 2024, 5:51pm

Hi everyone, thank you for today’s call (2024.09.05). This is the generated summary (short version, please find the long version here):

In today’s DeAI Working Group call for the Internet Computer, discussions centered on AI development with a focus on large language models (LLMs), vector databases, and decentralized AI applications. Key insights included the development of personal AI systems using secure, tamper-proof vector databases on the blockchain, offering users control over their own models and data, avoiding centralized data collection risks. The group emphasized optimizing LLMs like LLaMA for specific tasks such as email search and educational tools. They also explored the potential for a decentralized marketplace for AI models, enabling secure monetization of AI contributions on the ICP platform, with a long-term vision similar to Hugging Face, enhanced by blockchain features.

Links shared during the call:

JaMarco · September 24, 2024, 3:34am

patnorris · September 26, 2024, 5:27pm

Hi everyone, thank you for today’s call (2024.09.26). This is the generated summary (short version, please find the long version here ):

Today’s DeAI Working Group call for the Internet Computer focused on potential Delta V integrations, new participant introductions, and plans to publish a DeAI manifesto on ICP with an emphasis on accessibility, self-sovereignty, and AI safety. The group also explored on-chain vector databases, LLM scaling challenges, and standards for LLM inference. Future discussions will address cost efficiency, instruction limits, and the environmental impact of decentralized AI.

Links shared during the call:

patnorris · October 3, 2024, 7:27pm

Hi everyone, thank you for today’s call (2024.10.03). This is the generated summary (short version, please find the long version here ):

In today’s DeAI Working Group call, participants discussed the challenges of implementing shared stable storage for canisters to avoid re-uploading large AI models across subnets. The conversation also focused on the serialization of tensors, exploring the need for standardized formats such as ONNX or MLIR for data sharing between canisters. The group also touched on the future potential of using GPU subnets for AI computation, dealing with issues like determinism and instruction limits, and the possibility of creating decentralized systems to rent out and protect training data for AI models.

Links shared during the call:

Serialization - ONNX 1.18.0 documentation
MLIR Bytecode Format - MLIR
Tensor Operator Set Architecture (TOSA) Dialect - MLIR
GPU comparison: x.com
GPU benchmark: GPU Benchmarks Hierarchy 2024 - Graphics Card Rankings | Tom's Hardware

ktimam · October 11, 2024, 12:30pm

Anyone faced similar issue while working on deep learning or have a sample for dividing work into multiple messages?

Topic		Replies	Views
Come hear about the state of the ART on ZKML. *ICP is the global orchestration layer for DeAI Showcase	19	289	June 27, 2025
Announcing Technical Working Groups Developers	38	26074	July 25, 2024
DeAI Marketing Site and Campaign Survey: Q2 2024 Developers	0	120	April 15, 2024
Technical Working Group: Scalability & Performance Developers Discussing , community-consideration	173	9984	June 24, 2025
Working with Decentralized LLM Developers	2	521	January 14, 2024

Technical Working Group DeAI

Links shared during the call:

Links shared during the call:

Links shared during the call:

Links shared during the call:

Related topics