Technical Working Group DeAI

Samer · May 21, 2024, 9:42pm

Someone linked this critique of KANs on the MLST discord

zensh · May 24, 2024, 2:15pm

Great topic! I took some time to read through it and learned a lot about DeAI.

We developed a DeAI canister based on Huggingface’s Candle framework and tried running a 0.5B parameter, 1.2G data LLM.

It can run in a local cluster with adjusted instruction limits, and here are the results:

dfx canister call bwwuq-byaaa-aaaan-qmk4q-cai update_chat '(record {prompt="Nice to chat with you, Please introduce yourself."})'
(
  variant {
    Ok = record {
      instructions = 1_128_679_236_950 : nat64;
      tokens = 43 : nat32;
      message = "\nHello! I am Panda Oracle, a giant panda with human intelligence. I am here to help you with any questions or concerns you may have. How can I assist you today?\n";
    }
  },
)

Source code: ic-panda/src/ic_panda_ai at main · ldclabs/ic-panda · GitHub

jeshli · May 24, 2024, 2:56pm

Thank you for sharing. I have also been experimenting with candle and believe it to be an excellent crate for streamlining ML on the IC.

icpp · May 25, 2024, 10:17pm

Awesome accomplishment!!

Can you explain more whay you mean with ‘adjusted instructions limit’ ?

Also, are you doing query or update calls ?

zensh · May 26, 2024, 1:39am

Tool to build local IC replica and dfx:

I’ve tested it, and the query call works:

github.com

ldclabs/ic-panda/blob/main/src/ic_panda_ai/src/api_query.rs#L40


      
              take: Option<u32>,
              _access_token: Option<ByteBuf>,
          ) -> Vec<FileInfo> {
              let max_prev = store::state::with(|s| s.file_id).saturating_add(1);
              let prev = prev.unwrap_or(max_prev).min(max_prev);
              let take = take.unwrap_or(10).min(100);
              store::fs::list_files(prev, take)
          }
          
          #[ic_cdk::query]
          async fn chat(args: types::ChatInput) -> Result<types::ChatOutput, String> {
              let msg = json!([{
                  "role": "system",
                  "content": "You are a giant panda with human intelligence, the best friend and assistant to humans, named \"Panda Oracle\", born on the Internet Computer (ICP).",
              }, {
                  "role": "user",
                  "content": args.prompt,
              }]);
          
              let seed = args.seed.unwrap_or_else(|| {
                  u64::from_be_bytes(sha3_256(ic_cdk::id().as_slice())[..8].try_into().unwrap())

However, changing the instruction limit for query calls is a bit tricky. You need to modify the max_query_call_graph_instructions parameter in the ic.json5 file generated during runtime.

icpp · May 26, 2024, 10:17pm

Thank you for those details!

TusharGuptaMm · May 27, 2024, 7:13am

Our team @ RuBaRu is taking baby steps towards developing OnChain AI services. These innovations aim to enhance user experience and bring greater transparency to content distribution and discovery. Additionally, our AI will analyse and tag rich media, such as images and videos, with content types. I believe we have a long road ahead, but it’s a solid start towards keeping the ecosystem fully OnChain. We welcome your ideas and feedback as we advance in the SocialFi space.

Read more: RuBaRu - A Fully On-Chain Regenerative Creators-Consumers Economy - #7 by TusharGuptaMm

berestovskyy · May 29, 2024, 4:27pm

Hey folks,
The WebAssembly SIMD is available on the mainnet and the latest dfx:

To install the latest dfx with SIMD support: dfxvm install 0.20.2-beta.0
The Rust SIMD example, which compares naive, optimized, auto-vectorized, and SIMD intrinsic matrix multiplication: examples/rust/simd at master · dfinity/examples · GitHub
A bit of SIMD documentation on the IC: https://internetcomputer.org/docs/current/developer-docs/smart-contracts/advanced-features/simd/

Let me know if you have any questions,
and enjoy!

TusharGuptaMm · May 29, 2024, 4:30pm

This is amazing! Will give it a try!

icarus · May 29, 2024, 6:20pm

That is welcome and much anticipated great news!
Thanks for sharing it here, I will copy it to the DeAI WG Discord channel and start spreading the word!

Many thanks to you and other Dfinity team members who have worked on the SIMD support and pushed it along @berestovskyy

cryptoschindler · May 30, 2024, 1:10pm

Dear WG members,

please add any missing resources to this page

zensh · May 30, 2024, 1:59pm

A pull request to add ICPanda AI canister: chore: add ICPanda AI to AI docs by zensh · Pull Request #3004 · dfinity/portal · GitHub

patnorris · June 5, 2024, 8:21am

Hi everyone, this Thursday we’ll have our joined call with the DFINITY growth team We’ll discuss how to best collaborate, so please bring all your ideas As usual, it’s on Thursday at 6pm CET and in the ICP Developer Discord. This is a link to the event: Discord
Looking forward to the call!

patnorris · June 6, 2024, 7:22pm

Hi everyone, thank you for today’s call and the great discussion! Special thanks to @jennifertran for putting it together This is the summary of the call, looking forward to collaborating on the next steps and ideas we had
Long summary here: DeAIWorkingGroupInternetComputer/WorkingGroupMeetings/2024.06.06 at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub

Short summary:
The DeAI Working Group call for the Internet Computer focused on improving AI project marketing and documentation and was a joined session with DFINITY team members across growth, marketing and adoption. Participants discussed the need for better marketing strategies, comprehensive AI documentation, and practical use cases to demonstrate ICP’s benefits. Collaborative efforts were emphasized, including creating a decentralized AI manifesto, collecting success stories, and exploring monthly public updates. The call concluded with actionable steps to support ecosystem teams, enhance marketing resources, and ensure the successful promotion of decentralized AI on the Internet Computer.

catpirate3 · June 13, 2024, 5:30pm

Are the meetings being recorded? or just the summary?

patnorris · June 13, 2024, 6:31pm

Hi there, the meetings are not recorded, but we have all summaries so far in here: DeAIWorkingGroupInternetComputer/WorkingGroupMeetings at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub

patnorris · June 13, 2024, 6:33pm

Hi everyone, thank you for today’s call! Special thanks to @gip for sharing insights on his exciting work This is the short summary (long summary here: DeAIWorkingGroupInternetComputer/WorkingGroupMeetings/2024.06.13 at main · DeAIWorkingGroupInternetComputer/DeAIWorkingGroupInternetComputer · GitHub):
In today’s call of the DeAI Working Group for the Internet Computer, several key initiatives were discussed. The team plans to develop a survey to gather information from ecosystem teams building AI projects, aiming to create a tailored marketing strategy. The importance of tracking usage statistics for decentralized AI applications was emphasized to demonstrate activity and traction to potential partners. Updates were provided on the scheduling of DeAI live streams and organizing a DeAI panel for the Chain Fusion Day event at EthCC. A proposal to draft a decentralized AI manifesto was made to outline principles and goals. The group considered creating a chart to rate projects on their actual use of AI technology, enhancing transparency. Jessie presented a draft outline for new documentation pages to improve decentralized AI documentation. Metrics for tracking developer engagement were discussed.
Giles shared his challenges in fine-tuning a large language model (LLM) on the Internet Computer, highlighting the approach of dividing the model into smaller segments. Technical issues like efficient data loading across canisters and handling large data files were discussed, with suggestions to use separate endpoints to load weights. The discussion emphasized the importance of data privacy in AI applications and the potential of the Internet Computer in collaborative AI development. It was noted that specialized tasks might not require extremely large models, with more compact models being sufficient for many use cases. The group touched on current capabilities of the IC in running LLMs and the need for efficiency improvements. Discussions included optimizing Rust code using SIMD, creating a reusable library in Rust, and managing large models by splitting them into canisters. The challenges and potential of utilizing GPUs for AI on the Internet Computer were briefly discussed. The call concluded with a reminder of the upcoming scalability and performance working group meeting, encouraging participants to share topics and initiatives for future sessions.

catpirate3 · June 13, 2024, 6:42pm

Please do record the future discussions @jennifertran, thanks.

icpp · June 16, 2024, 4:13pm

@patnorris ,
I like to bring awareness to the project of @ktimam , who is porting Flashlight to the IC. This is a C++ machine learning library, and it includes a port of openblas, fftw & arrayfire. A truly amazing effort.

We adapted iccp-pro to support static libraries and C++17, and he is now able to build the wasm that should be able to run on the IC, but he is running again into the dreaded maximum on allowed globals.

He already entered a request to get it increased, here, but I like to also give this issue visibility through communication from our DeAI working group with DFINITY scaling team.

patnorris · June 17, 2024, 5:08pm

Thank you @icpp and sounds exciting @ktimam!

Cool, happy to talk more about this in the group and see how we can support communication and progress on this

And @ktimam if you’re interested in discussing your work with or maybe giving a demo to the DeAI group, please let me know and we’ll arrange happy to learn more

Topic		Replies	Views
AI and machine learning on the IC? Developers	114	10157	June 20, 2024
DeAI.chat – Decentralized AI chat on the Internet Computer Showcase DeAI	0	131	February 25, 2025
Generative AI related Application development on IC Programs & Applications	2	680	July 20, 2023
Llama2.c LLM running in a canister! Programs & Applications	61	4898	July 1, 2024
Working with Decentralized LLM Developers	2	527	January 14, 2024

Technical Working Group DeAI

Related topics