Introducing the LLM Canister: Deploy AI agents with a few lines of code

ielashi · February 27, 2025, 12:03pm

I noticed that as well. It happens a non-negligible number of times. I think this is a problem with the implementation of the sampler and not with the model itself. Looking into it.

Apart from the times where it returns gibberish, how did you find the model itself?

Mar · February 27, 2025, 12:40pm

I think it works quite well for generating tweets based on a persona. Soon, we will start using them to analyze prediction markets, which will be a bit more challenging since they will need to estimate probabilities based on available data and compete.

marcio · February 27, 2025, 1:49pm

If a closed model is not possible, it would be nice to have DeepSeek working.

ielashi · February 27, 2025, 2:54pm

@Mar Done some changes, and I think now you shouldn’t be seeing gibberish anymore, or at least it should be way less probable now. We’ll have a more permanent solution to that problem very soon.

@marcio I’ve tried DeepSeek extensively, and it’s impressive. We’ll explore supporting one of their distilled versions.

hpeebles · February 28, 2025, 11:42pm

Thanks for building this @ielashi!
We’ve just launched the “LlamaBot” within OpenChat which allows users to easily send prompts to the LLM canister!

felixkamau · March 2, 2025, 6:04pm

How were you able to build a bot for openchat with it @ielashi

hpeebles · March 2, 2025, 8:21pm

I built it, you can find the source code here - open-chat-bots/rs/offchain/examples/llama at main · open-chat-labs/open-chat-bots · GitHub

ielashi · March 6, 2025, 10:48am

Quick update: we released a new example of an agent, just to showcase what it’s like to build an agent that specializes in a specific task. In this case, the task is to lookup ICP prices.

A Rust and a Motoko implementation are provided in the examples folder here.

marcio · March 10, 2025, 11:44am

Indeed, we need to use better models for this, as the difference in probability estimates is too large (compared to 4o, for instance).

ielashi · March 11, 2025, 11:00am

Hey everyone,

I’d like to get some feedback here on what you think should be the highest priority for the LLM canister. There will be continuous work on all the topics below, but I’d like to gauge which one is the most important to you at this stage:

Support for more models (still centralized, managed by DFINITY)
Decentralizing the AI workers (models will no longer be managed centrally by DFINITY)
API improvements (e.g. support for tools, more tokens per request, images, etc.)

0 voters

Would love to hear also any comments you have wrt your choice.

marcio · March 11, 2025, 11:20am

I think supporting more models is the best choice in the near term. Then, we can spend more time decentralizing, plus we gain more time until open-source models catch up.

ielashi · March 17, 2025, 10:54am

Thanks everyone for voting, I appreciate your input. I closed the poll now.

As I mentioned, we’ll be making progress across all three fronts, but this poll will definitely help with the prioritization.

Regarding bigger models: can you share which models you’d like to have access to? To allow for decentralization later on, we can only consider open weight models.

Regarding decentralizing the AI workers: I’ll start a separate thread about that, as it’s a bigger topic.

Regarding API improvements: We just released a typescript library to seamlessly integrate with Azle, so developers don’t need to learn Motoko or Rust to build AI agents on the IC. Checkout the quickstart example that can be used as a template for building agents in Typescript (cc @tiago89).

Prompting (single message)

import { IDL, update } from "azle";
import * as llm from "@dfinity/llm";

export default class {
  @update([IDL.Text], IDL.Text)
  async prompt(prompt: string): Promise<string> {
    return await llm.prompt(llm.Model.Llama3_1_8B, prompt);
  }
}

Chatting (multiple messages)

import { IDL, update } from "azle";
import { chat_message as ChatMessageIDL } from "azle/canisters/llm/idl";
import * as llm from "@dfinity/llm";

export default class {
  @update([IDL.Vec(ChatMessageIDL)], IDL.Text)
  async chat(messages: llm.ChatMessage[]): Promise<string> {
    return await llm.chat(llm.Model.Llama3_1_8B, messages);
  }
}

swift · April 1, 2025, 5:18pm

Thanks for this. Here is what I am using it for.

ielashi · April 2, 2025, 9:18am

Very nice, I like the UX I created an Anima and had a short conversation with it. Thanks for sharing. I recommend demoing this to the DeAI Technical Working Group so that more people from the community can also see it and provide feedback.

marc0olo · April 2, 2025, 9:27am

I second that! cc @patnorris

patnorris · April 2, 2025, 9:50am

That’d be awesome, yes! Would you be open to sharing your work in one of the upcoming DeAI calls @swift ?

swift · April 2, 2025, 1:49pm

Sure . Just let me know where/when I can join.

patnorris · April 3, 2025, 2:45pm

Hi @swift , the DeAI calls are each Thursday at 6pm CET in the ICP Discord voice channel. This should take you to this week’s event on Discord (and also show the time in your timezone): ICP

We’ve usually got an agenda for the call (and also for the next few calls) but leave a few minutes for new members to introduce themselves and what they are working on. So you could give a quick intro on one of the next calls and if you’re open to it, it’d be great to also have a dedicated session to showcase ANIMA during one of the calls in a few weeks, so there’s enough time for properly presenting it, discussion and questions (maybe one of the first Thursdays in May?). How does this sound to you?

swift · April 7, 2025, 1:52pm

@patnorris Sounds fun, I am interested

josephgranata · April 9, 2025, 2:39am

First, let me join everyone and congratulate you.

A useful LLM integration.

Three questions:

How large could the token output be in the future? 200 tokens now is very limited.
How large is the token input now, and any idea of the future?
Do you have plans to incorporate Llama 4? Any news or plans about it?

Thanks!

Joseph

Topic		Replies	Views
Llama2.c LLM running in a canister! Programs & Applications	61	4946	July 1, 2024
How can I take an open source pretrained LLM model, deploy it to ICP and use as a private ChatGPT just fo me Developers	13	390	June 16, 2025
Dfinity/llm on the main Net Developers	14	279	July 14, 2025
AI and machine learning on the IC? Developers	114	10401	June 20, 2024
Llama.cpp on the Internet Computer Programs & Applications	11	574	February 2, 2025

Introducing the LLM Canister: Deploy AI agents with a few lines of code

Related topics