Introducing the LLM Canister: Deploy AI agents with a few lines of code

dawnnguyen · July 3, 2025, 10:21am

So technically, I can deploy and self hosted off-chain worker, which run my LLM. Then I deploy 2 canister, 1 is the LLM on-chain canister, 1 is my chat bot canister. Then I can do the same flow like you:

Chatbot canister call to LLM canister
LLM canister ingress the message → queued it
Off-chain worker poll the queued message from the LLM canister
Process it with local LLM or whatever
Off-chain worker call LLM canister to update the state of the responded message state to proceeded and insert newly responded message
LLM canister call chatbot canister to return message.

Am I right? If I do so what should I concern because I saw that you have working on it to improve and support many things like extend the input/output limit. Can you share some of the difficulties and obstacles you have encountered?

Topic		Replies	Views
Llama2.c LLM running in a canister! Programs & Applications	61	5062	July 1, 2024
How can I take an open source pretrained LLM model, deploy it to ICP and use as a private ChatGPT just fo me Developers	13	504	June 16, 2025
Llama.cpp on the Internet Computer Programs & Applications	18	774	March 19, 2026
Dfinity/llm on the main Net Developers	14	327	July 14, 2025
AI and machine learning on the IC? Developers	114	10546	June 20, 2024

Introducing the LLM Canister: Deploy AI agents with a few lines of code

Related topics