AI and machine learning on the IC?

Gamris · September 6, 2023, 2:04am

Has anyone tried TinyLlama yet? 1.1B parameters, Llama2 architecture and tokenizer, trained on 3 trillion tokens.

Current checkpoint available: PY007/TinyLlama-1.1B-step-50K-105b · Hugging Face
GitHub: GitHub - jzhang38/TinyLlama: The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Since the 4-bit quantized model weights of TinyLlama consumes only ~550MB RAM, I’d imagine the Llama.cpp 4-bit quantized version would run nicely inside a canister.

Topic		Replies	Views
Llama2.c LLM running in a canister! Programs & Applications	61	5013	July 1, 2024
Technical Working Group DeAI Developers	412	16496	February 26, 2026
Introducing the LLM Canister: Deploy AI agents with a few lines of code Developers rust , DeAI	76	4819	September 1, 2025
How can I take an open source pretrained LLM model, deploy it to ICP and use as a private ChatGPT just fo me Developers	13	466	June 16, 2025
DeAI.chat – Decentralized AI chat on the Internet Computer Showcase DeAI	0	166	February 25, 2025

AI and machine learning on the IC?

Related topics