DeAI: Tiny models

roger · December 4, 2024, 3:24am

Hello there,

We are in the last leg of our grant, LLM marketplace and exploring tiny language models to fine tune for a niche. Below are the ones we are interesred in:

SmolLM2-1.7B-Instruct (360M parameters)
Qwen2.5-Coder-32B-Instruct (0.5 B params)
DistilGpt2 (124M parameters)

I know Qwen has already been worked on in the community. But is there any research done on the other two? Or would you recommend a tiny LM to research on?

cc: @YashBit

Thank you!

Topic		Replies	Views
Working with Decentralized LLM Developers	2	518	January 14, 2024
[Request for Feedback] Added "ASK AI" button with LLM on this forum Developers	6	364	January 28, 2024
JS in WASM - bytecode alliance Language Support	0	593	June 4, 2021
Introducing the LLM Canister: Deploy AI agents with a few lines of code Developers rust , DeAI	46	2738	April 15, 2025
AI LLM via Canister/Cycles? Programs & Applications	6	764	February 12, 2024

DeAI: Tiny models

Related topics