Does rust-connect-py-ai-to-ic have a tokenizer?

ydr990318 · December 30, 2024, 11:52am

Hi! @jeshli , it’s me again, do you have a rust-connect-py-ai-to-ic tokenizer? Otherwise the input and output are tensors, not readable text

jeshli · December 30, 2024, 3:29pm

Yes. It’s in one of the branches.

ydr990318 · January 2, 2025, 4:45pm

Oh I found it, but I noticed that the return value of tokenize_text is vec int32 while the parameter of model_inference is vec int64? If these two datatypes don’t match then how do I use tokenizer_backend please? Happy New Year btw

jeshli · January 7, 2025, 3:17pm

Should be able to into() the type.

But for reference, we just open-sourced new resources for implementing models and tokenizers in a more straightforward and cleaner way.

It is actively being extended to work directly with HuggingFace’s transformers library which we are actively contributing to. Making it easier for anyone to deploy any model to the Internet Computer.

ydr990318 · January 9, 2025, 1:37pm

Wow, amazing project! I’d like to ask you about a Transformer-based Time Series Prediction model that we trained and tried to upload to the project backend, but setup_model reported an error, can’t this project run the Transformer model?

Error: Failed while trying to deploy canisters.
Caused by: Failed while trying to install all canisters.
Caused by: Failed to install wasm module to canister ‘predict_demo_backend’.
Caused by: Failed during wasm installation call
Caused by: The replica returned a rejection error: reject code CanisterError, reject message Error from Canister bkyz2-fmaaa-aaaaa-qaaaq-cai. Canister called ic0.trap with message: Panicked at ‘not yet implemented’, /home/ydr/.cargo/git/checkouts/tract-01a3138c7a4b58a3/2a2914a/ core/src/ops/change_axes.rs:856:17.
Consider gracefully handling failures from this canister or altering the canister to handle exceptions. See documentation: http://internetcomputer .org/docs/current/references/execution-errors#trapped-explicitly, error code None

jeshli · January 9, 2025, 2:15pm

I would be guessing but perhaps Tract-Onnx does not have that function implemented. Have you confirmed that you can run your model as Tract-Onnx outside of the canister?

Candle also has Onnx support. I suspect that it has more active development. I can research making a cleaner version of what you are using py-connect-ai-to-ic using Candle instead of Tract but it will take some time.

Other than that I would re-write your time series model in Rust.

ydr990318 · January 9, 2025, 3:44pm

I created a rust project locally as you suggested and tried to run the model as tract-onnx, but it reported the same error. I’ll try Candle

ydr990318 · January 9, 2025, 5:32pm

I briefly tried Candle, but it usually uses Device:cpu or gpu when loading models in it, is Candle supported in canister?

jeshli · January 10, 2025, 3:59pm

The repo that I showed has an example which deploys a gpt2 model running with candle.

Yes you need to use Device:cpu, as you would see in the example code, because the IC does not have GPUs … yet.

You would need to be able to either write models in Rust or wait for me to develop (or develop yourself) a wrapper Candle-Onnx integration that works the way that rust-connect-py-ai-to-ic does.

ydr990318 · January 10, 2025, 4:14pm

Oh I found it! I will study this code carefully, thank you so much for so many days of patient answers and replies , looking forward to your new project!

Topic		Replies	Views
Decoding of instructions between the rust language and the GPT2 model Developers	0	17	December 3, 2024
ArgumentEncoder Error Language Support	0	30	December 21, 2024
Generating a candid file after compiling code as wasm32-wasi gives an error Language Support Wasm	2	42	December 23, 2024
Generic ICC (Intercanister Call) type Rust	5	299	October 16, 2023
Migration from Python - API, Web Assembly, Rust questions Getting Started	0	2660	February 28, 2021

Does rust-connect-py-ai-to-ic have a tokenizer?

Related topics