Argument error in the rust-connect-py-ai-to-ic project

ydr990318 · November 21, 2024, 10:32am

Hey guys, I’m running rust-connect-py-ai-to-ic’s GPT2 example and I’m getting an error when I run GPT_kv.py. Anybody can tell me how to fix it? Thanks a lot! Here are error messages below: onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Invalid rank for input: past_key_values_input Got: 6 Expected: 5 Please fix either the inputs/outputs or the model.

jennifertran · November 22, 2024, 12:14am

Hey @jeshli Would you be able to help?

ydr990318 · November 22, 2024, 12:05pm

Oh! @jennifertran, haven’t seen you for a while since the last time I posted! I’ve been plagued by this problem these days. Thanks for @ing~

jennifertran · November 22, 2024, 7:05pm

I’ve been away on vacation and then was busy on a few other focus areas!

Anyways, Jeshli can help! Also, if you have a link to the public repo, I can take a look as well. It seems that the original rust-connect-py-ai-to-ic project link is no longer open-sourced.

ydr990318 · November 24, 2024, 2:00pm

Hey, @jeshli , I pulled down the code repository and followed the instructions to install the environment, but I keep getting errors when I run to the script GPT_kv.py, it seems that there is a problem with the input/output model, one dimension is missing. I’ll appreciate it if you could give me some help!~

jeshli · November 24, 2024, 11:30pm

Hi. I’m not entirely sure what is the cause of the error. Someone else had an issue from converting op-set 11 to 14 in order to make it work but then it broke in tract-onnx. You can review this thread Discord in which I provide a link to the onnx model. That will resolve the issue you are facing.

ydr990318 · November 25, 2024, 3:56am

Thanks, @jeshli , I just went to the Discord and download the onnx file, but the out come was the same

ydr990318 · November 25, 2024, 4:34am

@jeshli , here are the full error messages: torch.Size([1, 4, 50257]) Initial Logits: torch.Size([1, 4, 50257]) Initial Past Key Values Shapes: [torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64]), torch.Size([1, 12, 4, 64])] torch.Size([1, 3, 50257]) Logits: torch.Size([1, 3, 50257]) Past Key Values: 12 pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) pkv: 2 torch.Size([1, 12, 7, 64]) torch.Size([1, 12, 7, 64]) torch.Size([1, 7]) torch.Size([1, 3]) torch.Size([1, 12, 4, 64]) Output [198, 40, 1842, 284, 2342] torch.Size([1, 3, 50257]) torch.Size([1, 12]) torch.Size([24, 1, 12, 4, 64]) Second pass: output_id: tensor([[2342]]) new_input_ids: torch.Size([1, 3]) extended_attention_mask: torch.Size([1, 7]) past_key_values: torch.Size([24, 1, 12, 4, 64]) past_key_values_2: torch.Size([24, 1, 12, 7, 64]) torch.Size([1, 5]) /home/ydr/projects/rust-connect-py-ai-to-ic/python/GPT2_kv.py:93: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs! if torch.all(past_key_values == 0): /home/ydr/anaconda3/lib/python3.11/site-packages/transformers/modeling_attn_mask_utils.py:116: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs! if (input_shape[-1] > 1 or self.sliding_window is not None) and self.is_causal: /home/ydr/anaconda3/lib/python3.11/site-packages/transformers/modeling_attn_mask_utils.py:164: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs! if past_key_values_length > 0: [[6332 30 198]] (1, 3) (1, 7) (24, 1, 12, 4, 64) Output ID: [[198]] Past Key Values: (24, 1, 12, 7, 64) [[2061 318 534 4004 6332 30 198]] (1, 7) (1, 7) (12, 2, 1, 12, 1, 64) Traceback (most recent call last): File "/home/ydr/projects/rust-connect-py-ai-to-ic/python/GPT2_kv.py", line 261, in <module> outputs = ort_session.run(None, onnx_inputs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/ydr/anaconda3/lib/python3.11/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 266, in run return self._sess.run(output_names, input_feed, run_options) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Invalid rank for input: past_key_values_input Got: 6 Expected: 5 Please fix either the inputs/outputs or the model.

ydr990318 · November 25, 2024, 7:29am

Oh, I asked the chatGPT, and it gave me some advice, and it suggests that the possible cause of this error is the Input Shape of past_key_values,

jeshli · November 25, 2024, 2:33pm

When I provided the onnx file you no longer need to run anything in python. The python files merely download, tweak, and save the GPT2 model to onnx formatting. The onnx file is then uploaded to a deployed canister which runs the tract framework (rust based) and provides a working implementation of GPT2 on chain.

ydr990318 · November 25, 2024, 3:08pm

I see… I followed the instructions to step 4 which is to run the GPT_kv.py script here, do you mean I just skip this step and upload the onnx file?

jeshli · November 25, 2024, 3:22pm

Correct. The GPT_kv.py script is run order to generate the onnx file.

ydr990318 · November 25, 2024, 3:26pm

Oh~ I really appreciate your answer! Your project is really very groundbreaking!

Topic		Replies	Views
Some errors about rust-connect-py-ai-to-ic Developers	2	24	December 3, 2024
Decoding of instructions between the rust language and the GPT2 model Developers	0	10	December 3, 2024
Rust Periodic Timers Rust	1	157	April 23, 2024
Candid 0.9.3 ArgumentDecoder error on update guard Developers rust	6	411	August 4, 2023
Rust: time not implemented on this platform Developers	4	2556	August 11, 2021

Argument error in the rust-connect-py-ai-to-ic project

Related topics