Introducing Cipher AI Vault - A fully sandboxed AI demo w/ memory

cp-james-harbeck · September 6, 2024, 6:05am

Project Overview

Cipher AI Vault is a groundbreaking proof-of-concept that seamlessly integrates in-memory VectorDB and LLM functionalities within a canister on the Internet Computer. Designed for developers and researchers at the forefront of AI and blockchain technology, our project addresses the growing demand for secure, scalable AI tools by operating within a fully sandboxed environment.

Key Features:

In-memory VectorDB for efficient data retrieval
In-memory LLM for natural language processing
Secure asset and data storage
Dual sandboxing for enhanced security
Integration with multiple wallet options

Web3 Advantages

Cipher AI Vault stands out from traditional Web2 AI platforms by leveraging the power of blockchain technology:

Sandboxed Environment: Our solution operates within the secure confines of both the browser and the canister, ensuring unparalleled data protection.
Decentralized Processing: Unlike centralized AI solutions, all data processing occurs within the sandboxed environments, eliminating risks associated with external data centers.
Tamper-Proof Data: The blockchain-based infrastructure ensures that your data remains intact and unaltered.
High Availability: Say goodbye to downtime concerns – our decentralized approach guarantees continuous accessibility.

Technical Architecture

Built using the Azle framework, Cipher AI Vault enables TypeScript-based AI development for the Internet Computer. Our architecture emphasizes in-memory operations within a secure, sandboxed environment:

Frontend Canister: The main entry point for user interactions.
In-memory VectorDB: Manages embeddings for lightning-fast retrieval.
In-memory LLM: Processes natural language queries and interacts seamlessly with the VectorDB.
Secure Asset Storage: A dedicated module leveraging the Internet Computer’s asset layer.
Secure Data Store: An Azle-based canister for robust data management in stable memory.
Cycles Distro Canister: Efficiently manages cycles and top-ups.
ic-auth: Handles authentication with various wallets (Plug, Stoic, NFID, and Internet Identity).

Internet Computer Superpowers

Cipher AI Vault harnesses the unique capabilities of the Internet Computer:

Dual Sandboxing: Combines browser-based and canister sandboxing for highly secure, isolated AI operations.
Secure Asset & Data Storage: Protects all information from tampering and ensures continuous availability.
Multi-Wallet Authentication: Implements the ic-auth module for flexible and secure login options.

Project Status and Achievements

We’re proud to announce that Cipher AI Vault has reached a fully functional proof-of-concept stage, with several key milestones:

Operational in-memory VectorDB and LLM within the canister environment
Secure asset storage utilizing the Internet Computer’s asset layer
Robust data storage using stable memory
Integration and open-sourcing of the ic-auth npm module
Efficient cycle management with a developer-friendly open-source module

Future Roadmap

We’re committed to continuous improvement and expansion. Our future plans include:

Data Store backup canister
Edit functionality for Data Store file entries
Support for multiple in-memory LLMs
Model storage in asset canisters
Embeddings backup in Stable Memory
Document-to-Data File generation using in-memory LLM
In-memory Stable Diffusion for image generation and storage

Get Involved

We invite developers, researchers, and blockchain enthusiasts to explore Cipher AI Vault:

Try the Dapp: Cipher AI Vault
Explore our Code:
Watch our Demo: YouTube Video
Stay Updated: Follow us on Twitter

Call to Action

We’re excited about the potential of Cipher AI Vault and invite you to join us on this journey:

Developers: Contribute to our open-source projects and help shape the future of AI on the Internet Computer.
Researchers: Leverage our platform for your AI experiments in a secure, decentralized environment.
Blockchain Enthusiasts: Explore the intersection of AI and blockchain technology through our innovative solution.

Let’s revolutionize AI together with Cipher AI Vault – where security meets innovation on the Internet Computer!

Have questions or want to collaborate? Drop a comment below or reach out to us directly. We’re eager to hear your thoughts and ideas!

jennifertran · September 6, 2024, 5:58pm

This is cool! You should connect with @patnorris to see if you can present at the DeAI Technical Working Group!

A few follow-up questions:

What was it like working with Phi-3-mini-4k-instruct?
It looks like you are using your own vector DB implementation. Is there a reason for that as opposed to using the existing vector DBs on ICP?

cp-james-harbeck · September 6, 2024, 8:25pm

Sure! I’ll definitely reach out and explore the Working Group opportunity.

Working with Phi-3-mini-4k-instruct has been a positive experience. I’m impressed with the progress of in-memory models, particularly those utilizing WebGPU. Having worked with these models for a while, I’ve found that they’ve recently reached a point where they’re genuinely useful. The project is designed to support easy model swapping, and we plan to let users choose from various models in the future.
Yes, we’re using a custom version of client-vector-search. We chose this over existing ICP vector DBs because our solution is fully in-memory, implemented in TypeScript with HNSW, and doesn’t rely on external bindings. It runs entirely on the client side, is lightweight, and developer-friendly. This approach enables us to maintain a completely sandboxed environment.

jennifertran · September 9, 2024, 9:56pm

Thank you for the follow-up.

I am glad that Phi-3-mini-4k-instruct is working well! How’s the overall performance? Did you hit the instruction limit at any point?
Cool, I’ll look more into client-vector-search. I’ve been trying to learn more about how Vector DBs work!

patnorris · September 9, 2024, 11:16pm

cool stuff if you like, we’ve got a weekly DeAI working group call for ICP in the Discord

this is the event for this week: ICP Developer Community

cp-james-harbeck · September 11, 2024, 6:23am

The Phi-3-mini-4k-instruct model has been working surprisingly well, delivering fast and relevant responses. So far, we haven’t hit instruction limits in our tests, and the setup has been efficient for most use cases. The use of client-vector-search helps by pulling relevant context from an embedding space before the LLM responds, which lightens the load on the model and helps avoid those limits, especially with more complex queries.

We’re also exploring several other models, like Phi-3-mini-128k-instruct-onnx, to handle larger prompts and datasets, which could further enhance scalability and performance.

cp-james-harbeck · September 11, 2024, 6:28am

Thank you!

I’ll definitely check out the group! I’m very interested in joining and will likely make time for it soon.

icpalist · September 11, 2024, 7:03am

Isn’t the app doing the inference at user’s side, that’s it, at the user browser, loading the LLM models and running them in the frontend using WebGPU?

How are you going to hit any instruction limits with that?

Edit: Yeah, it’s: Cipher-AI-Vault/frontend/frontend/hooks/modelManager/llm.js at main · supaIC/Cipher-AI-Vault · GitHub

cp-james-harbeck · September 11, 2024, 7:22am

You’re right, the inference happens client-side using WebGPU, but the instruction limit is determined by the model itself, not the execution environment. Each model has a fixed token limit it can process in one go—like 4k tokens for Phi-3-mini-4k-instruct. So even in the browser, larger prompts could hit those limits, which is why we’re considering models like Phi-3-mini-128k-instruct-onnx for handling larger datasets.

StofAxeCap · September 12, 2024, 12:47pm

Awesome work on Cipher AI Vault!

Topic		Replies	Views
IC-based secret sharing: Vault k8s Canister by Zondax Showcase	4	372	January 24, 2024
AI player on ICP - IChess Showcase Functional-Programming , Discussing , community-consideration	9	391	October 9, 2024
Keygate Vault: Cross-Chain Asset Management for Web3 Teams [Early Access] Showcase	0	57	October 30, 2024
InternetComputer DA Layer Showcase Showcase	0	76	August 7, 2024
ELNA in Full Bloom: Canisters Activated, Sonic Boost Engaged, ICP Grant Greenlit, and Beyond! Programs & Applications	5	1316	December 4, 2023