Hi everyone, thank you for today’s call (2024.02.15). This is the generated summary (short version, please find a link to the comprehensive version below):
The Decentralized AI Technical Working Group for the Internet Computer’s call focused on two main areas: establishing benchmarks for AI on the IC and presenting a Vector DB implementation by the ELNA team.
Benchmarking and AI Deployment on the IC:
- The session opened with introductions to foster community connections, followed by discussions on deploying TensorFlow models using Azle and TensorFlow JS, highlighting the IC’s potential for decentralized AI services.
- @icpp detailed the memory requirements for large language models (LLMs), emphasizing the need for accurate benchmarking to understand computational demands and optimization strategies, including half-precision computation to reduce resource requirements.
- Technical challenges of AI deployment were discussed, focusing on memory limitations and the search for decentralized alternatives to centralized cloud services.
- The group explored technical solutions to these challenges, such as the potential shift to WASM 64 support for improved efficiency and the importance of collaborative tool development for model deployment.
Vector DB Implementation and Future Directions:
- The ELNA team (@branbuilder) introduced Vector DB, emphasizing its role in managing high-dimensional vectors for AI models and facilitating fast, accurate similarity searches essential for tasks like semantic search.
- A technical overview explained Vector DB’s operation, including the use of the HNSW algorithm for efficient indexing and cosine similarity for measuring vector closeness.
- Future enhancements for Vector DB were discussed, including supporting various similarity measures and integrating more efficient vector embedding models to improve search capabilities.
- Challenges of deploying AI models on the IC due to computational constraints were acknowledged, with strategies discussed for optimizing deployments through smaller models or task segmentation.
- The session concluded with a focus on collaborative opportunities, the development of educational resources, and the potential for community-driven efforts to advance decentralized AI on the IC.
Long summary: 2024.02.15_ICDeAITWG_CallSummary - Google Docs