Getting Started
User Guide
- 👩‍🏫Tutorial
- 🎮Demo
Technology
QuickAi Fundamentals

Powered by GitBook

On this page

Was this helpful?

Technology

Supported LMM

Currently our interface supports three major open source models, we will be adding more open source models eventually.

Model

Current Speed

Llama 2 70B (4096 Context Length)

~300 tokens/s

Llama 2 7B (2048 Context Length)

~750 tokens/s

Mixtral, 8x7B SMoE (32K Context Length)

~480 tokens/s

Gemma 7B (8K Context Length)

~820 tokens/s

llama2-70b-4096
gemma-7b-it
mixtral-8x7b-32768

PreviousAccount Abstraction Nextllama2-70b-4096

Last updated 11 months ago

Was this helpful?

🎫