Octopus v3

The smallest, most powerful on-device multimodal model for super AI agents: fast, accurate, and energy-efficient.

Request Access

Introducing Octopus v3

The smallest, most powerful on-device multimodal model for super AI agents.

Compact size

Fewer than 1 billion parameters.

Multimodal

Processes both text and images for function calling.

High Performance

On par with a combination of GPT-4V and GPT-4.

Multilingual

Fluent in English and Mandarin.

Request Acess

Cool things Octopus v3 can do:

Octopus v3 processes both visual and textual inputs, executing tasks swiftly and precisely. Its compact design and integration of visual data ensure highly accurate and context-aware function calls. Additionally, it is energy-efficient and maintains robust data privacy.

Octopus v3 Demo Video

Octopus model on VR

Compact Multimodal AI for Edge Devices

Discover EdgeAI, a compact AI model designed for edge devices that handles text, visuals, and audio in English and Chinese. It's efficient on low-power devices. Access demos and tools for research.

Technical Report

Explore our collection of 200+ Premium Webflow Templates