Device License Pricing
Full-stack AI voice solution for hardware, licensed per device per year. Includes ASR + LLM + TTS + Visual Understanding — all in one SDK.
Choose Your Tier
Each device is bound to a single License, valid for 1 year from activation. Voice + LLM inference + Visual understanding all included, integrated with a single SDK.
- 9 hours voice time
- 1,350,000 LLM inference tokens
- 50 rounds visual conversation *
- ASR + LLM + TTS + Vision
- Single SDK integration
- 19 hours voice time
- 2,850,000 LLM inference tokens
- 200 rounds visual conversation *
- ASR + LLM + TTS + Vision
- Single SDK integration
- 60 hours voice time
- 9,000,000 LLM inference tokens
- 1,000 rounds visual conversation *
- ASR + LLM + TTS + Vision
- Single SDK integration
10 free Basic Licenses included upon signup (valid for 1 year). All devices under one product must use the same tier.
* Visual conversation rounds only count visual understanding API calls. Voice time and LLM tokens consumed during the conversation are deducted from their respective quotas normally. Based on 720p (1280×720) standard resolution; higher resolutions are scaled proportionally by pixel area.
Pay-As-You-Go, Only for What You Use
When License quotas are exhausted, purchase resource packs to continue. Shared across all devices in a product, valid until fully consumed.
| Resource | Billing Unit | Unit Price |
|---|---|---|
| LLM Inference | 1 Billion tokens | $40 |
| Voice Time (ASR + TTS) | 100 hours | $19 |
| Visual Conversation * | 1,000 rounds | $2.99 |
| Voice Cloning | voice / year | $21 |
* Visual conversation rounds only count visual understanding API calls, excluding voice and LLM consumption during the conversation. Based on 720p; 1080p ≈ 2.25 rounds, 4K ≈ 9 rounds. Actual usage may vary per API response.
Full-Stack Cost at Equivalent Usage
Based on Basic tier usage (9 hours voice + 1.35M tokens), comparing the equivalent full-stack cost using each vendor's paid LLM.
* Based on publicly listed prices as of February 2026. OpenAI uses gpt-realtime audio token billing (input $32 + output $64 per million tokens). OpenAI bar is truncated — actual cost is ~90× the next highest vendor. All CNY prices converted at $1 ≈ ¥7. See the table below for detailed breakdowns.
| Vendor | Billing Model | ASR | LLM | TTS | Total |
|---|---|---|---|---|---|
| Adventists AI | Full-Stack License | — | — | — | $0.99 |
| Alibaba Qwen | ASR+LLM+TTS Separate | $0.37 | $0.27 | $1.00 | $1.64 |
| Doubao | Full-Stack License | — | — | — | $1.99 |
| iFlytek | Install Base + LLM | $0.43 (one-time install) | incl. | $4.48 | |
| OpenAI | Realtime API per token | Audio input $32 + output $64 / M tokens | ~$410 | ||
* Alibaba Qwen uses Paraformer + Qwen-Plus + CosyVoice-flash. iFlytek uses offline install base + Spark Pro/Max. OpenAI estimated using gpt-realtime standard model audio tokens. All CNY-denominated vendor prices converted at $1 ≈ ¥7.
Ready to Get Started?
Whether you're integrating APIs or building hardware, we're ready.