Grok Imagine
88.5
Performance
★ 4.5
Rating
680
Reviews
Video GenerationProprietary
About
xAI's video generation model with native synchronized audio, producing 720p clips up to 15 seconds with dialogue, music, and sound effects in a single pass.
Strengths
Native audio-video synchronization — generates dialogue, background music, and sound effects alongside video in one step. Fast generation (~30 seconds per clip). Up to 15 seconds at 720p. Aggressively priced API at $0.05/second. Extend from Frame feature for chaining clips. Strong lip-sync for character dialogue. Available via X Premium subscriptions and standalone API.
Specifications
- Context window
- —
- Parameters
- —
Available On
xAI APIX (Twitter)Grok WebImagineArt
Features
text to videoimage to videovideo editingnative audiolip syncvideo extensiontext to image
Performance Trend
Benchmark score trends over time for the top 5 benchmarks.
Loading history...
Benchmarks
Scores from various benchmark tests; higher is better.
| Test | Score | Percentile | Source |
|---|---|---|---|
| VBench | 84.0 | p94 | seed |