Back to all features
Word-Level Timestamps
Precise start and end times for every word—ideal for subtitles and video editing.
Pro plans and above include word-level timestamps in JSON exports. Each word has exact start and end times, enabling frame-accurate subtitles, precise video editing, and advanced applications like keyword highlighting or searchable video. Use them for SRT/VTT generation, caption sync, or custom tools that need timing at the word level.
Key Benefits
- Exact start and end time for every word in JSON export
- Frame-accurate subtitles and caption sync
- Precise video editing and clip extraction
- Support for advanced apps and searchable video
Use Cases
- Professional subtitle creation
- Video editing with word-level cuts
- Searchable video and keyword highlights
- Accessibility captions with precise timing
Technical Details
Included in JSON export on Pro, Studio, Agency, and Enterprise. Not available on Free tier. Compatible with standard subtitle and editing tools.
Available Plans
Monthly
Annual
Pro
$25/month
For creators & power users
- STT: 1,000 minutes
- TTS: 25,000 characters
- Summaries & exports
- Fast processing queue
- Premium voices ≤ 10%
Most Popular
Studio
$79/month
For serious creators & small teams
- STT: 3,000 minutes
- TTS: 90,000 characters
- Up to 3 users
- Batch uploads
- Priority queue
- Premium voices ≤ 25%
- Export formats (SRT, DOCX, JSON)
Agency
$149/month
For teams, SMBs & API users
- STT: 5,000 minutes
- TTS: 150,000 characters
- Team workspaces (up to 10 users)
- API access (rate-limited)
- Premium voices ≤ 35%
- Usage analytics
Enterprise
Custom(Starting at $300/month)
Custom ($300+ / month)
- Custom volumes
- BYO provider keys
- SLAs & compliance
- Dedicated routing & support