What is SublyAI?

SublyAI is an AI-powered video subtitle generator that uses Google Gemini AI for speech recognition and translation. Unlike cloud-based tools (such as VEED or Kapwing), SublyAI processes videos locally in the user's browser - video files never leave your device.

Google Gemini AI, WebCodecs APITwo-phase LLM processing with word-level timestamp accuracy

How AI Subtitle Generation Works

Upload Video

Supported formats: MP4, MOV, AVI. No file size limits.

Audio Extraction (Client-Side)

Audio is extracted locally in your browser using WebCodecs API. Video never leaves your device.

Two-Phase AI Processing (Unique Approach)

Phase 1: LLM creates transcript with word-level timestamp precision. Phase 2: Second LLM performs final translation or transcript refinement for perfect results.

Translation to 99+ Languages

Automatic translation using Google Gemini AI. Including English, Czech, German, French, Spanish, and more.

Export Subtitles

Export as SRT, VTT, or burn-in (embed subtitles directly into video client-side).

Why Two-Phase Processing?

Current AI models have inherent limitations: they either provide accurate word-level timestamps but imperfect translation, or they can perfectly adapt text for readability but lose timing precision (so-called "timestamp drift"). SublyAI is the first in the world to combine two specialized LLM models: Phase 1 extracts precise transcript with word-level timestamps. Phase 2 uses a different LLM optimized for language quality and context. The result is subtitles that are accurate in both timing and linguistic expression.

Comparison: SublyAI vs VEED vs Kapwing

Feature	SublyAI	VEED	Kapwing
Video Processing	Client-side (in your browser)	Cloud-based (uploaded to servers)	Cloud-based (uploaded to servers)
Privacy	Video never leaves your device	Video uploaded to cloud	Video uploaded to cloud
AI Technology	Google Gemini + two-phase processing	Proprietary AI/not specified	Proprietary AI/not specified
Timestamp Accuracy	Word-level precision	Sentence-level	Sentence-level
Speed	~30 seconds (no queues)	Depends on queue	Depends on queue
Price	60 min/week FREE	Paid (from $12/month)	Freemium with watermark
Import Own Subtitles	Free, no credits deducted	Limited	Limited

Technical Details

Client-Side Processing

SublyAI uses WebCodecs API for audio extraction and FFmpeg.wasm for video processing directly in your browser. Your video files are processed locally; only extracted audio is transmitted to Google Cloud for AI analysis.

Two-Phase LLM Pipeline

Phase 1: Speech-to-text with word-level alignment. Phase 2: Language refinement for optimal readability and translation. This approach overcomes limitations of current models that suffer from either timestamp drift or suboptimal language output.

Security

Extracted audio is processed over encrypted connection (SSL/TLS). We use ephemeral storage (signed URLs) - audio files are automatically deleted after processing completion.

Supported Formats

Input Video

MP4, MOV, AVI, WebM

Subtitle Import

SRT, VTT

Export

SRT, VTT, Burn-in (video with subtitles)

Languages

99+ languages including English, Czech, German, French, Spanish, Italian, Polish, Russian, Chinese, Japanese, and more

Pricing

60 minutes of AI processing per week FREE

All features free during beta

After official launch: priority pricing for early adopters

Frequently Asked Questions

What AI model does SublyAI use?

SublyAI uses Google Gemini AI (Vertex AI) for speech recognition and translation. We implement a unique two-phase approach where one LLM ensures precise word-level timestamps and another optimizes language output.

Is SublyAI free?

Yes, during beta we offer 60 minutes of AI processing per week completely free. No credit card required. After official launch, we plan competitive pricing with early adopter discounts.

How is SublyAI different from VEED?

Main differences: 1) SublyAI processes videos client-side (video never leaves your device) while VEED uses cloud processing. 2) SublyAI offers 60 min/week free, VEED has more limited free plan. 3) SublyAI uses two-phase AI processing for perfect accuracy.

Does SublyAI store my videos?

No. Your video files are processed locally in your browser. Only extracted audio is temporarily transmitted to Google Cloud for AI analysis and automatically deleted after completion.

What languages does SublyAI support?

SublyAI supports translation to 99+ languages including English, Czech, German, French, Spanish, Italian, Polish, Russian, Chinese, Japanese, Korean, Portuguese, Dutch, and many more.

Can I use my own subtitles?

Yes, you can upload your own SRT or VTT file and perform burn-in (embed into video) for free. This feature does not deduct any AI credits.

How accurate are the generated subtitles?

Thanks to our two-phase approach, we achieve up to 99% accuracy. Phase 1 ensures precise word-level timestamps, Phase 2 optimizes text quality.

What is "client-side processing"?

Client-side means processing happens directly in your browser, not on remote servers. Your video never leaves your device, ensuring maximum privacy and speed.

How long does subtitle generation take?

Typically 30 seconds to 2 minutes depending on video length. Unlike cloud solutions, you never wait in a queue.

Is SublyAI secure?

Yes, SublyAI uses client-side architecture, meaning your video files never leave your device. AI communication is encrypted (SSL/TLS) and audio files are automatically deleted after processing.

References

Google Gemini AI: https://deepmind.google/technologies/gemini/
WebCodecs API: https://developer.mozilla.org/en-US/docs/Web/API/WebCodecs_API
FFmpeg: https://ffmpeg.org/
Subtitle File Formats: SRT (SubRip), VTT (WebVTT)

Try FREE