Revolutionary real-time screen translation powered by Gemini & Gemma models – premium quality at record-low prices
Check out my other professional tool:
OHLC Forge (Crypto Data Tool)
✨ The defining feature of version 4 ✨
Simple mode is designed for immediate operation – just pick your languages, position your overlays, and start translating.
Custom mode provides granular control over every aspect of the tool, allowing power users to fine-tune quality, performance, and cost.
Industry-first AI text recognition. Use Gemini for ultimate precision or Gemma 4 (DeepInfra) for 4x lower cost (~$0.16/hour) while maintaining near-perfect accuracy.
Multiple models for top-quality, context-aware translation in 100+ languages. Choose Gemini 3 Flash for best results, or Gemma 4 (DeepInfra) for a fast, cost-effective alternative.
High-quality, context-aware translation for over 100 languages. Delivers elite precision for Japanese, Chinese, and European scripts, as well as minority languages such as Welsh, Icelandic, Maori, and Burmese. Context subtitles are provided free and don't count towards your quota.
Send up to 5 previous subtitles with every request. Maintains character names, grammatical flow, and narrative coherence across dialogue.
Real-time token-level analytics: cost per call, per minute, and cumulative cost for Google, DeepInfra and DeepL APIs tracked simultaneously.
In-memory LRU cache + optional file cache. Repeated phrases cost zero API credits – retrieved instantly from the disk.
Automatically scans the full screen for a set period to detect where subtitles appear, then locks the capture area.
Dynamically expands the OCR capture area to prevent edge-of-frame word truncation and AI hallucinations from tight crops.
Automatically overlays the translation directly onto the original subtitle area for seamless, immersive reading.
Even with this option disabled, a PRO user can drag the target window manually over the subtitles.
Inject a custom instruction into every translation request. Define the tone, style, or game-specific context to ensure consistent character names and immersive dialogue.
Inject a custom instruction into every AI OCR call – ignore HUD elements, strip speaker names, focus on dialogue only.
Fully redesigned interface with Simple and Custom modes. Clean, responsive, and self-contained. Built-in High-DPI scaling ensures a perfectly crisp and consistent interface across all screen resolutions.
The advanced native RTL engine powered by PySide6. Flawless character shaping, cursive joining, and bidirectional rendering for Arabic, Hebrew, Pashto, and Persian. Punctuation and numbering are handled with pixel-perfect accuracy.
Unparalleled transparency with dual-layer logging. Short logs provide a quick overview of costs, while Long logs capture the entire API exchange – including system prompts, context subtitles, and raw model responses.
Comprehensive audit trail for every vision-based request. Track image metadata, token efficiency, and model latencies for both Gemini and Gemma pipelines.
For the optimal balance of quality and cost, we recommend using Gemini 3 Flash or Gemini 3.1 Flash-Lite for translation and Gemma 4 for OCR. Since OCR is the primary cost driver (5-6x more expensive than translation), switching to Gemma for screen scanning reduces your total hourly cost by approximately 3 times.
• Translation: Gemini 3 Flash
• OCR: Gemma 4 (DeepInfra)
• Estimated cost: ~$0.29 / hour
• Translation: Gemma 4
• OCR: Gemma 4 (DeepInfra)
• Estimated cost: ~$0.18 / hour
Disclaimer: Costs are estimates based on a 750ms Scan Interval and are provided for illustrative purposes only. Actual costs depend on screen content, token density, and current API pricing. You are solely responsible for monitoring and managing your own API usage and billing through your official Google or DeepInfra provider accounts.
| Model Provider / Name | Input (Image / Text) | Output (OCR / Translation) |
|---|---|---|
| Gemma 4 26B A4B (DeepInfra) | $0.07 | $0.34 |
| Gemini 3.1 Flash-Lite (Google) | $0.25 | $1.50 |
| Gemini 3 Flash (Google) | $0.50 | $3.00 |
⚠️ Note on OCR Resolution (LOW vs MEDIUM):
Although the price per token is identical, the MEDIUM resolution setting captures more pixels and sends a larger data payload per image compared to LOW. Consequently, each screenshot in MEDIUM mode consumes more tokens, resulting in higher actual hourly expenses. Stick to LOW resolution unless you encounter extremely complex or tiny fonts.
*Pricing data verified as of May 2026.
🧙♂️ The Witcher 3: Watch our revolutionary AI OCR translate challenging subtitles in real-time!
⚔️ Kingdom Come: Deliverance II
Czech to English Translation
🌌 Star Wars: The Old Republic
French to English Translation
Whether you're playing games with foreign subtitles, learning languages through entertainment, or translating any screen content - our revolutionary AI OCR delivers accuracy that traditional tools can't match.
Minimal Requirements:
Quick Start: Download → Install → Select areas → Start translating!