Translation API Guide
The tool integrates 8 classic translation APIs and 21 large language models, so you can pick whichever fits your text type, budget, and privacy needs.
Which API should I pick?
For detailed comparisons and how to get API keys, keep reading ↓
Classic Translation APIs
Notes:
- DeepL can't be called directly from the browser — the tool routes through a built-in proxy by default. If you have your own proxy, fill it in the API URL field.
- Qwen-MT is Alibaba Cloud's translation-specialized model. See Qwen-MT essentials below.
- TranslateGemma is Google's open-source translation-specialized Gemma model. You'll need to run it locally with LM Studio / Ollama / llama.cpp — see Local Model Setup.
- GTX API/Web are free but rate-limited. Use a paid API for long-running jobs.
For more reliable service, apply for a commercial API key — see the API application guide.
Large Language Models (LLMs)
Supported: DeepSeek, OpenAI, Claude, Gemini, Qwen, Moonshot (Kimi), Zhipu GLM, Doubao, Xiaomi MiMo, MiniMax, Tencent Hunyuan, Baidu ERNIE, Cohere, xAI (Grok), Mistral, Perplexity, OpenRouter, Groq, SiliconFlow, GitHub Models, Nvidia NIM, Azure OpenAI, plus any OpenAI-compatible endpoint.
LLMs work best for:
- Literature and technical documentation that needs deeper understanding
- Multilingual content where consistent terminology matters
- Custom prompts to control translation style
Key parameters:
- Model: enter the model name from your provider; for Azure OpenAI, enter the deployment name.
- Temperature: defaults to 0.7. Try 0.2 for technical content, 0.9 for marketing or creative paraphrasing.
- Thinking mode: lets the AI think before translating — higher quality, slower and pricier. Supported models show a toggle in the UI. The toggle is stored per model so switching models preserves each one's setting independently. The exact UI form depends on the provider:
- Three levels (off / low / medium / high): Claude, Gemini, DeepSeek, OpenAI GPT-5, Qwen3, Azure OpenAI, Nvidia NIM, Perplexity Sonar Deep Research
- Binary toggle (off / on): Doubao, Zhipu GLM, Moonshot (Kimi), MiniMax, Xiaomi MiMo, Hunyuan TurboS, Mistral (Medium 3.5 / Small 4), Cohere Command A Reasoning (underlying APIs accept on/off but not an effort param)
- Low / high only: xAI Grok 4.3 (API limitation)
- Three-state (off / on / auto): when you enter an unlisted custom model on a thinking-capable provider (incl. Mistral, Perplexity, and the custom OpenAI-compatible endpoint). Auto omits the thinking param to follow the model's built-in default — a fallback for strict providers that 422 on a non-thinking SKU; defaults to Off
- Always-on SKUs (no toggle shown): ERNIE 5.0-Thinking, Hunyuan 2.0-Thinking, Mistral Magistral, Perplexity Sonar Reasoning Pro, Grok 4.20 Reasoning / Multi-Agent
- No thinking toggle: GitHub Models (the gateway doesn't support reasoning params)
Regional Endpoint Switcher
Many providers run separate endpoints for Mainland China, International, and US regions. The official endpoints appear as quick-pick chips above the URL field — click to switch:
URL Auto-Completion
OpenAI-compatible URL fields auto-complete to the full path when focus leaves the field — paste http://host:port or http://host:port/v1 and the tool fills in the rest. This catches the most common "incomplete URL → connection failure" mistake.
Qwen-MT Essentials
Qwen-MT is a machine translation service (not a general LLM). It has no system-prompt concept and works purely with source/target language codes — so the Prompt settings don't apply.
Picking a Model
You'll need to fill in the Model field manually:
Domain Hint
The domains field tells the model what industry the text is from, so terminology lands closer to the field. Important: write a short English description, not a keyword list. Alibaba's official example:
Leave empty if you don't need it.
Unsupported Languages
Qwen-MT covers ~92 languages; a number of low-resource ones aren't covered and the UI auto-blocks them with a clear message (the in-app blocklist is authoritative): e.g. Kyrgyz (ky), Turkmen (tk), Tajik (tg), Mongolian (mn), Malayalam (ml), Uyghur (ug), Amharic (am), and dozens more.
Built-in API Proxy
DeepL, Nvidia NIM, and similar providers can't be called from the browser due to CORS. The tool routes those through a built-in proxy by default. If you specify a custom API URL in settings, the proxy is bypassed and requests go directly to your URL.
Local Model Setup
Want to run models locally for privacy? The tool works with any OpenAI-compatible local server. For decent translation quality, use qwen3-14b or larger (32B / 70B works even better).
Default Endpoints
These appear as quick-pick chips next to the URL field.
TranslateGemma
Google's translation-specialized Gemma model, trained specifically for translation quality. Quick notes:
- Pick "TranslateGemma" directly from the service list — don't go through "Custom (OpenAI-compatible)" with
translategemma-4b-itas the model name. The two take entirely different code paths: the dedicated TranslateGemma service makes line-by-line calls tailored to the Gemma translation model's I/O format, while Custom uses the generic LLM pipeline with batching and context markers — which causes dropped lines and slower runs on small (under 14B) models. - The default URL points to LM Studio on port 1234; one click switches to Ollama / llama.cpp
- Recommended models:
translategemma-4b-it(compact and fast),translategemma-12b-it(better quality), ortranslategemma-27b-it(best quality) - Source language must be explicit — auto-detect isn't supported
- Limited language coverage: only ~55 mainstream languages (Google's WMT24++ benchmark scope). About 68 low-resource languages — including Cantonese (yue), Bhojpuri (bho), Wolof (wo), Aragonese (an), Guarani (gn), Kurdish (ckb/kmr) — are blocked by the UI. Use DeepL / Google / Azure / Qwen-MT for broader coverage
Solving CORS Issues
If a local model can't be reached, the two usual culprits:
Step 1: Disable ad/privacy extensions, then refresh and retry.
Step 2: Enable CORS on the local server.
Ollama
Run this once in PowerShell (Win + X to open Terminal) to enable it permanently:
*allows all origins. For tighter security, use a specific domain likehttp://192.168.2.20:3000.
Restart the Ollama service for the change to take effect. To enable temporarily, set the variable when starting:
LM Studio
- Open the "Developer" icon in the left menu
- Go to the local server settings page, click "Settings" at the top
- Check the "Enable CORS" box

That's it — local models should work now. If you're still stuck, check for port conflicts and look at the browser console for the actual error. (Special thanks to mrfragger for the configuration tips.)
Language Support
This tool supports translation between 120+ major languages, organized by region.
Language Code Reference
Use the language codes below for batch multi-language configuration (e.g., en, zh, ja, ko):
Common
Europe
Middle East
Central Asia
South Asia
Southeast Asia
Africa
Americas & Oceania
API Support Documentation
LLMs support all languages. Machine translation API language support:

