Translation API Guide

The tool integrates 7 classic translation APIs and 24 large language models, so you can pick whichever fits your text type, budget, and privacy needs.

Which API should I pick?

Use caseRecommendedWhy
First-time user, no API keyGTX (Free)Free, zero-config — the default
GTX won't connect, still want freeEdge (Free) or DeepLX (Free)Also zero-config — just a different route
Subtitles, UI strings, lots of short sentencesGTX Free or Qwen-MT FlashBatched chunks, cheap
Long text, business / literatureDeepLBest overall quality and flow
Entry-level AI translation / budget-conscious LLM useDeepSeekBest price/value LLM — strong CN/EN quality at a fraction of Claude / GPT per-token cost
Chinese ↔ English, reliabilityDeepSeek / Qwen-MT PlusDirect access from mainland CN, great CN/EN quality
Technical docs needing consistent terminologyDeepSeek (best value) / Claude (premium) / GPT / GeminiLLMs let you control style via custom prompts
Privacy-sensitive, local-onlyTranslateGemmaSelf-hosted, free, purpose-built translation model
Low-resource languages (Wolof, Bhojpuri, etc.)DeepLCovers many low-resource languages reliably (e.g. wo / an / gn that Google doesn't carry)

For detailed comparisons and how to get API keys, keep reading ↓

Classic Translation APIs

APIQualityStabilityBest forFree tier
DeepL(X)★★★★★★★★★☆Long text, business / literature500K characters/month
Google Translate★★★★☆★★★★★UI strings, short sentences500K characters/month
Azure Translate★★★★☆★★★★★Widest language coverage (90+)2M characters/month for the first year
Qwen-MT★★★★★★★★★★CN/EN translation, domain-tunedPay-per-token, free tier for new users
TranslateGemma★★★★☆★★★★☆Local translation, privacy-firstFree (self-hosted)
GTX API (Free)★★★☆☆★★★★☆General text, zero-config defaultShared service, auto-slows when throttled
Edge API (Free)★★★★☆★★★★☆Free GTX backup, different routeShared service, auto-slows when throttled

Notes:

  • DeepL can't be called directly from the browser — the tool routes through a built-in proxy by default. If you have your own proxy, fill it in the API URL field.
  • Qwen-MT is Alibaba Cloud's translation-specialized model. See Qwen-MT essentials below.
  • TranslateGemma is Google's open-source translation-specialized Gemma model. You'll need to run it locally with LM Studio / Ollama / llama.cpp — see Local Model Setup.
  • GTX API / Edge API are both zero-config free machine translation that back each other up — if one can't connect, switch to the other. See Free machine translation essentials below.

For more reliable service, apply for a commercial API key — see the API application guide.

Large Language Models (LLMs)

Supported: DeepSeek, OpenAI, Claude, Gemini, Qwen, Moonshot (Kimi), Zhipu GLM, Doubao, Xiaomi MiMo, MiniMax, Tencent Hunyuan, Baidu ERNIE, Cohere, xAI (Grok), Mistral, Perplexity, YandexGPT (AI Studio), OpenRouter, Groq, SiliconFlow, GitHub Models, Nvidia NIM, Azure OpenAI, LiteLLM (self-hosted gateway), plus any OpenAI-compatible endpoint.

LLMs work best for:

  • Literature and technical documentation that needs deeper understanding
  • Multilingual content where consistent terminology matters
  • Custom prompts to control translation style

Key parameters:

  • Model: enter the model name from your provider; for Azure OpenAI, enter the deployment name.
  • Temperature: defaults to 0.7. Try 0.2 for technical content, 0.9 for marketing or creative paraphrasing.
  • Thinking mode: lets the AI think before translating — higher quality, slower and pricier. Supported models show a toggle in the UI. The toggle is stored per model so switching models preserves each one's setting independently. The exact UI form depends on the provider:
    • Three levels (off / low / medium / high): Claude, Gemini, OpenAI GPT-5, Qwen3, Azure OpenAI, Nvidia NIM, OpenRouter, Groq (GPT-OSS), Perplexity Sonar Deep Research
    • Binary toggle (off / on): DeepSeek (ON always uses the top tier — the API's middle tiers add nothing for translation), Doubao, Zhipu GLM, Moonshot (Kimi), Xiaomi MiMo, SiliconFlow, ERNIE 5.0 Thinking, Mistral (Medium 3.5 / Small 4), Cohere Command A Reasoning (underlying APIs accept on/off but not an effort param)
    • Low / high only: xAI Grok 4.3 (API limitation)
    • Three-state (off / on / auto): when you enter an unlisted custom model on a thinking-capable provider (incl. Mistral, Perplexity, and the custom OpenAI-compatible endpoint). Auto omits the thinking param to follow the model's built-in default — a fallback for strict providers that 422 on a non-thinking SKU; defaults to Off
    • Always-on SKUs (thinking is intrinsic to the model; no toggle shown): MiniMax M2.x, Tencent Hunyuan (TurboS / 2.0 Thinking / T1), Mistral Magistral, Perplexity Sonar Reasoning Pro, Grok 4.20 Reasoning / Multi-Agent
    • No thinking toggle: GitHub Models, YandexGPT (the gateway/API doesn't support reasoning params), LiteLLM (the gateway injects no thinking params — upstream model defaults apply)

Provider-Specific Notes

  • GitHub Models: authenticates with a GitHub PAT (needs the models:read scope); free quota is tiered per model — a good entry point if you don't have a paid key.
  • Tencent Hunyuan: the official endpoint currently rejects browser CORS preflights (direct calls always fail), so API Relay is ON by default to make it work out of the box. If Tencent fixes this, you can switch back to direct in API Settings. See API Relay.
  • YandexGPT (AI Studio): needs a Folder ID in addition to the API key (grab it from the folders page in the Yandex AI Studio console). For the model field, enter a SKU name (e.g. yandexgpt-5.1; open-weight SKUs like Qwen3, DeepSeek, and GPT-OSS are also offered) or paste a full gpt://<folder_id>/<model>/latest URI. Yandex's API sends no CORS headers, so API Relay is ON by default (the switch stays user-controllable, and you can point the URL at your own relay instead).

Regional Endpoint Switcher

Many providers run separate endpoints for Mainland China, International, and US regions. The official endpoints appear as quick-pick chips above the URL field — click to switch:

ProviderAvailable regions
Qwen / Qwen-MTMainland CN / International / US
Moonshot (Kimi)Mainland CN / International
Zhipu GLMMainland CN / International (Z.ai)
MiniMaxMainland CN / International
Xiaomi MiMoPay-as-you-go / Token Plan (CN / Singapore / Europe)
Doubao (Volcano Ark)Standard / Coding Plan

URL Auto-Completion

Self-hosted and multi-region providers (Custom, LiteLLM, TranslateGemma, Qwen / Qwen-MT, Doubao, Nvidia) complete the URL to the full path the moment focus leaves the field; for every other provider, a custom URL is normalized automatically before each request is sent. Paste http://host:port or http://host:port/v1 and the tool fills in the rest — the classic "missing /v1/chat/completions → connection failure" mistake can't happen.

Free machine translation essentials

The tool ships three zero-config free machine-translation servicesGTX (Free), Edge (Free), and DeepLX (Free). None need an API key; they call the official endpoints directly from your browser and your text never touches this tool's servers. They take different routes, so they back each other up: if one can't connect, just switch to another in the dropdown — no key required. GTX is the default.

ServiceBackendDefault gateway
GTX (Free)Google Translatetranslate-pa.googleapis.com (same gateway as Google's web-translate widget)
Edge (Free)Microsoft Edge's built-in translator (Azure engine)edge.microsoft.com free token
DeepLX (Free)Community DeepLX public endpointBuilt-in public address

GTX gateway is switchable

GTX defaults to translate-pa.googleapis.com (the gateway behind Google's web-translate widget — CORS-correct, good availability). Quick-switch chips sit above the URL field:

  • translate-pa (default): recommended, works in most network environments
  • Legacy gtx: the old translate.googleapis.com/translate_a endpoint. Google has tightened anti-abuse on it (many IPs get redirected to a captcha page, which the browser reports as CORS), but the block is IP-reputation-based and some regions/networks still pass — kept as a fallback
  • Self-hosted mirror: paste your own mirror URL (e.g. a Cloudflare Worker); the tool auto-detects the protocol from the address shape

Rate limits and automatic slowdown

The shared free endpoints are rate-limited per user. GTX now translates in batched chunks — many lines packed into ~5000-character blocks, one request per block — so request volume drops sharply versus line-by-line, and everyday use rarely triggers throttling. Huge bursts can still hit limits; the tool handles it automatically:

  • When throttled it pauses all requests to the service and resumes on its own shortly after, showing "Rate limited — pausing briefly, will retry automatically"
  • Translation slows down but keeps going; in most cases no action is needed
  • If the failure panel keeps showing rate-limit messages: wait a few minutes and hit "Retry" (the cache skips completed lines), switch to the Edge (Free) / DeepLX (Free) backups, or move long batch jobs to a keyed service like DeepL / Qwen-MT / DeepSeek

Can't connect, or seeing CORS errors in the console? First switch the gateway or try Edge (Free) — that fixes most cases in one step. Still failing? Check your network environment (mainland China blocking, corporate network interception, browser extensions) — see FAQ → GTX Free cannot connect for the checklist.

Qwen-MT Essentials

Qwen-MT is a machine translation service (not a general LLM). It has no system-prompt concept and works purely with source/target language codes — so the Prompt settings don't apply.

Picking a Model

You'll need to fill in the Model field manually:

ModelCharacteristicsBest for
qwen-mt-plusHighest quality, slower, premium pricingLiterature, legal, medical translation
qwen-mt-turboBalanced speed and qualityGeneral documents, batch jobs
qwen-mt-flashFastest and cheapest (default)Subtitles, UI strings, large short-text volumes

Domain Hint

The domains field tells the model what industry the text is from, so terminology lands closer to the field. Important: write a short English description, not a keyword list. Alibaba's official example:

The sentence is from Ali Cloud IT domain. It mainly involves computer-related
software development and usage methods, including many terms related to computer
software and hardware. Pay attention to professional troubleshooting terminologies
and sentence patterns when translating. Translate into this IT domain style.

Leave empty if you don't need it.

Native Glossary Channel

Qwen-MT is one of the few MT services with native glossary support: with the Glossary enabled, matched terms are sent through the official translation_options.terms parameter and applied by the model itself — more reliable than prompt injection.

Unsupported Languages

Qwen-MT covers ~92 languages; a number of low-resource ones aren't covered and the UI auto-blocks them with a clear message (the in-app blocklist is authoritative): e.g. Kyrgyz (ky), Turkmen (tk), Tajik (tg), Mongolian (mn), Malayalam (ml), Uyghur (ug), Amharic (am), and dozens more.

API Relay & Built-in Proxy

Some providers' official endpoints block direct browser calls (CORS). The tool offers two proxy channels; text is never stored on our servers.

API Relay (user-controlled)

12 providers — DeepSeek, OpenAI, Claude, Qwen, Moonshot (Kimi), Doubao, Zhipu GLM, Mistral, xAI (Grok), Perplexity, Tencent Hunyuan, and YandexGPT — expose an "API Relay" switch in API Settings. When on, requests route through our Cloudflare relay (only the request body and auth headers are forwarded):

  • Most providers default to OFF — when a direct call hits a CORS / 403 wall, the UI shows an actionable hint to enable it
  • Hunyuan and YandexGPT default to ON: both official endpoints currently can't be reached from browsers, so the relay is what makes them work out of the box. The switch stays available — flip back to direct if the upstream ever fixes CORS
  • Self-hosted relay: prefer your own relay? Put its address in the URL field. The precedence is fixed: custom URL > relay switch > official direct — with a URL filled in, the relay switch is grayed out with a note ("clear the URL to re-enable")
  • The relay passes through the server's Retry-After header, so rate-limit auto-slowdown is exactly as precise as direct calls

Built-in Proxy (no switch)

DeepL and Nvidia NIM route through a separate built-in proxy by default. If you specify a custom API URL in settings, the proxy is bypassed and requests go directly to your URL.

Local Model Setup

Want to run models locally for privacy? The tool works with any OpenAI-compatible local server. For decent translation quality with a generic LLM, use qwen3-14b or larger (32B-class works even better); on limited VRAM, switch to the translation-specialized TranslateGemma — solid quality from 4B up.

In mainland China, download models from ModelScope — far faster than direct Hugging Face access or LM Studio's built-in downloader, and the official TranslateGemma repos are mirrored there.

Default Endpoints

ToolDefault endpoint
LM Studiohttp://127.0.0.1:1234/v1/chat/completions
Ollamahttp://127.0.0.1:11434/v1/chat/completions
llama.cpphttp://127.0.0.1:8080/v1/chat/completions

These appear as quick-pick chips next to the URL field.

LiteLLM Self-Hosted Gateway

LiteLLM is a self-hosted proxy that fronts 100+ upstream models behind one OpenAI-compatible API. Pick LiteLLM directly from the service list — versus Custom, it gives you a dedicated config slot, so running LiteLLM as your daily driver doesn't force you to rewrite Custom's URL every time you switch to another self-hosted endpoint.

  • Default address http://127.0.0.1:4000/v1/chat/completions (litellm's default port); the URL is the credential
  • API Key optional: not needed for a bare local proxy; fill it in if your proxy has a master / virtual key configured
  • Model can stay empty: when started with litellm --model X (the official quick-start) or with a completion_model server default, an empty model field follows the server's default; for multi-model config.yaml deployments, enter the model alias
  • LiteLLM allows browser cross-origin requests by default — no extra CORS setup

TranslateGemma

Google's translation-specialized Gemma model, trained specifically for translation quality. Quick notes:

  • Pick "TranslateGemma" directly from the service list — don't go through "Custom (OpenAI-compatible)" with translategemma-4b-it as the model name. The two take entirely different code paths: the dedicated TranslateGemma service makes line-by-line calls tailored to the Gemma translation model's I/O format, while Custom uses the generic LLM pipeline with batching and context markers — which causes dropped lines and slower runs on small (under 14B) models.
  • The default URL points to LM Studio on port 1234; one click switches to Ollama / llama.cpp
  • API key is optional: leave it empty for a plain local server; if your deployment requires auth (LM Studio's "require API key", vLLM's --api-key, or a reverse proxy in front), fill it in and requests carry an Authorization: Bearer header
  • Recommended models: translategemma-4b-it (compact and fast), translategemma-12b-it (better quality), or translategemma-27b-it (best quality)
  • Prompt settings don't apply: like Qwen-MT, it's a machine-translation service — the prompt is built into the call format; system/user prompts only affect LLM providers
  • Source language must be explicit — auto-detect isn't supported
  • Limited language coverage: only ~55 mainstream languages (Google's WMT24++ benchmark scope). About 68 low-resource languages — including Cantonese (yue), Bhojpuri (bho), Wolof (wo), Aragonese (an), Guarani (gn), Kurdish (ckb/kmr) — are blocked by the UI. Use DeepL / Google / Azure / Qwen-MT for broader coverage

Solving CORS Issues

If a local model can't be reached, the two usual culprits:

Step 1: Disable ad/privacy extensions, then refresh and retry.

Step 2: Enable CORS on the local server.

Ollama

Run this once in PowerShell (Win + X to open Terminal) to enable it permanently:

[System.Environment]::SetEnvironmentVariable('OLLAMA_ORIGINS', '*', 'User')

* allows all origins. For tighter security, use a specific domain like http://192.168.2.20:3000.

Restart the Ollama service for the change to take effect. To enable temporarily, set the variable when starting:

OLLAMA_ORIGINS="*" ollama serve

LM Studio

  1. Open the "Developer" icon in the left menu
  2. Go to the local server settings page, click "Settings" at the top
  3. Check the "Enable CORS" box

LM Studio CORS settings screenshot

That's it — local models should work now. If you're still stuck, check for port conflicts and look at the browser console for the actual error. (Special thanks to mrfragger for the configuration tips.)

Language Support

This tool supports translation between 120+ major languages, organized by region.

Language Code Reference

Use the language codes below for batch multi-language configuration (e.g., en, zh, ja, ko):

Common

CodeNativeEnglish中文
enEnglishEnglish英语
zh简体Simplified Chinese中文
zh-hant繁體Traditional Chinese繁体中文
esEspañolSpanish西班牙语
frFrançaisFrench法语
deDeutschGerman德语
ja日本語Japanese日语
ko한국어Korean韩语
hiहिन्दीHindi印地语
arالعربيةArabic阿拉伯语
ruРусскийRussian俄语
pt-brPortuguês (Brasil)Portuguese (Brazil)葡萄牙语(巴西)
idBahasa IndonesiaIndonesian印尼语
viTiếng ViệtVietnamese越南语
itItalianoItalian意大利语
yue粵語Cantonese粤语

Europe

CodeNativeEnglish中文
plPolskiPolish波兰语
ukУкраїнськаUkrainian乌克兰语
nlNederlandsDutch荷兰语
roRomânăRomanian罗马尼亚语
elΕλληνικάGreek希腊语
huMagyarHungarian匈牙利语
svSvenskaSwedish瑞典语
csČeštinaCzech捷克语
pt-ptPortuguês (Portugal)Portuguese (Portugal)葡萄牙语(葡萄牙)
caCatalàCatalan加泰罗尼亚语
srСрпскиSerbian塞尔维亚语
bgБългарскиBulgarian保加利亚语
hyՀայերենArmenian亚美尼亚语
daDanskDanish丹麦语
sqShqipAlbanian阿尔巴尼亚语
fiSuomiFinnish芬兰语
nbNorsk bokmålNorwegian Bokmål挪威语
skSlovenčinaSlovak斯洛伐克语
hrHrvatskiCroatian克罗地亚语
beБеларускаяBelarusian白俄罗斯语
scnSicilianuSicilian西西里语
kaქართულიGeorgian格鲁吉亚语
lmoLombardLombard伦巴第语
ltLietuviųLithuanian立陶宛语
glGalegoGalician加利西亚语
bsBosanskiBosnian波斯尼亚语
slSlovenščinaSlovenian斯洛文尼亚语
mkМакедонскиMacedonian马其顿语
lvLatviešuLatvian拉脱维亚语
etEestiEstonian爱沙尼亚语
isÍslenskaIcelandic冰岛语
mtMaltiMaltese马耳他语
cyCymraegWelsh威尔士语
gaGaeilgeIrish爱尔兰语
brBrezhonegBreton布列塔尼语
euEuskaraBasque巴斯克语
yiייִדישYiddish意第绪语
lbLëtzebuergeschLuxembourgish卢森堡语
ocOccitanOccitan奥克语
anAragonésAragonese阿拉贡语
laLatinaLatin拉丁语
eoEsperantoEsperanto世界语

Middle East

CodeNativeEnglish中文
trTürkçeTurkish土耳其语
heעבריתHebrew希伯来语
faفارسیPersian波斯语
urاردوUrdu乌尔都语
psپښتوPashto普什图语
prsدریDari达里语
ckbکوردیCentral Kurdish中库尔德语
kmrKurdîNorthern Kurdish北库尔德语

Central Asia

CodeNativeEnglish中文
uzOʻzbekchaUzbek乌兹别克语
kkҚазақ тіліKazakh哈萨克语
kyКыргызчаKyrgyz吉尔吉斯语
tkTürkmençeTurkmen土库曼语
azAzərbaycanAzerbaijani阿塞拜疆语
tgТоҷикӣTajik塔吉克语
mnМонголMongolian蒙古语
baБашҡортсаBashkir巴什基尔语
ttТатар телеTatar鞑靼语
ugئۇيغۇرچەUyghur维吾尔语

South Asia

CodeNativeEnglish中文
bnবাংলাBengali孟加拉语
mrमराठीMarathi马拉地语
teతెలుగుTelugu泰卢固语
taதமிழ்Tamil泰米尔语
guગુજરાતીGujarati古吉拉特语
knಕನ್ನಡKannada卡纳达语
paਪੰਜਾਬੀPunjabi旁遮普语
mlമലയാളംMalayalam马拉雅拉姆语
bhoभोजपुरीBhojpuri博杰普尔语
maiमैथिलीMaithili迈蒂利语
neनेपालीNepali尼泊尔语
siසිංහලSinhala僧伽罗语
asঅসমীয়াAssamese阿萨姆语
gomकोंकणीKonkani孔卡尼语
saसंस्कृतम्Sanskrit梵语

Southeast Asia

CodeNativeEnglish中文
thไทยThai泰语
loລາວLao老挝语
myမြန်မာBurmese缅甸语
msBahasa MelayuMalay马来语
filTagalogFilipino(Tagalog)菲律宾语(塔加拉语)
jvBasa JawaJavanese爪哇语
suBasa SundaSundanese巽他语
aceAcèhAcehnese亚齐语
pagSalitan PangasinanPangasinan邦阿西楠语
pamKapampanganPampangan邦板牙语
cebCebuanoCebuano宿务语

Africa

CodeNativeEnglish中文
swKiswahiliSwahili斯瓦希里语
haهَرْشٜىٰن هَوْسَاHausa豪萨语
amአማርኛAmharic阿姆哈拉语
igIgboIgbo伊博语
woWolofWolof沃洛夫语
xhisiXhosaXhosa科萨语
zuisiZuluZulu祖鲁语
afAfrikaansAfrikaans南非荷兰语
omAfaan OromooOromo奥罗莫语
stSesothoSouthern Sotho南索托语
tnSetswanaTswana茨瓦纳语
tsXitsongaTsonga聪加语
mgMalagasyMalagasy马拉加西语
lnLingálaLingala林加拉语

Americas & Oceania

CodeNativeEnglish中文
htKreyòl ayisyenHaitian Creole海地克里奥尔语
quRuna SimiQuechua克丘亚语
ayAymar aruAymara艾马拉语
gnAvañe'ẽGuarani瓜拉尼语
miMāoriMaori毛利语

API Support Documentation

LLMs support all languages. Machine translation API language support: