Image OCR (text extraction)

Photos and screenshots → text. Tesseract.js WASM running entirely in your browser.

Images never leave your device. OCR runs in the browser via a WASM engine. The engine and language data (~10–13MB) are downloaded once from a CDN, then cached in IndexedDB.

⚠️ Accuracy notice — OCR accuracy depends on image quality, fonts, and language (typically 80–95%). Always review the result by hand, and for legal documents or precise figures, double-check manually.

Drag an image here, or click to choose

JPG · PNG · WebP — one image at a time

Recognition language

First run downloads training data — English ~10MB · Korean ~13MB · Chinese Traditional ~16MB · Simplified ~15MB · Japanese ~13MB. Wi-Fi recommended.

Factors that affect OCR accuracy

Factor	Good	Bad
Resolution	1000+ px wide	under 500 px
Contrast	White background, black text	Colored background, faint text
Angle	Frontal, level	Skewed, perspective-distorted
Font	Standard print fonts (sans / serif)	Handwriting, cursive
Layout	Single-column horizontal text	Vertical text, multi-column, tables
Noise	None	Lines, smudges, glare, shadows

How to choose the recognition language

OCR only looks for characters within the language model you select. Picking a language the image doesn't contain adds misreads; leaving out a language you need garbles that part. Match the mode to the scripts actually in your image.

Image content	Recommended mode	Note
Korean only	Korean	Lightest model (~13MB)
Latin letters / digits only	English	Good for invoices, codes, URLs
Korean body with Latin / digits mixed in	Korean + English (mixed)	Downloads both models, slower first run
Classical / Traditional Han characters	Chinese (Traditional)	For old Korean texts, inscriptions
Simplified Chinese	Chinese (Simplified)	—
Japanese (kana + kanji)	Japanese	—
Korean mixed with Han characters	Korean + Han (mixed)	Old newspapers, academic notation

Mixed modes download both language models, so the first run takes longer, and the larger candidate set leaves slightly more room for misreads than a single language. If only one language is present, a single mode is faster and more accurate. Running several images in the same language back-to-back reuses the engine, so the second image onward is quicker. Extracted text is automatically cleaned of trailing spaces and excess blank lines and runs of spaces.

Related tools

About Tesseract.js

Tesseract.js is the Tesseract OCR engine (originally Google's) compiled to WebAssembly for the browser. Apache-2.0 license. Runs 100% client-side.

What is "traineddata"?

A model file containing per-language character-shape statistics. English ~10MB, Korean ~13MB. This tool downloads them from the jsdelivr CDN (tessdata.projectnaptha.com) and caches them in IndexedDB — instant start on subsequent runs.

Does it work offline?

Yes, once the training data is cached. IndexedDB may be cleared after long inactivity or by browser data clearing — that triggers a re-download next time.

Processing time for large images

Time grows with resolution. 4000×3000+ images can take a minute or more. The accuracy/time sweet spot is around 1500–2000px wide. Use the resize tool to shrink first, then OCR — it's significantly faster.

Frequently Asked Questions

Are my images uploaded to a server?

Images stay on your device. OCR runs entirely in the browser via the Tesseract.js WASM engine. The OCR worker code and language training data (around 10–13MB) are downloaded once from the jsdelivr CDN on first use, then cached in IndexedDB for offline use. The site operator cannot see your images.

Why is the first run slow?

The OCR engine core (WASM) and language training data (10–13MB) download on first use. After that, IndexedDB caches them and subsequent runs start instantly. Avoid the first run on cellular data — use Wi-Fi once to warm the cache.

Is accuracy 100%?

OCR is a statistical inference task and does not guarantee 100%. With Tesseract.js, clean printed/digital text typically scores 80–95%. Handwriting, vertical text, or distorted documents drop significantly. Always have a human review the results — for legal or critical material, type it in manually.

Which languages are supported?

English, Korean, English + Korean mixed, Traditional Chinese (Han), Simplified Chinese, Japanese, and Korean + Han mixed — seven modes in total. Mixed modes download both language models, so the first run takes longer. See "How to choose the recognition language" above for which mode to pick.

Output looks garbled.

① Re-shoot if the photo is blurry or skewed ② crop just the text area to avoid background noise ③ enlarge the image if the font is too small ④ if your text mixes English with another language, use the mixed mode.

Can I OCR a PDF?

This tool currently accepts images (JPG / PNG / WebP) only. If your PDF already has digital text, copy it directly from a PDF viewer. For scanned PDFs, export each page as an image (e.g. screenshot) and bring it here.

References

Last reviewed: 2026-05-09 / Tesseract.js (Apache-2.0) WASM engine.

Tesseract.js (Apache-2.0) — GitHub
Tesseract OCR engine — GitHub (Apache-2.0)
Training data — tessdata

⚠️ The recognized text is not warranted by this tool. Users must review and correct the output before relying on it.