boxtool.io

OCR — Image to Text

Extract text from images using Tesseract OCR. 12 languages. Runs in your browser — no upload, free.

Drop an image or click to browse

PNG, JPG, WEBP, BMP, GIF, TIFF

All processing happens in your browser — your images never leave your device. First run downloads language data (~4 MB/language), cached afterward.

About OCR — Image to Text

The OCR tool extracts text from images using Tesseract — a leading open-source OCR engine — running entirely in your browser via WebAssembly. Upload a PNG, JPG, or other image format, select the language, and get the extracted text in seconds. Nothing is uploaded to a server; your images stay completely private.

Use Cases

  • Extract text from a scanned document, invoice, or receipt
  • Copy text from a screenshot or image that cannot be selected normally
  • Digitize printed notes, book pages, or handwritten text
  • Read and capture text from photos of signs, labels, or printed materials

Tips

  • Higher-contrast, sharper images produce significantly better OCR accuracy
  • The first recognition may take longer as the OCR engine and language data are downloaded and cached
  • Select the correct language before starting — mixed-language documents may need separate passes

Frequently Asked Questions

PNG, JPG/JPEG, WEBP, BMP, GIF and TIFF. For best results use high-resolution images with clear, high-contrast text.

English, Portuguese, Spanish, French, German, Italian, Dutch, Polish, Russian, Japanese, Chinese (Simplified) and Arabic.

For clean, printed text with good contrast, expect 95–99% accuracy. For handwritten text or low-quality scans, accuracy will be lower.

No. OCR runs entirely in your browser using WebAssembly (Tesseract.js). Your images never leave your device.

Not directly — the tool accepts image files. Convert PDF pages to images first, then run OCR on each image.

On first use, the OCR engine (~10 MB) and language data (~4 MB) are downloaded and cached. Subsequent recognitions start immediately.

Related Tools

Ad