Back to Blog
Review 9 min

DeepL PDF Translation: We Tested 100 Pages and Here Is What Happened

DeepL PDF Translation: We Tested 100 Pages and Here Is What Happened

The Short Version

DeepL translates text beautifully. Nobody is arguing that. But when you hand it a PDF — a real one, with tables and columns and the occasional formula — what you get back is a well-translated mess. The words are right. The document is broken.

We learned this the hard way, after running 100 pages of actual work documents through DeepL Pro and PDFTranslate side by side.

What We Tested (And Why These Documents)

No cherry-picked examples. We grabbed files from real projects on our desks that week:

  • A 22-page arXiv paper on transformer architectures (English to Chinese, double-column layout with equations)
  • A 15-page SaaS vendor contract (Chinese to English, heavy legal terminology)
  • A 28-page scanned hardware manual from a Japanese manufacturer (scanned PDF, image quality was rough)
  • A 20-page annual financial report (English to Chinese, dense with data tables)
  • A 15-page clinical trial summary (English to Chinese, medical jargon, numbered references)

Two bilingual team members scored each output on four axes: accuracy, terminology consistency, layout fidelity, and readability. We were not gentle.

Translation Accuracy: DeepL Earns Its Reputation

On pure reading comprehension, DeepL is hard to beat. The arXiv paper came through with natural-sounding Chinese that actually made sense — no small feat for academic writing dense with niche terminology. Take this sentence: "While our approach demonstrates competitive BLEU scores across high-resource pairs, degradation on low-resource languages persists." DeepL nailed it in Chinese without mangling the meaning or producing something that read like a machine.

PDFTranslate held its own here. Using DeepSeek V3.2 as its engine, the translations were in the same ballpark — occasionally better on CJK-specific phrasing, occasionally more literal. The gap was small enough that for most purposes, you would not notice.

Layout: Where Everything Falls Apart for DeepL

Here is where the story changes. Fast.

The financial report was the clearest example. The original PDF has a two-column layout with eight data tables across 20 pages. DeepL returned... text. Just text. The tables collapsed into a pile of numbers with no structure. The columns merged into a single stream. The headers and footers disappeared. Two of us spent nearly three hours manually reconstructing the tables so we could actually use the translation.

Three hours to fix what took two minutes to translate. That math does not work for anyone processing documents at scale.

PDFTranslate handled the same file differently. The output PDF looked like the original — same table layout, same column structure, charts in the same positions. It is not magic; the tool analyzes document structure separately from the text, translates the content, then reassembles both layers. Whether that is worth paying for depends on whether you have three spare hours per document.

Scanned Documents: Dealbreaker

The Japanese hardware manual was a scanned PDF. We uploaded it to DeepL Pro. Got an error message: "This file cannot be processed." No details, no workaround.

This is not really a DeepL failure — scanned PDFs need OCR (optical character recognition) before any translation can happen, and DeepL simply does not include that step. Fair enough. But if your workflow includes any scanned documents — and in our experience, a surprising number of business PDFs are scans — this is a hard stop.

PDFTranslate processed the same file end-to-end. The OCR was not perfect (the scan quality was genuinely poor — some pages looked like they had been faxed twice), but the result was readable and usable as a reference. About what you would expect from a decent OCR pass followed by translation.

The Terminology Trap

One thing that caught our attention during the contract test: DeepL translated "不可抗力" (force majeure) correctly in clause 7, but then used "Act of God" for the exact same term in clause 12. Both are legitimate English translations. Both should not appear in the same contract.

In legal work, this kind of inconsistency is a real problem. DeepL does not currently offer a glossary or terminology lock feature for document translations. PDFTranslate does — you can pin specific term translations and they stay consistent throughout. Small feature, big consequences in professional settings.

What It Costs

DeepL Pro starts at €25/month (roughly $27). That buys you unlimited text translation and the ability to upload documents. The output, again, is plain text regardless of how complex your original formatting was.

For comparison, PDFTranslate's free tier handles 200 pages per month with full layout preservation. The Pro plan ($9.99/month) covers 2,000 pages. If you translate more than a handful of PDFs each month and need them to look right, the pricing difference adds up fast.

Our Honest Take

Use DeepL when you need to understand what a document says. The translation quality is genuinely good — possibly the best raw translation available right now.

Use PDFTranslate when you need the translated document to actually look like a document. Tables stay tables. Columns stay columns. Formulas stay where they belong.

Honestly? Our team uses both. DeepL for quick comprehension tasks — skimming foreign-language emails, reading abstracts, understanding the gist of a report. PDFTranslate for anything that needs to leave our desk — client deliverables, internal reports, translated reference documents. The right tool depends entirely on what "done" looks like for your particular task.

Want to try AI-powered PDF translation?

Start Translating Free