all options
buster  ] [  bullseye  ] [  bookworm  ] [  trixie  ] [  sid  ]
[ Source: ocrmypdf  ]

Package: ocrmypdf (10.3.1+dfsg-1)

Links for ocrmypdf

Screenshot

Debian Resources:

Download Source Package ocrmypdf:

Maintainer:

External Resources:

Similar packages:

add an OCR text layer to PDF files

OCRmyPDF generates a searchable PDF/A file from a regular PDF containing only images, allowing it to be searched.

It uses the Tesseract OCR engine and so supports all the languages that Tesseract does.

Some other main features:

  * Places OCR text accurately below the image to ease copy / paste
  * Keeps the exact resolution of the original embedded images
  * When possible, inserts OCR information as a lossless operation
    without rendering vector information
  * Keeps file size about the same
  * If requested deskews and/or cleans the image before performing OCR
  * Validates input and output files
  * Provides debug mode to enable easy verification of the OCR results
  * Processes pages in parallel when more than one CPU core is
    available
  * Battle-tested on thousands of PDFs, a test suite and continuous
    integration.

Other Packages Related to ocrmypdf

  • depends
  • recommends
  • suggests
  • enhances

Download ocrmypdf

Download for all available architectures
Architecture Package Size Installed Size Files
all 141.5 kB558.0 kB [list of files]