Paket: ocrmypdf (14.0.1+dfsg1-1)

Länkar för ocrmypdf

Debianresurser:

Hämta källkodspaketet ocrmypdf:

Ansvariga:

Debian Python Team (QA-sida)
Anton Gladky (QA-sida)

Externa resurser:

Hemsida [github.com]

Liknande paket:

add an OCR text layer to PDF files

OCRmyPDF generates a searchable PDF/A file from a regular PDF containing only images, allowing it to be searched.

It uses the Tesseract OCR engine and so supports all the languages that Tesseract does.

Some other main features:

  * Places OCR text accurately below the image to ease copy / paste
  * Keeps the exact resolution of the original embedded images
  * When possible, inserts OCR information as a lossless operation
    without rendering vector information
  * Keeps file size about the same
  * If requested deskews and/or cleans the image before performing OCR
  * Validates input and output files
  * Provides debug mode to enable easy verification of the OCR results
  * Processes pages in parallel when more than one CPU core is
    available
  * Battle-tested on thousands of PDFs, a test suite and continuous
    integration.

Andra paket besläktade med ocrmypdf

beror

rekommenderar

föreslår

enhances

dep: ghostscript (>= 9.18~dfsg~)

interpreter for the PostScript language and for PDF
dep: icc-profiles-free

ICC color profiles for use with color profile aware software
dep: python3

interactive high-level object-oriented language (default python3 version)
dep: python3-coloredlogs

colored terminal output for Python 3's logging module
dep: python3-deprecation

Library to handle automated deprecations
dep: python3-img2pdf (>= 0.3.0)

Lossless conversion of raster images to PDF (library)
dep: python3-importlib-resources

Read resources from Python packages

eller python3 (>> 3.9)

interactive high-level object-oriented language (default python3 version)
dep: python3-packaging

core utilities for python3 packages
dep: python3-pdfminer (>= 20181108+dfsg-3)

PDF parser and analyser (Python3)
dep: python3-pikepdf (>= 5.0.1)

Python library to read and write PDFs with QPDF
dep: python3-pil

Python Imaging Library (Python3)
dep: python3-pkg-resources

Package Discovery and Resource Access using pkg_resources
dep: python3-pluggy

plugin and hook calling mechanisms for Python - 3.x
dep: python3-reportlab

ReportLab library to create PDF documents using Python3
dep: python3-tqdm

fast, extensible progress bar for Python 3 and CLI tool
dep: python3-typing-extensions

Backported and Experimental Type Hints for Python

eller python3 (>> 3.10)

interactive high-level object-oriented language (default python3 version)
dep: tesseract-ocr (>= 4.0.0)

Tesseract command line OCR tool
dep: zlib1g

Kompressionsbibliotek - körtidspaket

rec: pngquant

PNG (Portable Network Graphics) image optimising utility
rec: unpaper

post-processing tool for scanned pages

sug: img2pdf

Lossless conversion of raster images to PDF
sug: ocrmypdf-doc

add an OCR text layer to PDF files - documentation
sug: python-watchdog

Paketet inte tillgängligt

Hämta ocrmypdf

Hämtningar för alla tillgängliga arkitekturer
Arkitektur	Paketstorlek	Installerad storlek	Filer
all	148,1 kbyte	555,0 kbyte	[filförteckning]