Summary
Optical Character Recognition (OCR)
Debian Accessibility Optical Character Recognition (OCR)
This metapackage will install packages which are useful for
Optical Character Recognition (OCR).
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Accessibility
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Accessibility mailing list
Links to other tasks
|
Debian Accessibility Optical Character Recognition (OCR) packages
Official Debian packages with high relevance
ebook-speaker
eBook reader that reads aloud in a synthetic voice
|
Versions of package ebook-speaker |
Release | Version | Architectures |
buster | 5.0.0-1 | amd64,arm64,armhf,i386 |
jessie | 2.8.1-1+deb8u1 | amd64,armel,armhf,i386 |
sid | 6.2.0-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 6.2.0-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 6.2.0-4+deb12u1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 5.5.2-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 4.1.0-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package ebook-speaker: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
sound | player |
works-with | file |
works-with-format | epub |
|
License: DFSG free
|
This package provides a command-line e-reader that reads out
electronic text using speech synthesis. It has a simple user
interface appropriate for Braille terminals.
Currently the following formats are supported (some formats need
additional packages as suggested by this package):
AportisDoc
ASCII mail text
ASCII text
Broadband eBooks (BBeB)
Composite Document File (Microsoft Office Word)
DAISY3 DTBook
EPUB ebook data
GIF image data
GutenPalm zTXT
GNU gettext message catalogue
HTML document
ISO-8859 text
JPEG image data
Microsoft Reader eBook Data
Microsoft Windows HtmlHelp Data
Microsoft Word 2007+
Mobipocket E-book
MS Windows HtmlHelp Data
Netpbm PPM data
OpenDocument Text
PDF document
PeanutPress PalmOS
PNG image data
POSIX shell script text
PostScript document
Rich Text Format
troff or preprocessor text (e.g. Linux man-pages)
UTF-8 Unicode mail text
UTF-8 Unicode text
WordPerfect
XML document text
|
|
gocr
|
Versions of package gocr |
Release | Version | Architectures |
stretch | 0.49-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.49-2 | amd64,armel,armhf,i386 |
sid | 0.52-6.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.52-6.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.52-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.52-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.52-1 | amd64,arm64,armhf,i386 |
Debtags of package gocr: |
accessibility | ocr |
interface | commandline |
role | program |
scope | application |
use | converting |
works-with | image, image:raster, text |
|
License: DFSG free
|
This is a multi-platform OCR (Optical Character Recognition) program.
It can read pnm, pbm, pgm, ppm, some pcx and tga image files.
Currently the program should be able to handle well scans that have their text
in one column and do not have tables. Font sizes of 20 to 60
pixels are supported.
If you want to write your own OCR, libgocr is provided in a separate
package. Documentation and graphical wrapper are provided in separated
packages, too.
|
|
hocr-gtk
GTK+ frontend for Hebrew OCR
|
Versions of package hocr-gtk |
Release | Version | Architectures |
jessie | 0.10.17-2 | all |
buster | 0.10.18-3 | all |
Debtags of package hocr-gtk: |
accessibility | ocr |
culture | hebrew |
interface | x11 |
role | program |
scope | application |
uitoolkit | gtk |
use | converting |
works-with | image, image:raster, text |
x11 | application |
|
License: DFSG free
|
Hocr-gtk is a GTK+ based graphical interface to the libhocr library. It
can open multiple image formats and uses aspell for internal spell checking.
|
|
lios
Linux intelligent OCR solution
|
Versions of package lios |
Release | Version | Architectures |
stretch | 2.1-2 | all |
sid | 2.7.2-8 | all |
trixie | 2.7.2-8 | all |
bookworm | 2.7.2-6 | all |
bullseye | 2.7.2-2 | all |
buster | 2.7-3 | all |
experimental | 2.7.2+git20221124-0.1 | all |
|
License: DFSG free
|
Lios provides a graphical interface on top of the Cuneiform and
Tesseract OCR backends to make OCR processing easier for impaired users,
with full autorotation, brightness optimization, rectangle selection,
audio feedback, etc.
|
|
tesseract-ocr
Tesseract command line OCR tool
|
Versions of package tesseract-ocr |
Release | Version | Architectures |
sid | 5.3.4-1.4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 5.3.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 3.03.03-1 | amd64,armel,armhf,i386 |
stretch | 3.04.01-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch-backports | 4.0.0-2~bpo9+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 4.0.0-2 | amd64,arm64,armhf,i386 |
trixie | 5.3.4-1.4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 4.1.1-2.1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 5.5.0 |
Debtags of package tesseract-ocr: |
accessibility | ocr |
interface | commandline |
role | program |
|
License: DFSG free
|
Tesseract is an open source Optical Character Recognition (OCR)
Engine. It can be used directly, or (for programmers) using an API to
extract printed text from images. It supports a wide variety of
languages. This package includes the command line tool.
|
|
Debian packages in contrib or non-free
cuneiform
multi-language OCR system
|
Versions of package cuneiform |
Release | Version | Architectures |
buster | 1.1.0+dfsg-7 (non-free) | amd64,arm64,armhf,i386 |
jessie | 1.1.0+dfsg-5 (non-free) | amd64,i386 |
sid | 1.1.0+dfsg-11 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
trixie | 1.1.0+dfsg-11 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
bookworm | 1.1.0+dfsg-9 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
bullseye | 1.1.0+dfsg-8 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
Debtags of package cuneiform: |
accessibility | ocr |
interface | commandline |
role | program |
scope | utility |
use | converting |
works-with | image, image:raster |
|
License: non-free
|
Cuneiform is an OCR system. In addition to text recognition it also does
layout analysis and text format recognition.
The following languages are supported: Bulgarian, Croatian, Czech, Danish,
Dutch, English, Estonian, French, German, Hungarian, Italian, Latvian,
Lithuanian, Polish, Portuguese, Romanian, Russian, Serbian, Slovenian,
Spanish, Swedish, Turkish and Ukrainian.
|
|