You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
26 lines
940 B
26 lines
940 B
<?xml version="1.0" encoding="UTF-8"?>
|
|
<!DOCTYPE pkgmetadata SYSTEM "https://www.gentoo.org/dtd/metadata.dtd">
|
|
<pkgmetadata>
|
|
<maintainer type="person">
|
|
<email>tupone@gentoo.org</email>
|
|
<name>Tupone Alfredo</name>
|
|
</maintainer>
|
|
<longdescription>
|
|
pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which
|
|
contain only images (no text) will be processed by optical character
|
|
recognition (OCR) and the text will be added to each page invisibly
|
|
"behind" the images.
|
|
|
|
pdfsandwich is a command line tool which is supposed to be useful to
|
|
OCR scanned books or journals. It is able to recognize the page layout
|
|
even for multicolumn text.
|
|
|
|
Essentially, pdfsandwich is a wrapper script which calls the following
|
|
binaries: unpaper, convert, gs, and tesseract. It supports
|
|
parallel processing on multiprocessor systems.
|
|
</longdescription>
|
|
<upstream>
|
|
<remote-id type="sourceforge">pdfsandwich</remote-id>
|
|
</upstream>
|
|
</pkgmetadata>
|