You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
gentoo-overlay/app-text/pdfsandwich/metadata.xml

26 lines
940 B

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE pkgmetadata SYSTEM "https://www.gentoo.org/dtd/metadata.dtd">
<pkgmetadata>
<maintainer type="person">
<email>tupone@gentoo.org</email>
<name>Tupone Alfredo</name>
</maintainer>
<longdescription>
pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which
contain only images (no text) will be processed by optical character
recognition (OCR) and the text will be added to each page invisibly
"behind" the images.
pdfsandwich is a command line tool which is supposed to be useful to
OCR scanned books or journals. It is able to recognize the page layout
even for multicolumn text.
Essentially, pdfsandwich is a wrapper script which calls the following
binaries: unpaper, convert, gs, and tesseract. It supports
parallel processing on multiprocessor systems.
</longdescription>
<upstream>
<remote-id type="sourceforge">pdfsandwich</remote-id>
</upstream>
</pkgmetadata>