1 | # $Id$ |
---|
2 | |
---|
3 | PortSystem 1.0 |
---|
4 | |
---|
5 | name pdfsandwich |
---|
6 | version 0.0.2 |
---|
7 | platforms darwin |
---|
8 | maintainers tobias-elze.de:macports |
---|
9 | categories textproc, graphics |
---|
10 | description pdfsandwich is a tool to make "sandwich" OCR pdf files |
---|
11 | homepage http://pdfsandwich.origo.ethz.ch/ |
---|
12 | master_sites http://download.origo.ethz.ch/pdfsandwich/1809/ |
---|
13 | |
---|
14 | use_bzip2 yes |
---|
15 | |
---|
16 | checksums md5 483a3dbf8f37e101818e8b094ad6741d \ |
---|
17 | sha1 6e9fea89fa9c81f15e177d23ddb4a3e88e59ca0b \ |
---|
18 | rmd160 a4d7c00cfec9db16f52bccd9ecf4b7691bb29aea |
---|
19 | |
---|
20 | long_description \ |
---|
21 | pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images \ |
---|
22 | (no text) will be processed by optical character recognition (OCR) and the text will be \ |
---|
23 | added to each page invisibly "behind" the images. pdfsandwich is a command line \ |
---|
24 | tool which is supposed to be useful to OCR scanned books or journals. \ |
---|
25 | It is able to recognize the page layout even for multicolumn text. \ |
---|
26 | Essentially, pdfsandwich is a wrapper script which calls the following binaries: \ |
---|
27 | convert, cuneiform, gs, and hocr2pdf. |
---|
28 | |
---|
29 | depends_build port:gawk \ |
---|
30 | port:ocaml |
---|
31 | depends_run port:cuneiform \ |
---|
32 | port:exact-image \ |
---|
33 | port:ghostscript |
---|
34 | |
---|