[TriLUG] PDF to text

Ian Kilgore ian at trilug.org
Wed Feb 1 12:01:44 EST 2006


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Owen Berry wrote:
| Anyone know of a command line utility for extracting text from a pdf
| file, other than the one included in xpdf (pdftotext)? pdftotext does
| exactly what I want, but I would like to avoid pulling in the rest of
| xpdf, if possible, as this is for a server.
|
| BTW, I'm using it combined with the perlfect search engine, so the text
| does not need to be formatted nicely or anything.
|
| Thanks,
| Owen
|
You can pipe pdf2ps | ps2ascii
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFD4Ol3wsRpgTiXSOERAvIjAJ9wvRL5r3AckTR9t0afnhwrzHnJkgCgut0v
v1GYZwwlecRLi1KoSakaHsE=
=wfbu
-----END PGP SIGNATURE-----



More information about the TriLUG mailing list