[TriLUG] PDF to text
Ian Kilgore
ian at trilug.org
Wed Feb 1 12:01:44 EST 2006
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Owen Berry wrote:
| Anyone know of a command line utility for extracting text from a pdf
| file, other than the one included in xpdf (pdftotext)? pdftotext does
| exactly what I want, but I would like to avoid pulling in the rest of
| xpdf, if possible, as this is for a server.
|
| BTW, I'm using it combined with the perlfect search engine, so the text
| does not need to be formatted nicely or anything.
|
| Thanks,
| Owen
|
You can pipe pdf2ps | ps2ascii
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (GNU/Linux)
iD8DBQFD4Ol3wsRpgTiXSOERAvIjAJ9wvRL5r3AckTR9t0afnhwrzHnJkgCgut0v
v1GYZwwlecRLi1KoSakaHsE=
=wfbu
-----END PGP SIGNATURE-----
More information about the TriLUG
mailing list