[TriLUG] More linux pdf questions
Owen Berry
oberry at trilug.org
Fri Sep 1 13:53:32 EDT 2006
Came across this in an article on lwn.net: http://jocr.sourceforge.net
No idea whether it would help, but take a look.
BTW, the article was about how a Spamassassin plugin is using this to
scan for spammery in images. Quite interesting.
Owen
On Tue, Aug 22, 2006 at 12:05:34PM -0400, James Tuttle wrote:
> Hello:
>
> I'm working to script some processes at work and have come across a
> problem creating pdfs from tiff images. I can easily do it with
> 'convert' from imagemagick, but this yields an image pdf rather than a
> text pdf which means one can't search it, select text, or full-text
> index the pdf. I was wondering if anyone has any advice about how to
> integrate ocr into the process. Alternately, I've been given a copy of
> Acrobat 7 Pro, but it doesn't seem to have a scriptable API.
>
> Any ideas?
>
> Thanks,
> Jim
More information about the TriLUG
mailing list