On Wed, Apr 30, 2014 at 11:41 AM, Scott Chilcote <scottchilcote at ncrrbiz.com>wrote: > What we found was that PDF > often mangles the text data that it contains in ways that make it very > painful to extract. > Second this. PDFs can have the plaintext in them in very strange order.