[TriLUG] Where do I get PDF internals information?
Steve Litt
slitt at troubleshooters.com
Thu Mar 15 10:12:38 EDT 2007
Hi all,
Where do I get PDF internals information? I've been writing a C program to
tweak my Ebooks, and have suddenly come to the realization that PDF files are
a lot more than a list of objects -- there's a hierarchy, there are linked
lists of objects, there are keywords (beyond obj and endobj, stream and
endstream), all horizontal and hierarchical links point bidirectionally, and
there's lots of redundancy. Any modification made to any object changes the
byte offset, so that in the byte address table in the xref section, such
changes must be accounted for.
Perhaps worst, different PDF files use different keywords, so following the
hierarchy to a page, a font, or content, is not easy.
And it's even more complex than what I just described, but I don't understand
it.
Anyone know where I can learn more about PDF internals. Reverse engineering it
with Vim can only get me so far, and I'm there :-).
Thanks
SteveT
Steve Litt
Author: Universal Troubleshooting Process books and courseware
http://www.troubleshooters.com/
More information about the TriLUG
mailing list