Software & Apps

Pdfsyntax / docs / browse.md in main · desgeeko / pdfsynexx · gitHub

Inspection of the internal structure of a PDF file involves many items (decompression, indexing indexing, etc …) to understand raw bytes.

PDFSyntax takes care of processing and suggests a visit procedure consisting of add information and hyperlinks to top a text-printed PDF data printable. It respects the physical flow of the file while offering a reasonable navigation between changes (additional updates) and between things.

PDFSyntax is a Python Python Package – with no trust – and mainly a low level at the PDF library. the browse The command is its highest and the most visible side. It creates static html content that offers adequate interviews: JavaScript can be disabled.

Please try the Live Demo In a perfect static HTML output you can browse, at https: //pdfsyntax.devple_text_string.html (hosted by github pages).

Here’s the same example, as a partial screenshot:
Pdfsyntax screenshot

NB: This is the output made for Simple text string Example file from PDF detail.

PDFSyntax can be installed from Github Repo (no trust) or from PYPI:

Change the standard output to a file you can open in your browser:

python3 -m pdfsyntax browse file.pdf > inspection_file.html

The HTML has become “as an extra raw file in PDF with the following additional work:

  • Add a Reverse Index: Links to where something is used
  • Add a page to the page to a navigation menu
  • Add a physical minimap to a navigation menu
  • Indent of things beautiful dictionary printing
  • Get things that are in things in the streams and insert them into the flow like regular items
  • Decompress streams and show a small part of it
  • Return indirect object references to hyperlinks
  • Offset references (eg a / prev entry) in hyperlinks
  • Display files offsets on items in a left margin
  • Put some color to important names (eg / type)
  • Put some color to warnings (eg the presence of / js)
  • Light and dark modes

Warning: Encrypted files have not been supported

Progress Work: New features are on the roadmap


https://opengraph.githubassets.com/a8c330d1c9b98ce8aeb36e1d99964eaa2e913059b9ca87a6bc5112afc4458067/desgeeko/pdfsyntax

2025-02-10 16:52:00

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button