Welcome to PDF4QT

I would like to introduce new software for processing PDF documents. Why start a new project from scratch when we already have poppler or ghostscript, and also popular viewers such as Evince or Okular? All of these are good, but are quite old and lack many functionalities compared to commercial solutions.

It is for this reason that, in 2018, I decided to start a new project - PDF4QT. PDF4QT offers much more - it aims to be an alternative to commercial pdf editors. The initial version of the software was released for public testing in 2021.

Properties

Future long-term goals

I would like to achieve the following goals in the next two years:

What is unique?

I have implemented some features that are unique to this project compared to other open source projects (commercial applications, of course, implement them). The first is PDF4QT DocDiff, which can compare two similar PDF documents on the basis of content. It examines the content of both PDF documents and displays the results of the comparison in a well-formed GUI. Advanced algorithms are used for the comparison. The text is compared as a whole, not on a per page basis, so it can handle text movements (e.g. a paragraph is on a new page in the second document). Images and vector graphics are also compared.

The second is redaction (removal of sensitive content from a PDF document), which comes in the form of a plug-in. The user can use various tools to mark areas of the page to be redacted. The text can be selected using a mouse. A rectangular area (useful for image redacting) and whole pages can also be redacted. Users can also use search results to redact multiple text occurences (also regular expressions can be used), as well as create redacted documents in which sensitive content has been removed (works also for images).

The third is audio book creation, which also comes in the form of a plug-in. This plug-in has the following workflow: the user creates a text stream with text blocks, which are then transformed into an audio stream. The text stream is automatically created by collecting text blocks from the PDF document (it is better to have a tagged PDF). The user can add or delete text blocks from the audio stream, or modify them. When a text stream has been prepared, audio stream can be created from it.