Find and Replace Text in PDF using C++

PDF is a popular format that is widely used for sharing documents between organizations and individuals. There might be scenarios where you have to find and replace some text in the PDF documents before sharing. You can do this manually, but that would take more time and be less efficient. The better and faster option would be to do this programmatically. In this article, you will learn how to find and replace text in PDF files using C++.

C++ API to Find and Replace Text in PDF files

Aspose.PDF for C++ is a C++ library for working with PDF files. It provides a bunch of features that help you automate various aspects of your PDF workflows. One such feature is finding and replacing text in PDF files. You can either install the API through NuGet or download it directly from the downloads section.

PM> Install-Package Aspose.PDF.Cpp

Find and Replace Text in PDF using C++

Aspose.PDF for C++ provides the TextFragmentAbsorber class for searching text in PDF documents. You initialize this class with the text you want to find and use it to retrieve all the matching text fragments. Once all the fragments are available, you loop over them and replace the text. The following are the steps to find and replace text in PDF files using C++.

The following is the sample code to find and replace text in the whole PDF file using C++.

C++ Find and Replace Text in a Specific PDF Page

There might be situations where you only want to find and replace text on a specific page rather than the whole document. For this, accept the TextFragmentAbsorber object for the page where you want to replace the text. The following are the steps to find and replace text on a particular page in the PDF document.

The following is the sample code to find and replace text on a specific PDF page using C++.

Replace Text in PDF Page Region using C++

Instead of searching the whole page, you can specify the region of the page where you want to replace the text. For this, the API provides the Rectangle class. The following are the steps to find and replace text in a specific part of the PDF page.

The following is the sample code to find and replace text in a specific PDF page region.

Find and Replace Text in PDF files using Regular Expressions

Aspose.PDF for C++ also provides the ability to search text using regular expressions. With regular expressions, you can find text like email addresses or phone numbers, etc. For this, you have to specify the regular expression instead of the search string and use the TextSearchOptions class to indicate that you are using a regular expression for searching. The following are the steps to find and replace text in PDF files using a regular expression.

The following is the sample code to find and replace text in PDF files using a regular expression.

Get a Free License

You can try the API without evaluation limitations by requesting a free temporary license.

Conclusion

In this article, you have learned how to find and replace text in PDF files using C++. You have seen how to replace text in the whole PDF document, a specific PDF page, or a particular region of the page. Furthermore, you have learned how to search and replace text using a regular expression. Aspose.PDF for C++ is a powerful API with many additional features that make working with PDF documents a piece of cake. You can explore the API in detail by using the official documentation. If you have any questions, please feel free to contact us on the forum.

See Also