Category Archive: Aspose.HTML Product Family

Official blog of Aspose.HTML product family with news of newly supported features, hot fixes, technical articles, tips and videos etc.

Create a Web Scraper in C#

Web Scraping, also known as web crawling, web harvesting, or data scraping, is used for extracting data from websites. A web scraper uses different data selectors like CSS selectors, XPath, or both of these in order to extract data from the web pages. Both of these selectors are efficient for collecting and analyzing information from the web. This article covers how to create a web scraper in C#, specifically the information about HTML navigation, XPath Query and CSS Selector.

Posted in Aspose.HTML Product Family | Tagged , , , ,

Convert XHTML to PDF in Java

XHTML is a text-based file format with markup in the XML format. This format is widely used because of being generic and involves less scripting. Whereas, PDF files are fixed-layout format files that are popularly used because of being platform-independent. In certain cases, you may need to convert XHTML files to PDF format. In accordance with such use cases, this article demonstrates how to convert XHTML to PDF format in Java.

Posted in Aspose.HTML Product Family | Tagged , ,

Convert EPUB to a PNG or JPG Image in C#

EPUB files are commonly used for large-scale printing purposes. In certain scenarios, you may need to preview the contents of an EPUB file in the form of an image. In accordance with such use cases, this article explains how to convert an EPUB file to a PNG or JPG programmatically in C#.

Posted in Aspose.HTML Product Family | Tagged , , , ,

Create a Web Scraper in Java

Web Scraping is also called data scraping, web harvesting, or web crawling which is used to extract data from the web pages. A web scraper can use different approaches to extract information. For instance, XPath, CSS selectors, custom filters, HTML navigation, etc. In accordance with such scenarios, this article covers how to create a web scraper programmatically in Java.

Posted in Aspose.HTML Product Family | Tagged , , ,

Convert HTML to MHT File in C#

In some situations, you may need to convert an HTML file to MHT format. In accordance with such requirements, this article discusses how to convert HTML to MHT format programmatically in C#.

Posted in Aspose.HTML Product Family | Tagged , ,

Convert MHT or MHTML to Word DOCX File in C#

In certain cases, you may need to convert an MHT or MHTML file to a Word document. Accordingly, this article covers how to convert an MHT or MHTML file to a Word document in DOCX format programmatically in C#.

Posted in Aspose.HTML Product Family | Tagged , , , , ,

Convert XHTML to PDF in C#

XHTML is a stricter and XML based version of the HTML files. It was developed to enhance interoperability with different data formats. Whereas, PDF files are popular owing to their support in multiple platforms and operating systems. In some cases, you may need to convert an XHTML file to PDF format. Accordingly, this article covers how to convert an XHTML file to a PDF document programmatically in C#.

Posted in Aspose.HTML Product Family | Tagged , ,

Convert MHT to an Image in Java

In certain situations, you may need to convert an MHT file to an image format. Accordingly, this article covers how to convert an MHT or MHTML to an image in JPG or PNG format programmatically in Java.

Posted in Aspose.HTML Product Family | Tagged , , , , ,

Convert MHT to PDF File in C#

MHT file is a webpage archive format that includes text, graphics, animations, CSS, JavaScript, and all linked resources in a single file. Some webpages access images or other resources from a server and saving the page in MHT or MHTML format will let you see such contents even when you are offline. In certain cases, you may need to convert an MHT file to PDF format. This article covers how to convert an MHT file to a PDF file in C#.

Posted in Aspose.HTML Product Family | Tagged , ,

Convert EPUB to a PNG or JPG Image in Java

EPUB file format is popular for printing e-books, brochures, pamphlets, etc. Sometimes you may need to convert an EPUB file to an image for displaying file contents. Therefore, this article guides you on how to convert an EPUB file to a JPG or PNG image programmatically in Java.

Posted in Aspose.HTML Product Family | Tagged , ,