Convert Word and PDF files to HTML while maintaining the original formatting and styling using APIs

We had to design and build a content managed literature section for a health-related website. The client team was going to create and manage the content in MS Word but wanted to make the same available on their website. Registered users could not only browse the content but download individual articles in PDF format. Additionally, they could combine multiple articles to compile their own collection of articles, and download it all as one PDF file.

Requirements Scenario

Since this was a web application, we needed a solution that would be stable and scalable in a server environment, and also powerful enough to handle a moderate-to-large number of requests. The solution would need to convert uploaded Word documents to HTML and PDF while maintaining the original formatting and styling.

Solution Implementation

After evaluating multiple products, and even considering server-side office automation, we finally chose Aspose.Words, Aspose.PDF and Aspose.Pdf.Kit for our solution. This set of components together did exactly what we needed them to do, work well together, and were extremely easy to implement.

The client uploads Word documents that are immediately converted to HTML using Aspose.Words, and PDF using Aspose.Pdf. When the registered users are browsing the site, they can either download the individual PDFs or create their own compilation. When they choose to download their compilation, Aspose.Pdf.Kit kicks in and combines the individual PDFs into a single one for download.

Benefits

The Aspose suite of components is a feature-rich and light-weight suite that were:

  • Simple to use, with few lines of code
  • Excellent documentation, wiki, forum and code samples
  • Very good response times by their email support
  • Scalable, perfect for server-side deployments
  • Developer OEM Subscription is a big plus for royalty-free distributions

Conclusion

We hit the ground running, and were able to get an Aspose-based solution working within a week. We’re also looking to Aspose for other potential projects that require document creation and manipulation. They have a complete suite of products, and each one is intuitive, powerful and works in both 32 and 64-bit environments. We would highly recommend this suite to any serious developer.

Content Management Screen where word documents are uploaded and converted to PDF + HTML
PDF version – retains formatting from MS Word
HTML version – converted from Word

Chandan Kanadia
Technical Manager,
www.concept46.com

Download this success story in PDF format.