Monday, February 11, 2008

Strip and Format text from PDF documents


PDF Text Online is a free online utility, that can:
  • Converts PDF text quickly and accurately
  • Handles all fonts and languages (including Chinese, Japanese, Korean, and more)
  • Provides easy access to form data, document properties, and bookmarks
  • Doesn't require a software download -- it works in your browser!

Upload your PDF document and the service will present it to you in text format, allow you to change some formatting elements like the font, and page layout of the text. In the end, you get a text document that you can save for later or just text your can copy and paste without having to worry about the formatting.

The developers of PDF Textonline developed an application called PDFTextStream, which they've incorporated into the PDFTextOnline Web app. The app uses Ajax to smoothly upload your document, strip out the images, and give you a clean and simple way to get to the document's text. The tool even retains any bookmarks and document information that might be included with the PDF.

If you just need to copy and paste the text, you don't need to save the document, you can just copy the text from the Web app. If you need to edit the text, you'll have to save the document as a text file. Click "save all text," and the service will present you with a zip archive that contains the text of your PDF document inside it. Once you've unzipped the file, you can use any text editor or word processor to manage the document.