The Path From Print to Web Has Just Become Much Shorter
If you have print assets that you want to publish on the web, until now you had two options. The first one: the scanned material is OCR-ed, the text is then extracted into an editable text format (Word or XML), formatting and structure are applied throughout, and then the document is converted to a web-friendly format, like HTML. This is an expensive process even if done offshore. The second option: you can simply publish an OCR-ed scanned PDF file and accept that it will be clumsily rendered, not interactive, somewhat searchable, and in the case of large documents, your browser . . . [more]
