First page Back Continue Last page Text

Notes:


The IGranulator interface provides APIs to extract tables from a document, to extract images from a document and to extract paragraphs from a document.

Implementation is normally based on a analysis of a base format (ex. ODT, HTML). Initial conversion to that base format may thus be required. Output of granulation is provided in a standard XML-RPC output. For images, it is provided in any image format and can then be converted by a conversion handler.