As a result, most of the Parser implementation classes are just adapters to such external libraries. When parsing a document, Tika attempts to reuse existing parser libraries such as Apache POI or PDFBox as much as possible. The parse method throws an IOException if it fails to read from the input stream, a TikaException if the document taken from the stream cannot be parsed and a SAXException if the handler is unable to process an event.
0 Comments
Leave a Reply. |