DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
Class List
Here are the classes, structs, unions and interfaces with brief descriptions:
[detail level 12]
 Ndoctotext
 CEncryptedFileException
 CException
 CExporterExporter class is responsible for exporting the parsed data from importer or transformer to an output stream
 CFormattingStyle
 CHtmlExporterExporter class for HTML output
 CHtmlWriterThe HTMLWriter class
 CImporterThe Importer class. This class is used to import a file and parse it using available parsers
 CInfo
 CListStyle
 CMetadata
 CMetaDataExporterExporter class for meta data. Important: Exports only meta data as a plain text
 CMetaDataWriterWrites the meta data of the document as plain text to an output stream
 CParserAbstract class for all parsers
 Cparser_creator
 CParserBuilder
 CParserBuilderWrapperProvides the basic mechanism to build any parser
 CParserManagerParser manager class. Loads all available parsers and provides access to them
 CParserParametersStores list of parsers parameters. Every parser can query ParserParameter for a specific parameter. For example OCRParser queries ParserParameters for a language. Every parser contains ParserParameters and recursively passes it to another parser
 CParserProviderThe ParserProvider class
 CParserWrapper
 CParsingChainParsingChain class is a wrapper for all defined steps of the parsing process
 CPlainTextExporterExporter class for plain text output
 CPlainTextWriter
 CSimpleExtractorBasic functionality for extracting text from a document
 CStandardFilterSets of standard filters to use in parsers. example of use:
 CStandardTagContains set of basic tags using in parsers
 CTransformerThe Transformer transforms data from Importer or from another Transformer
 CTransformerFuncWraps single function (doctotext::NewNodeCallback) into Transformer object
 Cwrapper_parser_creator
 CCustomParser
 CCustomParserBuilder
 CCustomParserProvider[plugin_example_1]