DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
doctotext::StandardTag Member List

This is the complete list of members for doctotext::StandardTag, including all inherited members.

TAG_ATTACHMENTdoctotext::StandardTaginlinestatic
TAG_Bdoctotext::StandardTaginlinestatic
TAG_BRdoctotext::StandardTaginlinestatic
TAG_CLOSE_ATTACHMENTdoctotext::StandardTaginlinestatic
TAG_CLOSE_Bdoctotext::StandardTaginlinestatic
TAG_CLOSE_FOLDERdoctotext::StandardTaginlinestatic
TAG_CLOSE_Idoctotext::StandardTaginlinestatic
TAG_CLOSE_LINKdoctotext::StandardTaginlinestatic
TAG_CLOSE_LISTdoctotext::StandardTaginlinestatic
TAG_CLOSE_LIST_ITEMdoctotext::StandardTaginlinestatic
TAG_CLOSE_MAILdoctotext::StandardTaginlinestatic
TAG_CLOSE_MAIL_BODYdoctotext::StandardTaginlinestatic
TAG_CLOSE_Pdoctotext::StandardTaginlinestatic
TAG_CLOSE_PAGEdoctotext::StandardTaginlinestatic
TAG_CLOSE_STYLEdoctotext::StandardTaginlinestatic
TAG_CLOSE_TABLEdoctotext::StandardTaginlinestatic
TAG_CLOSE_TDdoctotext::StandardTaginlinestatic
TAG_CLOSE_TRdoctotext::StandardTaginlinestatic
TAG_CLOSE_Udoctotext::StandardTaginlinestatic
TAG_COMMENTdoctotext::StandardTaginlinestatic
TAG_FOLDERdoctotext::StandardTaginlinestatic
TAG_Idoctotext::StandardTaginlinestatic
TAG_LINKdoctotext::StandardTaginlinestatic
TAG_LISTdoctotext::StandardTaginlinestatic
TAG_LIST_ITEMdoctotext::StandardTaginlinestatic
TAG_MAILdoctotext::StandardTaginlinestatic
TAG_MAIL_BODYdoctotext::StandardTaginlinestatic
TAG_METADATAdoctotext::StandardTaginlinestatic
TAG_Pdoctotext::StandardTaginlinestatic
TAG_PAGEdoctotext::StandardTaginlinestatic
TAG_STYLEdoctotext::StandardTaginlinestatic
TAG_TABLEdoctotext::StandardTaginlinestatic
TAG_TDdoctotext::StandardTaginlinestatic
TAG_TEXTdoctotext::StandardTaginlinestatic
TAG_TRdoctotext::StandardTaginlinestatic
TAG_Udoctotext::StandardTaginlinestatic