|
DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
|
This is the complete list of members for doctotext::Metadata, including all inherited members.
| addField(const std::string &field_name, const Variant &field_value) (defined in doctotext::Metadata) | doctotext::Metadata | |
| author() (defined in doctotext::Metadata) | doctotext::Metadata | |
| authorType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| creationDate() (defined in doctotext::Metadata) | doctotext::Metadata | |
| creationDateType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| DataType enum name (defined in doctotext::Metadata) | doctotext::Metadata | |
| EXTRACTED enum value (defined in doctotext::Metadata) | doctotext::Metadata | |
| getField(const std::string &field_name) const (defined in doctotext::Metadata) | doctotext::Metadata | |
| getFields() const (defined in doctotext::Metadata) | doctotext::Metadata | |
| getFieldsAsAny() const (defined in doctotext::Metadata) | doctotext::Metadata | |
| hasField(const std::string &field_name) const (defined in doctotext::Metadata) | doctotext::Metadata | |
| lastModificationDate() (defined in doctotext::Metadata) | doctotext::Metadata | |
| lastModificationDateType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| lastModifiedBy() (defined in doctotext::Metadata) | doctotext::Metadata | |
| lastModifiedByType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| Metadata() (defined in doctotext::Metadata) | doctotext::Metadata | |
| Metadata(const Metadata &r) (defined in doctotext::Metadata) | doctotext::Metadata | |
| NONE enum value (defined in doctotext::Metadata) | doctotext::Metadata | |
| operator=(const Metadata &r) (defined in doctotext::Metadata) | doctotext::Metadata | |
| pageCount() (defined in doctotext::Metadata) | doctotext::Metadata | |
| pageCountType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| setAuthor(const std::string &author) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setAuthorType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setCreationDate(const tm &creation_date) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setCreationDateType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setLastModificationDate(const tm &last_modification_date) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setLastModificationDateType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setLastModifiedBy(const std::string &last_modified_by) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setLastModifiedByType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setPageCount(int page_count) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setPageCountType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setWordCount(int word_count) (defined in doctotext::Metadata) | doctotext::Metadata | |
| setWordCountType(DataType type) (defined in doctotext::Metadata) | doctotext::Metadata | |
| wordCount() (defined in doctotext::Metadata) | doctotext::Metadata | |
| wordCountType() (defined in doctotext::Metadata) | doctotext::Metadata | |
| ~Metadata() (defined in doctotext::Metadata) | doctotext::Metadata |