DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
doctotext::ParserBuilderWrapper< ParserCreator > Class Template Reference

Provides the basic mechanism to build any parser. More...

#include <parser_wrapper.h>

Inheritance diagram for doctotext::ParserBuilderWrapper< ParserCreator >:
Collaboration diagram for doctotext::ParserBuilderWrapper< ParserCreator >:

Public Member Functions

std::unique_ptr< doctotext::Parserbuild (const std::string &inFileName) const override
 Builds new parser object. More...
 
std::unique_ptr< doctotext::Parserbuild (const char *buffer, size_t size) const override
 Builds new parser object. More...
 
doctotext::ParserBuilderwithLogStream (std::ostream *log_stream) override
 Sets log stream for parser. More...
 
doctotext::ParserBuilderwithVerboseLogging (bool verbose) override
 Turns on/off verbose logging. More...
 
doctotext::ParserBuilderwithOnNewNodeCallbacks (const std::vector< doctotext::NewNodeCallback > &callbacks) override
 
doctotext::ParserBuilderwithParserManager (const std::shared_ptr< doctotext::ParserManager > &inParserManager) override
 
doctotext::ParserBuilderwithParameters (const ParserParameters &inParameter) override
 Sets parser parameters. More...
 
- Public Member Functions inherited from doctotext::ParserBuilder
virtual std::unique_ptr< Parserbuild (const std::string &inFileName) const =0
 Builds new parser object. More...
 
virtual std::unique_ptr< Parserbuild (const char *buffer, size_t size) const =0
 Builds new parser object. More...
 
virtual ParserBuilderwithLogStream (std::ostream *log_stream)=0
 Sets log stream for parser. More...
 
virtual ParserBuilderwithVerboseLogging (bool verbose)=0
 Turns on/off verbose logging. More...
 
virtual ParserBuilderwithOnNewNodeCallbacks (const std::vector< NewNodeCallback > &callbacks)=0
 Adds callback function. More...
 
virtual ParserBuilderwithParserManager (const std::shared_ptr< ParserManager > &inParserManager)=0
 Sets parser manager. More...
 
virtual ParserBuilderwithParameters (const ParserParameters &inParameters)=0
 Sets parser parameters. More...
 

Detailed Description

template<typename ParserCreator>
class doctotext::ParserBuilderWrapper< ParserCreator >

Provides the basic mechanism to build any parser.

Template Parameters
ParserCreatortype of parser to build

Definition at line 123 of file parser_wrapper.h.

Constructor & Destructor Documentation

◆ ParserBuilderWrapper()

template<typename ParserCreator >
doctotext::ParserBuilderWrapper< ParserCreator >::ParserBuilderWrapper ( )
inline

Definition at line 126 of file parser_wrapper.h.

Member Function Documentation

◆ build() [1/2]

template<typename ParserCreator >
std::unique_ptr< doctotext::Parser > doctotext::ParserBuilderWrapper< ParserCreator >::build ( const char *  buffer,
size_t  size 
) const
inlineoverridevirtual

Builds new parser object.

Parameters
bufferraw data of file to be parsed
sizefile size
Returns
pointer to new parser object

Implements doctotext::ParserBuilder.

Definition at line 147 of file parser_wrapper.h.

◆ build() [2/2]

template<typename ParserCreator >
std::unique_ptr< doctotext::Parser > doctotext::ParserBuilderWrapper< ParserCreator >::build ( const std::string &  inFileName) const
inlineoverridevirtual

Builds new parser object.

Parameters
inFileNamepath to file
Returns
pointer to new parser object

Implements doctotext::ParserBuilder.

Definition at line 135 of file parser_wrapper.h.

◆ withLogStream()

template<typename ParserCreator >
doctotext::ParserBuilder & doctotext::ParserBuilderWrapper< ParserCreator >::withLogStream ( std::ostream *  log_stream)
inlineoverridevirtual

Sets log stream for parser.

Parameters
log_stream

Implements doctotext::ParserBuilder.

Definition at line 160 of file parser_wrapper.h.

◆ withOnNewNodeCallbacks()

template<typename ParserCreator >
doctotext::ParserBuilder & doctotext::ParserBuilderWrapper< ParserCreator >::withOnNewNodeCallbacks ( const std::vector< doctotext::NewNodeCallback > &  callbacks)
inlineoverride

Definition at line 174 of file parser_wrapper.h.

◆ withParameters()

template<typename ParserCreator >
doctotext::ParserBuilder & doctotext::ParserBuilderWrapper< ParserCreator >::withParameters ( const ParserParameters inParameters)
inlineoverridevirtual

Sets parser parameters.

Parameters
inParameters

Implements doctotext::ParserBuilder.

Definition at line 188 of file parser_wrapper.h.

◆ withParserManager()

template<typename ParserCreator >
doctotext::ParserBuilder & doctotext::ParserBuilderWrapper< ParserCreator >::withParserManager ( const std::shared_ptr< doctotext::ParserManager > &  inParserManager)
inlineoverride

Definition at line 181 of file parser_wrapper.h.

◆ withVerboseLogging()

template<typename ParserCreator >
doctotext::ParserBuilder & doctotext::ParserBuilderWrapper< ParserCreator >::withVerboseLogging ( bool  verbose)
inlineoverridevirtual

Turns on/off verbose logging.

Parameters
verbose

Implements doctotext::ParserBuilder.

Definition at line 167 of file parser_wrapper.h.


The documentation for this class was generated from the following file: