DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
doctotext::ParserParameters Class Reference

Stores list of parsers parameters. Every parser can query ParserParameter for a specific parameter. For example OCRParser queries ParserParameters for a language. Every parser contains ParserParameters and recursively passes it to another parser. More...

#include <parser_parameters.h>

Public Member Functions

 ParserParameters ()
 Creates empty ParserParameters.
 
 ParserParameters (const ParserParameters &parameters)
 
 ParserParameters (const std::string &name, const std::any value)
 
void operator+= (const ParserParameters &parameters)
 Adds all parameters from the second ParserParameters. More...
 
ParserParametersoperator= (const ParserParameters &parameters)
 
template<typename T >
std::optional< T > getParameterValue (const std::string &name) const
 Returns wanted parameter if it exists. More...
 

Detailed Description

Stores list of parsers parameters. Every parser can query ParserParameter for a specific parameter. For example OCRParser queries ParserParameters for a language. Every parser contains ParserParameters and recursively passes it to another parser.

Example:

ParserParameters parameters("language", doctotext::Language::english); // Important: explicit type
parameters += ParserParameters("TESSDATA_PREFIX", "tessdata"); // Adds second parameter
Importer importer("img.jpg");
importer.add_parameters(parameters)
The Importer class. This class is used to import a file and parse it using available parsers.
Definition: importer.h:57
Stores list of parsers parameters. Every parser can query ParserParameter for a specific parameter....
ParserParameters()
Creates empty ParserParameters.

Definition at line 60 of file parser_parameters.h.

Constructor & Destructor Documentation

◆ ParserParameters() [1/2]

doctotext::ParserParameters::ParserParameters ( const ParserParameters parameters)
inline

Definition at line 68 of file parser_parameters.h.

◆ ParserParameters() [2/2]

doctotext::ParserParameters::ParserParameters ( const std::string &  name,
const std::any  value 
)

Creates ParserParameters with one defined parameter.

Parameters
namename of parameter
valuevalue of parameter

Member Function Documentation

◆ getParameterValue()

template<typename T >
std::optional< T > doctotext::ParserParameters::getParameterValue ( const std::string &  name) const
inline

Returns wanted parameter if it exists.

Parameters
namename of wanted parameter
Returns
returns found parameter, if parameter doesn't exist returns empty optional

Definition at line 104 of file parser_parameters.h.

◆ operator+=()

void doctotext::ParserParameters::operator+= ( const ParserParameters parameters)

Adds all parameters from the second ParserParameters.

Parameters
parameters

◆ operator=()

ParserParameters & doctotext::ParserParameters::operator= ( const ParserParameters parameters)
inline

Definition at line 89 of file parser_parameters.h.


The documentation for this class was generated from the following file: