DocWire DocToText - Powered by Silvercoders 5.0.5
A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. This document parser is able to extract metadata along with annotations and supports a list of formats that include: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM)
doctotext::Exporter Class Reference

Exporter class is responsible for exporting the parsed data from importer or transformer to an output stream. More...

#include <exporter.h>

Inheritance diagram for doctotext::Exporter:

Public Member Functions

 Exporter (std::unique_ptr< Writer > writer)
 
 Exporter (std::unique_ptr< Writer > writer, std::ostream &out_stream)
 
 Exporter (const Exporter &other)
 
 Exporter (const Exporter &&other)
 
virtual Exporterclone () const
 Creates clone of this exporter. More...
 
void set_out_stream (std::ostream &out_stream)
 Sets output stream. More...
 
bool is_valid () const
 Check if exporter contains valid output. More...
 
void export_to (doctotext::Info &info) const
 Exxports data from Info structure to output stream. More...
 
void begin () const
 Sets writer to use.
 
void end () const
 Ends writing.
 

Protected Member Functions

std::ostream & get_output () const
 

Detailed Description

Exporter class is responsible for exporting the parsed data from importer or transformer to an output stream.

Importer(parser_manager, "file.pdf") | PlainTextExporter() | std::cout; // Imports file.pdf and exports it to std::cout as plain text
The Importer class. This class is used to import a file and parse it using available parsers.
Definition: importer.h:57
Exporter class for plain text output.
Definition: exporter.h:137

Definition at line 58 of file exporter.h.

Constructor & Destructor Documentation

◆ Exporter() [1/2]

doctotext::Exporter::Exporter ( std::unique_ptr< Writer >  writer)
Parameters
writerwriter to use.

◆ Exporter() [2/2]

doctotext::Exporter::Exporter ( std::unique_ptr< Writer >  writer,
std::ostream &  out_stream 
)
Parameters
writerwriter to use.
out_streamExporter output stream. Exporter will be writing to this stream.

Member Function Documentation

◆ clone()

virtual Exporter * doctotext::Exporter::clone ( ) const
virtual

Creates clone of this exporter.

Returns
new exporter

◆ export_to()

void doctotext::Exporter::export_to ( doctotext::Info info) const

Exxports data from Info structure to output stream.

Parameters
infodata from callback function.

◆ is_valid()

bool doctotext::Exporter::is_valid ( ) const

Check if exporter contains valid output.

True if output is valid.

◆ set_out_stream()

void doctotext::Exporter::set_out_stream ( std::ostream &  out_stream)

Sets output stream.

Parameters
out_streamreference to output stream.

The documentation for this class was generated from the following file: