![]() |
SG++-Doxygen-Documentation
|
ArffFileSampleProvider allows reading data in ARFF format into a sgpp::datadriven::Dataset object. More...
#include <ArffFileSampleProvider.hpp>
Public Member Functions | |
| ArffFileSampleProvider (DataShufflingFunctor *shuffling=nullptr) | |
| Default constructor. More... | |
| SampleProvider * | clone () const override |
| Clone Pattern to allow copying of derived classes. More... | |
| Dataset * | getAllSamples () override |
| Asks to return all available samples. More... | |
| size_t | getDim () const override |
| Returns the maximal dimensionality of the data. More... | |
| Dataset * | getNextSamples (size_t howMany) override |
| Lets the user request a certain amount of samples. More... | |
| size_t | getNumSamples () const override |
| Returns the number of samples availible or throws if not possible. More... | |
| void | readFile (const std::string &filePath, bool hasTargets, size_t readinCutoff=-1, std::vector< size_t > readinColumns=std::vector< size_t >(), std::vector< double > readinClasses=std::vector< double >()) override |
| Open an existing ARFF file, parse it and store its contents inside this class. More... | |
| void | readString (const std::string &input, bool hasTargets, size_t readinCutoff=-1, std::vector< size_t > readinColumns=std::vector< size_t >(), std::vector< double > readinClasses=std::vector< double >()) override |
| Parse contents of a string containing information in ARFF format, parse it and store its contents inside this class. More... | |
| void | reset () override |
| Resets the state of the sample provider (e.g. More... | |
Public Member Functions inherited from sgpp::datadriven::SampleProvider | |
| SampleProvider & | operator= (const SampleProvider &rhs)=default |
| SampleProvider & | operator= (SampleProvider &&rhs)=default |
| SampleProvider ()=default | |
| SampleProvider (const SampleProvider &rhs)=default | |
| SampleProvider (SampleProvider &&rhs)=default | |
| virtual | ~SampleProvider ()=default |
ArffFileSampleProvider allows reading data in ARFF format into a sgpp::datadriven::Dataset object.
Data can currently be either be a string formatted in ARFF or a file containing ARFF data.
|
explicit |
Default constructor.
| shuffling | functor to permute the training data indexes |
|
overridevirtual |
Clone Pattern to allow copying of derived classes.
Implements sgpp::datadriven::SampleProvider.
|
overridevirtual |
Asks to return all available samples.
This functionality is designed for returning all available samples from an entire file.
Implements sgpp::datadriven::SampleProvider.
References sgpp::datadriven::Dataset::getDimension(), getNextSamples(), and sgpp::datadriven::Dataset::getNumberInstances().
|
overridevirtual |
Returns the maximal dimensionality of the data.
Implements sgpp::datadriven::SampleProvider.
References sgpp::datadriven::Dataset::getDimension().
Referenced by python.uq.dists.SGDEdist.SGDEdist::__str__(), and python.uq.dists.KDEDist.KDEDist::getBandwidths().
|
overridevirtual |
Lets the user request a certain amount of samples.
This functionality is is designed for streaming algorithms where data is processed in batches.
| howMany | number requested amount of samples. The amount of actually provided samples can be smaller, if there is not sufficient data. |
Implements sgpp::datadriven::SampleProvider.
References sgpp::datadriven::Dataset::getDimension().
Referenced by getAllSamples().
|
overridevirtual |
Returns the number of samples availible or throws if not possible.
Implements sgpp::datadriven::SampleProvider.
References sgpp::datadriven::Dataset::getDimension(), and sgpp::datadriven::Dataset::getNumberInstances().
|
overridevirtual |
Open an existing ARFF file, parse it and store its contents inside this class.
Throws if file can not be opened or parsed.
| filePath | Path to an existing file. |
| hasTargets | whether the file has targest (i.e. supervised learning) |
| readinCutoff | see FileSampleProvider.hpp |
| readinColumns | see FileSampleProvider.hpp |
| readinClasses | see FileSampleProvider.hpp |
Implements sgpp::datadriven::FileSampleProvider.
References sgpp::datadriven::ARFFTools::readARFFFromFile().
|
overridevirtual |
Parse contents of a string containing information in ARFF format, parse it and store its contents inside this class.
Throws if string can not be parsed.
| input | string containing information in ARFF file format |
| hasTargets | whether the file has targest (i.e. supervised learning) |
| readinCutoff | see FileSampleProvider.hpp |
| readinColumns | see FileSampleProvider.hpp |
| readinClasses | see FileSampleProvider.hpp |
Implements sgpp::datadriven::FileSampleProvider.
References sgpp::datadriven::Dataset::getData(), sgpp::datadriven::Dataset::getDimension(), sgpp::datadriven::Dataset::getNumberInstances(), sgpp::datadriven::Dataset::getTargets(), python.statsfileInfo::i, sgpp::datadriven::ARFFTools::readARFFFromString(), and sgpp::base::DataMatrix::setRow().
|
overridevirtual |
Resets the state of the sample provider (e.g.
to start a new epoch)
Implements sgpp::datadriven::SampleProvider.