Patterns in static

Apophenia

Notes on input text file formatting

Each row of the file will be converted to one record in the database or one row in the matrix. Values on one row are separated by delimiters. Fixed-width input is also OK; see below.

By default, the delimiters are set to "|,\t", meaning that a pipe, comma, or tab will delimit separate entries. To change the default, please use an argument to apop_text_to_db or apop_text_to_data like .delimiters=" \t" or .delimiters="|". apop_opts.input_delimiters is deprecated.

The input text file must be UTF-8 or traditional ASCII encoding. Delimiters must be ASCII characters. If your data is in another encoding, try the POSIX-standard iconv program to filter the data to UTF-8.

If this rule doesn't work for your situation, you can explicitly insert a note that there is a missing data point. E.g., try:

perl -pi.bak -e 's/,,/,NaN,/g' data_file

If you have missing data delimiters, you will need to set apop_opts.nan_string to text that matches the given format. E.g.,

//Apophenia's default NaN string, matching NaN, nan, or NAN, but not Nancy:
apop_opts.nan_string = "Missing";
//Or, turn off nan-string checking entirely with:

SQLite stores these NaN-type values internally as NULL; that means that functions like apop_query_to_data will convert both your nan_string string and NULL to an NaN value.

Autogenerated by doxygen on Wed Oct 15 2014 (Debian 0.999b+ds3-2).