[v1.3.2][1.3.2]

Bug fixes

fix potentially incorrect generation of vocabulary map files on 32-bit systems (this appears to have only impacted non-default block sizes)

v1.3.1

Bug fixes

fix calculation of average precision in ir_eval (the denominator was incorrect)
specify that labels are required for the file_corpus document list; this allows spaces in the path to each document

v1.3

New features

additions to the graph library:
- myopic search
- BFS
- preferential attachment graph generation model (supports node attractiveness from different distributions)
- betweenness centrality
- eigenvector centrality
added a new natural language parsing library:
- parse tree library (visitor-based)
- shift-reduce constituency parser for generating phrase structure trees
- reimplementation of evalb metrics for evaluating parsers
- new filter for Penn Treebank-style normalization
added a greedy averaged Perceptron-based tagger
demo application for various basic text processing (profile)
basic iostreams that support gzip compression (if compiled with ZLib support)
added iteration method for stats::multinomial seen events
added expected value and entropy functions to stats namespace
added linear_model: a generic multiclass classifier storage class
added gz_corpus: a compressed version of line_corpus
added macros for generating type safe identifiers with user defined literal suffixes
added a persistent stack data structure to meta::util

Enhancements

added operator== for util::optional<T>
better CMake support for building the libsvm modules
better CMake support for downloading unit-test data
improved setup guide in README (for OS X, Ubuntu, Arch, and EWS/ENGRIT)
tree analyzers refactored to use the new parser library (removes dependency on outside toolkits for generating tree files)
analyzers that are not part of the "core" have been moved into their respective folders (so ngram_pos_analyzer is in src/sequence, tree_analyzer is in src/parser)
make_index now checks if the files exist before loading an index, and if they are missing creates a new one (as opposed to just throwing an exception on a nonexistent file)
cpptoml upgraded to support TOML v0.4.0
enable extra warnings (-Wextra) for clang++ and g++

Bug fixes

fix sequence_analyzer::analyze() const when applied to untagged sequences (was throwing when it shouldn't)
ensure that the inverted index object is destroyed first before uninverting occurs in the creation of a forward_idnex
fix bug where icu_tokenizer would output spaces as tokens
fix bugs where index objects were not destroyed before trying to delete their files in the unit tests
fix bug in sparse_vector::find() where it would return a non-end iterator when asked to find an element that does not exist

v1.2

New features

demo application for CRF-based POS tagging
nearest_centroid classifier
basic statistics library for representing relevant probability distributions
sparse_vector utility class

Enhancements

ngram_pos_analyzer now uses the CRf internally (see issue #46)
knn classifier new supports weighted knn
filesystem::copy_file() no longer hangs without progress reporting with large files
CMake build system now includes INTERFACE targets (better inclusion as a subproject in external projects)
MeTA can now (optionally) be built with C++14 support

Bug fixes

language_model_ranker scoring function corrected (see issue #50)
naive_bayes classifier scoring corrected
several incorrect instances of numeric_limits<double>::min() replaced with the intended numeric_limits<double>::lowest()
fix compilation with versions of ICU < 4.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1.3.2][1.3.2]

Bug fixes

v1.3.1

Bug fixes

v1.3

New features

Enhancements

Bug fixes

v1.2

New features

Enhancements

Bug fixes

v1.1

Changes

v1.0

FilesExpand file tree

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

[v1.3.2][1.3.2]

Bug fixes

v1.3.1

Bug fixes

v1.3

New features

Enhancements

Bug fixes

v1.2

New features

Enhancements

Bug fixes

v1.1

Changes

v1.0