Skip to content

MeTA v2.4.0

Compare
Choose a tag to compare
@smassung smassung released this 07 Sep 21:17
· 192 commits to master since this release
v2.4.0
9f55ba4

New features

  • Add a minimal perfect hashing implementation for language_model, and unify
    the querying interface with the existing language model.

  • Add a CMake install() command to install MeTA as a library (issue #143). For
    example, once the library is installed, users can do:

    find_package(MeTA 2.4 REQUIRED)
    
    add_executable(my-program src/my_program.cpp)                                
    target_link_libraries(my-program meta-index) # or whatever libs needed from MeTA
  • Feature selection functionality added to multiclass_dataset and
    binary_dataset and views (issues #111, #149 and PR #150 thanks to @siddshuk).

    auto selector = features::make_selector(*config, training_vw);               
    uint64_t total_features_selected = 20;                                       
    selector->select(total_features_selected);                                   
    auto filtered_dset = features::filter_dataset(dset, *selector);              
  • Users can now, similar to hash_append, declare standalone functions in the
    same scope as their type called packed_read and packed_write which will be
    called by io::packed::read and io::packed::write, respectively, via argument-dependent lookup.

Bug fixes

  • Fix edge-case bug in the succinct data structures
  • Fix off-by-one error in lm::diff

Enhancements

  • Added functionality to the meta::hashing library: hash_append overload for
    std::vector, manually-seeded hash function
  • Further isolate ICU in MeTA to allow CMake to install()
  • Updates to EWS (UIUC) build guide
  • Add std::vector operations to io::packed
  • Consolidated all variants of chunk iterators into one template
  • Add MeTA's citation to the README!

Model File Checksums (sha256)

d29bf8b4cbeef21db087cf8042efe5afe25c7bd3c460997728d58b92c24ec283  beam-search-constituency-parser-4.tar.gz
ce44c7d96a8339ff4b597f35a35534ccf93ab99b7d45cbbdddffe7e362b9c20e  crf.tar.gz
2a75ab9750ad2eabfe1b53889b15a31f79bd2315f71c2a4a62f6364586a6042d  gigaword-embeddings-50d.tar.gz
40cd87901eb29b69e57e4bca14bc2539d7d6b4ad5c186d6f3b1532a60c5163b0  greedy-constituency-parser.tar.gz
a0a3814c1f82780f1296d600eba260f474420aa2d93f000e390c71a0ddac42d9  greedy-perceptron-tagger.tar.gz