Skip to content

Releases: dssg/triage

White peaches

16 Oct 17:15
Compare
Choose a tag to compare

Bug Fixes

  • Fixes a bug related to the random_state variable in Triage's configuration file. The random seeds generated from the defined random state were not propagating correctly during the training of individual models.

Documentation

  • Colab example has been updated with warnings when installing Triage.

Update on required packages

  • Some packages have been updated with newer versions.
  • Some packages have been removed as requirements since they are now included in other packages in the requirements file.

Pusadee's (Patch 2)

02 Feb 18:42
Compare
Choose a tag to compare

Bug Fixes

Saving csv.gz matrices on S3 was not working correctly. Now we put a file to S3 instead of streaming chunks.

Pusadee's (Patch 1)

01 Feb 17:20
Compare
Choose a tag to compare

Bug Fixes

Fixes saving the csv.gz features matrices on S3.

Pusadee's

22 Jan 21:19
Compare
Choose a tag to compare

New Functionality

  • Changing how we create the features matrix. Now we generate CSVs for each feature group and stitch them together on disk.
  • We are reading and loading CSVs with Polars instead of pandas DataFrames to optimize time with a 10x improvement.

Dried Peach (Patch 3)

19 Sep 21:19
Compare
Choose a tag to compare

New Functionality

  • Adds functionality to deal with the cohort being defined within the label query for predicting lists

Bug Fixes

  • Fixes a bug on the Baseline Multi-Feature Ranker on the normalization (#934)

Refactoring/Documentation

  • Updates different packages

Dried Peach (Patch 2)

08 Nov 16:28
Compare
Choose a tag to compare

Bug Fixes

  • Fixes a bug with logging existing cohort dates when using the labels query for the cohort and running with replace=False (#915)

Dried Peach (Patched)

25 Oct 22:00
Compare
Choose a tag to compare

Bug Fixes

  • Fixes a bug that was resulting in old default behavior being used when omitting the cohort section of the config to use the label config for both (#911)
  • Raise the intended depreciation warning (rather than a cryptic error) when specifying a groups key to feature aggregations (#907)

Refactoring/Documentation

  • Adds a colab-based tutorial for a quicker introduction to how to use triage (#878)
  • Remove a duplicated cohort section from the example experiment config (#902)
  • Various minor documentation updates

Dried Peach

20 May 22:45
Compare
Choose a tag to compare

New Functionality

  • Adds python 3.10 support (NOTE: loses python 3.7 support) (via #893)
  • Support (and prefer) specifying a SQL file path for cohort and label queries (#883)
  • Adds information about cohort, label, and bias hashes to the triage_runs table (#888)
  • Allow specifying a cohort to be option, defaulting to the label query (#877)

Bug Fixes

  • Fixes issue with installing on macos with python 3.9 (via #893)
  • Removes (buggy) support for groupings other than entity_id in feature generation (#887)

Refactoring/Documentation

  • Various dependency updates (#893)
  • Error on cohort or label duplicates (#889)
  • Various updates to documentation

Dried Mango (Patched)

08 Feb 22:19
Compare
Choose a tag to compare

New Functionality

  • Adds support for postgres 12 and 13 (#882)

Refactoring/Documentation

  • Fixes typos in documentation

Dried Mango

07 Dec 18:07
Compare
Choose a tag to compare

NOTE: This release adds python 3.9 support, but unfortunately eliminates python 3.6 support.

New Functionality

  • Adds python 3.9 support (#865)
  • Adds BaselineRankMultiFeature baseline model to allow for ranking by more than one feature (#871)

Bug Fixes

  • Ensure attributes passed to aequitas are strings to avoid issues with crosstab calculation (#876)

Refactoring/Usability

  • Renames the descend parameter to low_value_high_score to improve clarity (#871; see also #869)
  • Use a threading backend for sklearn parallelization nested within triage multiprocessing to suppress a warning about nested parallelization (#866)