Detector components (costs, scores etc.) as classes #24

Tveten · 2024-10-20T15:00:19Z

Goal: Unify the detector components. Make them safer. Make the extension pattern simpler and clearer.

See #23 for discussions.

New features:

The BaseIntervalScorer class, inheriting from sktime.BaseEstimator. Public methods:
- fit(self, X, y=None) -> self
- evaluate(self, cuts: ArrayLike) -> np.ndarray
Four sub base classes inheriting from BaseIntervalScorer:
- skchange.costs.BaseCost. Expects 2 columns in cuts: start, end.
- skchange.change_scores.BaseChangeScore. Expects 3 columns in cuts: start, split, end.
- skchange.anomaly_scores.BaseSaving. Expects 2 columns in cuts: start, end.
- skchange.anomaly_scores.BaseLocalAnomalyScore: Expects 4 columns in cuts: outer_start, inner_start, inner_end, outer_end.
Classes for automatically converting costs to any of the three other score classes.
Convenience functions allowing either costs or an appropriate score to be used as input to all the detectors.

All existing functionality is implemented within the new design + additions.

Common pattern for initialisers.

…ction to scores.mean

Leave it to the numba compiler for now. Checking it in a good way is complicated due to the generic classes such as CostBasedChangeScore

Decoupled from numba and the detectors as opposed to other suggested designs.

Tveten · 2024-11-25T15:48:40Z

Naming decision:

Rename BaseIntervalEvaluator to BaseIntervalScorer based on discussion with @fkiraly
Rename the intervals argument to .evaluate to cuts based on offline discussion with @johannvk .
- intervals suggests the input should be two entries (start, end], and hides the potential splitting info.
- splits suggests the whole data should be split in two or more parts, and hides the interval subsetting info.
- cuts is a word that can cover both interval subsetting and splitting. It's up to each sub base class to define what the cuts mean and how they are used internally during .evaluate.

Tveten and others added 30 commits October 17, 2024 21:45

feat: add option to append a zero-row at the beginning of cumsum output

14e68d1

Common pattern for initialisers.

feat: add initialiser utilities

5c1de5d

refactor(api): move all functions related to mean change/anomaly dete…

fdc95a7

…ction to scores.mean

feat(api)!: add initial structure for detector components as classes

51ba5d8

docs: fix module docstring

03ba2a5

refactor: Rename MeanCost to L2Cost

446b864

fix: precomputed type to tuple

36f6ff8

refactor: rename precomputed_params -> precomputed

dae0376

feat: add first version of cost based anomaly score

7ff49d2

feat: remove check on the precompute pipeline

0a2c92e

Leave it to the numba compiler for now. Checking it in a good way is complicated due to the generic classes such as CostBasedChangeScore

refactor: precomputed typing and internal cost name

6a098ec

Add new base class for costs with more functionality

8e97e69

feat: add identify function

1838af9

refactor: name output scores

fe05137

feat: make cost based anomaly score work with new base cost

07621e7

feat: add cost based savings

7492e98

rename identify_func to identity

bbff69a

feat: add numba config and njit_configured decorator

e473dec

fix: njit_configured

037929b

fix: set default use_njit to None in update_config

a944f9c

feat: add function for converting to 2d array

dd38e35

feat: improve check_jitted error message

35a0936

delete experimental detector_components module

8f26238

feat: readd initialiser utilities

97fc3b0

rename mean_change_score

0083eff

add experimental numba subset evaluators

35e77f7

add experimental numba subset evaluators

aeca7a1

feat: add initial version of interval evaluators

234bbab

Decoupled from numba and the detectors as opposed to other suggested designs.

feat: add base cost class and l2 cost as interval evaluators

5ca63ab

feat: add interactive script for exploring evaluators

278015b

Tveten added 22 commits November 25, 2024 16:59

tests: add tests for erroneous covariance matrix inputs

f976b90

tests: add tests for erroneous variance inputs

912a52b

refactor: rename to BaseIntervalScorer and cuts argument

4027208

refactor: intervals -> cuts

de718aa

docs: small fixes

30c84c9

refactor: intervals -> cuts

b4db8d3

refactor: intervals -> cuts

dc7c2e4

refactor: intervals -> cuts

b47e2ba

refactor: intervals -> cuts

9d54e31

refactor: intervals -> cuts

f7904e6

refactor: score -> anomaly_score in circular binseg

e0e501a

docs: update base

819ec0f

docs: update to new interval scorer based structure

6f3f0ad

refactor: evaluator -> scorer in score converters

c1e8f1e

docs: update score -> anomaly_score in circular binseg

ef39f0d

docs: remove MoscoreAnomaly

731d00e

docs: add currentmodule for cost utilities

0ba4b7b

docs: fix typo

f7b823d

docs: add currentmodule

6cdfdb3

fix: failing tests

f6eee07

docs: add check_cuts_array

72c03ad

delete old njit module

f529c08

This was referenced Nov 25, 2024

Specializing numba score functions #19

Closed

Cost classes #23

Closed

Tveten added 4 commits November 26, 2024 00:53

fix: point_saving checking

ee31efb

tests: add more mvcapa tests for better coverage

8987914

tests: move comment

dd7dfda

fix: get evaluation_type from costs in cost converters

c08b0bb

Tveten merged commit 9591d33 into main Nov 26, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detector components (costs, scores etc.) as classes #24

Detector components (costs, scores etc.) as classes #24

Tveten commented Oct 20, 2024 •

edited

Loading

Tveten commented Nov 25, 2024 •

edited

Loading

Detector components (costs, scores etc.) as classes #24

Detector components (costs, scores etc.) as classes #24

Conversation

Tveten commented Oct 20, 2024 • edited Loading

Tveten commented Nov 25, 2024 • edited Loading

Tveten commented Oct 20, 2024 •

edited

Loading

Tveten commented Nov 25, 2024 •

edited

Loading