Multi-fidelity tabular benchmark compatible additions #112

Neeratyoy · 2021-07-06T18:28:14Z

Main changes/updates (wrt ML benchmarks primarily):

Editing abstract benchmark function signature to include option for choosing fidelity space type
- Adds an extra parameter called fidelity_choice=None which shouldn't break existing code
- Allows for future extension of the benchmark to a different fidelity (e.g.: dataset fraction) or even multi-multi-fidelity
Adds a ML-benchmark specific template that can be inherited to easily add more parameter spaces by defining only their ConfigurationSpace and just the model definition
The above point naturally implies that the data preprocessing and the model pipeline for the ML models are being fixed/standardized
- More decisions need to be taken for the final design, but future decisions under this class design would mean any change to the pipeline affects all ML parameter spaces

TODO:

Verify parameter spaces
Hist Gradient Boosting not tested, only as placeholder

* Scikit-learn showed every time the benchmark was created a warning that the surrogate was created with a different scikit-learn version. However, we can't suppress another deprecation warning, since scikit-learn makes sure that they are not suppressed. (!) * Increase benchmark version

* Add a reduced configuration space for the paramnet benchmark * Codestyle: Add new line at end of file

SVM Surrogate Benchmark * Intial Commit of a SVM Surrogate Benchmark. * Remove some wrong information from the docstrings (also Paramnet Benchmark) * Change an old test case which tests for actual runtime. (svm) this test was comparing the actual time needed for running a svm configuration. Thus, it was often failing on different machines. * SVM Surrogate: Add more references + min dataset fraction * SVM: Add Container Recipe + ClientInterface

* Readme + small fix in client * Test different singularity versions * Print Container Version + HPOBench Version in Log.

Improve Container Integration * Container-Client-Communication: The container does not read the hpobenchrc file anymore. We bind the directories (socket, data, ...) now to fixed paths in the container. * Increase Version Number for each Container * Equalize client abstract benchmark function calls. * Update Logging Procedure on Client and in Container * Update Recipe Files: Add a clean-up step to the recipe to reduce the size of the containers. * Add a Container Configuration test * Update Recipe Template + Fix Typo

KEggensperger · 2021-07-07T08:36:15Z

Could you please rebase to development?

…into thesis-paper

PhMueller · 2021-07-07T14:20:24Z

Hey Neeratyoy,

thanks for your work!
Before changing the Interface, maybe a quick question:

Couldn't we also solve it via inheritance?

I think about having a BaseMLBenchmark() and create new Classes which overwrite the corresponding fidelity space.
Is this something you could use?
Something like that?

class BaseMLBenchmark()
    def __init__():
        do_stuff()
        
   def objective_function():
       do_compute()
       
   def get_fidelity_space(seed=None):
       raise NotImplementedError()
       
class SmallFidelityBenchmark(BaseMLBenchmark):
    def get_fidelity_space(seed=None):
        return SmallFidelitySpace()
        
class LargeFidelityBenchmark(BaseMLBenchmark):
    def get_fidelity_space(seed=None):
        return LargeFidelitySpace()

Since the other benchmarks don't use the fidelity_choice, it would be cool to talk a little bit about it.

Neeratyoy · 2021-07-07T15:10:15Z

it would be cool to talk a little bit about it.

Sure, I think we should.

I went with that design since I believe there are checks for the abstract class function signature checks. And to not affect containerization potentially, I thought of changing the abstract class definition where I presumed a None will keep things safe.
I did try the inheritance (which I am already doing) but went back to this design for some reason I can't recall.

As for the multiple class definitions for different fidelity spaces, sure, that is an alternate solution. However, I just felt that it makes the code more verbose. Also might make things more brittle for code using these classes. Hence, I went with the parameterization approach.

In any case, these are probably design choices more than functionality so we just need to come to an agreement wrt the scope of the package. Happy to discuss!

Neeratyoy · 2021-08-31T18:16:01Z

@PhMueller shall we close this?

Neeratyoy · 2021-11-05T19:42:31Z

Deprecated.
Subsumed in #121.

KEggensperger and others added 18 commits March 26, 2021 11:46

init 0.0.8 (automl#96)

5dad061

Fix99 paramnet reduced space (automl#100)

76f712c

* Add a reduced configuration space for the paramnet benchmark * Codestyle: Add new line at end of file

Collection of small improvements (automl#103)

8029ed4

* Readme + small fix in client * Test different singularity versions * Print Container Version + HPOBench Version in Log.

Adding sample RF space for tabular collection design

b2155dd

Placeholder SVM benchmark to interface tabular data collection

ce405e6

Writing common ML benchmark class for tabular collection

2ef3af8

Adding placeholder for HistGradientBoostedClassifier

61b6963

Minor code cleaning

a5d0217

Reformatting output dict + option to add more metrics

3def203

Removing redundant import

750cc7d

Decoupling storage of costs for each metric

e7665e6

Including test scores in objective

47fe4cd

Documenting the structure of information in each fn eval.

2d085ec

Some decisions on lower bound for subsample fidelity

2da9d5c

AbstractBenchmark update for fidelity option + including XGBoost

751d2e9

KEggensperger self-requested a review July 7, 2021 08:35

Neeratyoy added 10 commits July 7, 2021 14:46

Adding sample RF space for tabular collection design

3f84afb

Placeholder SVM benchmark to interface tabular data collection

09b296a

Writing common ML benchmark class for tabular collection

af4f593

Adding placeholder for HistGradientBoostedClassifier

df2462d

Minor code cleaning

4d1d2d6

Reformatting output dict + option to add more metrics

299e592

Removing redundant import

c46321d

Decoupling storage of costs for each metric

17f6634

Including test scores in objective

7de891f

Documenting the structure of information in each fn eval.

ec316c3

Neeratyoy added 3 commits July 7, 2021 14:46

Some decisions on lower bound for subsample fidelity

e7f69b9

AbstractBenchmark update for fidelity option + including XGBoost

edb3e7f

Merge branch 'thesis-paper' of https://github.com/Neeratyoy/HPOBench …

642027b

…into thesis-paper

Neeratyoy added 22 commits July 8, 2021 19:18

Option to load data splits from disk

9e907e6

Reordering data load to work for different cases

f0d4f36

Updating source of SVM HP range

dbeae7c

Adding Tabular Benchmark class

f277a2e

Adding TabularBenchmark interface + easy import

60d5646

Adding LR space

c4100fd

Standardizing fidelity space definitions

9c6dcdb

Standardizing HPs + Adding NN space

74b6919

Small placeholder for testing

785055e

Updating NN HP space + Helper function for TabularBenchmark

0159a35

Adding fidelity range retrieval utility to TabularBenchmark

e9e097a

Enforcing subsample lower bound check inside objective

4797109

Bug fix + adding precicion as metric

dbb7327

Fixing param spaces and model building for LR, SVM

7d5ca57

TabularBenchmark edit to read compressed files and query a dataframe

a6d94bb

Not evaluating training set to save time

93b6908

Fidelity change for trees + NN space change

8164eb0

Final RF space

6916c9c

Final XGB space

8e5912b

Final HistGB space

6968ac3

Finalizing RF, XGB, NN

79dd1f3

TabularBenchmark edit to process only table and metadata

ca1e0d4

Neeratyoy closed this Nov 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-fidelity tabular benchmark compatible additions #112

Multi-fidelity tabular benchmark compatible additions #112

Neeratyoy commented Jul 6, 2021

KEggensperger commented Jul 7, 2021

PhMueller commented Jul 7, 2021

Neeratyoy commented Jul 7, 2021

Neeratyoy commented Aug 31, 2021

Neeratyoy commented Nov 5, 2021

Multi-fidelity tabular benchmark compatible additions #112

Multi-fidelity tabular benchmark compatible additions #112

Conversation

Neeratyoy commented Jul 6, 2021

KEggensperger commented Jul 7, 2021

PhMueller commented Jul 7, 2021

Neeratyoy commented Jul 7, 2021

Neeratyoy commented Aug 31, 2021

Neeratyoy commented Nov 5, 2021