Updated Tuner #26

klemen1999 · 2024-05-13T20:38:56Z

Updated Optuna requrements to a newer version compatible with pytorch-lightning>=2.0
Added continue_existing_study: bool = True parameter to the config. If this is set to True (default) then it will use existing study with that name otherwise if name already exists and this is set to False it will throw an error.
Added logging to Tuner which gives best study parameters at the end

…ter to config

github-actions · 2024-05-13T20:57:15Z

Test Results

6 files 6 suites 37m 35s ⏱️
57 tests 25 ✅ 25 💤 7 ❌
342 runs 178 ✅ 150 💤 14 ❌

For more details on these failures, see this check.

Results for commit 88b5eab.

♻️ This comment has been updated with latest results.

tersekmatija

Let's see if we can nest better than recursively in MLFlow, otherwise looks good.

kozlov721

LGTM

…into fix/tuner

klemen1999 · 2024-05-16T12:54:15Z

Now tuning runs are nested together under common parent experiment and can be more easily compared in MLFLow. Parent tracker is only used to report hyperparameters that are currently the best in the whole study. CC: @tersekmatija

tersekmatija · 2024-05-16T13:53:39Z

luxonis_train/core/tuner.py

+            **tracker_params,
+        )
+        if self.parent_tracker.is_mlflow:
+            run = self.parent_tracker.experiment["mlflow"].active_run()


Will parent run remain active long enough for this to not cause any issues?

Yes, this run is active until the very end when whole study finishes. Basically every trial in the study is its own training run (like we would run luxonis_train train...). And every trial gets its own tracker. To nest them together we need one top-level tracker - parent tracker. In the backend mlflow has a stack of all active runs and when nest=True it binds it to the previous run in the stack. So this top level tracker is only used for binding all trials together and logging best hyperparameters in the whole study - for easy of use, to quickly get the results of the study.

tersekmatija · 2024-05-16T13:58:24Z

luxonis_train/core/tuner.py

+        )
+        if self.parent_tracker.is_mlflow:
+            run = self.parent_tracker.experiment["mlflow"].active_run()
+            self.parent_run_id = run.info.run_id


Is this used somewhere? What is this set to if MLFlow is not used?

If MLFlow is not used then still this first tracker will report the best hyperparameters but nesting won't be done because it is not supported by e.g. Tensorboard. So every trial will have it's own tensorboard logs with no connection between them. That's why when tuning using MLFlow is recommended.
The whole part in this if block seems like it is not actually needed but if I remove it then nesting is not doen for some reason. Looking into it and I'll update you

Ok sounds good. Not critical. I think looks good, maybe we just add a comment in code to note this is required for MLFlow to work correctly, then we can merge.

I figured it out: The way tracker is written it creates the actual mlflow run once it is interacted with. And since in our case we need parent tracker to be created first (to then nest all children runs to that one) we need to kep this line in the code. I added a note to the code.
CC: @kozlov721 I think now we can merge

Is this specific to "mlflow" then, or should this be present for other underlying trackers as well? @klemen1999

It's the case for all trackers. LuxonisTracker is sctructured very similar to other default PL loggers (e.g. MLFlowLogger) where the actual objects are created when .experiment property is first called. This ensures that LuxonisTracker is nicely itegrated with PL but we have to be mindfull of that when using it like this. Although I don't think there will be many cases where one would create just a placeholder tracker, like we are doing for this specific case, so it shouldn't cause too many problems.

So should we always call this instead of only on:
if self.parent_tracker.is_mlflow

mlflow is the only tracker that this nesting applies to so that is why I'm doing it just for this.

Sounds good, I think we can proceed with the merge in this case.

klemen1999 · 2024-05-23T07:38:43Z

I've added graceful stop for LuxonisTrackerPL with function finalize() - this is automatically called when PL process is finished. This should remove the need for explicit mlflow.end_run in #30 @kozlov721

github-actions · 2024-05-23T07:57:33Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
4923	3787	77%	0%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
luxonis_train/core/tuner.py	81%	🟢
luxonis_train/utils/config.py	95%	🟢
luxonis_train/utils/tracker.py	50%	🟢
TOTAL	76%	🟢

updated for commit: 6a9bceb by action🐍

…th export

…ead of bare tensors

…put_sources, which tells the LuxonisModel which loader sub-elements it wants to load, and LoaderConfig has an images_name parameter which identifies the image-like input among the sub-elements for compatibility with visualizers etc.

…aderTorchOutput.

…_dim from shapes

…ng use of images_name.

…to code structure requirements. Added missing tests for evaluation, export, and inference.

…/tuner

Co-authored-by: Martin Kozlovsky <[email protected]> Co-authored-by: Michal Sejak <[email protected]> Co-authored-by: GitHub Actions <[email protected]>

klemen1999 added 2 commits May 7, 2024 09:16

updated optuna to newer version, added continue_existing_study parame…

7f6aa1f

…ter to config

added logging to Tuner

e66d325

klemen1999 assigned kozlov721, tersekmatija and conorsim May 13, 2024

Merge branch 'dev' into fix/tuner

02e6e99

klemen1999 requested review from kozlov721, tersekmatija and conorsim May 13, 2024 20:40

klemen1999 unassigned kozlov721, tersekmatija and conorsim May 13, 2024

tersekmatija approved these changes May 14, 2024

View reviewed changes

conorsim approved these changes May 15, 2024

View reviewed changes

kozlov721 assigned klemen1999 May 15, 2024

kozlov721 added the fix Fixing a bug label May 15, 2024

formatting

074214a

kozlov721 approved these changes May 15, 2024

View reviewed changes

klemen1999 and others added 3 commits May 16, 2024 14:33

fixed nested view for mlflow

b167cab

Merge branch 'fix/tuner' of https://github.com/luxonis/luxonis-train …

bc9560e

…into fix/tuner

Merge branch 'dev' into fix/tuner

98a6309

tersekmatija reviewed May 16, 2024

View reviewed changes

kozlov721 changed the title ~~Fix/tuner~~ Updated Tuner May 16, 2024

klemen1999 added 2 commits May 16, 2024 20:01

removed unused code

75221ee

removed unused code, added note

be26b89

klemen1999 mentioned this pull request May 22, 2024

Graceful Stop MLFlow Run #30

Closed

added finalize() to LuxonisTrackerPL for graceful tracker exit

2cf7898

Merge branch 'dev' into fix/tuner

88b5eab

CaptainTrojan and others added 20 commits June 2, 2024 00:27

Multi-input test case - building a complex multi-input POSET model wi…

4041c99

…th export

Added a test for new collate_fn, which collates dicts of tensors inst…

6809f98

…ead of bare tensors

Added a config for a multi-input model

f400246

Updated input_shape type and the return type of loader, the LuxonisLo…

769aaab

…aderTorchOutput.

Added images_name property to BaseNode class, removed redundant batch…

9cf7ebb

…_dim from shapes

Implemented multi-input support in node building and model export

4284fba

Compatibility changes due to the new way shapes work in loaders, maki…

0e1ceff

…ng use of images_name.

[Automated] Updated coverage badge

951b981

Moved 'images_name' setting from loader implementation to config due …

1cdcb9f

…to code structure requirements. Added missing tests for evaluation, export, and inference.

Merge branch 'dev' into feature/multi-input

6c69875

removed macos tests

6b07cd8

removed images_name from BaseNode

63c6ad0

renamed get_shape_pocket to to_shape_pocket

a4906e7

simplified input source handling

25d1758

renamed images_name to image_source

9987d0d

Merge branch 'feature/multi-input' into fix/tuner

1f47e12

Merge branch 'fix/tuner' of github.com:luxonis/luxonis-train into fix…

eef4203

…/tuner

docformat

590bd31

Merge branch 'dev' into fix/tuner

6a9bceb

kozlov721 merged commit bb9b01d into dev Jun 13, 2024
6 checks passed

kozlov721 deleted the fix/tuner branch June 13, 2024 03:15

kozlov721 mentioned this pull request Jul 30, 2024

Fixed New mlflow Run #49

Merged

kozlov721 added a commit that referenced this pull request Oct 9, 2024

Updated Tuner (#26)

8065f74

Co-authored-by: Martin Kozlovsky <[email protected]> Co-authored-by: Michal Sejak <[email protected]> Co-authored-by: GitHub Actions <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Tuner #26

Updated Tuner #26

klemen1999 commented May 13, 2024

github-actions bot commented May 13, 2024 •

edited

Loading

tersekmatija left a comment

kozlov721 left a comment

klemen1999 commented May 16, 2024

tersekmatija May 16, 2024

klemen1999 May 16, 2024

tersekmatija May 16, 2024

klemen1999 May 16, 2024

tersekmatija May 17, 2024

klemen1999 May 19, 2024

tersekmatija May 20, 2024

klemen1999 May 20, 2024 •

edited

Loading

tersekmatija May 20, 2024

klemen1999 May 20, 2024

tersekmatija May 20, 2024

klemen1999 commented May 23, 2024

github-actions bot commented May 23, 2024 •

edited

Loading

Updated Tuner #26

Updated Tuner #26

Conversation

klemen1999 commented May 13, 2024

github-actions bot commented May 13, 2024 • edited Loading

Test Results

tersekmatija left a comment

Choose a reason for hiding this comment

kozlov721 left a comment

Choose a reason for hiding this comment

klemen1999 commented May 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

klemen1999 May 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

klemen1999 commented May 23, 2024

github-actions bot commented May 23, 2024 • edited Loading

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

github-actions bot commented May 13, 2024 •

edited

Loading

klemen1999 May 20, 2024 •

edited

Loading

github-actions bot commented May 23, 2024 •

edited

Loading