Make it easier to run studies from git checkouts #235

mikix · 2024-05-10T20:05:35Z

When loading a builder, we add the study dir's parent folder into sys.path, so that builders can import helper code. This is basically equivalent to doing PYTHONPATH=. from a checkout.
When searching for the study dir, and we find a manifest.toml, actually read the manifest to find the study_prefix instead of using the directory name for the study name. This allows more natural python package layouts (rather than module_name/study_name, you can just do module_name)

This also bumps ruff's version to 0.4.4 and has the commit hook just auto-commit any formatting changes.

Checklist

Consider if documentation (like in docs/) needs to be updated
Consider if tests should be added
Update template repo if there are changes to study configuration

mikix · 2024-05-10T20:11:31Z

cumulus_library/template_sql/sql_utils.py

-class BaseConfig(abc.ABC):
-    """Abstract ase class for handling table detection/denormalization"""
+class BaseConfig:
+    """Base class for handling table detection/denormalization"""


ruff complained about this - there are no abstract methods or properties now that this is a simple dataclass.

mikix · 2024-05-10T20:13:15Z

pyproject.toml

-    "ruff == 0.2.1",
+    "ruff",


I chose to unpin this here, maybe I shouldn't have.

My thinking was:

The commit hook has its own checkout of ruff, so the developer's version could diverge, though they may see different errors if they manually run ruff.

But I liked that oddity more than I liked developers having to manage different packages wanting different ruff versions.

Thoughts?

Hmm, though then what version should the CI install...? Should that be hardcoded in the CI to match the hook? Or should we hardcode the version here in the pyproject.toml... hmmm.... Maybe I should leave this alone. But I don't love it.

I started doing explicit pins on these in response to a SQLFluff linting error that caused malformed code in some cases. I think the astral folks are less likely to make the same mistake, but i sort of like 'a human has decided to increment the version on the autoformatting tools'. could be talked out of this.

mikix · 2024-05-10T20:13:45Z

pyproject.toml

-[tool.ruff]
-target-version = "py310"


Looking at the docs for this, ruff will look at requires-python and actually recommends just setting that instead of this.

- When loading a builder, we add the study dir's parent folder into sys.path, so that builders can import helper code. This is basically equivalent to doing PYTHONPATH=. from a checkout. - When searching for the study dir, and we find a manifest.toml, actually read the manifest to find the study_prefix instead of using the directory name for the study name. This allows more natural python package layouts (rather than `module_name/study_name`, you can just do `module_name`) This also bumps ruff's version to 0.4.4 and has the commit hook just auto-commit any formatting changes.

mikix · 2024-05-10T20:32:07Z

cumulus_library/cli.py

-            manifest_paths[path.name] = path
+            try:
+                manifest = study_parser.StudyManifestParser(path)
+                manifest_paths[manifest.get_study_prefix()] = path
+            except errors.StudyManifestParsingError as exc:
+                rich.print(f"[bold red] Ignoring study in '{path}': {exc}")


I did not offer any kind of compatibility / migration for the old way of doing things. Like I could have also added the directory name into manifest_paths.

But... I figured any existing studies (outside of some of our test data studies) would have realistically needed to have the study_prefix match the directory already. If you didn't have that setup, that meant that your --target name didn't match your study_prefix, and that seems like something we don't really want to encourage / support?

mikix · 2024-05-10T20:37:54Z

tests/test_data/study_python_local_template/module1.py

+from study_python_local_template import local_template
+
 from cumulus_library.base_table_builder import BaseTableBuilder
-from tests.test_data.study_python_local_template import local_template


This was done to test the new sys.path injection

mikix · 2024-05-10T20:45:04Z

I didn't see any changes needed in the template repo, mostly because (1) it doesn't include any builders that try to import anything, (2) it doesn't use the two-level directory setup that some released studies like covid have had to use, and (3) its instructions talk about renaming the study folder already to match a python module name and to also rename study_prefix.

So I think it should be fine both before and after this change, but your naming can be more flexible after this PR

dogversioning · 2024-05-13T12:34:06Z

I didn't see any changes needed in the template repo, mostly because (1) it doesn't include any builders that try to import anything, (2) it doesn't use the two-level directory setup that some released studies like covid have had to use, and (3) its instructions talk about renaming the study folder already to match a python module name and to also rename study_prefix.

So I think it should be fine both before and after this change, but your naming can be more flexible after this PR

We should maybe chat about whether maintaining the template repo even makes sense - The example config file in the docs might be the 'right' way to telegraph how to make studies at this point.

mikix force-pushed the mikix/study-search branch from 34cf6ca to b8258f8 Compare May 10, 2024 20:10

mikix commented May 10, 2024

View reviewed changes

mikix force-pushed the mikix/study-search branch from b8258f8 to b05c1dc Compare May 10, 2024 20:16

mikix force-pushed the mikix/study-search branch from b05c1dc to 45d712f Compare May 10, 2024 20:21

mikix commented May 10, 2024

View reviewed changes

dogversioning approved these changes May 13, 2024

View reviewed changes

mikix merged commit 1932655 into main May 13, 2024
3 checks passed

mikix deleted the mikix/study-search branch May 13, 2024 12:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make it easier to run studies from git checkouts #235

Make it easier to run studies from git checkouts #235

mikix commented May 10, 2024 •

edited

Loading

mikix May 10, 2024

mikix May 10, 2024

mikix May 10, 2024

dogversioning May 13, 2024

mikix May 10, 2024

mikix May 10, 2024

mikix May 10, 2024

mikix commented May 10, 2024 •

edited

Loading

dogversioning commented May 13, 2024

		[tool.ruff]
		target-version = "py310"

Make it easier to run studies from git checkouts #235

Make it easier to run studies from git checkouts #235

Conversation

mikix commented May 10, 2024 • edited Loading

Checklist

mikix May 10, 2024

Choose a reason for hiding this comment

mikix May 10, 2024

Choose a reason for hiding this comment

mikix May 10, 2024

Choose a reason for hiding this comment

dogversioning May 13, 2024

Choose a reason for hiding this comment

mikix May 10, 2024

Choose a reason for hiding this comment

mikix May 10, 2024

Choose a reason for hiding this comment

mikix May 10, 2024

Choose a reason for hiding this comment

mikix commented May 10, 2024 • edited Loading

dogversioning commented May 13, 2024

mikix commented May 10, 2024 •

edited

Loading

mikix commented May 10, 2024 •

edited

Loading