Better support for loading a consistent python module #4059

billsacks · 2021-08-02T19:25:34Z

It is currently easy to inadvertently use different python versions for different parts of the operation of a given CIME command: When you run a CIME command, it will start by using whatever python3 version it finds in your path. However, at some point, it may call load_env (e.g., in the course of doing a build). At that point, your module environment is reset, and – depending on what's defined in this machine's section of config_machines.xml – the same or a different python module may be loaded, or there will be no python module loaded and it will use the machine defaults. From that point forward, any subprocess call that invokes python – notably, calls to components' buildlib commands, but possibly others – will use this possibly different version of python. As long as the two python versions are close enough, the user probably won't notice and there will be no ill effects. But it seems like this operation opens the door to some really subtle issues.

The idea @mnlevy1981 raised at a CSEG meeting in 2017 (but I think was never recorded in a CIME issue) is: Have the cime scripts initially determine the desired python module, load that, then do everything else in a python subprocess. This strategy would ensure that you get a well-defined, consistent operation of the cime scripts across all users of a given machine: no need for them to load a python module or virtual environment before running the cime scripts. This would be particularly helpful as we add support for 3rd party python packages like yaml and netcdf. For better or for worse (probably mostly for better, but there could be some downsides), that would mean that your python environment is in the hands of CIME's config_machines, and not in your own hands.

If that strategy doesn't work or is too difficult to implement, then an alternative strategy that would address the problem in the first paragraph but not the broader issues addressed by @mnlevy1981 's suggestion would be to somehow maintain the currently-loaded python environment across calls to load_env. I'm not sure how feasible that is, though. (Could you query the currently-loaded python module before resetting the module environment, then reload that python module afterwards???)

Another alternative would be to move towards requiring or strongly encouraging the use of python virtual environments. At least on cheyenne, if you have already loaded a python virtual environment then it seems that doing a module reset doesn't impact the python version you're using. In the past, there has been some (reasonable, in my mind) resistance to requiring users to load a python virtual environment before running cime scripts, but if it's too difficult to ensure a consistent python environment in other ways, we may want to reconsider this.

The text was updated successfully, but these errors were encountered:

billsacks · 2021-11-03T16:46:25Z

As mentioned in ESCOMP/CESM#188 (comment) , it might make sense to wait to resolve this until after the CIME7 reorganization (#3886), as long as that isn't too far out.

billsacks · 2021-12-14T22:34:31Z

From discussion at last week's cime meeting:

We will support two methods for ensuring you have a working python environment for CESM/CIME: (1) containers, and (2) manually loading the appropriate python environment on your system prior to running any cime scripts (via a conda or pip environment).
- Note that, for the sake of (2), we will soon remove the module loads of python on our machines, since this breaks the ability to control your own python environment
We will not spend time trying to work out how we would support a separate pip/conda installation of cime: cime's relationship to the rest of the model will remain as it currently is (just some inline code).

From discussion at today's cseg meeting, this strategy proposed in 2017 no longer feels like the right thing to do:

Have the cime scripts initially determine the desired python module, load that, then do everything else in a python subprocess.

Since our proposed solution involves changes to the workflow but not really to cime itself, I am going to close this as a wontfix.

billsacks added ty: enhancement tp: CIMElib labels Aug 2, 2021

billsacks mentioned this issue Oct 27, 2021

Add python module load command for izumi #4116

Closed

billsacks added this to CESM: infrastructure / cross-component SE priorities Nov 3, 2021

billsacks moved this to Needs prioritization in CESM: infrastructure / cross-component SE priorities Nov 3, 2021

billsacks mentioned this issue Nov 3, 2021

Add more metadata on compset-grid compatibility ESCOMP/CESM#188

Open

billsacks closed this as completed Dec 14, 2021

Repository owner moved this from Needs Prioritization to Done in CESM: infrastructure / cross-component SE priorities Dec 14, 2021

billsacks added the st: wontfix label Dec 14, 2021

billsacks mentioned this issue Aug 29, 2023

conda fails to load for SystemTests ESCOMP/CTSM#2111

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better support for loading a consistent python module #4059

Better support for loading a consistent python module #4059

billsacks commented Aug 2, 2021

billsacks commented Nov 3, 2021

billsacks commented Dec 14, 2021

Better support for loading a consistent python module #4059

Better support for loading a consistent python module #4059

Comments

billsacks commented Aug 2, 2021

billsacks commented Nov 3, 2021

billsacks commented Dec 14, 2021