Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add (optional) Carbon Emissions entry #10

Open
robknapen opened this issue May 4, 2023 · 10 comments
Open

Add (optional) Carbon Emissions entry #10

robknapen opened this issue May 4, 2023 · 10 comments
Labels
enhancement New feature or request

Comments

@robknapen
Copy link

robknapen commented May 4, 2023

Hugging Face has an interesting option in their Model Cards to add info on carbon emissions, see:
https://huggingface.co/docs/hub/model-cards-co2

It is based on this paper: https://arxiv.org/abs/1906.02243 and https://mlco2.github.io/impact/

Perhaps something to include as well?

@robknapen robknapen added enhancement New feature or request a/p resource metadata metadata for analysis/processing resource and removed a/p resource metadata metadata for analysis/processing resource labels May 4, 2023
@KathiSchleidt
Copy link
Member

Cool idea, my only worry is how we can calculate the carbon emissions, as this is not only dependent on the amount of processing resources, but also where the processing takes place.

@Schpidi @pebau : are either of you aware of such carbon intensity metrics on the HW you provide? "How much CO2 is produced by KwH of electricity"

@robknapen
Copy link
Author

It's a best estimate, for sure.

In the calculator you can specify which cloud provider or on-prem infra, and which region. With more work we could customise the calculations I think.

https://github.com/Green-Software-Foundation/sci/blob/main/Software_Carbon_Intensity/Software_Carbon_Intensity_Specification.md

On Linux the perf tool can be used to get energy usage of a process. And cloud providers have also started to provide tools, e.g. https://aws.amazon.com/aws-cost-management/aws-customer-carbon-footprint-tool/.

@cozzolinoac11
Copy link
Member

I think this is a great idea. We at EPSIT agree (should anyone be against this addition please comment here asap).
Should we all agree on doing this addition, we should also agree on how to document the use and purpose of the new field in the form. As Rob suggested regarding the emission value calculation, we can use ML CO2 Impact and the tools that some providers make available, at least in these early stages. But how do we explain to the user of the form? Let’s not forget that we are creating a STAC extension to be proposed to the STAC community so it is important that we document and justify in as much details as possible.
@robknapen suggestions more than welcomed 😉

@robknapen
Copy link
Author

For simplicity I would just (more or less) copy the explanation that Hugging Face gives, it is quite understandable.

I would expect the estimates at first to be about orders of magnitude and for creating awareness, perhaps later users would use them to compare models when selecting between alternatives and compute locations (specifically for ML training).

For the infrastructures that FAIRiCUBE controls we maybe can provide more accurate estimates and fill them automatically?

@pebau
Copy link

pebau commented May 4, 2023

we can happily add such information to any metadata providers want to make available (see discussion on metadata structure), but we will not invest own activities into this.

@KathiSchleidt
Copy link
Member

@pebau does this confirm that you can provide CO2 is produced by KwH of electricity for your servers, in addition, KwH per some processing metric? This is what we'd need from the infrastructure providers to enable this

@pebau
Copy link

pebau commented May 4, 2023

@KathiSchleidt no, nothing can be done on our side here.

@KathiSchleidt
Copy link
Member

@Schpidi what's the status from EOX on this issue, can you provide such info on used resources?

@Schpidi
Copy link
Member

Schpidi commented May 5, 2023

EOxHub is running on NILU resources so maybe @jetschny can derive some infos but I'm not aware this information is provided by AWS. would need investigation

@KathiSchleidt
Copy link
Member

To my understanding, the static web page is not what eats resources, it's the processing done on AWS, so please investigate!

I also see this as a nice trigger towards AWS to start thinking about such issues!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants