Skip to content

meeting 2024 11 07

Kenneth Hoste edited this page Nov 7, 2024 · 1 revision

Notes for 2024-11-07 meeting

  • date & time: Thu 7 Nov 2024 - 14:00 CEST (13:00 UTC)
    • (every first Thursday of the month)
  • venue: (online, see mail for meeting link, or ask in Slack)
  • agenda:
    • Quick introduction by new people
    • Progress update per EESSI layer
    • Update on EESSI production repository software.eessi.io
    • Modulefile for initializing the EESSI stack
    • Update on EESSI documentation + test suite + build-and-deploy bot
    • AWS/Azure sponsorship update
    • Upcoming/recent events
    • Q&A

Slides

Meeting notes

(by Bob/Kenneth)

Quick introduction by new people

  • (none)

Progress update per EESSI layer

Filesystem layer

(see slides)

  • monitoring of CVMFS network of servers
    • we should share more info on this in next meeting
Compatibility layer

(see slides)

  • compat layer for EESSI version
    • likely to happen early 2025
    • another good reason is significant performance improvements in latest glibc versions, in particular for Arm CPUs
Software layer

(see slides)

  • logistics of GPU builds are a concern, being look into
    • one way is by looking into adopting service accounts on on-premise systems where cost of using GPUs is less of a problem
    • this is actively being looked into at HPC-UGent, Snellius @ SURF, and EuroHPC system Deucalion
  • Q: Can additional CUDA software installations for other GPU types be done easily via EESSI-extend mechanism?
    • yet, to some extent
    • easier if you're willing to also install CUDA, cUDNN, etc. yourself
    • the installation prefix used by EESSI-extend will not be GPU-aware currently, EESSI-extend would need to be made aware GPU-aware
    • compilation here is that our way of splitting CPU/GPU builds into separate installation prefixes won't easily work for EESSI-extend
    • can EasyBuild hook be enhanced to automatically rewrite the installation prefix based on whether or not it's CUDA software?
    • should we also be worried about the GPU/CUDA driver version?
  • dev.eessi.io
    • Worth looking into cloud access in Julich to have a MC cluster there for dev.eessi.io

software.eessi.io repository

(see slides)

  • ...

Modulefile for initializing the EESSI stack

(see slides)

  • ...

Build-and-deploy bot

(see slides)

  • also an open PR to allow specifying that script that should be used for is located in a different repository (PR #283)

EESSI documentation

(see slides)

EESSI test suite

(see slides)

  • new time we should report more extensively on dashboard for EESSI test suite

AWS/Azure sponsored credits

(see slides)

  • ...

Events

(see slides)

  • ...

Frequency of EESSI update meeting

  • ...

Q&A

Clone this wiki locally