Micro2023-MPMXU-expr

Microbenchmark kernels

Clone cutlass repo

$ git clone https://github.com/NVIDIA/cutlass.git

Follow Cutlass README steps to build CUTLASS

$ export CUDACXX=${CUDA_INSTALL_PATH}/bin/nvcc
$ mkdir build && cd build
$ cmake .. -DCUTLASS_NVCC_ARCHS=80

Copy profiler building scripts to build directory

$ cp cutlass_profile_cmds/* cutlass/build/

Build cutlass profile with targeted kernels

$ cd cutlass/build/
$ chmod a+x build_profile
$ chmod a+x run_profile.sh
$ ./build_profile
$ make cutlass_profiler -j16

Run microbenchmark kernels

$ ./run_profile.sh

Adjust Device clock frequency

Setting "coolbits" to 28 [other may work, not tested]

$ sudo nvidia-xconfig --cool-bits=28

Reboot machine if following step does not work

Enabling presistence mode (required for adjusting mem/graph clock)

$ sudo nvidia-smi -pm 1

List all memclk,grclk(smclk) pairs

$ nvidia-smi -q -d SUPPORTED_CLOCKS

Note: Common device supports few memory clock rate option while a wide range of gr/sm clock rate are supported for each memory rate.

        Memory                            : 9501 MHz
            Graphics                      : 2100 MHz
            Graphics                      : 2085 MHz
            Graphics                      : 2070 MHz
        ...
        Memory                            : 9251 MHz
            Graphics                      : 2100 MHz
            Graphics                      : 2085 MHz
            Graphics                      : 2070 MHz
        ...

Parse output with better formatting

$ nvidia-smi -q -d SUPPORTED_CLOCKS | python3 scripts/get_all_clk_mode.py

Adjust Graphcs clock:

$ sudo nvidia-smi -lgc [MINCLK],[MAXCLK]

Adjust memory clock:

$ sudo nvidia-smi -lmc [MINCLK],[MAXCLK]

Note: Use actual minmum clock supported as MINCLK

Monitor device clock rate:

$ watch -n 1 nvidia-smi -q -d CLOCK

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
SystemVerilog		SystemVerilog
cutlass_profile_cmds		cutlass_profile_cmds
fft		fft
knn		knn
nebula		nebula
scripts		scripts
snapMRF_mod		snapMRF_mod
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Micro2023-MPMXU-expr

Microbenchmark kernels

Adjust Device clock frequency

About

Releases 1

Packages

Contributors 2

Languages

escalab/2024SC-M3-AD

Folders and files

Latest commit

History

Repository files navigation

Micro2023-MPMXU-expr

Microbenchmark kernels

Adjust Device clock frequency

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages