Skip to content

enerGyPU 0.1.1-rc0

Pre-release
Pre-release
Compare
Choose a tag to compare
@jagh jagh released this 22 Oct 20:14
· 63 commits to master since this release

Release 0.1.1

This version of enerGyPU monitor tool is used as a submodule of DiagnoseNET application-framework for energy-workload characterization while the deep neural networks are training.

Major Features And Improvements

  • Added enerGyPU_availabledevice.sh into the launcher script to get the GPU available in a multi-GPU node and pass for a high library as TensorFlow.
  • Extend the energy-workload characterization in a distributed training DNN on NVIDIA Jetson TX2 capturing
    power traces with tegrastats and managing the GPU identification by each worker enerGyPU_record-cluster.sh.
  • Added enerGyPU_bandwidth.sh bandwidth monitor to record between the master and each worker.

Adding enerGyPU into DiagnoseNET repository

  • Clone a enerGyPUTesting branch:
    git clone -b enerGyPUTesting https://github.com/jagh/enerGyPU.git

  • Build a enerGyPUTesting as submodule:
    git submodule add -b enerGyPUTesting https://github.com/jagh/enerGyPU.git

  • Fix a Git detached head in a submodule:
    git checkout enerGyPUTesting

  • Updated DiagnoseNet to the latest commit on enerGyPUTesting:
    git pull origin enerGyPUTesting
    ## Go back to DiagnoseNet directory
    git add enerGyPU/
    git commit -m "submodule updated"