Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[事前学習] - v4モデル環境構築実験 #111

Open
odashi opened this issue Jan 24, 2025 · 0 comments
Open

[事前学習] - v4モデル環境構築実験 #111

odashi opened this issue Jan 24, 2025 · 0 comments
Assignees
Labels
pretrain Experiment of model pretrain

Comments

@odashi
Copy link
Member

odashi commented Jan 24, 2025

Overview

次期事前学習モデルのための環境構築を行います。

Details

Llama 3.1 準拠の学習を行うためには、LLM-jp-3 よりも新しいバージョンのMegatronに導入された一部機能が必要となる。
このオプションが有効かつSakuraおよびABCI上で学習可能な設定の探索を行う。

Resources

  • 計算機
    • クラスタ: FIXME Sakura (Ishikari)
    • ノード種別: FIXME gpu-small (H100x8)
    • ノード台数: FIXME 32
  • コード
  • 入力データ:
    • {name}: {physical path}
  • 出力データ:
    • 保存先: {cluster}:/data/experiments/{number}
    • データ内訳:
      • {name}: xxx TB (バッファ容量を含む)
  • W&B ログ:
  • 開始日: YYYY-MM-DD
  • 終了予定日: YYYY-MM-DD (バッファ期間を含む)
@odashi odashi added the pretrain Experiment of model pretrain label Jan 24, 2025
@odashi odashi self-assigned this Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pretrain Experiment of model pretrain
Projects
None yet
Development

No branches or pull requests

1 participant