Nothing Special   »   [go: up one dir, main page]

Skip to content

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Notifications You must be signed in to change notification settings

pixeli99/_GaLore

 
 

Repository files navigation

bash run_7b.sh pre 3e-4
bash run_7b.sh post 3e-4
bash run_7b.sh post_pre 3e-4 # 8:24
bash run_7b.sh sandwich 3e-4
# scale w2 and wo
bash run_7b.sh scale_post_pre 3e-4
bash run_7b.sh scale_pre 3e-4

About

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.1%
  • Shell 0.9%