Alkemet News
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU
(arxiv.org)
222
points
bychrsw
8 hours ago |
42
comments
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date