Alkemet News
Run 70B LLM Inference on a Single 4GB GPU with This New Technique
(ai.gopubby.com)
111
points
bygardenfelder
2 years ago |
57
comments
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date
Invalid date