Alkemet News
ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math
(firethering.com)
18
points
bysteveharing1
2 hours ago |
18
comments
Invalid date
Invalid date
Invalid date
Invalid date