Better Hardware Could Turn Zeros into AI Heroes
Sparse computing enables leaner, faster AI
When it comes to AI models, size matters.
Even though some artificial-intelligence experts warn that scaling up large language models (LLMs) is hitting diminishing performance returns, companies are still coming out with ever larger AI tools. Meta’s latest Llama release had a staggering 2 trillion parameters...
The Stanford team’s work on Onyx represents a significant step toward addressing the growing energy and computational demands of AI models. By focusing on sparsity—a property where most parameters in neural networks are zero or near-zero—they’ve demonstrated that hardware can be re-architected to skip unnecessary calculations, drastically improving efficiency. This approach challenges the prevailing trend of simply scaling up AI models, which has led to diminishing returns in performance while e...
