News
May 21, 2025
Red Hat Launches the llm-d Community, Powering Distributed Gen AI Inference at Scale
“We are pleased to see Red Hat build upon the established success of vLLM, which originated in our lab to help address the speed and memory challenges that come with running large AI models. Open source projects like vLLM, and now llm-d anchored in vLLM, are at the frontier of AI innovation tackling the most demanding AI inference requirements and moving the needle for the industry at large.” -Ion Stoica, Professor and Director of Sky Computing Lab, University of California, Berkeley
May 7, 2025
PyTorch Foundation Welcomes vLLM as a Hosted Project
The PyTorch Foundation is excited to welcome vLLM as a PyTorch Foundation-hosted project. Contributed by the University of California – Berkeley, vLLM is a high-throughput, memory-efficient inference and serving engine designed for LLMs.
April 17, 2025
Broadcom expands support for UC Berkeley’s Sky Computing Lab
Following VMware’s long-standing sponsorship, Broadcom is deepening its collaboration with UC Berkeley’s Sky Computing Lab through a $4 million gift. This expanded collaboration reinforces a shared commitment to innovation and interoperability, building on both organizations’ histories of addressing complex industry challenges through cutting-edge research.
April 7, 2025
Ion Stoica and John Schulman recognized with UC Berkeley Achievement Awards
Stoica, a professor of electrical engineering and computer sciences, is the recipient of the 2025 Fiat Lux Faculty Award. The award recognizes a Berkeley faculty member whose extraordinary contributions advance the university’s philanthropic mission and transform its research, teaching. and programs.
February 21, 2025
Sky Computing Lab receives NVIDIA DGX B200 for AI research
This week, the Sky Computing Lab at UC Berkeley EECS became the first research institution in the nation to receive NVIDIA’s cutting-edge DGX B200 system.
February 19, 2025
Prof. Natacha Crooks named Sloan Fellow
The awards honor early career researchers who have demonstrated innovation and creativity.
February 4, 2025
NBC News: Ion Stoica on DeepSeek
January 29, 2025
How Chinese A.I. Start-Up DeepSeek Is Competing With Silicon Valley Giants
The company built a cheaper, competitive chatbot with fewer high-end computer chips than U.S. behemoths like Google and OpenAI, showing the limits of chip export control.
January 14, 2025
Researchers open source Sky-T1, a ‘reasoning’ AI model that can be trained for less than $450
So-called reasoning AI models are becoming easier — and cheaper — to develop.
December 12, 2024
The UC Berkeley Project That Is the AI Industry’s Obsession
Ranking AI is tricky, so two students developed a way to make the best bots battle