May 21, 2025

Red Hat Launches the llm-d Community, Powering Distributed Gen AI Inference at Scale

“We are pleased to see Red Hat build upon the established success of vLLM, which originated in our lab to help address the speed and memory challenges that come with running large AI models. Open source projects like vLLM, and now llm-d anchored in vLLM, are at the frontier of AI innovation tackling the most demanding AI inference requirements and moving the needle for the industry at large.” -Ion Stoica, Professor and Director of Sky Computing Lab, University of California, Berkeley

May 7, 2025

PyTorch Foundation Welcomes vLLM as a Hosted Project

The PyTorch Foundation is excited to welcome vLLM as a PyTorch Foundation-hosted project. Contributed by the University of California – Berkeley, vLLM is a high-throughput, memory-efficient inference and serving engine designed for LLMs.

April 17, 2025

Broadcom expands support for UC Berkeley’s Sky Computing Lab

Following VMware’s long-standing sponsorship, Broadcom is deepening its collaboration with UC Berkeley’s Sky Computing Lab through a $4 million gift. This expanded collaboration reinforces a shared commitment to innovation and interoperability, building on both organizations’ histories of addressing complex industry challenges through cutting-edge research.

April 7, 2025

Ion Stoica and John Schulman recognized with UC Berkeley Achievement Awards

Stoica, a professor of electrical engineering and computer sciences, is the recipient of the 2025 Fiat Lux Faculty Award. The award recognizes a Berkeley faculty member whose extraordinary contributions advance the university’s philanthropic mission and transform its research, teaching. and programs.

February 21, 2025

Sky Computing Lab receives NVIDIA DGX B200 for AI research

This week, the Sky Computing Lab at UC Berkeley EECS became the first research institution in the nation to receive NVIDIA’s cutting-edge DGX B200 system.

February 19, 2025

Prof. Natacha Crooks named Sloan Fellow

The awards honor early career researchers who have demonstrated innovation and creativity.

February 4, 2025

NBC News: Ion Stoica on DeepSeek

January 29, 2025

How Chinese A.I. Start-Up DeepSeek Is Competing With Silicon Valley Giants

The company built a cheaper, competitive chatbot with fewer high-end computer chips than U.S. behemoths like Google and OpenAI, showing the limits of chip export control.

January 14, 2025

Researchers open source Sky-T1, a ‘reasoning’ AI model that can be trained for less than $450

So-called reasoning AI models are becoming easier — and cheaper — to develop.

December 12, 2024

The UC Berkeley Project That Is the AI Industry’s Obsession

Ranking AI is tricky, so two students developed a way to make the best bots battle