Projects

R2E

R2E

A framework that turns static GitHub repository into a dynamic runnable environments to evaluate the performance of code-generating systems, both static and interactive

Rollbaccine

A General Solution to Rollback Attacks in TEEs
Auto-Whittaker

Auto-Whittaker

Automatically Rewriting Distributed Protocols for Scalability

Scrooge

Enabling Replicated State Machines to Communicate Efficiently

SVR3

The SVR3 project aims to store client-side secrets server-side protected by a human-remembered (and thus, low-entropy) pin.
Skydentity

Skydentity

Let orchestrators run your workloads on your cloud resources without handing over your cloud credentials and data.
Flock

Flock

A Framework for Deploying On-Demand Distributed Trust
POET

POET

Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging

SkyPIE

A Fast & Accurate Oracle for Object Placement
LiveCodeBench

LiveCodeBench

Holistic and Contamination Free Evaluation of Large Language Models for Code
Skyplane

Skyplane

Blazing Fast Bulk Data Transfers Between Any Cloud
RAFT

RAFT

“Retrieval-Augmented Fine-Tuning” combines the benefits of Retrieval-Augmented Generation and Fine-Tuning for better domain adaptation
Arena Hard

Arena Hard

An Automatic Pipeline to Build High-Quality LLM Benchmarks with High Separability and Agreement to Human Preference from Live Data
SGLang

SGLang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable by co-designing the frontend language and the runtime system.
vLLM

vLLM

Building the fastest and easiest-to-use inference engine for LLMs
Vicuna

Vicuna

An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality
Chatbot Arena

Chatbot Arena

An Open Platform for Evaluating LLMs by Human Preference
GoEx

GoEx

A Runtime for LLM-Generated Actions like Code, API Calls, and More.
Embarcadero

Embarcadero

A Totally Ordered, High Throughput, Pub/Sub System with Disaggregated Memory
Gorilla

Gorilla

Gorilla is an open-source, state-of-the-art LLM that invokes API calls to interact with services!
Hydro

Hydro

The Hydro Project at UC Berkeley is developing cloud-native programming models that allow anyone to develop scalable and resilient distributed applications.
MemGPT

MemGPT

Teach LLMs to manage their own memory for unbounded context!
SkyPilot

SkyPilot

SkyPilot is a framework for running LLMs, AI, and batch jobs on any infrastructure, offering maximum cost savings, highest GPU availability, and managed execution.