Arena Hard
An Automatic Pipeline to Build High-Quality LLM Benchmarks with High Separability and Agreement to Human Preference from Live Data
Gorilla
Gorilla is an open-source, state-of-the-art LLM that invokes API calls to interact with services!
LOTUS
Easily Build Knowledge-Intensive LLM Applications That Reason Over Your Data With LOTUS!
RAFT
“Retrieval-Augmented Fine-Tuning” combines the benefits of Retrieval-Augmented Generation and Fine-Tuning for better domain adaptation
RouteLLM
A Framework for Serving and Evaluating LLM Routers – Save LLM Costs Without Compromising Quality!
Skydentity
Let orchestrators run your workloads on your cloud resources without handing over your cloud credentials and data.
SkyPilot
SkyPilot is a framework for running LLMs, AI, and batch jobs on any infrastructure, offering maximum cost savings, highest GPU availability, and managed execution.
Stylus
We introduce Stylus, which efficiently selects and automatically composes task-specific adapters based on a prompt’s keywords.
SVR3
The SVR3 project aims to store client-side secrets server-side protected by a human-remembered (and thus, low-entropy) pin.