Highlights
- Pro
Pinned Loading
-
eth-sri/language-model-arithmetic
eth-sri/language-model-arithmetic PublicControlled Text Generation via Language Model Arithmetic
-
eth-sri/matharena
eth-sri/matharena PublicEvaluation of LLMs on latest math competitions
-
-
-
eth-sri/polyrating
eth-sri/polyrating PublicImplementation of Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
Python 4
-
ChessImageBench
ChessImageBench PublicA Benchmark for Chessboard Image Generation and Error Detection
Jupyter Notebook 4
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

