Learning in LLMs and MLsys, recently focused on RL training
-
SGLang | AMD | Tsinghua University
- California, USA
-
23:27
(UTC -08:00) - https://yushengsu-thu.github.io/
- @thu_yushengsu
Highlights
- Pro
Pinned Loading
-
OpenBMB/AgentVerse
OpenBMB/AgentVerse Public🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python 1
-
torch_memory_saver
torch_memory_saver PublicForked from fzyzcjy/torch_memory_saver
Allow torch tensor memory to be released and resumed later
Python 2
-
RLsys-Foundation/APRIL
RLsys-Foundation/APRIL PublicAPRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM training.
-
-
slime
slime PublicForked from THUDM/slime
slime is an LLM post-training framework for RL Scaling.
Python
If the problem persists, check the GitHub status page or contact support.




