HydraQYH

Follow

🤣

Qi Yuhang HydraQYH

🤣

Follow

AI Infra/MLSys Performance Analysis & Tuning. Now in AML@Bytedance, used to be in AIACC Team@Alibaba Cloud.

118 followers · 44 following

Bytedance
Hangzhou, China
00:40 (UTC +08:00)
https://www.zhihu.com/people/anonymous-76-65-9

Achievements

Achievements

Pinned Loading

sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 2
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++
expert_specialization_moe expert_specialization_moe Public

Expert Specialization MoE Solution based on CUTLASS

Cuda 25 1
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python
hp_rms_norm hp_rms_norm Public

High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)

Cuda 25 1