YouliangYuan

Follow

😃

hello, have a good day

Youliang Yuan YouliangYuan

😃

hello, have a good day

Follow

PhD student at CUHK-SZ. Currently interested in LLM safety.

76 followers · 61 following

The Chinese University of Hong Kong, Shenzhen
Shenzhen China
04:57 (UTC +08:00)
https://youliangyuan.github.io
https://scholar.google.com/citations?user=cd-wSAsAAAAJ&hl=zh-CN&oi=ao

Achievements

Achievements

YouliangYuan/README.md

Hi 😄， if you want to join us, email Professor Pinjia He, a guy as nice as me.

Codes for my work CipherChat and DeRTa are below 👇.

Pinned Loading

RobustNLP/CipherChat RobustNLP/CipherChat Public

A framework to evaluate the generalization capability of safety alignment for LLMs

Python 621 69
RobustNLP/DeRTa RobustNLP/DeRTa Public

A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.

Python 71 2
rrm-cure-miracle-steps rrm-cure-miracle-steps Public

Rubric Reward Model to reduce “miracle steps” and unfaithful CoT in math; SFT+PPO training and verified evaluation.

Python 8