-
Notifications
You must be signed in to change notification settings - Fork 448
add hunyuanv1 dense and moe model #940
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@Kingsleyandher could you kindly update the hunyuan_v1.py to use the output types that also return the metric? See the other models for that refactoring |
d9cd6a8 to
e269841
Compare
|
e269841 to
55474c4
Compare
|
55474c4 to
f62b40b
Compare
|
@kashif Please review it, thanks |
|
@shimizust Hi, this PR's workflow is pending maintainer approval. Could someone please approve running the workflows? Thanks! |
shimizust
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
verified hunyuan tests pass, lgtm!
## Summary <!--- This is a required section; please describe the main purpose of this proposed code change. ---> This pull request introduces support for the Hunyuanv1 dense and moe model within the Liger-Kernel framework. [HunyuanV1 Model PR](huggingface/transformers#39606) <!--- ## Details This is an optional section; is there anything specific that reviewers should be aware of? ---> ## Testing Done <!--- This is a required section; please describe how this change was tested. ---> <img width="3482" height="595" alt="1763112849386" src="https://github.com/user-attachments/assets/eaab7b15-7737-4285-9f23-1d01cc09ee91" /> And a simple test: <img width="2748" height="1051" alt="image" src="https://github.com/user-attachments/assets/fa6155f3-9456-4054-a874-41e1c25e4b47" /> <!-- Replace BLANK with your device type. For example, A100-80G-PCIe Complete the following tasks before sending your PR, and replace `[ ]` with `[x]` to indicate you have done them. --> - Hardware Type: H20 - [ x ] run `make test` to ensure correctness - [ x ] run `make checkstyle` to ensure code style - [ x ] run `make test-convergence` to ensure convergence Co-authored-by: aureli <[email protected]> Co-authored-by: Steven Shimizu <[email protected]>



Summary
This pull request introduces support for the Hunyuanv1 dense and moe model within the Liger-Kernel framework.
HunyuanV1 Model PR
Testing Done
And a simple test:

make testto ensure correctnessmake checkstyleto ensure code stylemake test-convergenceto ensure convergence