Skip to content
/ D2PO Public

The official PyTorch implementation of the paper "Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective".

License

Notifications You must be signed in to change notification settings

LotuSrc/D2PO

About

The official PyTorch implementation of the paper "Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective".

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 102

Languages