Comparing changes

* Update release.yml * add release to ref * Update release.yml * overwrite * add repo * remove unneed args * fix env * Update repo & ref usage * Print env * CUDA_VISIBLE_DEVICES: 0

Co-authored-by: LRL-ModelCloud <[email protected]>

* support for minicpm3 * Update auto.py

First of all, kudos for this project. It's the only project I found that properly supports modern models like llama-3.1 out of the box. Also the speed, and other factors, seem better. Fixing in this PR a small bug - the previous link led to a missing page.

Co-authored-by: LRL-ModelCloud <[email protected]>

* add grinmoe support * not quantize "block_sparse_moe.gate" * not deepcopy model * mod README.md * README.md add minicpm3 --------- Co-authored-by: LRL-ModelCloud <[email protected]>

Commits on Aug 28, 2024

transformer 4.44.2 update (#382 )

Qubitium authored Aug 28, 2024

Configuration menu

View commit details

Copy full SHA for cc9dcc0

Browse repository at this point

Copy the full SHA

cc9dcc0 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comparing changes

Open a pull request

Commits on Aug 17, 2024

Commits on Aug 20, 2024

Commits on Aug 28, 2024

Commits on Sep 10, 2024

Commits on Sep 15, 2024

Commits on Sep 19, 2024

This comparison is taking too long to generate.

Uh oh!