Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v1.0.2
Choose a base ref
...
head repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.0.3
Choose a head ref
  • 11 commits
  • 12 files changed
  • 6 contributors

Commits on Aug 17, 2024

  1. [CI] update user input env (#378)

    * Update release.yml
    
    * add release to ref
    
    * Update release.yml
    
    * overwrite
    
    * add repo
    
    * remove unneed args
    
    * fix env
    
    * Update repo & ref usage
    
    * Print env
    
    * CUDA_VISIBLE_DEVICES: 0
    CSY-ModelCloud authored Aug 17, 2024
    Configuration menu
    Copy the full SHA
    67c73d9 View commit details
    Browse the repository at this point in the history
  2. should be allow gptq/gptq_v2 convert to marlin (#377)

    Co-authored-by: LRL-ModelCloud <[email protected]>
    LRL-ModelCloud and LRL2-ModelCloud authored Aug 17, 2024
    Configuration menu
    Copy the full SHA
    242da1e View commit details
    Browse the repository at this point in the history

Commits on Aug 20, 2024

  1. Update version.py (#379)

    Qubitium authored Aug 20, 2024
    Configuration menu
    Copy the full SHA
    07a0ff0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ec9b07b View commit details
    Browse the repository at this point in the history

Commits on Aug 28, 2024

  1. Configuration menu
    Copy the full SHA
    cc9dcc0 View commit details
    Browse the repository at this point in the history

Commits on Sep 10, 2024

  1. support for minicpm3 (#385)

    * support for minicpm3
    
    * Update auto.py
    LDLINGLINGLING authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    06b5c4e View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2024

  1. Fix missing link (#386)

    First of all, kudos for this project.
    It's the only project I found that properly supports modern models like llama-3.1 out of the box.
    Also the speed, and other factors, seem better.
    
    Fixing in this PR a small bug - the previous link led to a missing page.
    mrT23 authored Sep 15, 2024
    Configuration menu
    Copy the full SHA
    365955a View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. fix minicpm3 support (#387)

    Co-authored-by: LRL-ModelCloud <[email protected]>
    LRL-ModelCloud and LRL2-ModelCloud authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    6fe5378 View commit details
    Browse the repository at this point in the history
  2. [MODEL] add grinmoe support (#388)

    * add grinmoe support
    
    * not quantize "block_sparse_moe.gate"
    
    * not deepcopy model
    
    * mod README.md
    
    * README.md add minicpm3
    
    ---------
    
    Co-authored-by: LRL-ModelCloud <[email protected]>
    LRL-ModelCloud and LRL2-ModelCloud authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    d09e17e View commit details
    Browse the repository at this point in the history
  3. build 1.0.3 (#389)

    Qubitium authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    72364c2 View commit details
    Browse the repository at this point in the history
  4. Update README.md (#390)

    Qubitium authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    44b9df7 View commit details
    Browse the repository at this point in the history
Loading