Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v1.2.3
Choose a base ref
...
head repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.3.0
Choose a head ref
  • 18 commits
  • 47 files changed
  • 6 contributors

Commits on Nov 25, 2024

  1. Configuration menu
    Copy the full SHA
    4be54ca View commit details
    Browse the repository at this point in the history
  2. [CI] remove zen4 name in CI (#658)

    * [CI] make name short
    
    * [CI] rename to public server
    
    * [CI] rename to task list
    
    * [CI] rename to public server
    
    * [CI] remove -zen4 suffix
    
    * [CI] remove tag
    
    * [CI] rename to runner
    CSY-ModelCloud authored Nov 25, 2024
    Configuration menu
    Copy the full SHA
    4158042 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e5048e1 View commit details
    Browse the repository at this point in the history
  4. [CI] fix env.GPU not exists (#661)

    * [CI] fix env.GPU not exists
    
    * [CI] don't run at last step
    CSY-ModelCloud authored Nov 25, 2024
    Configuration menu
    Copy the full SHA
    874095e View commit details
    Browse the repository at this point in the history
  5. Update README.md (#662)

    Qubitium authored Nov 25, 2024
    Configuration menu
    Copy the full SHA
    941c1f0 View commit details
    Browse the repository at this point in the history
  6. Update README.md (#663)

    Qubitium authored Nov 25, 2024
    Configuration menu
    Copy the full SHA
    9ce786e View commit details
    Browse the repository at this point in the history

Commits on Nov 26, 2024

  1. move lm-eval to utils to make it optional, fixed #664 (#666)

    * move lm-eval to utils
    
    * rename
    
    * add hint to install lm-eval
    
    * update unit tests
    CSY-ModelCloud authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    a1469f7 View commit details
    Browse the repository at this point in the history
  2. do ruff (#667)

    CSY-ModelCloud authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    29a3c1e View commit details
    Browse the repository at this point in the history
  3. add ipex bench code (#660)

    * add ipex bench code.
    
    * update batch
    
    * use cli arg
    
    * print all arg values before benchmark starts
    
    * add quantized model support
    
    * cleanup
    
    * Rename prompts.json to prompts.jsonl
    
    ---------
    
    Co-authored-by: LRL-ModelCloud <[email protected]>
    Co-authored-by: Qubitium-ModelCloud <[email protected]>
    3 people authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    878f58b View commit details
    Browse the repository at this point in the history
  4. Add progress bar to replace tqdm (#668)

    * add ProgressBar
    
    * add time countdown
    
    * replace with new progress bar
    
    * remove suffix text
    
    * should't print \n at last
    
    * replace all tqdm usages
    
    * remove tqdm in req
    
    * rename func
    
    * fix current not +1
    
    * fix current not +1
    
    * remove name
    
    * fix iteration = 0
    CSY-ModelCloud authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    bbb8cab View commit details
    Browse the repository at this point in the history
  5. [MODEL] add hymba support (#651)

    * add hymba support
    
    * fix kv_last_layer
    
    * fix layer cache
    
    * padding outfeatures
    
    * don't quantize "mamba.x_proj.0" and "mamba.dt_proj.0". Otherwise the quant model will output empty.
    
    * add TODO
    
    * change RREADME.md
    
    ---------
    
    Co-authored-by: LRL-ModelCloud <[email protected]>
    Co-authored-by: ZX-ModelCloud <[email protected]>
    3 people authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    e55e749 View commit details
    Browse the repository at this point in the history
  6. Update README.md (#669)

    * Update README.md
    
    * Update README.md
    
    * Update README.md
    Qubitium authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    2cbe7bd View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    0322ea9 View commit details
    Browse the repository at this point in the history
  8. Set the "OMP_NUM_THREADS" environment variable to the specified numbe…

    …r of cores to control the number of threads used by OpenMP (#671)
    
    Co-authored-by: LRL-ModelCloud <[email protected]>
    LRL-ModelCloud and LRL2-ModelCloud authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    f5ff1f1 View commit details
    Browse the repository at this point in the history
  9. add load/opt time print (#672)

    Co-authored-by: LRL-ModelCloud <[email protected]>
    LRL-ModelCloud and LRL2-ModelCloud authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    c460bf5 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    c360d1e View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    a519eb3 View commit details
    Browse the repository at this point in the history
  12. prep for 1.3.0 release (#675)

    * prep for 1.3.0 release
    
    * Update README.md
    Qubitium authored Nov 26, 2024
    Configuration menu
    Copy the full SHA
    8819f51 View commit details
    Browse the repository at this point in the history
Loading