Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v1.6.1
Choose a base ref
...
head repository: ModelCloud/GPTQModel
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.7.0
Choose a head ref
  • 18 commits
  • 22 files changed
  • 6 contributors

Commits on Jan 9, 2025

  1. Configuration menu
    Copy the full SHA
    8761c08 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    24895b7 View commit details
    Browse the repository at this point in the history
  3. Add backend.MLX (#1061)

    * add export
    
    * cleanup
    
    * new nn.linear
    
    * export to mlx
    
    * move dequantize to troch linear
    
    * cleanup
    
    * desc_act=True is not supported
    
    * fix lm_head
    
    * should allow  gptq_v2
    
    * add TestExport
    
    * save tokenizer after save model
    
    * fix torch.dequantize_weight
    
    * fix bias
    
    * fix test_export
    
    * add dynamic check
    
    * add convert_gptq_to_mlx_weights
    
    * add backend.mlx
    
    * load backend.mlx
    
    * fix load
    
    * fix group size
    
    * add mlx_generate
    
    * fix generate
    
    * fix load mlx model
    
    * Update loader.py
    
    * Rename test_export.py to test_mlx.py
    
    * Update backend.py
    
    * Revert "Update loader.py"
    
    This reverts commit 1366d35.
    
    * Update setup.py
    
    * Update loader.py
    
    * add mlx check
    
    ---------
    
    Co-authored-by: LRL-ModelCloud <[email protected]>
    Co-authored-by: CL-ModelCloud <[email protected]>
    Co-authored-by: Qubitium-ModelCloud <[email protected]>
    4 people authored Jan 9, 2025
    Configuration menu
    Copy the full SHA
    3dd1466 View commit details
    Browse the repository at this point in the history
  4. add mlx generate test (#1069)

    * mlx test
    
    * add generate test
    CL-ModelCloud authored Jan 9, 2025
    Configuration menu
    Copy the full SHA
    0b42b1a View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2025

  1. [CI] upload source in build step (#1070)

    * [CI] upload source & use same codes to test
    
    * [CI] disable show-statistics
    
    * [CI] fix dir exists
    
    * [CI] fix dir exists
    
    * [CI] fix hash
    
    * [CI] fix file name
    
    * [CI] print tags
    
    * [CI] print enve
    
    * [CI] fix compress
    
    * [CI] print files name
    
    * [CI] print files name
    
    * [CI] always run build, but skip compile
    
    * [CI] rename step
    
    * [CI] update uploading source
    CSY-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    071ceb8 View commit details
    Browse the repository at this point in the history
  2. code review (#1072)

    CL-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    3d72d5e View commit details
    Browse the repository at this point in the history
  3. [CI] install mlx (#1071)

    * [CI] install mlx
    
    * [CI] remove mlx
    CSY-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    196afce View commit details
    Browse the repository at this point in the history
  4. Add option to quantize lm_head (#1037)

    * quantize lm_head
    
    * update
    
    * Fix incorrect call to layer.forward()
    
    * lm_head uses a special quantize config
    
    * remove store_lm_head_input_hook()
    
    * added code of save/load lm_head_layer_inputs.pt
    
    * fix pack_module()
    
    * remove pack_module()
    
    * Check if quant lm_head supports
    
    * cleanup
    
    * add only_quant_lm_head
    
    * fix only_quant_lm_head
    
    * add store_lm_head_input_hook()
    
    * fix lm_head layer forward error with marlin
    
    * Revert "add store_lm_head_input_hook()"
    
    This reverts commit 10c97a8.
    
    * cleanup
    
    * QuantizeConfig add "lm_head_low_gpu_mem_usage" field
    
    * add TestLmHeadQuant
    
    * fix merge error
    ZX-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    98d1a05 View commit details
    Browse the repository at this point in the history
  5. fix test_packing (#1073)

    * revert marlin dequanitze code
    
    * move dequantize_weight -> qlinear.utils
    LRL-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    b292bf9 View commit details
    Browse the repository at this point in the history
  6. [CI] add mlx test (#1074)

    * [CI] move mlx test to m4
    
    * [CI] fix syntax
    
    * [CI] update build if
    
    * [CI] fix mlx-files
    
    * [CI] check not ''
    
    * [CI] update build if
    
    * [CI]  if
    
    * [CI]  if
    
    * [CI] print if env
    
    * [CI] remove alywas
    
    * [CI] remove cancel
    
    * [CI] Print conditions and parameters
    
    * [CI] update outputs
    
    * [CI] use _
    
    * [CI] add needs
    
    * [CI] add needs
    
    * [CI] rename test sh
    
    * [CI] fix parameter not received
    
    * [CI] rename
    
    * [CI] update
    
    * [CI] update if
    
    * [CI] remove ignore
    
    * [CI] clean local
    
    * [CI] clean local
    
    * [CI] update
    
    * darwin BUILD_CUDA_EXT false
    
    * [CI] add test var
    
    * [CI] append .py
    
    * [CI] fix regex
    CSY-ModelCloud authored Jan 10, 2025
    Configuration menu
    Copy the full SHA
    492076a View commit details
    Browse the repository at this point in the history

Commits on Jan 11, 2025

  1. Configuration menu
    Copy the full SHA
    3add320 View commit details
    Browse the repository at this point in the history
  2. update mlx test (#1079)

    * update prompt
    
    * no need redo quant model
    
    * fix import
    
    * [CI] replace model path
    
    * [CI] replace model path
    
    * [CI] fix path
    
    * [CI] update path
    
    * [CI] update path
    
    * [CI] fix replace
    
    * fix not export
    
    * remove     deprecated repetition_penalty
    
    * check none
    
    * [CI] remove all in clean cache step
    
    * print repo & ref
    
    * use zen3 public ip
    
    * [CI] install with index
    
    * [CI] print dir at top
    
    * update prompt
    
    * import at top
    
    * fix python not activated
    
    * force update
    
    * [CI] deelte hidden files
    
    * [CI] update rm
    
    * [CI] fix clean error
    
    * [CI] fix clean error
    CSY-ModelCloud authored Jan 11, 2025
    Configuration menu
    Copy the full SHA
    37018fc View commit details
    Browse the repository at this point in the history

Commits on Jan 16, 2025

  1. convert to mlx support desc_act true (#1082)

    LRL-ModelCloud authored Jan 16, 2025
    Configuration menu
    Copy the full SHA
    2f04c05 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    976e27e View commit details
    Browse the repository at this point in the history

Commits on Jan 17, 2025

  1. catch module error for setup.py (#1084)

    * [CI] check monster dir is mounted
    
    * [CI] check monster dir is mounted
    
    * add ModuleNotFoundError
    
    * sys.exit with error msg
    
    * Update setup.py
    
    ---------
    
    Co-authored-by: Qubitium-ModelCloud <[email protected]>
    CSY-ModelCloud and Qubitium authored Jan 17, 2025
    Configuration menu
    Copy the full SHA
    066f727 View commit details
    Browse the repository at this point in the history
  2. 1.7.0 release (#1085)

    * prepare for v1.7.0 release
    
    * Update version.py
    
    * Update README.md
    Qubitium authored Jan 17, 2025
    Configuration menu
    Copy the full SHA
    55dc91d View commit details
    Browse the repository at this point in the history
  3. Update README.md (#1086)

    Qubitium authored Jan 17, 2025
    Configuration menu
    Copy the full SHA
    6a245a8 View commit details
    Browse the repository at this point in the history
  4. typo (#1087)

    Qubitium authored Jan 17, 2025
    Configuration menu
    Copy the full SHA
    d247fd0 View commit details
    Browse the repository at this point in the history
Loading