Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: vectorch-ai/ScaleLLM
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v0.2.0
Choose a base ref
...
head repository: vectorch-ai/ScaleLLM
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v0.2.1
Choose a head ref
  • 11 commits
  • 64 files changed
  • 1 contributor

Commits on Aug 24, 2024

  1. Configuration menu
    Copy the full SHA
    865861b View commit details
    Browse the repository at this point in the history

Commits on Aug 25, 2024

  1. build: speed up compilation for marlin kernels (#316)

    generate the instantiations for the marlin kernels to speed up
    compilation
    guocuimi authored Aug 25, 2024
    Configuration menu
    Copy the full SHA
    d49ff0b View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2024

  1. Configuration menu
    Copy the full SHA
    8101d44 View commit details
    Browse the repository at this point in the history

Commits on Aug 28, 2024

  1. Configuration menu
    Copy the full SHA
    83357e1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4425f74 View commit details
    Browse the repository at this point in the history
  3. cmake: make includes private and disable jinja2cpp build (#320)

    This diff would fix following build error:
    ```
    /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: error: there are no arguments to ‘__cvta_generic_to_shared’ that depend on a template parameter, so a declaration of ‘__cvta_generic_to_shared’ must be available [-fpermissive]
    [build]    17 |   uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr));
    [build]       |                                         ^~~~~~~~~~~~~~~~~~~~~~~~
    [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
    [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h: In function ‘void marlin::cp_async4_pred(void*, const void*, bool)’:
    [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:50:41: error: ‘__cvta_generic_to_shared’ was not declared in this scope
    [build]    50 |   uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr));
    [build]       |                                         ^~~~~~~~~~~~~~~~~~~~~~~~
    ```
    guocuimi authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    43d913c View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2024

  1. Configuration menu
    Copy the full SHA
    ea0f6bf View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2024

  1. fix: put item into asyncio.Queue in a thread-safe way (#324)

    `asyncio.Queue` is not thread-safe, so we need to use
    `loop.call_soon_threadsafe` to schedule callbacks from other threads.
    This diff fix the potential contention issue.
    
    Thanks @tp-nan for reporting the issue
    #323.
    guocuimi authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    bb739f6 View commit details
    Browse the repository at this point in the history

Commits on Sep 3, 2024

  1. Configuration menu
    Copy the full SHA
    2a947e1 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2024

  1. Configuration menu
    Copy the full SHA
    5c8c82a View commit details
    Browse the repository at this point in the history
  2. bump up version to 0.2.1

    guocuimi committed Sep 4, 2024
    Configuration menu
    Copy the full SHA
    c28c441 View commit details
    Browse the repository at this point in the history
Loading