Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: scaleapi/llm-engine
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v0.0.0beta33
Choose a base ref
...
head repository: scaleapi/llm-engine
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v0.0.0beta34
Choose a head ref
  • 7 commits
  • 18 files changed
  • 3 contributors

Commits on May 21, 2024

  1. Hardcode llama 3 70b endpoint param (#524)

    * Hardcode some tuning for endpoints
    
    * remove mixtral 8x22b hardcode
    
    * test
    yunfeng-scale authored May 21, 2024
    Configuration menu
    Copy the full SHA
    fe56840 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2024

  1. Configuration menu
    Copy the full SHA
    9a0e071 View commit details
    Browse the repository at this point in the history

Commits on May 28, 2024

  1. Option to read Redis URL from AWS Secret (#526)

    Add an option to have the pods read Redis auth info from an AWS secret.
    Note: there are two places the redis auth info needs to be added, since Redis is used for both the model endpoint creation request message queue and a cache for endpoint info
    
    The secret is formatted as follows:
    It must contain a few keys, namely host, port, scheme (optional, defaults to redis://), auth_token (optional), query_params (optional). These control which Redis gets used as the message queue for the endpoint builder. Also must contain a key cache-url, the full Redis url of the redis to be used as a cache.
    seanshi-scale authored May 28, 2024
    Configuration menu
    Copy the full SHA
    5a3e339 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0fbb062 View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2024

  1. Configuration menu
    Copy the full SHA
    1a68da1 View commit details
    Browse the repository at this point in the history

Commits on Jun 4, 2024

  1. Configuration menu
    Copy the full SHA
    7d4cc3e View commit details
    Browse the repository at this point in the history
  2. [Client] Add guided_grammar and other missing fields (#532)

    Add guided_grammar to the client, + add some missing fields to some codepaths
    seanshi-scale authored Jun 4, 2024
    Configuration menu
    Copy the full SHA
    9a937f8 View commit details
    Browse the repository at this point in the history
Loading