Skip to content

Releases: ggml-org/llama.cpp

b7406

15 Dec 04:15
4aced7a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (#17826)

  • support gpt-oss GPU by OP add-id, mul_mat for mxfp4, swiglu_oai, fix warning

  • fix fault ut case, update ops.md

  • rebase, fix format issue

macOS/iOS:

Linux:

Windows:

openEuler:

b7405

15 Dec 04:09
745fa0e

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

model : add glm-asr support (#17901)

  • [model] add glm-asr support

  • fix format for ci

  • fix convert format for ci

  • update glm_asr convert script & use build_ffn for glm_asr clip & use build_stack for padding and review

  • check root architecture for convert hf script

  • fix conficlt with upstream

  • fix convert script for glm asr & format clip-impl

  • format

  • restore hparams text

  • improved conversion


Co-authored-by: Sigbjørn Skjæret [email protected]

macOS/iOS:

Linux:

Windows:

openEuler:

b7404

14 Dec 22:34
5239229

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

preset: handle negated arg, reverse the meaning if needed (#18041)

macOS/iOS:

Linux:

Windows:

openEuler:

b7402

14 Dec 19:48
37f5a10

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

mtmd: enhance image resizing in llava_uhd (#18014)

macOS/iOS:

Linux:

Windows:

openEuler:

b7401

14 Dec 19:19
9e6649e

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: fix mul_mat_vec_iq1_s formatting (#18026)

macOS/iOS:

Linux:

Windows:

openEuler:

b7400

14 Dec 18:25
0759b09

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

graph: add f_attn_temp_offset (#18025)

macOS/iOS:

Linux:

Windows:

openEuler:

b7399

14 Dec 13:17
254098a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

common : refactor common_sampler + grammar logic changes (#17937)

  • common : refactor common_sampler + grammar logic changes

  • tests : increase max_tokens to get needed response

  • batched : fix uninitialized samplers

macOS/iOS:

Linux:

Windows:

openEuler:

b7398

14 Dec 13:08
3238b14

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887)

macOS/iOS:

Linux:

Windows:

openEuler:

b7397

14 Dec 12:36
4722671

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: improve mul_mat_vec_iq1_s speed (#17874)

macOS/iOS:

Linux:

Windows:

openEuler:

b7394

14 Dec 11:22
609a2d0

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

models : fix YaRN regression + consolidate logic (#18006)

  • models : fix YaRN regression + consolidate logic

  • cont : fix the fix

  • cont : remove header

  • cont : add header

macOS/iOS:

Linux:

Windows:

openEuler: