Default Branch

ebd048fc5e · opencl: flash attention improvement (#25069) · Updated 2026-06-27 17:36:06 -05:00

Branches

b9421898b6 · add for Q4_0 · Updated 2026-04-23 02:33:19 -05:00    jdelony

1045
2

a5355a0226 · server: keep router model refcount to avoid unloading models that have running requests · Updated 2026-04-22 03:07:13 -05:00    jdelony

958
15

35df147d80 · cont : remove /api/tags · Updated 2026-04-20 07:45:42 -05:00    jdelony

973
2

4943e3a396 · gen-libllama-abi: compile sort-key regex once outside the lambda · Updated 2026-04-15 07:04:44 -05:00    jdelony

1028
4

4cabbe36e0 · state · Updated 2026-04-09 06:00:31 -05:00    jdelony

1125
16

a30369d515 · cpu: fix ARM NEON nvfp4 vec dot · Updated 2026-04-06 03:27:03 -05:00    jdelony

1156
1

c30e012253 · contrib : rewrite AGENTS.md, make it more clear about project values (#21270) · Updated 2026-04-01 16:31:51 -05:00    jdelony

1201
0
Included

2985be3324 · update hw info · Updated 2026-03-30 20:24:40 -05:00    jdelony

1438
2

f0fea264b0 · cont : rand hadamard matrices · Updated 2026-03-27 13:11:47 -05:00    jdelony

1290
4

ff76c6731d · cont : cache shift support · Updated 2026-03-27 07:39:14 -05:00    jdelony

1290
4

4cd732f445 · better wording · Updated 2026-03-25 13:46:17 -05:00    jdelony

1300
2

07a6fd8775 · kleidiai: removed cpu feature detection from CI run script · Updated 2026-03-24 12:24:41 -05:00    jdelony

1540
3

203eec25c0 · releases : disable s390x builds · Updated 2026-03-20 12:31:25 -05:00    jdelony

1371
1

4af94d9afe · gguf : fix division by zero · Updated 2026-03-18 05:39:19 -05:00    jdelony

1421
1

15324f905b · cont : reduce paths · Updated 2026-03-15 06:03:20 -05:00    jdelony

1472
3

0776a6a039 · remove event pending stage · Updated 2026-03-15 03:00:06 -05:00    jdelony

1503
10

5ec6569eb5 · unify scalar+vector and fix reduce function · Updated 2026-03-13 03:23:03 -05:00    jdelony

1504
6

95ae9982d3 · Merge branch 'master' into compilade/imatrix-neutral-prior · Updated 2026-03-12 12:20:00 -05:00    jdelony

1506
6

7ded1269ab · unify matmul_id shader selection · Updated 2026-03-12 08:55:12 -05:00    jdelony

1507
2

20fbf04cd6 · metal : fix capture_started flag · Updated 2026-03-11 16:15:16 -05:00    jdelony

1529
2