llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-27 23:50:20 -05:00

History

* enable qwen to llama.cpp

* llama : do not GPU split bias tensors

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-12-01 20:16:31 +02:00

alpaca.txt

2023-04-14 22:58:43 +03:00

assistant.txt

2023-10-18 16:21:57 +03:00

chat-with-baichuan.txt

2023-09-14 12:32:10 -04:00

chat-with-bob.txt

2023-04-14 22:58:43 +03:00

chat-with-qwen.txt

2023-12-01 20:16:31 +02:00

chat-with-vicuna-v0.txt

2023-05-03 20:58:11 +03:00

chat-with-vicuna-v1.txt

2023-05-03 20:58:11 +03:00

chat.txt

2023-05-03 20:58:11 +03:00

dan-modified.txt

2023-05-11 18:10:19 +03:00

dan.txt

2023-05-11 18:10:19 +03:00

LLM-questions.txt

2023-10-06 16:16:38 +03:00

mnemonics.txt

2023-10-12 09:35:30 +03:00

parallel-questions.txt

2023-10-06 16:36:32 +03:00

reason-act.txt

2023-04-13 11:33:16 +02:00