diff --git a/tools/export-lora/README.md b/tools/export-lora/README.md index 7dce99c9a9..f0729341f3 100644 --- a/tools/export-lora/README.md +++ b/tools/export-lora/README.md @@ -6,11 +6,10 @@ Apply LORA adapters to base model and export the resulting model. usage: llama-export-lora [options] options: - -m, --model model path from which to load base model (default '') - --lora FNAME path to LoRA adapter (can be repeated to use multiple adapters) - --lora-scaled FNAME S path to LoRA adapter with user defined scaling S (can be repeated to use multiple adapters) - -t, --threads N number of threads to use during computation (default: 4) - -o, --output FNAME output file (default: 'ggml-lora-merged-f16.gguf') + -m, --model FNAME model path from which to load base model + --lora FNAME path to LoRA adapter (use comma-separated values to load multiple adapters) + --lora-scaled FNAME:SCALE,... path to LoRA adapter with user defined scaling (format: FNAME:SCALE,...) + -o, --output, --output-file FNAME output file (default: 'ggml-lora-merged-f16.gguf') ``` For example: @@ -22,12 +21,11 @@ For example: --lora lora-open-llama-3b-v2-english2tokipona-chat-LATEST.gguf ``` -Multiple LORA adapters can be applied by passing multiple `--lora FNAME` or `--lora-scaled FNAME S` command line parameters: +Multiple LORA adapters can be applied by passing comma-separated values to `--lora FNAME` or `--lora-scaled FNAME:SCALE,...`: ```bash ./bin/llama-export-lora \ -m your_base_model.gguf \ -o your_merged_model.gguf \ - --lora-scaled lora_task_A.gguf 0.5 \ - --lora-scaled lora_task_B.gguf 0.5 + --lora-scaled lora_task_A.gguf:0.5,lora_task_B.gguf:0.5 ```