One of the core design goals Georgi Gerganov had with GGUF was to *not* need oth...

rahimnathwani · 2025-04-30T18:03:41 1746036221

Most of the parameters you would include in ollama's ModelFile are things you would pass to llama.cpp using command line flags:

https://github.com/ggml-org/llama.cpp/blob/master/examples/m...

If you only ever have one set of configuration parameters per model (same temp, top_p, system prompt...), then I guess you can put them in a gguf file (as the format is extensible).

But what if you want two different sets? You still need to keep them somewhere. That could be a shell script for llama.cpp, or a ModelFile for ollama.

(Assuming you don't want to create a new (massive) gguf file for each permutation of parameters.)

novaRom · 2025-04-30T21:44:25 1746049465

This is why we use xdelta3, rdiff, and git