RAG
- Updating facts – edit the document, reindex – done in minutes
- Citing sources – native; citations come straight from the retrieval step
- Cost – vector database + LLM inference; predictable
- Compliance posture – easy; source-of-truth is your database, auditable, deletable on request
- Style and format – limited; RAG does not train the model, so output format and voice rely on prompting
Fine-tuning
- Updating facts – retrain the model, validate, redeploy – hours to days, every time
- Citing sources – not possible; the model cannot point at a source for a fact it absorbed in training
- Cost – training compute up front, plus inference; higher floor, harder to estimate
- Compliance posture – hard; removing a fact from a fine-tuned model is essentially impossible
- Style and format – this is where it shines; consistent output format or voice that prompting cannot achieve
