Question 1

Does RAG hallucinate?

Accepted Answer

Less than a plain LLM, because the model is instructed to answer from the retrieved passages and cite them. If the passages do not contain the answer, the model is told to say so. It can still go wrong if the retrieval step returns irrelevant chunks – which is why retrieval quality tuning is part of every build, not an afterthought.

Question 2

What documents can RAG work with?

Accepted Answer

Any text-readable content – PDFs, Word documents, web pages, internal wikis, structured database exports. Scanned PDFs need OCR (optical character recognition) preprocessing. The quality of the source documents directly determines the quality of the answers.

Question 3

How is RAG different from just searching documents?

Accepted Answer

Traditional search returns a list of documents matching your keywords. RAG returns a synthesised answer, with citations back to the source passages. The user gets a direct response, not a list of files to read through.

Question 4

How long does a RAG project take?

Accepted Answer

The Kompetenzz knowledge chatbot went from scoping to production in four sprints. Simpler corpora with clean documents take less preparation; large corpora with unstructured PDFs and missing metadata take longer to prepare. The AI layer is rarely the bottleneck – the documents are.

Question 5

Can RAG work in German?

Accepted Answer

Yes. Our production systems handle German regulatory text – GDV insurance policy documents, EU Taxonomy classifications – with cited answers. Embedding model choice matters for non-English text; we test and disclose what we use.

What is RAG? Retrieval-Augmented Generation, explained in plain language, from teams who build it in production

The short answer

How RAG works in three steps

When to use RAG

Why N3XTCODER

Frequently asked questions

Talk through your AI project