How to Improve RAG Retrieval Accuracy and Control Similarity Threshold in FAISS / Hybrid Search

ThirzaOPPO258 · August 12, 2025, 7:56am

Hi all,

I’m building a RAG (Retrieval-Augmented Generation) application for my dataset of many reports. The goal is: given a problem statement, return the most relevant reports that match it closely.

Current Approach

Chunking strategy:
- Initially, I converted each report into one chunk.
- Each chunk is vectorized, then stored in FAISS for dense retrieval.
- Retrieval is done by embedding the problem statement and searching for top matches.
Variants I tried:
- Dense FAISS search only → Works, but sometimes returns unrelated reports.
- Sparse search (BM25) → Slight improvement in keyword matching, but still misses some exact mentions.
- Hybrid dense + sparse search → Combined scores, still inconsistent results.
Keyword column approach:
- I added a separate column with keywords extracted from the problem.
- Retrieval sometimes improved, but still not perfect — some unrelated reports are returned, and worse, some exact matches are not returned.

Main Problems

Low retrieval accuracy: Sometimes irrelevant chunks are in the top results.
Missed obvious matches: Even if the problem statement is literally mentioned in the report, it is sometimes not returned.
No control over similarity threshold: FAISS returns top-k results, but I’d like to set a minimum similarity score so irrelevant matches can be filtered out.

Questions

Is there a better chunking strategy for long reports to improve retrieval accuracy?
Are there embedding models better suited for exact + semantic matching (dense + keyword) in my case?
How can I set a similarity threshold in FAISS so that results below a certain score are discarded?
Any tips for re-ranking results after retrieval to boost accuracy?

John6666 · August 12, 2025, 9:58am

1

“Semantic chunking” or so?

2

For example, this one. For more, I recommend searching on the MTEB leaderboard.

3

MetricType and distances ?

ThirzaOPPO258 · August 13, 2025, 7:08am

Thank you for your answer

Topic		Replies	Views
Poor Results with FAISS Index on RAG System 🤗Transformers	0	636	March 13, 2024
RAG Debugging Is 10x Worse Than I Thought — So I Wrote a Semantic Firewall Instead Intermediate	1	66	July 28, 2025
Similarity Search in FAISS Returning Raw, Unintelligible Data 🤗Datasets	2	188	January 8, 2025
Language model to search an answer in a huge collection of (unrelated) paragraphs Research	4	1546	July 6, 2021
AttributeError: 'NoneType' object has no attribute 'repeat_interleave' Beginners	0	82	September 18, 2024

How to Improve RAG Retrieval Accuracy and Control Similarity Threshold in FAISS / Hybrid Search

Current Approach

Main Problems

Questions

Related topics