Cognitive Architecture [2] #

CRAG #

Corrective-RAG (CRAG) is a strategy for RAG that incorporates self-reflection / self-grading on retrieved documents.

In the paper here, a few steps are taken:

If at least one document exceeds the threshold for relevance, then it proceeds to generation
Before generation, it performs knowledge refinement
This partitions the document into “knowledge strips”
It grades each strip, and filters our irrelevant ones
If all documents fall below the relevance threshold or if the grader is unsure, then the framework seeks an additional datasource
It will use web search to supplement retrieval

We will implement some of these ideas from scratch using LangGraph:

Let’s skip the knowledge refinement phase as a first pass. This can be added back as a node, if desired.
If any documents are irrelevant, let’s opt to supplement retrieval with web search.
We’ll use Tavily Search for web search.
Let’s use query re-writing to optimize the query for web search.

Self-RAG 则是更加主动和智能的实现方式，主要步骤概括如下：

Self-RAG is a strategy for RAG that incorporates self-reflection / self-grading on retrieved documents and generations.

In the paper, a few decisions are made:

Are the LLM generation from each chunk in D is relevant to the chunk (hallucinations, etc) -

Input: x (question), d (chunk), y (generation) for d in D
All of the verification-worthy statements in y (generation) are supported by d
Output: {fully supported, partially supported, no support

The LLM generation from each chunk in D is a useful response to x (question) -

We will implement some of these ideas from scratch using LangGraph.

1xx. 【社区第十三讲】老刘说NLP线上交流