我要投稿

PlanRAG：基于迭代计划再RAG解决复杂数据分析任务，比迭代RAG提升15.8%

发布日期：2024-06-30 19:32:08 浏览次数： 2166 来源：PaperAgent

如何将大型语言模型（LLMs）应用于需要复杂数据分析的决策制定问题？

定义了一种新的任务类型，称为决策问答（Decision QA），它要求模型回答给定的决策问题、业务规则和数据库的最佳决策。

决策QA示例。一个红点代表一个交易节点。在Deccan框中的盈利表示每个决策可能带来的盈利变化。请注意，潜在的盈利变化不在数据库中，应该从数据库中计算得出。每个国家只有一个主要的（家庭）交易节点。在表格中，下划线列名表示表格的键。

为了有效解决决策问答任务，提出了一种新的基于迭代计划然后检索增强生成（PlanRAG）的技术：

PlanRAG技术：它通过两个主要的推理类型来回答问题：首先是制定计划，其次是基于检索结果的推理。PlanRAG技术的核心在于，它使用单个语言模型来执行这两种类型的推理，以减少使用不同语言模型可能带来的副作用。
PlanRAG的推理过程：PlanRAG的推理过程包括三个主要步骤：

规划（Planning）：语言模型接收决策问题、数据库架构和业务规则作为输入，生成一个初始的数据分析计划。
检索与回答（Retrieving & Answering）：与之前的RAG技术不同，PlanRAG在这一步骤中不仅考虑问题和规则，还包括初始计划，以更有效地生成数据分析查询。
重新规划（Re-planning）：如果初始计划不足以解决问题，PlanRAG会根据每次检索的结果评估当前计划，并生成新的计划或纠正先前分析的方向。

先前RAG技术与PlanRAG 技术推理过程的比较

在定位场景中，先前的迭代RAG与PlanRAG 推理过程的示例

对不同类型的问题（单次检索和多次检索问题）进行了分析，定位场景中比最先进的迭代RAG方法提高了15.8%，在构建场景中提高了7.4%。

DQA 技术准确率（%）（每个准确率是 RDB 和 GDB 中准确率的平均值）

IterRAG-LM和PlanRAGLM 在简单回答（SR）和多步回答（MR）问题上的准确率（%）

定位场景中，针对 GDB 案例的单轮 RAG 技术检索Prompt模版：

# PrefixYou are a decision-making agent answering a given question.You should collect the data to answer the question:# Tool descriptionsGraph DB: Useful for when you need to collect the data thatfollows the following schema (You MUST generate a Cypher querystatement to interact with this tool):(n:Trade_node {{name, local_value, is_inland, total_power,outgoing, ingoing}});(m:Country {{name, home_node, development}});(Trade_node)-[r:source {{flow}}]->[Trade_node](Country)-[NodeCountry{{is_home,has_merchant,base_trading_power,calculated_trading_power}}]->(Trade_node), args: {{{{'tool_input': {{{{'type': 'string'}}}}}}}}Self thinking: Useful for when there is no available tool., args:{{{{'tool_input': {{{{'type': 'string'}}}}}}}}# Format instructionsUse the following Strict format:Question: the input question you must answer.Thought: you should always think about what to do.Action: a suitable database name, MUST be one of [‘Graph DB’,‘Self-thinking’].Action input: a syntactically correct query statement only, MUSTbe written by Cypher query language.Observation: the result of the action.Thought: I now know the answer.Final answer: the final answer to the question based on theobserved data.# SuffixBegin! Keep in mind that Your response MUST follow the validformat above.

在定位场景中，针对 GDB 案例的单轮 RAG 技术答案生成Prompt模版：

# PrefixYou are a decision-making agent answering a given question.You have already collected the data to answer the question.Indeed, you should make your Final answer immediately.:# Tool descriptionsGraph DB: Useful for when you need to collect the data thatfollows the following schema (You MUST generate a Cypher querystatement to interact with this tool):(n:Trade_node {{name, local_value, is_inland, total_power,outgoing, ingoing}});(m:Country {{name, home_node, development}});(Trade_node)-[r:source {{flow}}]->[Trade_node](Country)-[NodeCountry{{is_home,has_merchant,base_trading_power,calculated_trading_power}}]->(Trade_node), args: {{{{'tool_input': {{{{'type': 'string'}}}}}}}}Self thinking: Useful for when there is no available tool., args:{{{{'tool_input': {{{{'type': 'string'}}}}}}}}# Format instructionsUse the following Strict format:Final answer: the final answer to the question based on theobserved data.# SuffixBegin!

https://arxiv.org/pdf/2406.12430PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makershttps://github.com/myeon9h/PlanRAG.

53AI，企业落地应用大模型首选服务商

产品：大模型应用平台+智能体定制开发+落地咨询服务

承诺：先做场景POC验证，看到效果再签署服务协议。零风险落地应用大模型，已交付160+中大型企业

160+中大型企业正在使用53AI

立即咨询预约演示

百度智能云邀53AI：共创AI新纪元，启航智能新时代

2024-05-27

钉钉恒星计划：53AI与百余位企业家及钉钉生态伙伴，共议“AI 浪潮下的新机遇”

2024-05-22

热点资讯

GraphRAG+Ollama 本地部署，保姆教程，踩坑无数，闭坑大法

2024-07-18

RAGFlow：基于OCR和文档解析的下一代 RAG 引擎

2024-05-05

RAG框架，都在这了!

2024-06-20

一文彻底搞懂大模型 - RAG（检索、增强、生成）

2024-09-04

基于Llama 3 构建RAG语音助手：将本地 RAG 与 Qdrant、Whisper 和 LangChain 集成

2024-05-19

微软开源的GraphRAG解读

2024-07-09

深入解析 Graph RAG：提升语言模型问答能力的创新策略

2024-07-09

Prompt工程师必备：复旦重磅 | 最佳RAG实践长什么样的？

2024-07-07

下一代 RAG 技术来了！微软正式开源 GraphRAG：大模型行业将迎来新的升级？

2024-07-07

RAG 高效应用指南：Embedding 模型的选择和微调

2024-06-13

大家都在问

构建行业RAG应用系统：金融、财务、保险、医疗等行业该怎么做？

2024-12-21

为什么分块在RAG技术中很重要？但又很没有存在感？

2024-12-14

比GraphRAG还好的LightRAG到底是何方神圣？

2024-12-01

企业级RAG应用的5大技术发展趋势，你准备好了吗？

2024-11-27

RAG搭建中，如何选择最合适的向量索引？

2024-11-25

RAG vs ICL：AI大模型的记忆术和临场发挥，谁才是最强辅助？

2024-11-06

企业如何从0到1构建RAG系统？