<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">

DiaRAG:面向糖尿病領域的智能問答系統

DiaRAG: intelligent question-answering system for the diabetes domain

  • 摘要: 為了滿足糖尿病領域對智能問答系統高效性與專業性的雙重需求,本文設計并實現了融合知識圖譜與檢索增強生成(Retrieval augmented generation, RAG)的糖尿病領域智能問答系統——DiaRAG. 該系統提出了一種自動提示生成方法(Auto prompt generation, APG),能夠自動生成適用于糖尿病領域的提示模板,用于提取糖尿病知識圖譜并構建檢索知識庫. 同時,通過提示學習對病患提出的問句進行校正,有效解決了復雜問句中的語義和語法偏誤問題. 此外,本文設計了微調排序模型(Fine-tuned reranker),對糖尿病知識圖譜的社區摘要進行二次過濾,以確保檢索結果與病患提問意圖的高度契合. DiaRAG系統通過深度融合知識圖譜與大語言模型(Large language model, LLM),充分利用外部知識庫,從而顯著提升了糖尿病領域知識的問答能力. 實驗結果表明,DiaRAG在問答準確性、社區摘要相關性等方面均顯著優于現有系統,為糖尿病個性化知識服務提供了創新性解決方案.

     

    Abstract:
    To address the dual requirements of efficiency and professionalism in diabetes-related intelligent question-answering, this study presents DiaRAG, an innovative system that synergistically integrates knowledge graphs with retrieval-augmented generation (RAG) techniques. The proposed system is specifically tailored to the diabetes domain, in which both medical expertise and updated knowledge are critical. DiaRAG introduces an autoprompt generation (APG) method that automatically synthesizes diabetes-specific prompt templates. These templates are used to extract structured information from diabetes literature and clinical data, thus facilitating the construction of a comprehensive diabetes knowledge graph and a dedicated retrieval knowledge base. By applying APG, the system effectively generates candidate prompts that enhanced the extraction of relevant knowledge triples, addressing the challenges posed by ambiguous or complex medical queries and ensuring that the subsequent retrieval process is grounded in an accurate, domain-specific context.
    Furthermore, DiaRAG integrates a specialized text correction module based on PL-BART (Prompt Learning and Bidirectional Auto-Regressive Transformers). This module is designed to correct semantic and syntactic errors in patient queries. By leveraging prompt-guided correction, PL-BART improves the clarity of input questions, thus enabling the retrieval module to perform more precise matching with the underlying diabetes knowledge graph.
    In the retrieval phase, a fine-tuned re-ranker model is introduced to further optimize the ordering of the candidate community summaries. This re-ranker, built on a cross-encoder architecture that employs BERT, evaluates the relevance of the retrieved documents to the patient’s query. The secondary filtering provided by this module not only enhances the alignment between the query intent and the retrieved content but also mitigates the common issue of hallucinations in large language models (LLMs) by ensuring that only high-quality, domain-relevant information is passed to the generation stage.
    Experimental evaluations were conducted on the DaCorp diabetes question-answering dataset, and the results showed that DiaRAG achieved superior performance compared to state-of-the-art models, such as GPT-3.5, HuatuoGPT, and other retrieval-augmented frameworks, such as NaiveRAG and SelfRAG. Key evaluation metrics, including ROUGE-1, ROUGE-2, and ROUGE-L, indicated that DiaRAG consistently outperformed baseline methods in terms of answer accuracy and community summary relevance.
    Ablation studies further demonstrated that each component—the APG module, PL-BART-based text correction, and fine-tuned re-ranker —contributed significantly to the overall system performance. Notably, iterative prompt optimization via APG and a specialized re-ranking process have been shown to be critical for handling the intricate and specialized language inherent in diabetes-related queries. In a detailed case study involving patient inquiries about the suitability of a traditional Chinese medicine for diabetic conditions, DiaRAG provided a comprehensive answer that not only considered the general pharmacological properties of the medicine but also incorporated detailed clinical insights. This nuanced explanation, which directly addressed the complexities of diabetic complications and the specific indications of the medicine, resulted in expert evaluations rating DiaRAG’s response significantly higher than those provided by competing models such as GPT-3.5 and HuatuoGPT. The experts praised DiaRAG for its precise and contextually appropriate advice, which ultimately highlighted the system’s potential for delivering personalized and reliable medical guidance.
    Overall, DiaRAG represents an important advancement in the design of domain-specific intelligent question-answering systems. Seamlessly integrating structured knowledge extraction, robust text correction, and refined retrieval strategies, it offers an innovative solution for personalized medical knowledge services in diabetes care.

     

/

返回文章
返回
<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">
259luxu-164