كيف يعمل المسترد المدرك للتاريخ؟

الصفحة الأمامية > برمجة > كيف يعمل المسترد المدرك للتاريخ؟

كيف يعمل المسترد المدرك للتاريخ؟

تم النشر بتاريخ 2024-11-08

تصفح:582

How a history-aware retriever works?

المسترد المدرك للتاريخ الذي تمت مناقشته في هذا المنشور هو الذي تم إرجاعه بواسطة وظيفة create_history_aware_retriever من حزمة LangChain. تم تصميم هذه الوظيفة لتلقي المدخلات التالية في منشئها:

ماجستير في القانون (نموذج لغة يتلقى استعلامًا ويعيد إجابة)؛
مسترد مخزن المتجهات (نموذج يتلقى استعلامًا ويعيد قائمة بالمستندات ذات الصلة).
سجل الدردشة (قائمة بتفاعلات الرسائل، عادةً بين الإنسان والذكاء الاصطناعي).

عند الاستدعاء، يأخذ المسترد المدرك للتاريخ استعلام المستخدم كمدخل ويخرج قائمة بالمستندات ذات الصلة. تعتمد المستندات ذات الصلة على الاستعلام المدمج مع السياق الذي يوفره سجل الدردشة.

وفي النهاية، ألخص سير العمل.

إعداده

from langchain.chains import create_history_aware_retriever
from langchain_community.document_loaders import WebBaseLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_openai import OpenAIEmbeddings, ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain_chroma import Chroma
from dotenv import load_dotenv
import bs4

load_dotenv() # To get OPENAI_API_KEY

def create_vectorsore_retriever():
    """
    Returns a vector store retriever based on the text of a specific web page.
    """
    URL = r'https://lilianweng.github.io/posts/2023-06-23-agent/'
    loader = WebBaseLoader(
        web_paths=(URL,),
        bs_kwargs=dict(
            parse_only=bs4.SoupStrainer(class_=("post-content", "post-title", "post-header"))
        ))
    docs = loader.load()
    text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=0, add_start_index=True)
    splits = text_splitter.split_documents(docs)
    vectorstore = Chroma.from_documents(documents=splits, embedding=OpenAIEmbeddings())
    return vectorstore.as_retriever()

def create_prompt():
    """
    Returns a prompt instructed to produce a rephrased question based on the user's
    last question, but referencing previous messages (chat history).
    """
    system_instruction = """Given a chat history and the latest user question \
        which might reference context in the chat history, formulate a standalone question \
        which can be understood without the chat history. Do NOT answer the question, \
        just reformulate it if needed and otherwise return it as is."""

    prompt = ChatPromptTemplate.from_messages([
        ("system", system_instruction),
        MessagesPlaceholder("chat_history"),
        ("human", "{input}")])
    return prompt

llm = ChatOpenAI(model='gpt-4o-mini')
vectorstore_retriever = create_vectorsore_retriever()
prompt = create_prompt()

history_aware_retriever = create_history_aware_retriever(
    llm,
    vectorstore_retriever,
    prompt
)

استخدامه

هنا، يتم طرح سؤال دون أي سجل للدردشة، وبالتالي فإن المسترد يستجيب فقط بالمستندات ذات الصلة بالسؤال الأخير.

chat_history = []

docs = history_aware_retriever.invoke({'input': 'what is planning?', 'chat_history': chat_history})
for i, doc in enumerate(docs):
    print(f'Chunk {i 1}:')
    print(doc.page_content)
    print()

Chunk 1:
Planning is essentially in order to optimize believability at the moment vs in time.
Prompt template: {Intro of an agent X}. Here is X's plan today in broad strokes: 1)
Relationships between agents and observations of one agent by another are all taken into consideration for planning and reacting.
Environment information is present in a tree structure.

Chunk 2:
language. Essentially, the planning step is outsourced to an external tool, assuming the availability of domain-specific PDDL and a suitable planner which is common in certain robotic setups but not in many other domains.

Chunk 3:
Another quite distinct approach, LLM P (Liu et al. 2023), involves relying on an external classical planner to do long-horizon planning. This approach utilizes the Planning Domain Definition Language (PDDL) as an intermediate interface to describe the planning problem. In this process, LLM (1) translates the problem into “Problem PDDL”, then (2) requests a classical planner to generate a PDDL plan based on an existing “Domain PDDL”, and finally (3) translates the PDDL plan back into natural

Chunk 4:
Planning

Subgoal and decomposition: The agent breaks down large tasks into smaller, manageable subgoals, enabling efficient handling of complex tasks.
Reflection and refinement: The agent can do self-criticism and self-reflection over past actions, learn from mistakes and refine them for future steps, thereby improving the quality of final results.


Memory

الآن، استنادًا إلى سجل الدردشة، يعرف المسترد أن الإنسان يريد معرفة تحليل المهام بالإضافة إلى التخطيط. لذا فهو يستجيب بأجزاء من النص تشير إلى كلا الموضوعين.

chat_history = [
    ('human', 'when I ask about planning I want to know about Task Decomposition too.')]

docs = history_aware_retriever.invoke({'input': 'what is planning?', 'chat_history': chat_history})
for i, doc in enumerate(docs):
    print(f'Chunk {i 1}:')
    print(doc.page_content)
    print()

Chunk 1:
Task decomposition can be done (1) by LLM with simple prompting like "Steps for XYZ.\n1.", "What are the subgoals for achieving XYZ?", (2) by using task-specific instructions; e.g. "Write a story outline." for writing a novel, or (3) with human inputs.

Chunk 2:
Fig. 1. Overview of a LLM-powered autonomous agent system.
Component One: Planning#
A complicated task usually involves many steps. An agent needs to know what they are and plan ahead.
Task Decomposition#

Chunk 3:
Planning

Subgoal and decomposition: The agent breaks down large tasks into smaller, manageable subgoals, enabling efficient handling of complex tasks.
Reflection and refinement: The agent can do self-criticism and self-reflection over past actions, learn from mistakes and refine them for future steps, thereby improving the quality of final results.


Memory

Chunk 4:
Challenges in long-term planning and task decomposition: Planning over a lengthy history and effectively exploring the solution space remain challenging. LLMs struggle to adjust plans when faced with unexpected errors, making them less robust compared to humans who learn from trial and error.

الآن يعتمد السؤال بالكامل على سجل الدردشة. ويمكننا أن نرى أنه يستجيب بأجزاء من النص تشير إلى المفهوم الصحيح.

chat_history = [
    ('human', 'What is ReAct?'),
    ('ai', 'ReAct integrates reasoning and acting within LLM by extending the action space to be a combination of task-specific discrete actions and the language space')]

docs = history_aware_retriever.invoke({'input': 'It is a way of doing what?', 'chat_history': chat_history})
for i, doc in enumerate(docs):
    print(f'Chunk {i 1}:')
    print(doc.page_content)
    print()

Chunk 1:
ReAct (Yao et al. 2023) integrates reasoning and acting within LLM by extending the action space to be a combination of task-specific discrete actions and the language space. The former enables LLM to interact with the environment (e.g. use Wikipedia search API), while the latter prompting LLM to generate reasoning traces in natural language.
The ReAct prompt template incorporates explicit steps for LLM to think, roughly formatted as:
Thought: ...
Action: ...
Observation: ...

Chunk 2:
Fig. 2. Examples of reasoning trajectories for knowledge-intensive tasks (e.g. HotpotQA, FEVER) and decision-making tasks (e.g. AlfWorld Env, WebShop). (Image source: Yao et al. 2023).
In both experiments on knowledge-intensive tasks and decision-making tasks, ReAct works better than the Act-only baseline where Thought: … step is removed.

Chunk 3:
The LLM is provided with a list of tool names, descriptions of their utility, and details about the expected input/output.
It is then instructed to answer a user-given prompt using the tools provided when necessary. The instruction suggests the model to follow the ReAct format - Thought, Action, Action Input, Observation.

Chunk 4:
Case Studies#
Scientific Discovery Agent#
ChemCrow (Bran et al. 2023) is a domain-specific example in which LLM is augmented with 13 expert-designed tools to accomplish tasks across organic synthesis, drug discovery, and materials design. The workflow, implemented in LangChain, reflects what was previously described in the ReAct and MRKLs and combines CoT reasoning with tools relevant to the tasks:

خاتمة

في الختام، يعمل سير عمل المستردات المدركة للتاريخ على النحو التالي عندما يتم استدعاء .invoc({'input': '...', 'chat_history': '...'}):

يستبدل العناصر النائبة للإدخال وchat_history في المطالبة بقيم محددة، مما يؤدي إلى إنشاء مطالبة جديدة جاهزة للاستخدام تقول بشكل أساسي "خذ سجل الدردشة هذا وهذا الإدخال الأخير، وأعد صياغة الإدخال الأخير بطريقة يمكن لأي شخص أن يفهمها دون رؤية سجل الدردشة".
يرسل الموجه الجديد إلى LLM ويتلقى مدخلات معاد صياغتها.
ثم يرسل المدخلات المعاد صياغتها إلى مسترد مخزن المتجهات ويتلقى قائمة بالمستندات ذات الصلة بهذا الإدخال المعاد صياغته.
أخيرًا، تقوم بإرجاع قائمة المستندات ذات الصلة.

Obs.: من المهم ملاحظة أن التضمين المستخدم لتحويل النص إلى ناقل هو الذي يتم تحديده عندما يتم استدعاء Chroma.from_documents. عندما لا يتم تحديد أي شيء (الحالة الحالية)، يتم استخدام تضمين اللون الافتراضي.

بيان الافراج تم إعادة نشر هذه المقالة على: https://dev.to/guilhermecxe/how-a-history-aware-retriever-works-5e07?1 إذا كان هناك أي انتهاك، يرجى الاتصال بـ [email protected] لحذفه

أحدث البرنامج التعليمي أكثر>

كيف يمكنني العثور على المستخدمين الذين لديهم أعياد ميلاد اليوم باستخدام MySQL؟
كيفية التعرف على المستخدمين الذين لديهم أعياد ميلاد اليوم باستخدام MySQL تحديد ما إذا كان اليوم هو عيد ميلاد المستخدم باستخدام MySQL يتضمن الب...

برمجة تم النشر بتاريخ 2024-12-26
$كيفية إصلاح \"تكوين غير صحيح: حدث خطأ أثناء تحميل وحدة MySQLdb\" في Django على نظام التشغيل macOS؟$
كيفية إصلاح \"تكوين غير صحيح: حدث خطأ أثناء تحميل وحدة MySQLdb\" في Django على نظام التشغيل macOS؟
تم تكوين MySQL بشكل غير صحيح: مشكلة المسارات النسبية عند تشغيل python manager.py runserver في Django، قد تواجه الخطأ التالي: ImproperlyConfigur...

برمجة تم النشر بتاريخ 2024-12-26
استخدام WebSockets في Go للاتصال في الوقت الفعلي
يتطلب إنشاء التطبيقات التي تتطلب تحديثات في الوقت الفعلي - مثل تطبيقات الدردشة أو الإشعارات المباشرة أو الأدوات التعاونية - طريقة اتصال أسرع وأكثر...

برمجة تم النشر بتاريخ 2024-12-26
صفيف
الطرق هي fns التي يمكن استدعاؤها على الكائنات المصفوفات هي كائنات، وبالتالي فهي تحتوي أيضًا على طرق في JS. الشريحة (البدء): استخراج جزء من الم...

برمجة تم النشر بتاريخ 2024-12-26
ما وراء عبارات "if": في أي مكان آخر يمكن استخدام نوع ذو تحويل "bool" صريح بدون الإرسال؟
التحويل السياقي إلى منطقي مسموح بدون إرسال يحدد فصلك تحويلًا صريحًا إلى منطقي، مما يتيح لك استخدام مثيله 't' مباشرة في العبارات الشرطية....

برمجة تم النشر بتاريخ 2024-12-26
هل يمكنني ترحيل التشفير من Mcrypt إلى OpenSSL، وفك تشفير البيانات المشفرة Mcrypt باستخدام OpenSSL؟
ترقية مكتبة التشفير الخاصة بي من Mcrypt إلى OpenSSL هل يمكنني ترقية مكتبة التشفير الخاصة بي من Mcrypt إلى OpenSSL؟ في OpenSSL، هل من الممكن فك...

برمجة تم النشر بتاريخ 2024-12-26
كيف يمكنني استخدام Calc() بشكل فعال مع الأعمدة المستندة إلى النسبة المئوية في جداول HTML؟
استخدام Calc () مع الجداول: التغلب على معضلة النسب المئوية يمكن أن يكون إنشاء الجداول ذات الأعمدة ذات العرض الثابت والمتغير أمرًا صعبًا، خاصة ...

برمجة تم النشر بتاريخ 2024-12-26
ماذا حدث لموازنة الأعمدة في الإصدار التجريبي من Bootstrap 4؟
الإصدار التجريبي من Bootstrap 4: إزالة واستعادة إزاحة الأعمدة قدم Bootstrap 4، في إصداره التجريبي 1، تغييرات مهمة في الطريقة تم تعويض الأعمدة....

برمجة تم النشر بتاريخ 2024-12-26
كيفية إرسال ومعالجة المصفوفات متعددة الأبعاد عبر POST في PHP؟
إرسال مصفوفات متعددة الأبعاد عبر POST في PHP عند العمل مع نماذج PHP التي تحتوي على أعمدة وصفوف متعددة بأطوال متغيرة، من الضروري التحويل الإدخا...

برمجة تم النشر بتاريخ 2024-12-26
ما هي بالضبط حلقة for(;;) وكيف تعمل؟
إزالة الغموض عن حلقة for(;;) في أعماق قاعدة التعليمات البرمجية القديمة، تتعثر على حلقة for غريبة تحيرك فهمك. ويظهر بالشكل التالي:for (;;) { ...

برمجة تم النشر بتاريخ 2024-12-25
كيف يعمل Scanner.useDelimiter() الخاص بـ Java مع التعبيرات العادية؟
فهم المحددات باستخدام Scanner.useDelimiter في Java توفر فئة Scanner في Java طريقة useDelimiter، مما يسمح لك بتحديد محدد (حرف أو نمط) يفصل بين الر...

برمجة تم النشر بتاريخ 2024-12-25
كيف يمكنني عرض صور GIF المتحركة في Android؟
عرض صور GIF المتحركة في Android على الرغم من الاعتقاد الخاطئ الأولي بأن Android لا يدعم صور GIF المتحركة، إلا أنه في الواقع لديه القدرة على فك ...

برمجة تم النشر بتاريخ 2024-12-25
$لماذا يظهر لي الخطأ \"لا يمكن العثور على config.m4\" عند تشغيل phpize؟$
لماذا يظهر لي الخطأ \"لا يمكن العثور على config.m4\" عند تشغيل phpize؟
استكشاف الأخطاء وإصلاحها خطأ "لا يمكن العثور على config.m4" في phpize مواجهة الخطأ "لا يمكن العثور على config.m4" أثناء تشغ...

برمجة تم النشر بتاريخ 2024-12-25
كيف يمكنني تكرار رؤوس الجدول في كل صفحة عند الطباعة؟
تكرار رؤوس الجدول في وضع الطباعة عندما يمتد الجدول لعدة صفحات أثناء الطباعة، غالبًا ما يكون من المرغوب فيه أن يكون لديك صفوف الرأس (عناصر TH) ...

برمجة تم النشر بتاريخ 2024-12-25
لماذا لا يلتقط طلب POST الإدخال في PHP على الرغم من الرمز الصالح؟
معالجة خلل طلب POST في PHP في مقتطف الكود المقدم: action=''action=""action="<?php echo $_SERVER['PHP_SELF'];?>" فحص م...

برمجة تم النشر بتاريخ 2024-12-25