{"id":5724,"date":"2024-01-24T13:55:37","date_gmt":"2024-01-24T05:55:37","guid":{"rendered":"https:\/\/aict.nkust.edu.tw\/digitrans\/?p=5724"},"modified":"2024-12-13T20:37:45","modified_gmt":"2024-12-13T12:37:45","slug":"rag%e5%af%a6%e4%bd%9c%e6%95%99%e5%ad%b8%ef%bc%8clangchain-llama2-%e5%89%b5%e9%80%a0%e4%bd%a0%e7%9a%84%e5%80%8b%e4%ba%ballm","status":"publish","type":"post","link":"https:\/\/aict.nkust.edu.tw\/digitrans\/?p=5724","title":{"rendered":"RAG\u5be6\u4f5c\u6559\u5b78\uff0cLangChain + Llama2 |\u5275\u9020\u4f60\u7684\u500b\u4ebaLLM"},"content":{"rendered":"\n<p>2024-01-24 | ChiChieh Huang<\/p>\n\n\n\n<p id=\"7c81\">\u5728\u9019\u7bc7\u6587\u7ae0\u4e2d\uff0c\u6703\u5e36\u4f60\u4e00\u6b65\u4e00\u6b65\u67b6\u8a2d\u81ea\u5df1\u7684 RAG\uff08Retrieval-Augmented Generation\uff09\u7cfb\u7d71\uff0c\u8b93\u4f60\u53ef\u4ee5\u4e0a\u50b3\u81ea\u5df1\u7684 PDF\uff0c\u4e26\u4e14\u8a62\u554f LLM \u95dc\u65bc PDF \u7684\u8a0a\u606f\uff0c\u9019\u7bc7\u6559\u5b78\u6ce8\u91cd\u5728\u4e0a\u5716\u85cd\u5e95\u8272\u7684\u90e8\u4efd\uff0c\u4e5f\u5c31\u662f\u5148\u4e0d\u63a5 Gradio (\u60f3\u770b\u63a5\u597d\u7684\u53ef\u4ee5\u53c3\u8003\u4e0b\u7bc7)\u3002\u76f8\u95dc\u7684 tech stack \u6709\u4ee5\u4e0b\u5e7e\u500b\uff1a<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>LLM<\/strong>: Llama2<\/li>\n\n\n\n<li>LLM API: llama.cpp service<\/li>\n\n\n\n<li>Langchain<\/li>\n\n\n\n<li><strong>Vector DB<\/strong>: ChromaDB<\/li>\n\n\n\n<li><strong>Embeding<\/strong>: sentence-Tranformers<\/li>\n<\/ol>\n\n\n\n<p id=\"c31f\">\u5176\u4e2d\u7684\u6838\u5fc3\u5728\u65bc Langchain\uff0c\u5b83\u662f\u7528\u65bc\u958b\u767c\u7531\u8a9e\u8a00\u6a21\u578b\u652f\u63f4\u7684\u61c9\u7528\u7a0b\u5f0f\u7684\u6846\u67b6\uff0cLangChain \u5c31\u50cf\u81a0\u6c34\u4e00\u6a23\uff0c\u6709\u5404\u7a2e\u63a5\u53e3\u53ef\u4ee5\u5c07 LLM \u6a21\u578b\u8207\u5176\u4ed6\u5de5\u5177\u548c\u6578\u64da\u6e90\u9023\u63a5\uff0c\u4e0d\u904e\u73fe\u5728 LangChain \u6b63\u5728\u84ec\u52c3\u767c\u5c55\u4e2d\uff0c\u8a31\u591a\u6587\u4ef6\u6216 API \u6539\u7248\u5f88\u591a\uff0c\u4ee5\u4e0b\u6211\u4f7f\u7528\u6700\u7c21\u55ae\u7684\u65b9\u5f0f\u793a\u7bc4\u3002<\/p>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"e45e\">\u6b65\u9a5f1. \u74b0\u5883\u8a2d\u7f6e<\/h1>\n\n\n\n<p id=\"282c\">\u9996\u5148\u8a2d\u7f6e python \u74b0\u5883\uff0c\u6211\u4f7f\u7528 conda \u5275\u5efa\u74b0\u5883\uff0c\u4e26\u5b89\u88dd\u4ee5\u4e0b library\uff0c\u6211\u5728 Jupyter \u74b0\u5883\u5b8c\u6210\u7bc4\u4f8b\uff0c\u5b8c\u6574 code \u53ef\u4ee5\u5728&nbsp;github&nbsp;\u627e\u5230\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\"># python=3.9<br>ipykernel<br>ipywidgets<br>langchain<br>PyMuPDF<br>chromadb<br>sentence-transformers<br>llama-cpp-python<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"d0e4\">\u6b65\u9a5f2. \u8b80\u5165\u6a94\u6848\u8655\u7406\u4e26\u532f\u5165 DB<\/h1>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter is-resized\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2ApvD0sDAZyrI8w_Tyw2FdCQ.png?w=640&#038;ssl=1\" alt=\"\" style=\"width:625px;height:auto\"\/><\/figure>\n<\/div>\n\n\n<p id=\"ae6b\">\u9996\u5148\u6211\u5011\u8981\u5c07\u5916\u90e8\u8cc7\u8a0a\u8655\u7406\u5f8c\uff0c\u653e\u5230 DB \u4e2d\uff0c\u4ee5\u4f9b\u4e4b\u5f8c\u67e5\u8a62\u76f8\u95dc\u77e5\u8b58\uff0c\u9019\u908a\u7684\u6b65\u9a5f\u5c0d\u61c9\u5230\u4e0a\u5716\u6846\u8d77\u4f86\u7684\u90e8\u5206\uff0c\u4e5f\u5c31\u662f\u6a58\u8272\u7684 1. Text splitter \u8207 2. Embedding<\/p>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"3e90\">a). \u4f7f\u7528\u6587\u4ef6\u8f09\u5165\u5668<\/h1>\n\n\n\n<p id=\"d213\">Langchain \u63d0\u4f9b\u4e86\u5f88\u591a\u6587\u4ef6\u8f09\u5165\u5668\uff0c\u7e3d\u8a08\u5927\u6982\u6709 55 \u7a2e\uff0c\u5305\u542b word, csv, PDF, GoogleDrive, Youtube \u7b49\uff0c\u4f7f\u7528\u65b9\u5f0f\u4e5f\u5f88\u7c21\u55ae\uff0c\u9019\u908a\u6211\u5275\u5efa\u4e86\u4e00\u500b\u865b\u64ec\u4eba\u7269 Alison Hawk \u7684 PDF \u8cc7\u8a0a\uff0c\u4e26\u4f7f\u7528&nbsp;<code>PyMuPDFLoader<\/code>\u8b80\u5165\uff0cAlison Hawk \u7684PDF \u8cc7\u8a0a\u53ef\u4ee5\u5728&nbsp;github&nbsp;\u67e5\u770b\u3002\u6ce8\u610f PyMuPDFLoader \u9700\u8981\u5b89\u88dd&nbsp;<code>PyMuPDF<\/code>&nbsp;\u624d\u80fd\u4f7f\u7528\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.document_loaders import PyMuPDFLoader<br>loader = PyMuPDFLoader(\"LangChain\/Virtual_characters.pdf\")<br>PDF_data = loader.load()<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"1ab4\">b). \u4f7f\u7528 Text splitter \u5206\u5272\u6587\u4ef6<\/h1>\n\n\n\n<p id=\"a0fd\">Text splitter \u6703\u5c07\u6587\u4ef6\u6216\u6587\u5b57\u5206\u5272\u6210\u4e00\u500b\u500b chunk\uff0c\u7528\u4ee5\u9810\u9632\u6587\u4ef6\u7684\u8cc7\u8a0a\u8d85\u904e LLM \u7684 tokens\uff0c\u6709\u4e00\u4e9b\u7814\u7a76\u5728\u63a2\u8a0e\u5982\u4f55\u5c07 chunk \u512a\u5316\uff0c\u53ef\u4ee5\u53c3\u8003\u6211\u7684\u53e6\u4e00\u7bc7\u6587\u7ae0\u3002<\/p>\n\n\n\n<p id=\"15c6\">\u9019\u6b65\u9a5f\u5e38\u7528\u7684\u5de5\u5177&nbsp;<code>RecursiveCharacterTextSplitter<\/code>&nbsp;\u8207&nbsp;<code>CharacterTextSplitter<\/code>\uff0c\u5dee\u5225\u5728\u65bc&nbsp;<code>RecursiveCharacterTextSplitter<\/code>&nbsp;\u5982\u679c\u5340\u584a\u5927\u5c0f\u8d85\u904e\u6307\u5b9a\u95be\u503c\uff0c\u5b83\u4e5f\u6703\u905e\u6b78\u5730\u5c07\u6587\u5b57\u5206\u5272\u6210\u66f4\u5c0f\u7684\u5340\u584a\u3002LangChain 2\u7a2e\u65b9\u5f0f\u7686\u6709\u63d0\u4f9b\uff0c\u53e6\u5916\u4e3b\u8981\u53c3\u6578\u70ba\u4ee5\u4e0b\uff1a<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>chunk_size\uff1a\u6c7a\u5b9a\u5206\u5272\u6587\u5b57\u6642\u6bcf\u500b\u5340\u584a\u4e2d\u7684\u6700\u5927\u5b57\u5143\u6578\u3002\u5b83\u6307\u5b9a\u6bcf\u500b\u5340\u584a\u7684\u5927\u5c0f\u6216\u9577\u5ea6\u3002<\/li>\n\n\n\n<li>chunk_overlap\uff1a\u6c7a\u5b9a\u5206\u5272\u6587\u5b57\u6642\u9023\u7e8c\u5340\u584a\u4e4b\u9593\u91cd\u758a\u7684\u5b57\u5143\u6578\u3002\u5b83\u6307\u5b9a\u524d\u4e00\u500b\u5340\u584a\u7684\u591a\u5c11\u61c9\u5305\u542b\u5728\u4e0b\u4e00\u500b\u5340\u584a\u4e2d\u3002<\/li>\n<\/ul>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.text_splitter import RecursiveCharacterTextSplitter<br>text_splitter = RecursiveCharacterTextSplitter(chunk_size=100, chunk_overlap=5)<br>all_splits = text_splitter.split_documents(PDF_data)<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"a3aa\">c). \u8f09\u5165 Embedding model<\/h1>\n\n\n\n<p id=\"0cb4\">\u63a5\u8457\u4f7f\u7528 Embedding \u5c07\u6b65\u9a5f b)\u5206\u5272\u7684 chunk \u6587\u5b57\u8f49\u63db\u70ba\u5411\u91cf\uff0cLangChain \u63d0\u4f9b\u4e86\u8a31\u591a Embedding model \u7684\u63a5\u53e3\uff0c\u5982OpenAI\u3001Cohere\u3001Hugging Face\u3001Weaviate \u7b49\uff0c\u53ef\u4ee5\u53c3\u8003&nbsp;LangChain\u5b98\u7db2\u3002<\/p>\n\n\n\n<p id=\"5732\">\u9019\u908a\u6211\u4f7f\u7528 Hugging Face \u7684 Sentence Transformers\uff0c\u5b83\u63d0\u4f9b\u4e86\u8a31\u591a\u7a2e pretrain \u6a21\u578b\uff0c\u53ef\u4ee5\u6839\u64da\u4f60\u7684\u9700\u6c42\u6216\u61c9\u7528\u60c5\u5883\u9078\u64c7\uff0c\u6211\u9078\u64c7&nbsp;<code>all-MiniLM-L6-v2<\/code>&nbsp;\uff0c\u5176\u4ed6 model \u7d30\u7bc0\u53ef\u4ee5\u770b\u5230&nbsp;SBERT.net&nbsp;\u6216&nbsp;HuggingFace\u3002\u6ce8\u610f\u8981\u5148\u5b89\u88dd&nbsp;<code>sentence-Tranformers<\/code>&nbsp;\u624d\u80fd\u4f7f\u7528\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.embeddings import HuggingFaceEmbeddings<br>model_name = \"sentence-transformers\/all-MiniLM-L6-v2\"<br>model_kwargs = {'device': 'cpu'}<br>embedding = HuggingFaceEmbeddings(model_name=model_name,<br>                                  model_kwargs=model_kwargs)<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"853f\">d). \u5c07 Embedding \u7d50\u679c\u532f\u5165 VectorDB<\/h1>\n\n\n\n<p id=\"f646\">Embedding \u5f8c\u7684\u7d50\u679c\u6211\u5011\u6703\u5132\u5b58\u5728 VectorDB \u4e2d\uff0c\u5e38\u898b\u7684 VectorDB \u6709 Chroma\u3001Pinecone\u3001FAISS\u7b49\uff0c\u9019\u908a\u6211\u4f7f\u7528 Chroma \u4f86\u5be6\u4f5c\u3002Chroma \u8ddf LangChain \u7684\u6574\u5408\u5f97\u5f88\u597d\uff0c\u53ef\u4ee5\u76f4\u63a5\u4f7f\u7528 LangChain \u7684\u63a5\u53e3\u4f86\u505a\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\"># Embed and store the texts<br># Supplying a persist_directory will store the embeddings on disk<br>from langchain.vectorstores import Chroma<br>persist_directory = 'db'<br>vectordb = Chroma.from_documents(documents=all_splits, embedding=embedding, persist_directory=persist_directory)<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"e4ee\">\u6b65\u9a5f3. \u555f\u7528 LLM \u670d\u52d9<\/h1>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter is-resized\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2ABhMBZnuZOOJtK310fgUvxw.png?w=640&#038;ssl=1\" alt=\"\" style=\"width:615px;height:auto\"\/><\/figure>\n<\/div>\n\n\n<p id=\"1946\">\u6709\u5169\u7a2e\u65b9\u6cd5\u555f\u52d5\u4f60\u7684 LLM \u6a21\u578b\u4e26\u9023\u63a5\u5230 LangChain\u3002\u4e00\u662f\u4f7f\u7528 LangChain \u7684 LlamaCpp \u63a5\u53e3\u4f86\u5be6\u4f5c\uff0c\u9019\u6642\u5019\u662f\u7531 LangChain \u5e6b\u4f60\u628a llama2 \u670d\u52d9\u555f\u52d5\uff1b\u53e6\u4e00\u500b\u65b9\u6cd5\u662f\u7528\u5176\u4ed6\u65b9\u5f0f\u67b6\u8a2d Llama2 \u7684 API \u670d\u52d9\uff0c\u4f8b\u5982\u4f7f\u7528 llama.cpp \u7684 server \u555f\u52d5 API \u670d\u52d9\u7b49\uff0c\u9019\u90e8\u5206\u7d30\u7bc0\u53ef\u4ee5\u770b\u5230\u6211\u7684\u53e6\u4e00\u7bc7&nbsp;llama.cpp\u6559\u5b78\u3002\u6211 2 \u500b\u65b9\u5f0f\u90fd\u6703\u793a\u7bc4\uff0c\u4f60\u53ef\u4ee5\u9078\u64c7\u9069\u5408\u4f60\u7684\u65b9\u6848\u3002<\/p>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"7c4f\">a). \u4f7f\u7528 LangChain \u7684 LlamaCpp<\/h1>\n\n\n\n<p id=\"5cdd\">\u4f7f\u7528 LlamaCpp \u63a5\u53e3\u8f09\u5165 model\uff0c\u5b83\u6703\u5e6b\u4f60\u555f\u52d5 Llama \u7684\u670d\u52d9\uff0c\u9019\u65b9\u6cd5\u8f03\u7c21\u55ae\uff0c\u76f4\u63a5\u4f7f\u7528\u4e0b\u9762 code \u5c31\u53ef\u4ee5\u57f7\u884c\uff0cmodel_path \u6307\u5b9a\u5230\u4f60\u7684\u6a21\u578b\u4e2d\uff0c\u4f8b\u5b50\u4e2d\u6211\u4f7f\u7528\u91cf\u5316\u904e\u5f8c\u7684 Llama2 Chat\u3002\u6ce8\u610f\u9019\u908a\u8981\u5b89\u88dd&nbsp;<code>llama-cpp-python<\/code><\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.callbacks.manager import CallbackManager<br>from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler<br>from langchain_community.llms import LlamaCpp<br>model_path = \"llama.cpp\/models\/llama-2-7b-chat\/llama-2_q4.gguf\"<br><br>llm = LlamaCpp(<br>    model_path=model_path,<br>    n_gpu_layers=100,<br>    n_batch=512,<br>    n_ctx=2048,<br>    f16_kv=True,<br>    callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]),<br>    verbose=True,<br>)<\/pre>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2Ark_msRiNOe-wTq6d0ntvpw.png?w=640&#038;ssl=1\" alt=\"\"\/><\/figure>\n<\/div>\n\n\n<p id=\"cdb0\">\u53ef\u4ee5\u4f7f\u7528\u6e2c\u8a66\u770b\u770b\u6709\u6c92\u6709 llm \u670d\u52d9\u555f\u52d5\u6c92\uff1a<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">llm(\"What is Taiwan known for?\")<\/pre>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2Aak-hfiup7IEv6tVZujz10A.png?w=640&#038;ssl=1\" alt=\"\"\/><\/figure>\n<\/div>\n\n\n<h1 class=\"wp-block-heading\" id=\"629f\">b). \u4f7f\u7528\u5df2\u67b6\u8a2d\u7684 API \u670d\u52d9<\/h1>\n\n\n\n<p id=\"a629\">\u5982\u679c\u4f60\u5df2\u7d93\u4f7f\u7528\u5176\u4ed6\u65b9\u5f0f\u67b6\u8a2d LLM \u7684 API \u670d\u52d9\uff0c\u6216\u8005\u662f\u4f7f\u7528 openai \u7684 API \u7684\u8a71\uff0c\u4f60\u9700\u8981\u4f7f\u7528 LangChain \u7684 ChatOpenAI \u63a5\u53e3\u3002\u6211\u9019\u908a\u793a\u7bc4\u662f llama.cpp \u7684 server \u670d\u52d9 (llama.cpp\u6559\u5b78)\uff0c\u5b83\u63d0\u4f9b\u4e86\u985e\u5225OpenAI\u7684API\uff0c\u56e0\u6b64\u6211\u5011\u80fd\u76f4\u63a5\u7528\u540c\u500b\u63a5\u53e3\u4f86\u64cd\u4f5c\uff0c\u4ee5\u4e0b\u662f\u8a72\u63a5\u53e3\u7684\u4e00\u4e9b\u76f8\u95dc\u53c3\u6578\uff1a<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>open_ai_key<\/strong>\uff1a\u7531\u65bc\u4e26\u6c92\u6709\u4f7f\u7528\u771f\u6b63\u7684 OpenAI API\uff0c\u56e0\u6b64\u53ef\u4ee5\u96a8\u610f\u586b\u5beb\u3002<\/li>\n\n\n\n<li><strong>openai_api_base<\/strong>\uff1a\u70ba\u6a21\u578bAPI\u7684Base URL<\/li>\n\n\n\n<li><strong>max_tokens\uff1a<\/strong>\u898f\u7bc4\u6a21\u578b\u56de\u7b54\u7684\u9577\u5ea6<\/li>\n<\/ul>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.chat_models import ChatOpenAI<br>llm = ChatOpenAI(openai_api_key='None', openai_api_base='http:\/\/127.0.0.1:8080\/v1')<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"0251\">\u6b65\u9a5f4. \u8a2d\u5b9a\u4f60\u7684 Prompt<\/h1>\n\n\n\n<p id=\"b15e\">\u4e00\u4e9b LLM \u53ef\u4ee5\u4f7f\u7528\u7279\u5b9a\u7684 Prompt\u3002\u4f8b\u5982\uff0cLlama \u53ef\u4f7f\u7528\u7279\u6b8a token\uff0c\u7d30\u7bc0\u53ef\u4ee5\u53c3\u8003&nbsp;twitter\u3002\u6211\u5011\u53ef\u4ee5\u4f7f\u7528 ConditionalPromptSelector \u6839\u64da\u6a21\u578b\u985e\u578b\u8a2d\u5b9a Prompt\uff0c\u5982\u4ee5\u4e0b\uff1a<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">from langchain.chains import LLMChain<br>from langchain.chains.prompt_selector import ConditionalPromptSelector<br>from langchain.prompts import PromptTemplate<br><br>DEFAULT_LLAMA_SEARCH_PROMPT = PromptTemplate(<br>    input_variables=[\"question\"],<br>    template=\"\"\"&lt;&lt;SYS&gt;&gt; \\n You are an assistant tasked with improving Google search \\<br>results. \\n &lt;&lt;\/SYS&gt;&gt; \\n\\n [INST] Generate THREE Google search queries that \\<br>are similar to this question. The output should be a numbered list of questions \\<br>and each should have a question mark at the end: \\n\\n {question} [\/INST]\"\"\",<br>)<br><br>DEFAULT_SEARCH_PROMPT = PromptTemplate(<br>    input_variables=[\"question\"],<br>    template=\"\"\"You are an assistant tasked with improving Google search \\<br>results. Generate THREE Google search queries that are similar to \\<br>this question. The output should be a numbered list of questions and each \\<br>should have a question mark at the end: {question}\"\"\",<br>)<br><br>QUESTION_PROMPT_SELECTOR = ConditionalPromptSelector(<br>    default_prompt=DEFAULT_SEARCH_PROMPT,<br>    conditionals=[(lambda llm: isinstance(llm, LlamaCpp), DEFAULT_LLAMA_SEARCH_PROMPT)],<br>)<br><br>prompt = QUESTION_PROMPT_SELECTOR.get_prompt(llm)<br>prompt<\/pre>\n\n\n\n<p id=\"4aa0\">\u4f7f\u7528 LLMChain \u5c07 prompt \u8207 llm \u63a5\u5728\u4e00\u8d77\uff0c\u53e6\u5916 LangChain \u6700\u8fd1\u7684\u6539\u7248\u4f7f\u7528<code>invoke<\/code>&nbsp;\u66ff\u4ee3&nbsp;<code>run<\/code>&nbsp;\uff0c\u7576\u4f60\u770b\u5230\u5176\u4ed6\u6587\u7ae0\u4f7f\u7528&nbsp;<code>run<\/code>&nbsp;\u6642\u53ef\u4ee5\u6ce8\u610f\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">llm_chain = LLMChain(prompt=prompt, llm=llm)<br>question = \"What is Taiwan known for?\"<br>llm_chain.invoke({\"question\": question})<\/pre>\n\n\n\n<p id=\"00e6\">\u5be6\u6a5f\u756b\u9762\uff1a<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2AdexT6PX6eGG3MmWaLAMEoQ.png?w=640&#038;ssl=1\" alt=\"\"\/><\/figure>\n<\/div>\n\n\n<h1 class=\"wp-block-heading\" id=\"4f21\">\u6b65\u9a5f5. Text Retrieval + Query LLM<\/h1>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2AMFfdNUmTPsA9EHC3FA77Ow.png?w=640&#038;ssl=1\" alt=\"\"\/><\/figure>\n<\/div>\n\n\n<p id=\"4703\">\u4e0a\u9762\u6211\u5011\u5c07 PDF \u8cc7\u8a0a\u532f\u5165 DB \u4e2d\uff0c\u4e26\u4e14\u555f\u52d5\u4e86 LLM \u670d\u52d9\uff0c\u63a5\u4e0b\u4f86\u6211\u5011\u8981\u5c07\u6574\u500b RAG \u7684\u6b65\u9a5f\u4e32\u8d77\u4f86\uff1a<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>User \u767c\u4f86 QA<\/li>\n\n\n\n<li>\u5f9e DB \u4e2d Text Retrieval<\/li>\n\n\n\n<li>\u7d50\u5408 QA \u8207 Text Retrieval \u767c\u9001\u7d66 LLM<\/li>\n\n\n\n<li>LLM \u6839\u64da\u8cc7\u8a0a\u56de\u7b54<\/li>\n<\/ol>\n\n\n\n<p id=\"d347\">\u9996\u5148\u6211\u5011\u8981\u5148\u5275\u5efa Retriever\uff0c\u5b83\u53ef\u4ee5\u6839\u64da\u975e\u7d50\u69cb\u5316QA \u8fd4\u56de\u76f8\u61c9\u6587\u4ef6\uff0cLangChain \u63d0\u4f9b\u4e86\u5f88\u591a\u7a2e\u65b9\u5f0f\uff0c\u4e5f\u6574\u5408\u9032\u7b2c\u4e09\u65b9\u7684\u5de5\u5177\uff0c\u76ee\u524d\u6709\u5f88\u591a\u7814\u7a76\u5728\u63a2\u8a0e\u5982\u4f55\u6839\u64da QA \u627e\u5c0b\u5c0d\u61c9\u7684\u6587\u4ef6\u3002\u6211\u9019\u908a\u4f7f\u7528\u7684\u662f\u00a0Vectorstore\u00a0\u7684\u65b9\u5f0f\uff0c\u5176\u4ed6\u7a2e\u985e\u53ef\u4ee5\u53c3\u8003\u00a0Retrievers\u3002<\/p>\n\n\n\n<p id=\"9351\">\u63a5\u8457\u4f7f\u7528 RetrievalQA \u7d50\u5408 Retriever \u8207 QA \u8207 llm\uff0c\u6ce8\u610f&nbsp;<code>VectorDBQA<\/code>&nbsp;\u7684\u529f\u80fd\u5df2\u7d93\u88ab\u68c4\u7528\uff0c\u73fe\u5728\u90fd\u4f7f\u7528&nbsp;<code>RetrievalQA<\/code>\uff0c\u5982\u679c\u770b\u5230\u5225\u7684\u6587\u7ae0\u4f7f\u7528\u53ef\u4ee5\u6ce8\u610f\u3002<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">retriever = vectordb.as_retriever()<br><br>qa = RetrievalQA.from_chain_type(<br>    llm=llm, <br>    chain_type=\"stuff\", <br>    retriever=retriever, <br>    verbose=True<br>)<\/pre>\n\n\n\n<h1 class=\"wp-block-heading\" id=\"1c38\">\u6b65\u9a5f6. \u4f7f\u7528\u4f60\u7684 RAG<\/h1>\n\n\n\n<p id=\"a551\">\u5230\u9019\u88e1\u6211\u5011\u5c31\u4e32\u597d\u6574\u500b RAG \u7684\u6d41\u7a0b\uff0c\u63a5\u4e0b\u4f86\u6211\u5011\u4f86\u554f\u554f Alison Hawk \u7684\u8a0a\u606f(PDF \u7d00\u9304\u7684\u865b\u64ec\u4eba\u7269\u540d\u7a31)<\/p>\n\n\n\n<pre class=\"wp-block-preformatted\">query = \"Tell me about Alison Hawk's career and age\"<br>qa.invoke(query)<\/pre>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/miro.medium.com\/v2\/resize%3Afit%3A700\/1%2A4iAZeoLqBT6TA9wnZBB0EA.png?w=640&#038;ssl=1\" alt=\"\"\/><\/figure>\n<\/div>\n\n\n<p id=\"5839\">\u53ef\u4ee5\u770b\u5230 LLM \u6709\u62ff\u5230\u5f9e DB \u4e2d\u62ff\u5230\u6211\u5011\u4e0a\u50b3\u7684 Alison Hawk \u7684 PDF \u8a0a\u606f\uff0c\u4e26\u4e14\u5f97\u77e5\u5979\u662f\u4e00\u4f4d 28 \u6b72\u7684 researcher\u3002<\/p>\n\n\n\n<p>\u8cc7\u6599\u4f86\u6e90:<a href=\"https:\/\/medium.com\/@cch.chichieh\/rag%E5%AF%A6%E4%BD%9C%E6%95%99%E5%AD%B8-langchain-llama2-%E5%89%B5%E9%80%A0%E4%BD%A0%E7%9A%84%E5%80%8B%E4%BA%BAllm-d6838febf8c4\" data-type=\"link\" data-id=\"https:\/\/medium.com\/@cch.chichieh\/rag%E5%AF%A6%E4%BD%9C%E6%95%99%E5%AD%B8-langchain-llama2-%E5%89%B5%E9%80%A0%E4%BD%A0%E7%9A%84%E5%80%8B%E4%BA%BAllm-d6838febf8c4\">https:\/\/medium.com\/@cch.chichieh\/rag%E5%AF%A6%E4%BD%9C%E6%95%99%E5%AD%B8-langchain-llama2-%E5%89%B5%E9%80%A0%E4%BD%A0%E7%9A%84%E5%80%8B%E4%BA%BAllm-d6838febf8c4<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>2024-01-24 | ChiChieh Huang \u5728\u9019\u7bc7\u6587\u7ae0\u4e2d\uff0c\u6703\u5e36\u4f60\u4e00\u6b65\u4e00\u6b65\u67b6\u8a2d\u81ea\u5df1\u7684 RAG\uff08Retrieval-Augmented Generation\uff09\u7cfb\u7d71\uff0c\u8b93\u4f60\u53ef\u4ee5&hellip;<\/p>\n","protected":false},"author":9,"featured_media":5725,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[579,4],"tags":[26,562,40],"class_list":["post-5724","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-579","category-industry-news","tag-ai","tag-rag","tag-40"],"gutentor_comment":0,"jetpack_featured_media_url":"https:\/\/i0.wp.com\/aict.nkust.edu.tw\/digitrans\/wp-content\/uploads\/2024\/04\/1_BKiZCXCL9A4_9dthYFBGeg.webp?fit=720%2C405&ssl=1","jetpack-related-posts":[],"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/posts\/5724","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=5724"}],"version-history":[{"count":2,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/posts\/5724\/revisions"}],"predecessor-version":[{"id":5731,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/posts\/5724\/revisions\/5731"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=\/wp\/v2\/media\/5725"}],"wp:attachment":[{"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=5724"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=5724"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aict.nkust.edu.tw\/digitrans\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=5724"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}