我们需要下载一个 LangChain
官方提供的本地小数据库。
安装依赖
SQL:
https://raw.githubusercontent.com/lerocha/chinook-database/master/ChinookDatabase/DataSources/Chinook_Sqlite.sql
Shell:
pip install --upgrade --quiet langchain-core langchain-community langchain-openai
导入数据
我这里使用 Navicat
导入数据,你也可以通过别的方式导入(当然你有现成的数据库也可以,但是不要太大了,不然会消耗很多Token
)。
编写代码
这里我使用了 GPR 3.5 Turbo
,效果不理想的话可以试试GPT 4
或者 GPT 4 Turbo
from langchain_core.prompts import ChatPromptTemplate
from langchain_community.utilities import SQLDatabase
from langchain_core.output_parsers import StrOutputParser
from langchain_core.runnables import RunnablePassthrough
from langchain_openai import ChatOpenAI
template = """Based on the table schema below, write a SQL query that would answer the user's question:
{schema}
Question: {question}
SQL Query:"""
prompt = ChatPromptTemplate.from_template(template)
db = SQLDatabase.from_uri("sqlite:///./Chinook.db")
def get_schema(_):
return db.get_table_info()
def run_query(query):
return db.run(query)
model = ChatOpenAI(
model="gpt-3.5-turbo",
)
sql_response = (
RunnablePassthrough.assign(schema=get_schema)
| prompt
| model.bind(stop=["\nSQLResult:"])
| StrOutputParser()
)
message = sql_response.invoke({"question": "How many employees are there?"})
print(f"message: {message}")
运行结果
➜ python3 test08.py
message: SELECT COUNT(*) AS totalEmployees
FROM Employee;