dev-resources.site
for different kinds of informations.
Chat With Your DB via DBChat & Gemini (Part 4)
Hi there! I'm Shrijith Venkatrama, the founder of Hexmos. Right now, Iām building LiveAPI, a super-convenient tool that simplifies engineering workflows by generating awesome API docs from your code in minutes.
In this tutorial series, I am on a journey to build for myself DBChat - a simple tool for using AI chat to explore and evolve databases.
See previous posts to get more context:
- Building DBChat - Explore and Evolve Your DB with Simple Chat (Part 1)
- DBChat: Getting a Toy REPL Going in Golang (Part 2)
- DBChat Part 3 - Configure , Connect & Dump Databases
Crafting A Working LLM Prompt To Generate SQL
In the previous posts we could start up a shell, connect to you a database, and get the schema context for further use.
Now the problem can be split it into three subtasks:
- Taking in arbitrary user requests
- Get schema definition
- Combine (1) and (2) into a coherent prompt
To handle arbitrary user requests, we want the "default" case in our command handling to do its magic. Essentially what we say is:
- By default we try to figure out if the words input by the user are indeed a command (or command variation)
- If not, we assume it to be a genuine user query
So now the default
handler looks like this:
default:
if h.queryHandler == nil {
return "Query handler not available. Check Gemini API key configuration."
}
return h.queryHandler.HandleQuery(h.db, line)
And in dbchat/cmd/dbchat/query
we build up a prompt like this:
const promptTemplate = `Consider yourself as an SQL and databases expert. Your task is to carefully look at a user's request in the context of a database schema, and provide structured answer to the request.
The output structure can have two components of a valid JSON like so:
{
"suggested_queries": [{"sql": "query 1", "explanation": ...}, {"sql": "query 2", "explanation": ...} ...],
"explanation": "explanation"
}
At least one of the components must be there in the answer, and both the components can also be there.
Suggested_queries must be in the order of expected execution. And each suggested SQL must also come with an explanation.
You must follow the above instructions thoroughly. The next sections will follow up with the user's request or query, and the database schema/context:
User Query:
{{.Query}}
Database Schema/Context:
{{.Schema}}
`
Learning to Interact with Gemini (with TOML configuration support)
The next step is to send the prompt to Gemini. Head over to Google's AI Studio and get a free API key.
Then we use some generated code from Google (mostly) to create some basic LLM functions:
func (g *GeminiClient) GetResponse(prompt string) (string, error) {
ctx := context.Background()
// Add explicit instruction for JSON format
prompt = prompt + "\nPlease provide your response in valid JSON format only, without any additional text or markdown formatting."
resp, err := g.session.SendMessage(ctx, genai.Text(prompt))
if err != nil {
return "", fmt.Errorf("error sending message to Gemini: %v", err)
}
var response string
for _, part := range resp.Candidates[0].Content.Parts {
response += fmt.Sprintf("%v", part)
}
// Clean up the response
cleanedResponse := cleanJSONResponse(response)
return cleanedResponse, nil
}
I updated the configuration format to include an llm
section, with gemini_key
value:
# DBChat Sample Configuration File
# Copy this file to ~/.dbchat.toml and modify as needed
[connections]
# Format: name = "connection_string"
local = "postgresql://postgres:postgres@localhost:5432/postgres"
liveapi = "postgresql://user:pwd@ip:5432/db_name"
[llm]
gemini_key = "the_key"
Intermediate demo
At this point, when we submit a user query, we should get some suggested SQL to try along with some explanations. Let's see how that works:
Executing Suggested Queries with a New Command: exec <n>
As we can see from previous section - now Gemini comes with queries we can execute in an array.
But executing arbitrary SQL can be dangerous when dealing with databases with precious data.
So I've decided to start a command exec <n>
. Essentially - given the list of suggestions, one can run any of them with the exec
command.
We use the tablewriter
library to beautify the results.
The code looks something like this:
func ExecuteQuery(db *sql.DB, query string) (*QueryResult, error) {
rows, err := db.Query(query)
if err != nil {
return nil, fmt.Errorf("error executing query: %v", err)
}
defer rows.Close()
// Get column names
columns, err := rows.Columns()
if err != nil {
return nil, fmt.Errorf("error getting columns: %v", err)
}
// Prepare result container
result := &QueryResult{
Columns: columns,
Rows: make([][]string, 0),
}
// Prepare value containers
values := make([]interface{}, len(columns))
valuePtrs := make([]interface{}, len(columns))
for i := range columns {
valuePtrs[i] = &values[i]
}
// Fetch rows
for rows.Next() {
err := rows.Scan(valuePtrs...)
if err != nil {
return nil, fmt.Errorf("error scanning row: %v", err)
}
// Convert values to strings
rowStrings := make([]string, len(columns))
for i, val := range values {
if val == nil {
rowStrings[i] = "NULL"
} else {
rowStrings[i] = fmt.Sprintf("%v", val)
}
}
result.Rows = append(result.Rows, rowStrings)
}
if err = rows.Err(); err != nil {
return nil, fmt.Errorf("error iterating rows: %v", err)
}
return result, nil
}
The Whole Experience - A Small Demo
With that - now we have all the components of a basic interaction with DBChat. You can:
- Connect to database
- Make a query/request in simple English
- Let Gemini suggests SQL to try
- Pick and execute any of the suggested queries
- See results in a nicely structured table
Take a look at the whole interaction in the following demo:
Next Steps
Now that we have a prototype working for DBChat end to end, many new exciting opportunities open up.
In part 1, our goal was to build a VSCode extension, which requires a good LSP support.
But then - it may also be a good idea to setup a build pipeline, and
get the team at Hexmos to try this out first, and collect some feedback.
I believe there is some prioritization work in order, before I can focus
on execution tasks. We will explore how to take DBChat forward in the
next post.
Featured ones: