HTTP Handler
The Databend HTTP handler is a REST API that used to send query statement for execution on the server and to receive results back to the client.
The HTTP handler is hosted by databend-query, it can be specified by using --http_handler_host
and --http_handler_port
(This defaults to 8000).
HTTP Methods
Overview
This handler return results in pages
with long-polling.
- Start with A
POST
to/v1/query
with JSON of typeQueryRequest
which contains the SQL to execute, returns a JSON of typeQueryResponse
. - Use fields of
QueryResponse
for further processing:- A
GET
to thenext_uri
returns the nextpage
of query results. It returnsQueryResponse
too, processing it the same way recursively untilnext_uri
is nil. - A
GET
to thefinal_uri
finally after all results is fetched (next_uri = nil
) or the remaining is not needed. Return empty body. - (optional) A
GET
to thestats_uri
to get stats only at once (without long-polling), returnQueryResponse
with emptydata
field.
- A
Quick Example
curl -u root: --request POST '127.0.0.1:8001/v1/query/' --header 'Content-Type: application/json' --data-raw '{"sql": "SELECT avg(number) FROM numbers(100000000)"}'
the SQL will be run with default session and pagination settings, mainly:
- use a new one-off session with the
default
database. - each request wait for at most 1 second for results before return.
for more advanced configs, see the Reference below:
you are expected to get JSON like this (formatted):
{
"id": "3cd25ab7-c3a4-42ce-9e02-e1b354d91f06",
"session_id": "8d3a737d-2f6c-4df7-ba44-6dfc818255ce",
"session": {},
"schema": {
"fields": [
{
"name": "avg(number)",
"default_expr": null,
"data_type": {
"type": "Float64Type"
}
}
],
"metadata": {}
},
"data": [
[
49999999.5
]
],
"state": "Succeeded",
"error": null,
"stats": {
"scan_progress": {
"rows": 100000000,
"bytes": 800000000
},
"write_progress": {
"rows": 0,
"bytes": 0
},
"result_progress": {
"rows": 0,
"bytes": 0
},
"running_time_ms": 466.85395800000003
},
"stats_uri": "/v1/query/3cd25ab7-c3a4-42ce-9e02-e1b354d91f06",
"final_uri": "/v1/query/3cd25ab7-c3a4-42ce-9e02-e1b354d91f06/kill?delete=true",
"next_uri": null
"affect": null
}
Note:
- next_uri is null because all data is returned.
- client should call final_uri to tell the server the client has received the results and server can delete them.
Query Request
QueryRequest
field | type | Required | Default | description |
---|---|---|---|---|
sql | string | Yes | the sql to execute | |
session_id | string | No | used only when reuse server-side session | |
session | SessionState | No | ||
pagination | Pagination | No | a uniq query_id for this POST request | |
string_fields | bool | No | false | all field value in data is represented in string |
SessionState
field | type | Required | Default | description |
---|---|---|---|---|
database | string | No | "default" | set current_database |
keep_server_session_secs | int | No | 0 | secs the Session will be retain after the last query finished |
settings | map(string, string) | No | 0 |
OldSession
field | type | Required | Default | description |
---|---|---|---|---|
id | string | Yes | session_id from QueryResponse.session_id |
PaginationConf: critical conditions for each HTTP request to return (before all remaining result is ready to return)
field | type | Required | Default | description |
---|---|---|---|---|
wait_time_secs | i32 | No | 1 | long polling time |
Query Response
QueryResponse:
field | type | description |
---|---|---|
state | string | choices: "Running","Failed", "Succeeded" |
error | QueryError | error of the sql parsing or execution |
id | string | a uniq query_id for this POST request |
data | array | each item is a row of results |
schema | Schema | the schema of the results |
affect | Affect | the affect of some queries |
session_id | String | |
session | SessionState |
Schema:
field | type | description |
---|---|---|
fields | array | An ordered sequence of Field |
metadata | object | A map of key-value pairs containing additional meta data. |
Field:
field | type |
---|---|
name | string |
data_type | string |
nullable | bool |
Stats:
field | type | description |
---|---|---|
running_time_ms | float | million secs elapsed since query begin to execute internally, stop timing when query Finished (state != Running) |
scan_progress | QueryProgress | query scan progress |
Progress:
field | type |
---|---|
read_rows | int |
read_bytes | int |
Error:
field | type | description |
---|---|---|
stats | int | error code used inside databend |
message | string | error message |
backtrace | string |
Affect:
field | type | description |
---|---|---|
type | string | ChangeSetting/UseDB |
... | according to type |
Response Status Code
The usage of status code for different kinds of errors:
code | error |
---|---|
200 | if sql is invalid or failed, the detail is in the error field of the JSON |
404 | "query_id" or "page" not found |
400 | invalid request format |
Check the response body for error reason as a string when status code is not 200.
data format
json only support a few types, we commanded setting string_fields
to true
,
then all field value in data is represented in string,
client need to interpreter the values with the help of information in the schema filed.
session support (Optional)
client can config the session in the session
field
{
"sql": "select 1",
"session": {
"db": "db2",
"settings": {
"max_threads": "1"
}
}
}
all the values of settings should be string.
By default, each request has its own session inside server which is destroyed right after the SQL execution
finished, even before the result data is fetched by client.
To use SQLs like set
and use
which change context for the following SQLs, we need session support, there are 2 choices:
server-side session
For the first QueryRequest
which start a session: set QueryRequest.session.keep_server_session_secs
to positive int like 10.
remember the returned QueryResponse.session_id
.
{
"sql": "...;",
"session": {
"keep_server_session_secs": 100
}
}
For the following QueryRequest
which use the session:
use the QueryResponse.session_id
for QueryRequest.session.id
.
{
"sql": "...;",
"session_id": "<QueryResponse.session_id>"
}
client-side session
the handler will return info about changed setting or current db in the affect
field,
client can remember these changes and put them in the session field in the following requests.
when use client side session, response will contain session
field, which is session
field in request together with the affect applied to it.
Only use session
field in next_url, and when the state
field is "Succeeded"
set statement:
{
"sql": "set max_threads=1;",
"session": {
"database": "db1",
"settings": {
"max_threads": "6"
}
}
}
{
"affect": {
"type": "ChangeSetting",
"key": "max_threads",
"value": "1",
"is_global": false
},
"session": {
"database": "db1",
"settings": {
"max_threads": "1"
}
}
}
use statement:
{"sql": "use db2",
"session": {
"database": "db1",
"settings": {
"max_threads": "6"
}
}
}
{
"affect": {
"type": "UseDB",
"name": "db2"
},
"session": {
"database": "db2",
"settings": {
"max_threads": "1"
}
}
}