Haufe.ai | Quickstart

Get started with Copilot via API in minutes. This guide walks you through the basic flow of generating a response using Copilot via API.

Basic Flow

Conceptually, you will:

Retrieve available assistants.
Create a thread, which represents a conversation between a user and an assistant.
Attach one or more user messages to this thread.
Run the thread to generate an assistant message.
Optionally retrieve all messages in the thread.

For additional operations (e.g. deleting threads), refer to the API Reference.

Step 1: Retrieve Available Assistants

Copilot via API offers various assistants specialized in different domains such as Tax, HR, and Sustainability. Use this endpoint to retrieve all available assistants and their associated metadata structure.

List available assistants

curl --request GET \
  --url https://api.haufe.ai/agents/v1/assistants \
  --header 'api-key: <API_KEY>'

The response contains the list of available assistants and their metadata structures. For full data type definitions, refer to the API Reference.

Step 2: Create a Thread

To start a conversation with an assistant, create a new thread:

Create a thread

curl --request POST \
  --url https://api.haufe.ai/agents/v1/threads \
  --header 'content-type: application/json' \
  --header 'api-key: <API_KEY>' \
  --data '{
    "assistant_id": "<ASSISTANT_ID>",
    "user_id": "<USER_ID>",
    "title": "First Thread Example"
  }'

info

user_id is optional but useful if you need to track which end user created the thread. If provided, it must be a valid UUID.

The response contains the id of the created thread, referred to as thread_id in the following steps.

Step 3: Attach User Messages

After creating a thread, attach one or more user messages to it:

Create a message in the thread

curl --request POST \
  --url https://api.haufe.ai/agents/v1/threads/:thread_id/messages \
  --header 'content-type: application/json' \
  --header 'api-key: <API_KEY>' \
  --data '{
    "role": "user",
    "content": "<Your Message Content>"
  }'

Replace :thread_id with the id from the Create a thread response.
You can add multiple user messages by calling this endpoint multiple times.
Each message must include role (must be "user") and content (max 30,000 characters).
Only user messages are allowed. Attempting to add assistant messages manually will result in an error.

Step 4: Generate an Assistant Message (Run)

Trigger generation of an assistant response by running the thread:

Generate an answer

curl --request POST \
  --url https://api.haufe.ai/agents/v1/threads/:thread_id/run \
  --header 'content-type: application/json' \
  --header 'api-key: <API_KEY>' \
  --data '{
    "meta_data": {
      "user_data": {
        "licence": "<LicenseId>"
      }
    }
  }'

warning

The payload requires valid metadata, including a license ID under meta_data.user_data.licence. The value must be one of the allowed license IDs provided by your Haufe contact person. An invalid or mismatching license ID may degrade response quality but typically does not raise an error.

This endpoint shares rate limits with other answer-generation operations (/run, /run/stream). See Rate Limits.
To receive the response incrementally, use the streaming variant: POST /agents/v1/threads/:thread_id/run/stream.

The response contains an AssistantMessage with the generated answer and corresponding metadata.

Step 5: Retrieve Messages

You can retrieve all messages in a thread, including user and assistant messages:

Get all messages

curl --request GET \
  --url https://api.haufe.ai/agents/v1/threads/:thread_id/messages \
  --header 'api-key: <API_KEY>'

The endpoint is paginated and returns messages sorted by creation time (newest first).

Quickstart

Basic Flow

Step 1: Retrieve Available Assistants

Step 2: Create a Thread

Step 3: Attach User Messages

Step 4: Generate an Assistant Message (Run)

Step 5: Retrieve Messages

Next Steps

Threads vs Chat Completions

Error Handling

On this page