Data Integration

The Data Integration feature in Databend Cloud provides a visual, no-code interface for importing, synchronizing, or consuming data from external systems into Databend. The feature centers around two key concepts: data sources and integration tasks.

Key Concepts

Concept	Description
Data Sources	Reusable connection settings or credentials used to access external systems or send notifications, such as AWS Access Key / Secret Key, MySQL hostname / username / password, SQS (S3) queue URL, Kafka broker addresses, or a FeiShu bot webhook.
Integration Tasks	Executable tasks that define where data comes from, where it is written or how results are saved, which runtime parameters are used, and how the task is started and monitored.

Data sources do not move data by themselves. They only store the information required to access external systems. Integration tasks are the units that actually perform imports, snapshots, continuous synchronization, or message consumption.

info

Running Data Integration tasks incurs service hosting fees, billed per second based on the actual running time of the service. For details, see Service Hosting Pricing.

Not every data source corresponds to an ingestion task. For example, FeiShuBot is used for notifications rather than loading source data into Databend.

Supported Integration Task Types

Task Type	Description
Amazon S3	Imports CSV, Parquet, or NDJSON files from Amazon S3 with support for one-time or continuous ingestion.
Amazon SQS (S3) (Beta)	Consumes S3 object creation events from an SQS queue and writes the corresponding object data into Platform.
MySQL	Synchronizes table data from MySQL using `Snapshot`, `CDC Only`, or `Snapshot + CDC` modes.
PostgreSQL	Synchronizes table data from PostgreSQL using `Snapshot`, `CDC Only`, or `Snapshot + CDC` modes.
Kafka Consumer Integration Task (Beta)	Continuously consumes messages from Kafka topics and saves the message content to internal object storage.

Recommended Flow

Create and test reusable connection settings on the Data Sources page.
Review supported task types and their use cases on the Integration Tasks page.
Read the task-specific guide to configure the source, preview the data, and configure the result location or result viewing method.
Use the Task Management page to start tasks, check status, and troubleshoot execution issues.

Data Integration

Key Concepts

Supported Integration Task Types

Recommended Flow

Video Tour

Join our growing community

GitHub

Slack

X(Twitter)

YouTube

Or simply contact us directly

Contact Us

Explore Databend Cloud

Try Databend Cloud for FREE

Key Concepts​

Supported Integration Task Types​

Recommended Flow​

Video Tour​

Join our growing community

GitHub

Slack

X(Twitter)

YouTube

Or simply contact us directly

Contact Us

Explore Databend Cloud

Try Databend Cloud for FREE

Key Concepts

Supported Integration Task Types

Recommended Flow

Video Tour