Data Integration
The Data Integration feature in Databend Cloud provides a visual, no-code interface for importing or synchronizing data from external systems into Databend. The feature centers around two key concepts: data sources and integration tasks.
Key Concepts
| Concept | Description |
|---|---|
| Data Sources | Reusable connection settings or credentials used to access external systems or send notifications, such as AWS Access Key / Secret Key, MySQL hostname / username / password, or a FeiShu bot webhook. |
| Integration Tasks | Executable tasks that define where data comes from, which Databend table it is written to, which runtime parameters are used, and how the task is started and monitored. |
Data sources do not move data by themselves. They only store the information required to access external systems. Integration tasks are the units that actually perform imports, snapshots, and continuous synchronization.
Not every data source corresponds to an ingestion task. For example, FeiShuBot is used for notifications rather than loading source data into Databend.
Supported Integration Task Types
| Task Type | Description |
|---|---|
| Amazon S3 | Imports CSV, Parquet, or NDJSON files from Amazon S3 with support for one-time or continuous ingestion. |
| MySQL | Synchronizes table data from MySQL using Snapshot, CDC Only, or Snapshot + CDC modes. |
Recommended Flow
- Create and test reusable connection settings on the Data Sources page.
- Review supported task types and their use cases on the Integration Tasks page.
- Read the task-specific guide to configure the source, preview the data, and set the target table.
- Use the Task Management page to start tasks, check status, and troubleshoot execution issues.