Skip to main content

Data Integration

The Data Integration feature in Databend Cloud provides a visual, no-code interface for importing or synchronizing data from external systems into Databend. The feature centers around two key concepts: data sources and integration tasks.

Key Concepts

ConceptDescription
Data SourcesReusable connection settings or credentials used to access external systems or send notifications, such as AWS Access Key / Secret Key, MySQL hostname / username / password, or a FeiShu bot webhook.
Integration TasksExecutable tasks that define where data comes from, which Databend table it is written to, which runtime parameters are used, and how the task is started and monitored.

Data sources do not move data by themselves. They only store the information required to access external systems. Integration tasks are the units that actually perform imports, snapshots, and continuous synchronization.

Not every data source corresponds to an ingestion task. For example, FeiShuBot is used for notifications rather than loading source data into Databend.

Supported Integration Task Types

Task TypeDescription
Amazon S3Imports CSV, Parquet, or NDJSON files from Amazon S3 with support for one-time or continuous ingestion.
MySQLSynchronizes table data from MySQL using Snapshot, CDC Only, or Snapshot + CDC modes.
  1. Create and test reusable connection settings on the Data Sources page.
  2. Review supported task types and their use cases on the Integration Tasks page.
  3. Read the task-specific guide to configure the source, preview the data, and set the target table.
  4. Use the Task Management page to start tasks, check status, and troubleshoot execution issues.

Video Tour

Try Databend Cloud for FREE

Multimodal, object-storage-native warehouse for BI, vectors, search, and geo.

Snowflake-compatible SQL with automatic scaling.

Sign up and get $200 in credits.

Try it today