Skip to main content

Integration Tasks

An integration task in Databend Cloud defines how data flows from a source into a target table in Databend. Each task references an existing data source and specifies source settings, a target warehouse, a target database / table, and runtime parameters that are specific to the task type.

Unlike data sources, integration tasks are the executable units that actually perform data movement and synchronization. Data sources store access settings, while tasks handle scheduling, ingestion, synchronization, stopping, resuming, and monitoring.

Supported Task Types

Task TypeDescription
Amazon S3Imports CSV, Parquet, or NDJSON files from Amazon S3 with support for one-time or continuous ingestion.
Amazon SQS (S3)Consumes S3 object creation events from an SQS queue and writes the corresponding object data into Databend.
MySQLSynchronizes table data from MySQL using Snapshot, CDC Only, or Snapshot + CDC.
PostgreSQLSynchronizes table data from PostgreSQL using Snapshot, CDC Only, or Snapshot + CDC.

Reading Guide

Recommended reading order:

  1. Start with Task Management to understand the task creation flow, start / stop behavior, status, and run history.
  2. Then read the task-specific guide for the source type you want to configure.

Task Type Differences

  • S3 tasks are designed for file import scenarios and mainly focus on file path patterns, file formats, and ingestion behavior.
  • SQS (S3) tasks are designed for S3 event-driven data ingestion and mainly focus on the SQS queue, S3 event filters, IAM Role, and target table.
  • MySQL and PostgreSQL tasks are designed for table synchronization scenarios and mainly focus on sync modes, primary keys, incremental capture, and archive scheduling.
Try Databend Cloud for FREE

Multimodal, object-storage-native warehouse for BI, vectors, search, and geo.

Snowflake-compatible SQL with automatic scaling.

Sign up and get $200 in credits.

Try it today