Worker Examples

Introduced: v1.3.0

This page provides comprehensive examples of using WORKER commands to manage UDF execution environments in Databend Cloud.

Basic Worker Lifecycle

1. Create a Worker

Create a basic worker for a UDF named read_env:

CREATE WORKER read_env;

Create a worker with IF NOT EXISTS to avoid errors:

CREATE WORKER IF NOT EXISTS read_env;

Create a worker with custom configuration:

CREATE WORKER read_env
    WITH size = 'small',
         auto_suspend = '300',
         auto_resume = 'true',
         max_cluster_count = '3',
         min_cluster_count = '1';

2. List Workers

View all workers in the current tenant:

SHOW WORKERS;

3. Modify Worker Settings

Change worker size and auto-suspend settings:

ALTER WORKER read_env SET size = 'medium', auto_suspend = '600';

Reset specific options to defaults:

ALTER WORKER read_env UNSET size, auto_suspend;

4. Manage Worker Tags

Add tags to categorize workers:

ALTER WORKER read_env SET TAG purpose = 'sandbox', owner = 'ci';

Remove tags when no longer needed:

ALTER WORKER read_env UNSET TAG purpose, owner;

5. Control Worker State

Suspend a worker (stop its execution environment):

ALTER WORKER read_env SUSPEND;

Resume a suspended worker:

ALTER WORKER read_env RESUME;

6. Remove a Worker

Remove a worker when no longer needed:

DROP WORKER read_env;

Safely remove a worker (no error if it doesn't exist):

DROP WORKER IF EXISTS read_env;

Advanced Examples

Worker for Different Environments

Create workers with environment-specific configurations, then tag them separately:

-- Development worker
CREATE WORKER dev_processor WITH
    size = 'small',
    auto_suspend = '60',
    auto_resume = 'true',
    max_cluster_count = '1',
    min_cluster_count = '1';

ALTER WORKER dev_processor SET TAG environment = 'development', purpose = 'testing';

-- Production worker
CREATE WORKER prod_processor WITH
    size = 'large',
    auto_suspend = '1800',
    auto_resume = 'true',
    max_cluster_count = '5',
    min_cluster_count = '2';

ALTER WORKER prod_processor SET TAG environment = 'production', team = 'data-engineering';

Dynamic Worker Management

Script to ensure a worker exists with specific configuration:

-- Create worker if it doesn't exist
CREATE WORKER IF NOT EXISTS my_worker WITH
    size = 'small',
    auto_suspend = '300';

-- Update tags
ALTER WORKER my_worker SET TAG
    environment = 'staging',
    owner = 'ci';

-- Tune options later
ALTER WORKER my_worker SET auto_resume = 'true', max_cluster_count = '2';

-- Show current configuration
SHOW WORKERS;

Best Practices

1. Naming Conventions

Use descriptive names that indicate the UDF's purpose
Include environment suffix (e.g., _dev, _prod, _staging)
Consider team/project prefixes for multi-team environments

2. Resource Sizing

Start with size='small' for development and testing
Use auto_suspend to save costs for infrequently used workers
Set appropriate min_cluster_count based on expected load

3. Tag Strategy

Use tags for cost allocation and resource tracking
Include environment, team, and project information
Add creation date and owner for audit purposes

4. Lifecycle Management

Use IF NOT EXISTS and IF EXISTS for idempotent scripts
Monitor worker usage with SHOW WORKERS
Clean up unused workers to reduce costs

Common Use Cases

1. UDF Development

-- Create a worker for UDF development
CREATE WORKER dev_transform WITH
    size = 'small',
    auto_suspend = '60';

ALTER WORKER dev_transform SET TAG environment = 'development', purpose = 'testing';

-- After UDF is developed and tested
ALTER WORKER dev_transform SET
    size = 'medium',
    auto_suspend = '300';

ALTER WORKER dev_transform SET TAG purpose = 'production-ready';

2. Batch Processing

-- Worker for nightly batch jobs
CREATE WORKER nightly_etl WITH
    size = 'large',
    auto_suspend = '3600',  -- Suspend after 1 hour of inactivity
    auto_resume = 'false';  -- Don't auto-resume (manual control)

ALTER WORKER nightly_etl SET TAG
    schedule = 'nightly',
    job_type = 'etl',
    criticality = 'high';

3. Multi-tenant Environments

-- Workers for different teams
CREATE WORKER team_a_processor WITH
    size = 'medium';

ALTER WORKER team_a_processor SET TAG team = 'team-a', billing_code = 'TA-2024';

CREATE WORKER team_b_processor WITH
    size = 'small';

ALTER WORKER team_b_processor SET TAG team = 'team-b', billing_code = 'TB-2024';

Troubleshooting

Worker Not Starting

If a worker doesn't start as expected:

Check if the UDF exists and is properly configured
Verify environment variables are set in the cloud console
Review the current worker metadata and resume it if needed:

-- Inspect current worker metadata
SHOW WORKERS;

-- Resume the worker
ALTER WORKER my_worker RESUME;

Permission Issues

Ensure you have the necessary privileges:

-- Check your privileges
SHOW GRANTS;

Resource Constraints

If experiencing performance issues:

-- Increase worker size
ALTER WORKER my_worker SET size = 'large';

-- Adjust cluster counts
ALTER WORKER my_worker SET
    max_cluster_count = '5',
    min_cluster_count = '2';

User-Defined Functions (UDFs) - Learn about creating and using UDFs
Warehouse Management - Manage compute resources for query execution
Workload Groups - Control resource allocation and priorities

Worker Examples

Basic Worker Lifecycle

1. Create a Worker

2. List Workers

3. Modify Worker Settings

4. Manage Worker Tags

5. Control Worker State

6. Remove a Worker

Advanced Examples

Worker for Different Environments

Dynamic Worker Management

Best Practices

1. Naming Conventions

2. Resource Sizing

3. Tag Strategy

4. Lifecycle Management

Common Use Cases

1. UDF Development

2. Batch Processing

3. Multi-tenant Environments

Troubleshooting

Worker Not Starting

Permission Issues

Resource Constraints

Join our growing community

GitHub

Slack

X(Twitter)

YouTube

Or simply contact us directly

Contact Us

Explore Databend Cloud

Try Databend Cloud for FREE

Basic Worker Lifecycle​

1. Create a Worker​

2. List Workers​

3. Modify Worker Settings​

4. Manage Worker Tags​

5. Control Worker State​

6. Remove a Worker​

Advanced Examples​

Worker for Different Environments​

Dynamic Worker Management​

Best Practices​

1. Naming Conventions​

2. Resource Sizing​

3. Tag Strategy​

4. Lifecycle Management​

Common Use Cases​

1. UDF Development​

2. Batch Processing​

3. Multi-tenant Environments​

Troubleshooting​

Worker Not Starting​

Permission Issues​

Resource Constraints​

Related Topics​

Join our growing community

GitHub

Slack

X(Twitter)

YouTube

Or simply contact us directly

Contact Us

Explore Databend Cloud

Try Databend Cloud for FREE

Basic Worker Lifecycle

1. Create a Worker

2. List Workers

3. Modify Worker Settings

4. Manage Worker Tags

5. Control Worker State

6. Remove a Worker

Advanced Examples

Worker for Different Environments

Dynamic Worker Management

Best Practices

1. Naming Conventions

2. Resource Sizing

3. Tag Strategy

4. Lifecycle Management

Common Use Cases

1. UDF Development

2. Batch Processing

3. Multi-tenant Environments

Troubleshooting

Worker Not Starting

Permission Issues

Resource Constraints

Related Topics