Loading Semi-structured Formats
What is Semi-structured Data?
Semi-structured data contains tags or markers to separate semantic elements while not conforming to rigid database structures. Databend efficiently loads these formats using the COPY INTO
command, with optional on-the-fly data transformation.
Supported File Formats
File Format | Description | Guide |
---|---|---|
Parquet | Efficient columnar storage format | Loading Parquet |
CSV | Comma-separated values | Loading CSV |
TSV | Tab-separated values | Loading TSV |
NDJSON | Newline-delimited JSON | Loading NDJSON |
ORC | Optimized Row Columnar format | Loading ORC |
Avro | Row-based format with schema definition | Loading Avro |