Loading Semi-structured Formats
What is Semi-structured Data?
Semi-structured data contains tags or markers to separate semantic elements while not conforming to rigid database structures. Databend efficiently loads these formats using the COPY INTO command, with optional on-the-fly data transformation.
Supported File Formats
| File Format | Description | Guide |
|---|---|---|
| Parquet | Efficient columnar storage format | Loading Parquet |
| CSV | Comma-separated values | Loading CSV |
| TSV | Tab-separated values | Loading TSV |
| NDJSON | Newline-delimited JSON | Loading NDJSON |
| ORC | Optimized Row Columnar format | Loading ORC |
| Avro | Row-based format with schema definition | Loading Avro |