Hands-On Tutorial
In this tutorial, you will create a database and a data table in Databend Cloud, and import a sample data. Before getting started, please ensure that you have successfully registered and logged in to your Databend Cloud account. For detailed instructions, please refer to Activate Databend Cloud.
Step 1: Create a Database and a Table
Click on Worksheets in the sidebar, and then click on New Worksheet. You will see a page like this:
In the worksheet, we can write and execute SQL statements. All subsequent SQL operations in this tutorial will be executed through the workspace.
The default
next to the Run Script button refers to the default warehouse. You can also create more warehouses of different specifications on the Warehouses page according to your needs to adapt to different workloads.
Copy and paste the following SQL statement to the editor, then click Run Script to create a database:
CREATE DATABASE book_db;
Copy and paste the following SQL statement to the editor, then click Run Script to create a table:
CREATE TABLE books
(
title VARCHAR,
author VARCHAR,
date VARCHAR
);
After clicking Run Script, you can see a prompt indicating that the SQL statement was executed successfully. Afterwards, you can refresh the page to view the newly created database and table in the left-hand navigation pane.
Step 2: Load Data
You can choose any of the following methods to import data. Before starting, please click the link to download the sample file books.csv.
Load with Data Loading Wizard
- On the Home page, click Load Data:
- Click An existing table:
- Select the local sample file, then click Next:
- Select
book_db
as the target database andbooks
as the target table, then click Confirm.
Load with BendSQL
Databend Cloud offers a CLI tool called BendSQL. You can also use BendSQL in your data pipeline to automate data import. To install BendSQL, please refer to Using BendSQL.
- On the Home page, click Connect:
- You can obtain the connection information as shown in the following snapshot. If you do not see the password, click Reset DB Password to generate a new one:
- Open the command line console and execute the following command to log in to BendSQL:
bendsql connect -u cloudapp -p <password> -P443 --ssl -H <host-address> -d book_db
- Execute the following command to upload the sample file:
bendsql load csv -f ~/Downloads/books.csv -t book_db.books --skip-header 0
If you can see printed messages like this, it indicates that the data loading was successful:
-> Uploading ...
-> Inserting into table: `book_db`.`books` ...
-> Successfully loaded into table: `book_db`.`books` in 1.067s
Rows: 2, Bytes: 157
Load with COPY INTO
COPY INTO supports importing data from various data sources such as OSS, S3, Azure Blob, HTTPS URLs, etc.
Copy and paste the following SQL statement to the editor in your worksheet, then click Run Script:
COPY INTO book_db.books FROM 'https://datafuse-1253727613.cos.ap-hongkong.myqcloud.com/data/books.csv'
FILE_FORMAT = (type = CSV field_delimiter = ',' record_delimiter = '\n' skip_header = 0);
Step 3. Query Imported Data
Copy and paste the following SQL statement to the editor in your worksheet to get the number of books by different years:
SELECT count(*), date FROM book_db.books GROUP BY date;
You can also execute the same SQL statement with the BendSQL CLI tool:
╰─$ bendsql query
Connected with driver databend (DatabendQuery v0.9.44-nightly-72bde50(rust-1.68.0-nightly-2023-02-19T16:20:24.92751464Z))
Type "help" for help.
dd:cloudapp@tn3ftqihs--bl/book=> SELECT count(*), date FROM book_db.books GROUP BY date;
+----------+------+
| count(*) | date |
+----------+------+
| 1 | 1992 |
| 1 | 2004 |
+----------+------+
(2 rows)
Next Steps
Now that you have a basic understanding of how to use Databend Cloud, how can you use it to solve real-world data analysis needs? Databend Cloud provides a variety of solutions to address different scenario requirements. Please refer to the following documentation for more information: