Skip to main content

Databend Releases

This page provides information about recent features, enhancements, and bug fixes for Databend.

Jan 2, 2025 (v1.2.680)

What's Changed

Exciting New Features ✨

  • feat(query): optimize min/max aggregation use accurate column stats by @sundy-li in #16270
  • feat: new table option data_retention_period_in_hours by @dantengsky in #16266
  • feat(query): add config udf_server_allow_insecure by @everpcpc in #16290
  • feat(query): Inverted index search function support options by @b41sh in #16256
  • feat(geo): add geography data type by @forsaken628 in #16286
  • feat(query): Support create/drop dictionary ddl. by @Winnie-Hong0927 in #16280
  • feat: add endpoint /v1/session/logout and record client session ID in meta. by @youngsofun in #16281
  • feat(query): ST_GEOGRAPHYFROMEWKT by @forsaken628 in #16302
  • feat: support temporary table by @SkyFan2002 in #16250
  • feat(query): Add jq set returning function for processing Variant data with jq filters by @maxjustus in #16288
  • feat: Add KVPbApi::get_id_and_value() by @drmingdrmer in #16325
  • feat: Check essential permissions when creating table with external location by @dantengsky in #16315
  • feat(query): add is_local to admin /v1/tables info by @everpcpc in #16329
  • feat(query): add is_external to /v1/tables info by @everpcpc in #16336
  • feat(query): add plan cache by @sundy-li in #16333
  • feat: login and renew return session_token_validity_in_secs too. by @youngsofun in #16313
  • feat(query): add discovery nodes api by @zhang2014 in #16353
  • feat: hilbert clustering by @zhyass in #16296
  • feat: add error stack to ErrorCode by @andylokandy in #16343
  • feat: support vacuum temporary table by @SkyFan2002 in #16364
  • feat: http handler support temp table. by @youngsofun in #16375
  • feat: batch size hint of stream consumption by @zhyass in #16372
  • feat(query): make external server parallel by batch by @sundy-li in #16390
  • feat: estimate selectivity by table sample by @xudong963 in #16362
  • feat: introduce an intermediate version that is compatible with vacuum2 by @SkyFan2002 in #16354
  • feat(query): Procedure (Part1) by @TCeason in #16348
  • feat: add limit push down rule by @xudong963 in #16403
  • feat: discovery_nodes not need auth. by @youngsofun in #16392
  • feat(query): Support SHOW VARIABLES by @TCeason in #16409
  • feat: compact after merge sort by @zhyass in #16401
  • feat(query): Support access Redis data from dictionaries via the dict_get function. by @Winnie-Hong0927 in #16389
  • feat: continue vacuum drop table on per-table cleanup failures by @SkyFan2002 in #16424
  • feat: Load DATABEND_ENTERPRISE_LICENSE_EMBEDDED while building by @Xuanwo in #16440
  • feat(query): Support access Mysql data from dictionaries via the dict_get function. by @Winnie-Hong0927 in #16444
  • feat: Auto compact(re-cluster) for multiple table insertion statement by @SkyFan2002 in #16443
  • feat(query): Procedure Part2 support arguments by @TCeason in #16453
  • feat(query): refactor window spill by @Dousir9 in #16448
  • feat(query): push rank limit into aggregate partial node by @sundy-li in #16466
  • feat: add eliminate union optimizer rule by @xudong963 in #16478
  • feat(query): parallel k way merge sort by @forsaken628 in #16340
  • feat: add fast path for cross join to optimize tpcds q23 (~50x) by @xudong963 in #16504
  • feat(catalog): ignore error when listing databases by @BohuTANG in #16514
  • feat: limit stage name to plain_identifier by @youngsofun in #16522
  • feat(query): Initial implementation of postgres-compatible number to_char function by @forsaken628 in #16507
  • feat(query): Procedure support drop if exists and create or replace by @TCeason in #16500
  • feat(query): DATE_ADD Functions to support week as a unit by @TracyZYJ in #16530
  • feat: http handler add X-DATABEND-VERSION in each response. by @youngsofun in #16518
  • feat: short sql length setting by @arkzuse in #16502
  • feat: unset table options by @dantengsky in #16544
  • feat(query): window partition by spill to disk by @forsaken628 in #16441
  • feat(query): Simplified version Pattern S/L/D/G for number to_char by @forsaken628 in #16569
  • feat(query): add json array lambda functions by @sundy-li in #16573
  • feat(query): support describe user statement by @b41sh in #16579
  • feat: copy option 'pattern' support variable. by @youngsofun in #16525
  • feat(query): support json array functions by @b41sh in #16571
  • feat(query): add date_diff function by @chagelo in #16560
  • feat(query): Reduce serialisation memory usage when spilling to local disk by @forsaken628 in #16580
  • feat(query): Support show dictionaries DQL. by @Winnie-Hong0927 in #16602
  • feat: support sampling table with block and row level simultaneously by @xudong963 in #16613
  • feat: add unload options overwrite, include_query_id and use_raw_path. by @youngsofun in #16614
  • feat(function): Add mode Aggregate function. by @Freejww in #16627
  • feat(query): Support use parquet format when spilling by @forsaken628 in #16612
  • feat(query): support json_object_insert function by @b41sh in #16636
  • feat: http handler support request forwarding. by @youngsofun in #16637
  • feat: Implemented a new sql explain analyze graphical by @Maricaya in #16543
  • feat: add deterministic block level sampling for small datasets by @xudong963 in #16670
  • feat: support explain decorrelated plan by @xudong963 in #16681
  • feat: add ARRAY_SIZE alias for ARRAY_LENGTH by @ding-young in #16685
  • feat(query): support json_object_delete and json_object_pick function by @b41sh in #16682
  • feat: Bump OpenDAL to 0.50.1 by @Xuanwo in #16661
  • feat: support map lambda functions by @Dragonliu2018 in #16683
  • feat(query): add api /v1/verify by @everpcpc in #16665
  • feat: support calculating table storage size(including time travel) by @SkyFan2002 in #16698
  • feat(query): add table function udf_echo && expose more metrics of udf client by @sundy-li in #16696
  • feat(query): Settings clause by @TCeason in #16669
  • feat: implement is_not_null selectivity based on null count in stats by @xudong963 in #16730
  • feat: HTTP handler use cookie to pass session id to and from clients. by @youngsofun in #16735
  • feat: impl opendal::HttpFetch for StorageHttpClient by @everpcpc in #16738
  • feat(query): function about convert_timezone #16177 by @florann in #16181
  • feat(functions): add new function: map_pick by @hanxuanliang in #15573
  • feat(metrics): new storage_http_requests_count by @everpcpc in #16769
  • feat(functions): add map_insert function by @hanxuanliang in #15567
  • feat: filter null value before join by @xudong963 in #16722
  • feat(query): Support rename dictionary ddl. by @Winnie-Hong0927 in #16754
  • feat(query): add last_day, previous_day, next_day function by @Freejww in #16740
  • feat: implement StringColumn using StringViewArray by @andylokandy in #16610
  • feat: Implement WAL-based RaftLog storage by @drmingdrmer in #16776
  • feat(query): expand trim/ltrim/rtrim by @TCeason in #16802
  • feat: support pushdown predicate into iceberg engine by @sundy-li in #16650
  • feat(query): TopN window operator by @forsaken628 in #16726
  • feat(query): support SHOW DROP DATABASES by @TCeason in #16811
  • feat: uuid function from v4 to v7 by @BohuTANG in #16827
  • feat: refine the rule PushDownFilterWindowTopN in case top n is equal to 0. by @forsaken628 in #16830
  • feat: add aws glue as an iceberg connection type by @Rowlandev in #16824
  • feat(query): add distributed pruning settings by @zhang2014 in #16864
  • feat: databend-meta adds more metrics about raft-log by @drmingdrmer in #16884
  • feat(query): speed up fetching redis data from dictionaries via mget. by @Dragonliu2018 in #16766
  • feat(query): add new function TO_TIMESTAMP<int, scale> by @TCeason in #16924
  • feat: disable specifying copy options when create stage. by @youngsofun in #16925
  • feat(doc): Introducing Databend Guru on Gurubase.io by @kursataktas in #16773
  • feat(query): Support geometry relation functions by @b41sh in #16927
  • feat(query): virtual column allow cast to other type by @b41sh in #16903
  • feat: copy support option COLUMN_MATCH_MODE by @youngsofun in #16963
  • feat(query): speed up fetching mysql data from dictionaries via batch processing. by @Dragonliu2018 in #16948
  • feat: Support USE CATALOG syntax and current_catalog() function by @flashmouse in #16926
  • feat(query): Support st_collect aggregate function by @b41sh in #16996
  • feat(query): Support flight_connection_max_retry_times and flight_connection_retry_interval setting by @b41sh in #16856
  • feat: show dropped table and data_retention_time_in_hours in fuse_tim… by @SkyFan2002 in #17015
  • feat(rbac): add GrantObject::Warehouse by @TCeason in #17029
  • feat(query): partition sort spill by @forsaken628 in #16987
  • feat: support vacuum leaked table data by @SkyFan2002 in #17022
  • feat(query): to_binary function support variant, bitmap, geometry, geography types by @b41sh in #17026
  • feat(query): add interval type by @TCeason in #16990
  • feat: add key prefix count assertions to transactions by @drmingdrmer in #17053
  • feat(query): null type infer as Nullabel(String) by default in create as statement by @TCeason in #17052
  • feat(query): support global network policy by @everpcpc in #17050
  • feat(query): add config network_policy_whitelist by @everpcpc in #17065
  • feat(query): pre-check cast type by @andylokandy in #16836
  • feat(functions): add jaro_winkler string similarity function by @maxjustus in #16993
  • feat(query): add config jwks_refresh_interval & jwks_refresh_timeout by @everpcpc in #17087
  • feat: databend-meta transaction support generic bool-expression and else-if chain by @drmingdrmer in #17064
  • feat(query): refactor vacuum temp files by @sundy-li in #17089
  • refactor: Serialize iceberg table inside engine options to reduce catalog requests by @Xuanwo in #17049
  • feat(query): support interval function by @TCeason in #17093
  • feat: new settings stream_consume_batch_size_hint by @dantengsky in #17102
  • feat(query): support javascript/python script User Defined Aggregate Function by @forsaken628 in #17054
  • feat: databend-meta watch API provides initial_flush field by @drmingdrmer in #17098
  • feat(query): support javascript/python script User Defined Aggregate Function by @forsaken628 in #17108
  • feat(query): support interval 'string' by @TCeason in #17105
  • feat(query): Add external_block_rows metrics by @b41sh in #17116
  • feat: support vacuum leaked table data (#17022)-Revert by @dantengsky in #17123

Thoughtful Bug Fix 🔧

  • fix: max_wait_time in http handler not work when result set is large … by @youngsofun in #16267
  • fix(query): enable distribute query for system log table by @zhang2014 in #16300
  • fix(query): fix child_has_window by @sundy-li in #16301
  • fix: uri path used as both path and root. by @youngsofun in #16321
  • fix: vacuum dry run should carry on when target objects are missing by @dantengsky in #16322
  • fix(query): fix session leak in ctas by @zhang2014 in #16337
  • fix: When Replacing column mask, old name->id->value should be completely deleted by @drmingdrmer in #16328
  • fix: reduce sample possibility to avoid flaky test by @xudong963 in #16330
  • fix(query): fix IFNULL by @sundy-li in #16339
  • fix(query): fix wrong result for join system tables in cluster mode by @zhang2014 in #16342
  • fix(query): fix try cast by @sundy-li in #16341
  • fix(query): fix binary default string by @sundy-li in #16345
  • fix: push filter down join with materialized cte by @xudong963 in #16349
  • fix(query): fix aggregate panic in cluster mode by @zhang2014 in #16319
  • fix(query): fix incorrect alias of select items by @sundy-li in #16368
  • fix: separate sessions for temporary tables with memory engine by @SkyFan2002 in #16374
  • fix(executor): enable backpressure for union all by @zhang2014 in #16388
  • fix: hilbert_index function panic by @zhyass in #16393
  • fix(iceberg): Handling options like "s3.region" by @Xuanwo in #16404
  • fix: non-default catalog should not call get_table_function by @Xuanwo in #16410
  • fix(query): fix stack overflow for optimizer by @zhang2014 in #16420
  • fix: add custom (de)serialization methods for special float value by @dqhl76 in #16258
  • fix: table lock flaky test by @zhyass in #16429
  • fix(query): The columns permission of the system library is misjudged. by @TCeason in #16431
  • fix: restore parser backrtace when map_res failed by @andylokandy in #16434
  • fix(iceberg): Table desc should allow nested db_name by @Xuanwo in #16414
  • fix: split if block too big during append by @zhyass in #16435
  • fix: flaky test 09_0036_merge_into_without_dist by @dantengsky in #16457
  • fix: wrong order by result by @xudong963 in #16422
  • fix: Multi-thread recreate view/table gives error by @zhyass in #16465
  • fix: log query detail and profile with identical format by @everpcpc in #16469
  • fix: revert log format breakage from #16249 by @andylokandy in #16470
  • fix: typo for query & profile logging config by @everpcpc in #16476
  • chore: add extra bracket for and and or to make explain clear by @xudong963 in #16494
  • fix(query): fix planner cache cause modify async function does not take effect by @b41sh in #16510
  • fix: ignore_result doesn't have an effect on union by @xudong963 in #16515
  • fix(query): fix copy into table with disable distributed copy into by @zhang2014 in #16529
  • fix(query): fix column leaf_index by @Dousir9 in #16537
  • fix: Only when all tables are included, the db can be dropped by @drmingdrmer in #16539
  • fix: move TableInfo building out of SchemaApi by @drmingdrmer in #16548
  • fix: identify failed vacuum db by id by @drmingdrmer in #16553
  • fix(query): fix json object compare cause panic by @b41sh in #16555
  • fix: direct io syscall not available on mac by @forsaken628 in #16574
  • fix: window in scalar subquery returns wrong results by @xudong963 in #16567
  • fix: avoid eliminating the left child of union in RuleEliminateUnion by @xudong963 in #16584
  • fix(query): fix inverted index panic caused by null value by @b41sh in #16576
  • fix(query): fix cannot obtain stack trace when segmentation fault by @zhang2014 in #16566
  • fix: merge into panic by @zhyass in #16581
  • fix(query): list_dictionaries should not ignore db_id by @TCeason in #16587
  • fix(iceberg): Make sure iceberg table has been loaded by @Xuanwo in #16593
  • fix: Disable prewhere for iceberg table for now by @Xuanwo in #16597
  • fix(query): fix left outer join wrong result by @Dousir9 in #16601
  • fix: subquery returns wrong results under cluster by @xudong963 in #16606
  • fix(query): ntile frame should be unbounded by @TCeason in #16604
  • fix: lock mgr may failed to release lock in time by @dantengsky in #16611
  • fix(query): support subquery in pivot by @Dragonliu2018 in #16631
  • fix(query): check_timestamp/date -> clamp_timestamp/date by @TCeason in #16634
  • fix: deadlock in table lock by @zhyass in #16632
  • fix(query): sort spilling use arrow file format painc by @forsaken628 in #16658
  • fix(query): create table need fail if storage format is invalid by @TCeason in #16663
  • fix(query): forbid explain explain statement by @TCeason in #16654
  • fix(storage): fix refresh virtual column using async api by @b41sh in #16656
  • fix(query): insert with long sql parse error by @sundy-li in #16678
  • fix(ci): flaky test by @zhyass in #16664
  • fix: insert long sql again by @sundy-li in #16684
  • fix(query): sort spilling may hang by @forsaken628 in #16672
  • fix: JEAllocator should properly handle nullptr returned by @dantengsky in #16692
  • fix(metrics): use f64::MAX as histogram bound to display +Inf properly by @flaneur2020 in #16701
  • fix(query): use custom connector for udf client by @everpcpc in #16697
  • fix: MetaGrpcClient deadlock when drop by @drmingdrmer in #16727
  • fix: date_add/sub need respect timezone by @TCeason in #16721
  • fix(query): remove single quotes for system.columns comment by @TCeason in #16732
  • fix: HTTP handle lost rows when result set is wide and large. by @youngsofun in #16748
  • fix: keep column statistics of all NULL column by @dantengsky in #16753
  • fix: Files::remove_file_in_batch should not swallow errors by @dantengsky in #16761
  • fix(query): add tls config for udf client by @everpcpc in #16782
  • fix: token and cookie session id miss match. by @youngsofun in #16786
  • fix: clustering_information panic if string domain is none by @zhyass in #16792
  • fix(query): fix broken file log layout by @zhang2014 in #16803
  • fix: re-ensure dir before upgrading by @drmingdrmer in #16805
  • fix: purge may not work on tables after a flash back operation by @SkyFan2002 in #16812
  • fix(storage): recluster endless loop by @zhyass in #16831
  • fix: incorrect table data disk cache key by @dantengsky in #16837
  • fix(query): fix group by with alias column can't bind the column by @b41sh in #16804
  • fix(cluster): fix the possibility of connection leak when the cluster state is broken. by @zhang2014 in #16842
  • fix(query): fix aggregator_groups_builder to work with string view by @sundy-li in #16843
  • fix(query): check storage request method with arg list-type by @everpcpc in #16849
  • fix: replace risky dma_buffer_as_vec implementations by @forsaken628 in #16829
  • fix(query): fix and check total_buffer_len and total_bytes_len by @sundy-li in #16854
  • fix: raft-log wont write when exceeding 256M by @drmingdrmer in #16876
  • fix(query): fix distinct set-returning function by @b41sh in #16883
  • fix(query): fix incorrect total_bytes_len in string view by @sundy-li in #16877
  • fix: StreamVersionMismatched when using CTAS in multi statement trans… by @SkyFan2002 in #16889
  • fix(ci): flaky test by @zhyass in #16898
  • fix(storge): write progress in compact hook by @zhyass in #16901
  • fix: ensure atomicity in create_lock_revision by @zhyass in #16907
  • fix: dropped temporary table should not be shown in system.temporary_tables by @SkyFan2002 in #16911
  • fix(query): join predict use cast_expr_to_non_null_boolean by @sundy-li in #16937
  • fix(ci): flaky test by @zhyass in #16933
  • fix(query): add_hours function may panic if the argument is too big by @TCeason in #16929
  • fix: lock attach table's snapshot location by @dantengsky in #16952
  • fix(ci): fix release dbg upload path by @zhang2014 in #16958
  • fix(query): make memory engine as non-local table by @sundy-li in #16955
  • fix: solve some inconsistencies between code and comments by @YichiZhang0613 in #16974
  • fix: fix segfault in list_domain by @forsaken628 in #16979
  • fix(query): keep remaining_predicates when filtering grouping sets by @sundy-li in #16971
  • fix(query): fix bug while mysql external dictionary table contains null values by @Dragonliu2018 in #16978
  • fix(sqlparser): fix order by followed by scalar by @chagelo in #16967
  • fix(query): group by item allow set returning functions by @b41sh in #16986
  • fix: http handler set cookie only when cookie_enabled=true. by @youngsofun in #16992
  • fix: need_sticky field was incorrectly set to true. by @youngsofun in #16997
  • fix(ci): fix benchmark ci failure by @zhang2014 in #17005
  • fix: insert and mutation progress by @zhyass in #17014
  • fix(query): reuse connection to fix dictionary mysql flaky test by @b41sh in #17016
  • fix(ci): flaky test by @zhyass in #17030
  • fix: filter push down union in r cte by @xudong963 in #17031
  • fix: drop channel if partition receiver finished by @dqhl76 in #17037
  • fix: handle trailing zeros in Raft log WAL with EXT4 writeback mode by @drmingdrmer in #17042
  • fix(query): fix read small parquet files in cluster mode by @sundy-li in #17063
  • fix(query): fix thread leak by @zhang2014 in #17077
  • fix(query): fix thread leak and remove all usage by @dqhl76 in #17081
  • fix: change tracking stream failed by @zhyass in #17072
  • fix(query): fix join on not null column cause panic by @b41sh in #17069
  • fix: create as query respect ddl_column_type_nullable by @TCeason in #17071
  • fix: check if the table option is valid according to the engine by @SkyFan2002 in #17076
  • fix(ci): flaky test by @zhyass in #17090
  • fix(query): double panic if broken state in block builder by @zhang2014 in #17091
  • fix(query): Correct sign extension handling in months_days_micros struct by @TCeason in #17086
  • fix(query): union all panic in mysql client by @zhyass in #17095
  • fix: name resolution bug in case-sensitive mode by @zhyass in #17097
  • fix(query): fix a number of bugs in the spill config. by @forsaken628 in #17103
  • fix(query): uri path should not be percent-encoded by @youngsofun in #17109
  • fix(query): fix default values of SpillConfig may not take effect by @forsaken628 in #17113
  • fix(query): table field data type in show create table incorrect by @forsaken628 in #17112
  • fix(query): fix udf script return type is arrow DataType::LargeList caused failed by @b41sh in #17127
  • fix: ignore case when matching function name by @notauserx in #16912
  • fix: analyze table contain keyword bug by @zhyass in #17131
  • fix(query): Disabled the shrink_scalar optimization for cast expressions during the resolve procedure args by @TCeason in #17119
  • fix(log): prefix_filter not working. by @youngsofun in #17136

Code Refactor 🎉

  • refactor: replace absolute expiration time with relative TTL in schema API by @drmingdrmer in #16268
  • refactor(query): remove lru cache meter by @zhang2014 in #16260
  • refactor: Use global http client to share the connection pool by @Xuanwo in #16276
  • refactor(query): refactor hash join spill by @Dousir9 in #15746
  • refactor: change DatabaseInfo.meta to SeqV<DatabaseMeta> by @drmingdrmer in #16295
  • refactor: change value of meta-service key DatabaseNameIdent from DatabaseId to Id<DatabaseId> by @drmingdrmer in #16299
  • refactor: adopt get_pb() for get_background_task by @drmingdrmer in #16308
  • refactor: Reduce parallelism when vacuuming dropped tables by @dantengsky in #16312
  • refactor(query): add setting external_server_request_retry_times by @sundy-li in #16307
  • refactor(query): refactor bind set returning functions by @b41sh in #16316
  • refactor(query): add config enable_meta_data_upgrade_json_to_pb_from_v307 by @TCeason in #16306
  • refactor: add associated type kvapi::Value::KeyType by @drmingdrmer in #16320
  • refactor: kv-app-errors does not need serde by @drmingdrmer in #16324
  • refactor(query): refactor bind async functions by @b41sh in #16331
  • refactor: Catalog Manager API improvements for create and drop operations by @drmingdrmer in #16338
  • refactor: Add BaseApi to abstract usually used meta-service access pattern by @drmingdrmer in #16351
  • refactor: simplify SchemaApi::list_indexes(), get_index() list_background_jobs() by @drmingdrmer in #16357
  • chore(query): optimize window sort by @sundy-li in #16355
  • chore(query): reduce hash join memory usage by @Dousir9 in #16359
  • refactor: simplify drop_index() with NameIdValueApi by @drmingdrmer in #16366
  • refactor: simplify dictionary API by @drmingdrmer in #16371
  • refactor: Add item count check to KVPbApi::get_pb_stream() by @drmingdrmer in #16377
  • refactor(query): optimize show drop tables by @TCeason in #16370
  • chore(query): Simplified sort simple rows implementation by @forsaken628 in #16382
  • refactor(query): refactor some codes of parquet2 by @sundy-li in #16386
  • refactor: remove all share related functions by @drmingdrmer in #16383
  • refactor: replace get_pb_value() with get_pb() by @drmingdrmer in #16395
  • refactor: Add NameValueApi, simplify virtual column APIs by @drmingdrmer in #16398
  • refactor: simplify catalog API by @drmingdrmer in #16406
  • refactor: Simplify table lock API. by @drmingdrmer in #16413
  • chore(query): introduce EmptyResultScan plan by @sundy-li in #16411
  • refactor: remove unused TableInfo.tenant by @drmingdrmer in #16416
  • refactor: simplify least-visible-time schema API, add LeastVisibleTimeIdent by @drmingdrmer in #16418
  • refactor: add mget_id_value_compat() by @drmingdrmer in #16425
  • refactor(query): first check privilege in SystemEngine get_full_data by @TCeason in #16421
  • chore(query): limit max concurrency of spill io requests by @zhang2014 in #16442
  • chore(query): make string to int respect behavior like PG by @sundy-li in #16428
  • refactor: simplify vacuum drop table handling by @drmingdrmer in #16446
  • chore: use logforth to replace fern by @andylokandy in #16249
  • refactor(executor): use event cause to refactor shuffle processor by @zhang2014 in #16445
  • refactor: cleanup logic to get tables to vacuum by @drmingdrmer in #16450
  • refactor: enhancements of HTTP client session. by @youngsofun in #16452
  • refactor(query): sort kernel optimization by @sundy-li in #16458
  • refactor(storage): improve inverted index read fst file first to reduce load index by @b41sh in #16385
  • refactor: refactor vacuum by @drmingdrmer in #16454
  • chore: simplify get_sequence() and get_sequence_next_value() by @drmingdrmer in #16463
  • refactor: add DatabaseMeta.gc_in_progress by @drmingdrmer in #16464
  • refactor(query): use pipeline exchange to refactor window scatter by @zhang2014 in #16471
  • refactor(query): simplify expression kernel by @sundy-li in #16482
  • refactor: add KVPbCrudApi as abstraction layer for commonly used CRUD operations by @drmingdrmer in #16479
  • refactor: add KVPbCrudApi::crud_try_insert() and crud_try_upsert() by @drmingdrmer in #16484
  • refactor: SchemaApi::list_databases() by @drmingdrmer in #16485
  • refactor: SchemaApi::get_database_history() by @drmingdrmer in #16487
  • refactor: test SchemaApi::get_tenant_history_database() by @drmingdrmer in #16488
  • refactor(query): refactor license manager by @zhang2014 in #16489
  • refactor(query): refactor license manager by @zhang2014 in #16492
  • refactor: simplify DroppedId. Replace manual comparison with string comparison by @drmingdrmer in #16495
  • refactor: update dependency datafusion-orc. by @youngsofun in #16491
  • refactor: get_tableinfos_by_ids() by @drmingdrmer in #16503
  • chore(query): refactor like function by @sundy-li in #16493
  • chore: simplify SchemaApi::truncate_table() by @drmingdrmer in #16506
  • refactor: remove get_pb_value() by @drmingdrmer in #16516
  • refactor: Output whole error message with source while retry by @Xuanwo in #16519
  • refactor: unify path of write segment by @SkyFan2002 in #16517
  • refactor: Improve gc_dropped_table_by_id() method by @drmingdrmer in #16528
  • refactor: dropped table listing and GC by @drmingdrmer in #16531
  • refactor: gc_dropped_db_by_id() by @drmingdrmer in #16538
  • refactor: SchemaAPI::do_get_table_history by @drmingdrmer in #16540
  • refactor: DroppedId for listing db/tables for gc by @drmingdrmer in #16542
  • refactor: get_history_tables_for_gc() should not return TableInfo, but just table name, id and values by @drmingdrmer in #16545
  • refactor(storage): improve inverted index match phrase query by @b41sh in #16547
  • refactor: Automatically chunk large key sets in KVPbApi methods by @drmingdrmer in #16561
  • refactor: simplify get_tables_history() by @drmingdrmer in #16572
  • refactor(iceberg): Use iceberg file IO instead of our own operator by @Xuanwo in #16577
  • refactor: add SchemaApi::get_table_in_db by @drmingdrmer in #16582
  • refactor: SchemaApi::get_table_meta_history() by @drmingdrmer in #16586
  • refactor: fix error message. by @youngsofun in #16578
  • refactor(storage): inverted index use merge io to read data by @b41sh in #16589
  • refactor: isolate session_id and session states of diff users. by @youngsofun in #16592
  • refactor: Unify the set and unset structures by @TCeason in #16622
  • refactor(query): make delta table support partition prunning by @sundy-li in #16621
  • refactor(storage): inverted index support calculate the score by @b41sh in #16609
  • refactor: some http APIs do not need sticky. by @youngsofun in #16644
  • refactor: delete user_procedure and refactor procedure interpreter by @TCeason in #16532
  • refactor(query): use arrow-flight to exchange datas by @sundy-li in #16657
  • refactor: refine cast variant to map by @andylokandy in #16691
  • refactor: streamline system.temp_files table scan by @dantengsky in #16659
  • refactor: Align versions for arrow, opendal, hyper, tonic, prost by @Xuanwo in #16704
  • refactor: refine column statistic map serialization by @dantengsky in #16728
  • refactor: MetaGrpcClient needs to specify timeout to auto-reconnect to restarted server by @drmingdrmer in #16733
  • refactor: replace direct GitHub dependencies with published crates.io versions by @drmingdrmer in #16700
  • refactor: Enforce all deps must be declared at root by @Xuanwo in #16741
  • refactor: Remove not used async-std by @Xuanwo in #16745
  • refactor: change blocking pruning process to non-blocking channel by @dqhl76 in #16718
  • refactor: ensure last-purged log-id cursor in exported from databend-meta service by @drmingdrmer in #16759
  • refactor: Consolidate MetaStorageError variants by @drmingdrmer in #16760
  • refactor(storage): refactor read virtual columns by @b41sh in #16758
  • refactor: remove location_prefix, which is no longer supported by @dantengsky in #16757
  • refactor(base): add stacktrace to replace backtrace by @zhang2014 in #16643
  • refactor: when upgrading, do not import already purged garbage log entries by @drmingdrmer in #16809
  • refactor: align metrics field names with current behavior by @drmingdrmer in #16826
  • refactor(base): support stacktrace in cluster mode by @zhang2014 in #16821
  • refactor: refactor cte binder and fix materialized cte used in subquery by @xudong963 in #16785
  • refactor: remove EmbeddedMeta by @drmingdrmer in #16847
  • refactor: remove unused sled related codes by @drmingdrmer in #16857
  • refactor: make ListKV non-blocking by returning stream directly by @drmingdrmer in #16868
  • refactor: optimize query system.tables when query single table by @TCeason in #16869
  • refactor(query): refactor geometry functions by @b41sh in #16870
  • refactor: use jiff crate replace chrono in date/timestamp func by @TCeason in #16787
  • refactor: add new raft-log metrics to "metactl status" response by @drmingdrmer in #16899
  • refactor: remove obsolete gRPC read API by @drmingdrmer in #16909
  • refactor: deprecate: Operation::AsIs and will be removed by @drmingdrmer in #16913
  • refactor(sqlsmith): refactor sqlsmith using http client by @b41sh in #16890
  • refactor: remove ValueRef and use Value instead by @andylokandy in #16891
  • refactor: make test_watch_expired_events() less sensitive to inaccurate time by @drmingdrmer in #16966
  • refactor: use temp table to refactor materialized cte by @xudong963 in #16900
  • refactor: set path of cookie. by @youngsofun in #17008
  • refactor: replace RangeMap with span-map for better range lookup by @drmingdrmer in #17006
  • refactor: revert pr #16869 by @TCeason in #17012
  • refactor: auto split large mget_database_names_by_ids into chunks with KVPbApi::get_pb_values_vec() by @drmingdrmer in #17011
  • refactor: mget db names and table names remove duplicate ids to avoid too many parameters by @b41sh in #17013
  • refactor: integrate fuse table block pruning into pipeline by @dqhl76 in #16841
  • refactor: optimize query system.tables when query single table (#16869) by @TCeason in #17021
  • refactor(cluster): refactor cluster for dyn cluster [part 1] by @zhang2014 in #17023
  • refactor(query): refactor udf server batch rows by @b41sh in #17092
  • refactor: set name for all runtime worker threads. by @youngsofun in #17084
  • refactor(query): refactor udf server connection reuse endpoint by @b41sh in #17106
  • refactor: simplify alter and drop cluster key logic by @zhyass in #17128
  • refactor: integrate segment pruning into pipeline by @dqhl76 in #17126

Build/Testing/CI Infra Changes 🔌

  • ci: Add iceberg tpch sqllogictests by @Xuanwo in #16598
  • ci: Enable incremental back to see how it works by @Xuanwo in #16817
  • ci: fix flaky test by @youngsofun in #16945
  • ci: ignore RUSTSEC-2024-0402. by @youngsofun in #16998

Documentation 📔

  • docs: add stream to readme by @BohuTANG in #16810

Others 📒

  • chore(query): add more snapshot/segment serde tests by @sundy-li in #16273
  • chore: Use tagged iceberg crates by @Xuanwo in #16278
  • chore: add a setting to decide if table sample is deterministic by @xudong963 in #16275
  • chore(ci): tmp disable sharing test by @everpcpc in #16285
  • chore(ci): flaky test by @zhyass in #16207
  • chore: replace condition and if_then with txn in txn methods by @drmingdrmer in #16289
  • chore(query): bump ast 0.0.3 by @sundy-li in #16283
  • chore(query): avoid create query context each auth by @zhang2014 in #16279
  • chore(query): remove unused parquet2 codes by @sundy-li in #16282
  • chore: add setting auto_compaction_segments_limit by @zhyass in #16298
  • chore: remove the log about if histogram exists by @xudong963 in #16303
  • chore: simplify create_data_mask by @drmingdrmer in #16309
  • chore(ci): adjust release time by @everpcpc in #16334
  • chore(query): explain ast work like explain syntax by @sundy-li in #16335
  • chore: support temporary table with memory engine by @SkyFan2002 in #16332
  • chore(query): improve project set by @Dousir9 in #16326
  • chore: remove unused serde impl by @drmingdrmer in #16347
  • chore: remove empty reply struct by @drmingdrmer in #16350
  • chore(query): fix window index out of range by @Dousir9 in #16352
  • chore: separate test run for temp_table suite by @SkyFan2002 in #16358
  • chore(code): update the version of gix-path in Cargo.lock by @Dousir9 in #16365
  • chore(ci): add internal test by @TCeason in #16369
  • chore: improve Debug and Display for DataId and TIdent by @drmingdrmer in #16379
  • chore: add logging for embedded meta test by @drmingdrmer in #16384
  • chore(ci): fix cargo clippy by @TCeason in #16397
  • chore: add error stack to cmd entrence by @andylokandy in #16391
  • chore(query): add external server request profile by @zhang2014 in #16400
  • chore(query): refactor udf retry by @sundy-li in #16405
  • chore(query): add connect duration metric for udf by @zhang2014 in #16407
  • chore: add join type for explain analyze partial by @xudong963 in #16408
  • chore: bump quinn-proto to 0.11.8 for cargo audit pass by @dqhl76 in #16419
  • chore(planner): improve infer filter by @Dousir9 in #16361
  • chore: log remote addr when meta-service raft API recevies a message by @drmingdrmer in #16438
  • chore(query): add column: system.tables.total_columns by @TCeason in #16447
  • chore(ci): fix dataset domain by @everpcpc in #16455
  • chore(ci): release with musl by @everpcpc in #16460
  • chore(executor): support for data exchange between different processors within the pipeline by @zhang2014 in #16467
  • chore(executor): more edge detail for executing graph by @zhang2014 in #16468
  • chore(ci): remove hdfs release by @everpcpc in #16477
  • chore(query): null first behavior by @sundy-li in #16475
  • chore: update nom-rule by @andylokandy in #16483
  • chore: bump proj4rs 0.1.4 by @b41sh in #16474
  • chore: remove logs from block writer by @dantengsky in #16497
  • chore: simplify get_gc_table_info() by @drmingdrmer in #16499
  • chore(query): replace to datafuse-extras/dtparse by @TCeason in #16509
  • chore(storage): hook recluster avoid scan all segments by @zhyass in #16498
  • chore: update source repo for async-compression by @everpcpc in #16513
  • chore(query): move recursion & backtrace crate to datafuse-extras by @zhang2014 in #16520
  • chore(query): add more asserts by @sundy-li in #16536
  • chore: update repo for workflows by @everpcpc in #16546
  • chore(storage): refactor compact source by @zhyass in #16527
  • chore(query): SHOW/DESC is not DDL by @TCeason in #16583
  • chore: check lock revision before watch delete by @zhyass in #16590
  • chore(ci): flaky test by @zhyass in #16596
  • chore(query): improve variable parser by @sundy-li in #16599
  • chore: refine table sample tests by @xudong963 in #16600
  • chore(query): add more logs on aggregation by @sundy-li in #16552
  • chore: reset recluster block size by @zhyass in #16623
  • chore(query): add http address info to cluster node info by @zhang2014 in #16620
  • chore: drop guard when acquire lock failed by @zhyass in #16616
  • chore: revert "lock mgr may failed to release lock in time (#16611)" by @dantengsky in #16630
  • chore(query): reduce unsafe codes in kernels by @sundy-li in #16633
  • chore(query): add warn log if vacuum temporary file is error by @zhang2014 in #16646
  • chore: resolve todo in dynamic sample by @xudong963 in #16629
  • chore(query): enable loser tree merge sort default by @forsaken628 in #16649
  • chore(query): replace arrow2 with arrow-rs in serialize_column and deserialize_column by @forsaken628 in #16655
  • chore(query): remove hash join build detail status by @zhang2014 in #16674
  • chore: upon startup, meta-service output human-readable error if trying to open future version on-disk data by @drmingdrmer in #16679
  • chore(query): modify rbac test result display by @TCeason in #16686
  • chore: adjust default value of setting max_storage_io_requests by @SkyFan2002 in #16680
  • chore(query): returns nan for aggregate_kurtosis and aggregate_skewness by @sundy-li in #16688
  • chore: display port number on error from bind to port failure by @lyang24 in #16695
  • chore: remove SortColumnDescription.is_nullable by @forsaken628 in #16699
  • chore: remove transaction keyword by @SkyFan2002 in #16707
  • chore: add timeout for detect_region operation to prevent hanging request by @dantengsky in #16709
  • chore(query): add current_query_id and database id for admin api by @zhang2014 in #16703
  • chore(query): index scalar support variant by @sundy-li in #16719
  • chore: display log.file.limit and prefix_filter by @TCeason in #16713
  • chore: avoid returning error during computing binary comparison selectivity by @xudong963 in #16725
  • chore: Simplifying BlockMetaInfo Definitions by @forsaken628 in #16716
  • chore: tweak cargo feature "jemalloc" by @dantengsky in #16693
  • chore: remove flaky test by @TCeason in #16739
  • chore: refine vacuum by @dantengsky in #16743
  • chore: Remove not used dep config by @Xuanwo in #16746
  • chore(query): Introduce VecExt and SliceExt to check unsafe bounds in debug mode by @sundy-li in #16747
  • chore: make VACUUM TEMPORARY FILES killable by @dantengsky in #16751
  • chore: tweak batch deletion concurrency by @dantengsky in #16770
  • chore: refine raft-types import path by @drmingdrmer in #16764
  • chore: remove obsolete version by @everpcpc in #16789
  • chore: add snapshot location cache for attached table by @dantengsky in #16795
  • chore: update meta-service version info and compatibility doc by @drmingdrmer in #16800
  • chore(query): embedded public key of license by @zhang2014 in #16813
  • chore(query): refactor common/arrow removes arrow/io crates by @sundy-li in #16808
  • chore: purge dropped view metadata during vacuum by @dantengsky in #16819
  • chore(query): keep queries detail and profile log format by @zhang2014 in #16822
  • chore(ci): test with different license by @everpcpc in #16815
  • chore(base): rollback to kernel signal handler after dump backtrace by @zhang2014 in #16828
  • chore: remove unused codes by @zhyass in #16834
  • chore: fix typo in ColumnCacheKeyBuilder by @dantengsky in #16838
  • chore(cluster): disable parallel commit of cluster tasks by @zhang2014 in #16851
  • chore(query): remove SliceExt by @sundy-li in #16853
  • chore(ci): separate build profile by @everpcpc in #16855
  • chore(query): remove useless new line for query log by @zhang2014 in #16860
  • chore: add some log by @SkyFan2002 in #16863
  • chore(query): improve distributed runtime filter by @Dousir9 in #16862
  • chore: enable stderr logging for table_meta_inspector by @dantengsky in #16865
  • chore(query): remove common/arrow crate by @sundy-li in #16846
  • chore(ci): adjust runner size by @everpcpc in #16879
  • chore(settings): enable spill by @Dousir9 in #16888
  • chore: remove unused config_id from RaftConfig by @drmingdrmer in #16902
  • chore(ci): flaky test by @zhyass in #16917
  • chore(ci): enable overflow-checks in ci profile by @sundy-li in #16895
  • chore(ci): flaky test by @zhyass in #16919
  • chore(query): add snapshot logs in read partitions by @sundy-li in #16918
  • chore: bump rustls version 0.23.18 by @TCeason in #16936
  • chore(query): spilt binary symbol by @zhang2014 in #16861
  • chore: k way merge sort add boundary check to prevent process crash by @forsaken628 in #16934
  • chore(ci): remove aarch64 musl in release by @zhang2014 in #16946
  • chore(planner): improve cardinality estimation by @Dousir9 in #16938
  • chore(planner): refine project set cardinality by @Dousir9 in #16944
  • chore: support recursive cte with normal cte by @xudong963 in #16932
  • chore(binder): add enable_materialized_cte settings by @Dousir9 in #16950
  • chore(storage): hook compact avoid scan all segments by @zhyass in #16954
  • chore(storage): do compact before recluster during compact hook by @zhyass in #16949
  • chore(ci): upgrade bendsql by @everpcpc in #16965
  • chore(storage): refine error message for stream read offset snapshot by @zhyass in #16964
  • chore: bump ruzstd from 0.7.2 to 0.7.3 by @dqhl76 in #16969
  • chore(ci): install bendsql from packages by @everpcpc in #16970
  • chore(planner): improve physical join by @Dousir9 in #16962
  • chore(ci): fix some flaky test by @TCeason in #16980
  • chore(query): improve decimal aggregate state by @sundy-li in #16983
  • chore(planner): refine physical join by @Dousir9 in #16988
  • chore: adjust fuse_time_travel_size() by @SkyFan2002 in #16995
  • chore(ci): use new dsn for create database by @zhang2014 in #16968
  • chore(ci): rust toolchain 2024-11-28 build image by @sundy-li in #17001
  • chore(query): fix topk in native by @sundy-li in #17003
  • chore(query): bump up rust toolchain to 2024-11-28 by @sundy-li in #16981
  • chore: tweak attach table refreshing by @dantengsky in #17019
  • chore(query): remove old aggregate codes by @sundy-li in #17025
  • chore: add more test to meta-service txn value-matching by @drmingdrmer in #17033
  • chore(cluster): disable parallel commit for cluster task by @zhang2014 in #17034
  • chore: improve error handling in partition sending loop by @dqhl76 in #17035
  • chore(ci): remove musl release by @everpcpc in #17041
  • chore(query): improve comparison functions by @sundy-li in #17032
  • chore(binder): fix bind materialized cte by @Dousir9 in #17046
  • chore: refmt crash log message by @TCeason in #17059
  • chore: bump dtparse by @TCeason in #17073
  • chore: remove unused dev by @zhyass in #17088
  • chore(query): Revert "feat(query): support javascript/python script User Defined Aggregate Function" by @sundy-li in #17107
  • chore: add more log by @SkyFan2002 in #17110
  • chore(planner): fix except and intersect operation by @Dousir9 in #17111
  • chore(query): add fuzz tests by @sundy-li in #17100
  • chore(query): bump federated mysql version by @TCeason in #17118
  • chore(query): reduce-functions-code-expand by @sundy-li in #17114
  • chore(query): improve hook vacuum temp after query by @sundy-li in #17125
  • chore(query): bump enum-as-inner 0.6 by @forsaken628 in #17120
  • chore: rename default_cluster_key_id to cluster_key_seq by @zhyass in #17135
  • chore(query): fix udf metrics by @sundy-li in #17134

New Contributors

  • @arkzuse made their first contribution in #16502
  • @chagelo made their first contribution in #16560
  • @Dragonliu2018 made their first contribution in #16631
  • @Maricaya made their first contribution in #16543
  • @ding-young made their first contribution in #16685
  • @lyang24 made their first contribution in #16695
  • @florann made their first contribution in #16181
  • @Rowlandev made their first contribution in #16824
  • @kursataktas made their first contribution in #16773
  • @flashmouse made their first contribution in #16926
  • @notauserx made their first contribution in #16912

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.615...v1.2.680

Aug 19, 2024 (v1.2.615)

What's Changed

Exciting New Features ✨

  • feat: support sample for table scan by @xudong963 in #16216
  • feat(meta): Support create dictionary meta api by @Winnie-Hong0927 in #16178
  • feat: auth by refresh and session tokens. by @youngsofun in #16220
  • feat: http handler support set variable. by @youngsofun in #16239
  • feat(query): support json_object_agg function by @b41sh in #16238
  • feat: support block level table sample by @xudong963 in #16237
  • feat(query): add identifier function from session variable by @sundy-li in #16264

Thoughtful Bug Fix 🔧

  • fix(query): cume_dist() always return unbounded frame by @TCeason in #16242
  • fix(query): fix alter user set MUST_CHANGE_PASSWORD by @b41sh in #16243

Code Refactor 🎉

  • refactor(query): remove cache solt by @zhang2014 in #16233
  • refactor: optimize share table performance by @lichuang in #16218
  • refactor(query): don't retry transform on each block by @sundy-li in #16235
  • refactor(metactl): change to subcommands & support transfer-leader by @everpcpc in #16254
  • refactor(deps): Bump OpenDAL and other crates by @Xuanwo in #16251
  • refactor(query): remove named cache by @zhang2014 in #16245
  • refactor(query): decrease information_schema.tables call union by @TCeason in #16246
  • refactor: introduce new method accurate_columns_ranges to Table by @dantengsky in #16247

Others 📒

  • chore: remove admin api for testing for old version SM by @drmingdrmer in #16236
  • chore: disable table info refreshing for ai_to_sql by @dantengsky in #16240
  • chore: impl Display for LicenseInfo and JWTClaims<T> by @drmingdrmer in #16244
  • chore: update command for metactl in tests by @everpcpc in #16263
  • chore: more logs for attached table refreshing by @dantengsky in #16261
  • chore: refactoring some functions by @b41sh in #16265
  • chore: comment out fake_time test by @dantengsky in #16269
  • chore(query): revert #16192 by @Dousir9 in #16271

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.609...v1.2.615

Aug 13, 2024 (v1.2.609)

What's Changed

Exciting New Features ✨

  • feat: support recursive cte used in multiple places by @xudong963 in #15835
  • feat(query): support distinct from for hash join by @Dousir9 in #15838
  • feat: Implement Default for CatalogInfo by @Xuanwo in #15844
  • feat(query): enable udf in udf call by @sundy-li in #15839
  • feat(query): support row fetch for merge into by @Dousir9 in #15859
  • feat: support filling uuid when loading data and uuid is missing/empty in some rows. by @youngsofun in #15871
  • feat: stream consuming for copy into LOCATION from stmt by @dantengsky in #15882
  • feat: publish databend-common-ast to crates.io by @andylokandy in #15905
  • feat: Persist Meta-Service State Machine on Disk by @drmingdrmer in #15772
  • feat: failsafe tool fuse_amend by @dantengsky in #15929
  • feat: refactor create share endpoint ddl by @lichuang in #15937
  • feat: add built-in udfs in config by @BohuTANG in #15938
  • feat(planner): support pruning columns for merge into when insert only by @Dousir9 in #15948
  • feat: clickhouse handler no longer support insert with format. by @youngsofun in #15952
  • feat(query): Array lambda function support outer scope columns by @b41sh in #15957
  • feat: support real-time retrieval of profiles from admin API (part 1) by @dqhl76 in #15958
  • feat: refactor create database from share ddl by @lichuang in #15950
  • feat: pretty print backtrace by @andylokandy in #15913
  • feat: allow tuple field name to contain '_'. by @youngsofun in #15965
  • feat(query): alter user support modify current password by @TCeason in #15962
  • feat: update sort algorithm using loser tree for multi sort merge by @forsaken628 in #15869
  • feat: orc support missing fields. by @youngsofun in #15970
  • feat: orc add option missing_field_as. by @youngsofun in #15974
  • feat(executor): support error profiling for cluster mode by @zhang2014 in #15969
  • feat(query): implement HAVERSINE by @kkk25641463 in #15971
  • feat(query): implement st_length/st_distance by @kkk25641463 in #15982
  • feat: config option for syncing disk cache data by @dantengsky in #15984
  • feat: add tenant&queryid to udf client by @BohuTANG in #15987
  • feat: Reducing column.clone overhead in transform aggregate by introducing InputColumns by @forsaken628 in #15991
  • feat: support real-time retrieval of profiles from admin API (part 2) by @dqhl76 in #15975
  • feat: parquet add option missing_field_as. by @youngsofun in #15993
  • feat: http handler limit body size of each response to 10MB. by @youngsofun in #15960
  • feat: add NameResolutionSuggest to enhancement table name case sensitive error by @forsaken628 in #15889
  • feat: refactor share spec location and format by @lichuang in #15989
  • feat(query): add setting sort_spilling_batch_bytes by @sundy-li in #16019
  • feat: change nan/inf to NaN/Infinity in http handler results. by @youngsofun in #16017
  • feat: pretty print backtrace on panic by @andylokandy in #16024
  • feat(ast): add TokenKind::{Get, Put} by @andylokandy in #16020
  • feat(query): add new setting enable_dst_hour_fix by @TCeason in #16022
  • feat: Change 'inf' to 'Infinity' when unloading CSV/TSV by @youngsofun in #16028
  • feat(query): ST_CONTAINS by @kkk25641463 in #15994
  • refactor: Refactor aggregate hashtable, replace &[Column] with InputColumns by @forsaken628 in #16038
  • feat(query): ST_SETSRID/ST_NPOINTS by @kkk25641463 in #16035
  • feat: new table function set_cache_capacity by @dantengsky in #16016
  • feat: support must change password option for create user by @b41sh in #16031
  • feat: snapshots generated in multi txn has same prev_snapshot_id by @SkyFan2002 in #16044
  • feat(query): add option [ IGNORE | RESPECT NULLS ] functionality for window rank function(first_value, last_value, nth_value) by @TCeason in #15919
  • feat: add function name to udf client header by @everpcpc in #16053
  • feat: back compatibility of old share db by @lichuang in #16056
  • feat: log file with local time by @everpcpc in #16064
  • feat: add share integration test by @lichuang in #16051
  • feat(query): impl LocalShuffle by @Freejww in #16055
  • feat(query): add setting enable_strict_datetime_parser by @TCeason in #16067
  • feat: improve selectivity accuracy by @xudong963 in #16069
  • feat: optimize loser tree, disable peek top2 by @forsaken628 in #15979
  • feat(storage): add table function clustering_statistics by @zhyass in #16081
  • feat: limit auto cast when loading from column store. by @youngsofun in #16082
  • feat(query): PartitionsShuffleKind::ConsistentHash support by @forsaken628 in #16094
  • feat(http): expose queries queue length to /v1/status by @flaneur2020 in #16112
  • feat(storage): refactor recluster by @zhyass in #16070
  • feat(query): support json null for nullable response by @everpcpc in #16120
  • feat(query): hash partitioning in window by @Freejww in #16090
  • feat: generate accurate histogram by @xudong963 in #16018
  • feat(query): add sql variable support by @sundy-li in #16134
  • feat(query): support parse create dictionary stmt by @Winnie-Hong0927 in #16137
  • feat(storage): refactor compact by @zhyass in #16119
  • feat: add histogram info to fuse_statistic table by @xudong963 in #16141
  • feat(query): add user function admin api by @zhang2014 in #16146
  • feat: add is_attach column to system.tables and disable index creation for attache tables by @dantengsky in #16166
  • feat(planner): unify execution of DML statements (MERGE, UPDATE, DELETE) by @Dousir9 in #16060
  • feat(query): Support json_array_agg function by @b41sh in #16169
  • feat: add share catalog by @lichuang in #16172
  • feat: explain analyze output partial infos to debug cardinality estimator by @xudong963 in #16185
  • feat: transfer leader command for meta-service by @drmingdrmer in #16198
  • feat: meta-service transfer_leader: add response and default value by @drmingdrmer in #16201
  • feat: allow casting int to uint for string functions by @andylokandy in #16210
  • feat(query): Support materialized view match algorithm by @b41sh in #16023
  • feat: grant view to share by @lichuang in #16186
  • feat(query): ST_TRANSFORM by @kkk25641463 in #15992
  • feat: introduce new setting enable_last_snapshot_location_hint by @dantengsky in #16226

Thoughtful Bug Fix 🔧

  • fix(query): list user stage not check privilege by @TCeason in #15800
  • fix: fix sequence used in function calls by @lichuang in #15814
  • fix: multi-tbl insert with lateral flattern panics by @dantengsky in #15822
  • fix: uuid() as default value result in copy error. by @youngsofun in #15837
  • fix: function uuid() should be non_deterministic. by @youngsofun in #15840
  • fix: stream illegal after source rename by @zhyass in #15843
  • fix: parse comment in raw value by @andylokandy in #15834
  • fix(query): to_timestmap should always return err if parse err by @TCeason in #15850
  • fix(query): decimal div overflow by @sundy-li in #15856
  • fix(query): fix sum rewriter by @sundy-li in #15870
  • fix: merge into unresolved conflict by @zhyass in #15884
  • fix(query): window function out of bounds access panic by @shamb0 in #15860
  • fix: stream error after alter table by @zhyass in #15907
  • fix(query): fix expr stack overflow [part 1] by @zhang2014 in #15790
  • fix(jwks): refresh the jwks store on key id not found by @flaneur2020 in #15921
  • fix(query): fix cluster mode stack overflow by @zhang2014 in #15927
  • fix: remove VisitorWithParent, add more delete/update by subquery test by @lichuang in #15526
  • fix: no-recursive cte reference itself by @xudong963 in #15941
  • fix(query): fix stackoverflow for ci release mode by @zhang2014 in #15944
  • fix(query): window panic when max_block_size is small by @TCeason in #15949
  • fix: when upgrade snapshot v002 to v003, put kvs after expire index by @drmingdrmer in #15947
  • fix(meta): fix reserved number err by @TCeason in #15955
  • fix(storage): stream show columns maybe error by @zhyass in #15964
  • fix: update orc-rust to fix error when read columns of array<float>. by @youngsofun in #15968
  • fix(query): fix tuple inner fields name with quotes by @b41sh in #15973
  • fix(query): udf function support lambda arguments by @b41sh in #15981
  • fix: wrong arrow schema when fuse engine read parquet by @SkyFan2002 in #15997
  • fix(query): fix drop table column with quotes by @b41sh in #16006
  • fix(query): fix incorrect fast_memcmp function by @sundy-li in #16008
  • fix(query): coalesce continue loop when arg is null by @TCeason in #16002
  • fix(query): unify udf allow list validator by @sundy-li in #16012
  • fix: arrow schema in parquet reader by @SkyFan2002 in #16004
  • fix(query): fix create table as query with null type by @b41sh in #16041
  • fix(query): fix ambiguous time by @TCeason in #16046
  • fix: output format missing '-' for '-Infinity'. by @youngsofun in #16052
  • fix(query): if role not store ownership skip revoke ownership from role by @TCeason in #16047
  • fix(query): pad information_schema.views.view_definition by @TCeason in #16065
  • fix: bitmap with empty buffer stands for empty. by @youngsofun in #16066
  • fix(query): if timestamp is some, no need to init parsed by @TCeason in #16078
  • fix(query): fix read decimal by @sundy-li in #16080
  • fix(query): fix session manager dead lock if call instance status by @zhang2014 in #16088
  • fix: fix native decompress binary offsets out of bounds by @b41sh in #16085
  • fix(query): fix incorrect derive_stats for window and limit plan by @sundy-li in #16087
  • fix: wrong prev snapshot id by @SkyFan2002 in #16092
  • fix: error message in data. by @youngsofun in #16089
  • fix(storage): recluster maybe endless loop by @zhyass in #16096
  • fix(query): fix subquery with function failed by @b41sh in #16103
  • fix(query): cast string to decimal respect numeric_cast_option setting by @sundy-li in #16101
  • fix: tenant tables with stream by @zhyass in #16115
  • fix(query): fix stackoverflow in collect_statistics by @zhang2014 in #16123
  • fix(query): revert window hash partition by @Freejww in #16129
  • fix(query): fix window hash partition optimization by @Freejww in #16136
  • fix: deadlock with export() and build_snapshot() by @drmingdrmer in #16138
  • fix: vacuum dropped table in parallel by @dantengsky in #16139
  • fix: drop attached table that can no longer reach table data by @dantengsky in #16125
  • fix(query): revert find_eq_and_or_filter by @TCeason in #16148
  • fix(query): fix add column default value is indeterministic expression by @b41sh in #16153
  • fix(query): fix in operator convert to subquery cast failed by @b41sh in #16159
  • fix: explain analyze lack of statistics by @dqhl76 in #16162
  • fix: meta-service: data inconsistency risk by @drmingdrmer in #16175
  • fix(query): reduce get table meta call by @TCeason in #16170
  • fix: properly ignore column statistic from meta v2 if string contains non-utf8 by @andylokandy in #16180
  • fix(query): support optimize or filter in system.tables by @TCeason in #16165
  • fix(query): fix wasm udf runtime create with code by @b41sh in #16191
  • fix: skip column statistics entirely if deserialization failed by @andylokandy in #16192
  • fix(query): to_year/monday... should respect dst by @TCeason in #16195
  • fix(query): rounding decimal multiply results by @sundy-li in #16196
  • fix: backward compatibility of table statistics by @dantengsky in #16200
  • fix(query): remove deserialize_cluster_stats by @sundy-li in #16204
  • fix(query): fix index out of bounds with constant expr by @zhang2014 in #16206
  • fix(query): cannot use fully qualified names with views by @TCeason in #16223
  • fix(query): fix unexpected panic message by @zhang2014 in #16221
  • fix: block_entry.memory_size by @forsaken628 in #16230

Code Refactor 🎉

  • refactor: move databend-meta binaries to separate dir by @drmingdrmer in #15803
  • refactor: adaptive entries count for AppendEntriesRequest by @drmingdrmer in #15805
  • refactor: add meta-meta compat test by @drmingdrmer in #15823
  • refactor: Migrate catalog info into table info by @Xuanwo in #15857
  • refactor(query): refactor merge into pipeline by @Freejww in #15891
  • refactor(query): remove async for query binder by @zhang2014 in #15900
  • refactor: CatalogInfo has been stored in TableInfo by @Xuanwo in #15902
  • refactor: Make CatalogCreator Accept Arc<CatalogInfo> by @Xuanwo in #15906
  • refactor: simplify adding transform to pipeline by @dantengsky in #15873
  • refactor: improve modulo predicate selectivity by @xudong963 in #15917
  • refactor: optimize gen_columns_statistics() for scalar. by @youngsofun in #15909
  • refactor: optimize building bloom index for scalar. by @youngsofun in #15910
  • refactor: attach table read only by default by @dantengsky in #15922
  • refactor(cluster): refactor flight actions and add flight secret by @zhang2014 in #15930
  • refactor: speed up ColumnBuilder::repeat(Scalar::Null). by @youngsofun in #15939
  • refactor: Bump arrow to 52 by @Xuanwo in #15943
  • refactor: fuse statistics table function by @xudong963 in #15954
  • refactor: optimize OrcChunkReader. by @youngsofun in #15967
  • refactor: Use iceberg-rust to replace icelake by @Xuanwo in #15951
  • refactor: unify transaction related code by @SkyFan2002 in #15966
  • refactor: rename metrics & tweak system.caches table by @dantengsky in #15996
  • refactor: upgrade QuotaMgr to using protobuf by @forsaken628 in #15858
  • refactor(query): use shuffle on distributed merge into by @Freejww in #15946
  • refactor: add history record for queries_profiling table by @dqhl76 in #16014
  • refactor: optimize take_by_slices_limit_from_blocks by @forsaken628 in #15978
  • refactor(query): add crash handler for databend-query by @zhang2014 in #16054
  • refactor: Simplify HashMethod.build_keys_state function signature by replacing &[(Column, DataType)] with InputColumns by @forsaken628 in #16050
  • refactor: meta-service wait at most 1 second to shutdown by @drmingdrmer in #16104
  • refactor(query): Optimize aggregate function arg_min_max by @forsaken628 in #16109
  • refactor(query): If table_id is specified, use it directly by @TCeason in #16098
  • refactor: upgrade to Openraft-0.10.0-alpha.2 by @drmingdrmer in #16091
  • refactor: Bump OpenDAL to 0.48.0 by @Xuanwo in #16147
  • refactor: tweak table_statistics of trait Table by @dantengsky in #16152
  • refactor: add retry logic for flight service by @dqhl76 in #16097
  • refactor: refector ShareMeta struct by @lichuang in #16100
  • refactor(query): Optimize udf js runtime to avoid lock blocking by @b41sh in #16174
  • refactor(query): if projection not query stat/owner will not get table_stats/ownership when access system.tables by @TCeason in #16183
  • refactor: improve network error handling. by @drmingdrmer in #16194
  • refactor(query): remove useless meter by @zhang2014 in #16202
  • refactor(query): remove useless hash builder by @zhang2014 in #16203
  • chore: remove unused minmax by @zhyass in #16197
  • refactor: display unix timestamp for human by @drmingdrmer in #16212
  • refactor: support unset session|global by @TCeason in #16214
  • refactor: Use native-tls as default by @Xuanwo in #16199
  • refactor: fuse table funcs by @dantengsky in #16149
  • refactor(query): use bigint to handle the fallback of decimal op overflow by @sundy-li in #16215
  • refactor: CrudMgr::update should return only when the value is changed by @drmingdrmer in #16225

Build/Testing/CI Infra Changes 🔌

  • ci: code reuse and get raid of streaming load. by @youngsofun in #15811
  • ci: test load parquet unloaded by databend. by @youngsofun in #15813
  • ci: longer timeout for test_stateless_standalone. by @youngsofun in #15936
  • ci: Build runner for nightly-2024-07-02 by @Xuanwo in #16033

Others 📒

  • chore: recluster final ignore error by @zhyass in #15815
  • chore: reduce recluster depth threshold by @zhyass in #15819
  • chore(query): remove redundant code in merge into by @Freejww in #15812
  • chore: normalize compat test script names by @drmingdrmer in #15817
  • chore: fix scripts in meta-meta compatibility test by @drmingdrmer in #15826
  • chore(ci): add reporter for meta chaos test by @everpcpc in #15829
  • chore: bump arrow-udf by @sundy-li in #15836
  • chore(query): remove addRowNumber if source is physical table by @Freejww in #15828
  • chore(query): show view just display view name by @TCeason in #15841
  • chore: Fix build of arrow-udf by @Xuanwo in #15846
  • chore: change meta chaos io delay param by @lichuang in #15852
  • chore(ci): fix duplicate failure artifact name by @everpcpc in #15854
  • chore: disable recursive cte test for mysql by @xudong963 in #15853
  • chore(query): add retry logs by @sundy-li in #15862
  • chore: refactor name and match expression by @lichuang in #15863
  • chore: update logcall by @andylokandy in #15831
  • chore: install gdb for query service image by @everpcpc in #15875
  • chore(query): string_to_date/ts should not ignore error by @TCeason in #15878
  • chore: fix recursive cte hang by @xudong963 in #15883
  • chore(executor): avoid lock poisoning for pipeline executor by @zhang2014 in #15887
  • chore: add log to show if enable distributed optimization by @xudong963 in #15890
  • chore: print the select plan of insert query by @xudong963 in #15892
  • chore(query): improve error msg by @sundy-li in #15874
  • chore: replace patched codespan-reporting with rspack-codespan-reporting by @andylokandy in #15894
  • chore(query): udf allow list only judge host match by @sundy-li in #15896
  • chore: change meta chaos io delay param by @lichuang in #15897
  • chore: Bump minitrace for new opentelementry support by @Xuanwo in #15700
  • chore(ci): switch runner to aws by @everpcpc in #15903
  • chore: improve spiller buffer by @xudong963 in #15904
  • chore: copy into default value using RemoteExpr by @b41sh in #15893
  • chore: update domain for datasets & benchmark by @everpcpc in #15920
  • chore: remove open sharding binary and test by @lichuang in #15923
  • chore: add test case of issue 15791 by @lichuang in #15931
  • chore(storage): limit compact num when recluster by @zhyass in #15926
  • chore: fix param typo in function process_left_or_full_join_null_block by @lichuang in #15933
  • chore(planner): refactor join equi conditions by @Dousir9 in #15924
  • chore(planner): refine join_order_changed check in merge into by @Dousir9 in #15934
  • chore: remove streaming_load api. by @youngsofun in #15935
  • chore(query): readd some tests by @sundy-li in #15940
  • chore: add issue case for pr15941 by @xudong963 in #15945
  • chore(query): enable none lazy pruner in lazy read by @sundy-li in #15942
  • chore(docs): update domain for databend.rs by @everpcpc in #15959
  • chore: Bump hive_metastore to 0.1.0 by @Xuanwo in #15976
  • chore: move display_ident() to ast by @andylokandy in #15980
  • chore(query): show user functions should display built-in udfs by @TCeason in #15990
  • chore(query): fix cluster ci failure if set license by @zhang2014 in #15999
  • chore(ci): add license for cluster test by @everpcpc in #15995
  • chore: update nom-rule by @andylokandy in #16009
  • chore: orc and parquet use option missing_field_as. by @youngsofun in #16007
  • chore: add repo link in databend-common-ast lib by @lewiszlw in #16010
  • chore(query): disallow agg/window/udfcall in insert expr by @sundy-li in #16015
  • chore: remove unused ShareEndpointManager and ShareTableConfig by @lichuang in #16013
  • chore: add errors column and more tests for system.queries_profiling table by @dqhl76 in #16029
  • chore(query): add full expr in error message by @sundy-li in #16032
  • chore: temporarily disable a flaky test by @dqhl76 in #16042
  • chore: add privilege checking for fuse_amend by @dantengsky in #16045
  • chore: add node column to table system.malloc_stats_totals by @dantengsky in #16043
  • chore(query): enable distributed merge into by default by @Dousir9 in #16059
  • chore(ci): setup license with mask by @everpcpc in #16061
  • chore(query): information_schema.tables support display view engine by @TCeason in #16058
  • chore: fix share test integration missing query binary by @lichuang in #16071
  • chore: Bump jsonb version 0.4.1 by @b41sh in #16073
  • chore(query): better error msg in check_function by @sundy-li in #16075
  • chore(query): fix merge into virtual computed field by @Dousir9 in #16086
  • chore: update rust toolchain to nightly-2024-07-02 by @andylokandy in #16026
  • chore(code): update the version of object_store in Cargo.lock by @Dousir9 in #16110
  • chore(query): disable inner columns in prewhere by @Dousir9 in #16108
  • chore(query): add sigaltstack for signal handler by @zhang2014 in #16122
  • chore(query): limit frames size for capture backtrace by @zhang2014 in #16118
  • chore(query): remove useless box processor impl by @zhang2014 in #16128
  • chore: resolve lints by @andylokandy in #16127
  • chore: fake time test by @dantengsky in #16130
  • chore: resolve lints (part 2) by @andylokandy in #16133
  • chore(http): exposes max_running_query_executed_time in /v1/status by @flaneur2020 in #16131
  • chore(query): optimize tables query speed by @TCeason in #16144
  • chore(ci): add buildkitd config & fix typos by @everpcpc in #16151
  • chore: reset table_lock_expire_secs default value by @zhyass in #16154
  • chore(query): move udf admin api to management mode by @zhang2014 in #16155
  • chore: improve parser by @andylokandy in #16156
  • chore: reduce the sleep range during lock holder by @zhyass in #16158
  • chore(ci): split internal test sql by @TCeason in #16167
  • chore(query): revert 16097 by @zhang2014 in #16171
  • chore(ci): reset table lock expire in sqllogic by @zhyass in #16160
  • chore(query): skip empty block for udf script by @zhang2014 in #16182
  • chore(ci): disable musl build by @everpcpc in #16205
  • chore: refine TimedFutre wrapper by @drmingdrmer in #16211
  • chore(query): change stream table is_local to false by @Dousir9 in #16208
  • chore(ci): try fix main ci err by @TCeason in #16224
  • chore: refine meta-service by @drmingdrmer in #16229
  • chore(planner): fix distributed merge into by @Dousir9 in #16228

New Contributors

  • @lewiszlw made their first contribution in #16010

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.530...v1.2.609

Jun 16, 2024 (v1.2.530)

What's Changed

Exciting New Features ✨

  • feat(query): implement ST_XMAX by @kkk25641463 in #15406
  • feat(executor): record query profiling for standalone error query by @zhang2014 in #15425
  • feat(query): add external_server_request_batch_rows setting by @sundy-li in #15440
  • feat(function): Implement map_size function by @forsaken628 in #15444
  • feat(query): add created_time column to processes table by @TCeason in #15448
  • feat(function): Implement map_cat function by @shamb0 in #15348
  • feat: support adjust query's priority by @dqhl76 in #15352
  • feat(executor): record errors info into profiling log by @zhang2014 in #15472
  • feat: optimize vacuum drop table, execute in parallel by @lichuang in #15478
  • feat: http handler add endpoint /v1/login/. by @youngsofun in #15483
  • feat(functions): add map_contains_key scalar function by @hanxuanliang in #15465
  • feat(query): build image limit python 3.12.2 by @sundy-li in #15501
  • feat(query): ensure git is installed in pyenv by @sundy-li in #15505
  • feat(query): pyenv 3.12.2 by @sundy-li in #15512
  • feat(query): support unset session setting by @TCeason in #15510
  • feat: add version metrics for meta & query by @everpcpc in #15522
  • feat: support explain and profile multi table insert statemennt by @SkyFan2002 in #15518
  • feat(query): add query query_hash and query_parameterized_hash to query log by @TCeason in #15531
  • feat: add meta key_num metrics by @lichuang in #15536
  • feat(query): bump arrow-udf version to enhance types supported by JS UDFs by @maxjustus in #15529
  • feat: resolve conflict when multi table insert commit by @SkyFan2002 in #15503
  • feat(query): udf script support load from stage location by @sundy-li in #15542
  • feat(metasrv): add metrics for snapshot key numbers by @everpcpc in #15545
  • feat: select support consume stream by @zhyass in #15541
  • feat(query): show users support display roles by @TCeason in #15544
  • feat: remove XML input format. by @youngsofun in #15540
  • feat(query): create table support add inverted index by @b41sh in #15547
  • feat: fuzzy table data disk cache key reload by @dantengsky in #15566
  • feat(query): variant type as map value support bloom filter by @b41sh in #15562
  • feat(metasrv): add metrics for db_size by @everpcpc in #15595
  • feat: support millisecond level task scheduling by @ZhiHanZ in #15599
  • feat(query): show grants support where/limit option by @TCeason in #15607
  • feat: otlp logging support http endpoint by @everpcpc in #15636
  • feat(planner): support ExpressionScan and CacheScan by @Dousir9 in #15561
  • feat(query): implement ST_YMAX/ST_XMIN/ST_YMIN by @kkk25641463 in #15594
  • feat(query): add python udf script runtime by @sundy-li in #15494
  • feat: compact and recluster add table lock by @zhyass in #15632
  • feat(query): support disable vacuum temporary files after query by @zhang2014 in #15673
  • feat(query): Role/UserInfo add field create_on and udpate_on by @TCeason in #15661
  • feat: unload parquet with version as created_by. by @youngsofun in #15680
  • feat(metrics): add process collector by @flaneur2020 in #15591
  • feat: add meta-service config sled_max_cache_size_mb by @drmingdrmer in #15709
  • feat(query): add async retry transform by @sundy-li in #15647
  • feat: basic support input file format ORC. by @youngsofun in #15720
  • feat: recluster unclustered blocks by @zhyass in #15623
  • feat: add task_history function by @ZhiHanZ in #15731
  • feat(query): show indexes add inverted indexes by @b41sh in #15733
  • feat: Bump to opendal v0.47.0 rc by @Xuanwo in #15745
  • feat: Add http config for all storage services by @Xuanwo in #15757
  • feat(query): add inverted index size to block meta by @b41sh in #15752
  • feat: add meta verifier binary by @lichuang in #15756
  • feat(query): add shuffle before window plan by @sundy-li in #15726
  • feat: parquet support option null_if by @youngsofun in #15776
  • feat: rm CREDENTIALS as alias of CONNECTION. by @youngsofun in #15780
  • feat(function): Implement map_delete function by @shamb0 in #15480
  • feat(query): support try_to_timestmap(string, string), try_to_date(string, string) by @TCeason in #15775
  • feat: add io delay chaos meta test scripts by @lichuang in #15769
  • feat: support recursive cte by @xudong963 in #15675
  • feat(ci): add logging test for query by @everpcpc in #15792
  • feat: support dynamic enable of rust backtrace by @dqhl76 in #15749

Thoughtful Bug Fix 🔧

  • fix(query): fix cse projection by @sundy-li in #15409
  • fix(storage): change tracking with alias update row_version bug in merge into by @zhyass in #15412
  • fix(executor): fix broken merge into source profiling by @zhang2014 in #15415
  • fix(query): ensure all inputs pull all data that less than max partition count in new agg singleton by @Freejww in #15418
  • fix(query): fix array_concat function domain panic by @b41sh in #15424
  • fix: ExistsTable should has_result_set. by @youngsofun in #15431
  • fix: max_files not work for transform on ndjson/csv/tsv. by @youngsofun in #15441
  • fix(functions): improve array_prepend and array_append by @andylokandy in #15437
  • fix(executor): fix queries executor cannot kill problem and reactivate kill test assertion by @dqhl76 in #15443
  • fix(query): fix create table with cluster by get map type panic by @b41sh in #15475
  • fix(query): RoleInfo from_pb should not err by @TCeason in #15496
  • fix: fix group by cube with join panic assert by @lichuang in #15515
  • fix: Enable opendal backtrace output to help debug by @Xuanwo in #15521
  • fix: explain ast of multi table insert stmt by @dantengsky in #15527
  • fix: Prevent premature snapshot commit on unfinished writes by @drmingdrmer in #15534
  • fix(query): fix view flaky test with different dialect by @b41sh in #15533
  • fix(query): fix qualify display by @sundy-li in #15537
  • fix: FlightData must contain valid data header by @Xuanwo in #15559
  • fix: http handler kill not working. by @youngsofun in #15558
  • fix: Avoid duplicate temp file name for install-snapshot requests by @drmingdrmer in #15565
  • fix(query): fix shrink_d256 for negative value by @sundy-li in #15574
  • fix(query): fix bloom fitler using is_null and not function by @b41sh in #15577
  • fix(query): fix filter with nested not expr return wrong result by @b41sh in #15580
  • fix: fix normalize disjunctive filter by @xudong963 in #15587
  • fix: consume stream inside explicit txn by @dantengsky in #15582
  • fix: TxnManager with state AutoCommit should not be saved in http manager. by @youngsofun in #15602
  • fix(query): fix subquery writer in udf by @sundy-li in #15610
  • fix: analyze table error by @zhyass in #15614
  • fix: update derive-vistor to unblock rustc upgrage by @andylokandy in #15625
  • fix: allow http client retry when lost response with next_uri=/final. by @youngsofun in #15624
  • fix: subquery correlated column is derived column by @xudong963 in #15629
  • fix(query): unify optimize plan call in one place by @sundy-li in #15630
  • fix: missing service label for otlp logs by @everpcpc in #15639
  • fix(query): delete len check in mget_database/table_names_by_ids by @TCeason in #15640
  • fix(udf): use Debug to display transport error on UDF server by @flaneur2020 in #15646
  • fix(query): show grants also need to check user roles by @TCeason in #15650
  • fix(query): fix current date/time functions return constant value in a statement. by @b41sh in #15659
  • fix: merge into split by @dantengsky in #15644
  • fix: make sure lock guard is dropped before next recluster by @zhyass in #15662
  • fix(query): db owner role can access all tables under this db by @TCeason in #15634
  • fix(cluster): fix complex plan serde in cluster actions by @zhang2014 in #15672
  • fix: potential missing index creation in merge process by @dantengsky in #15674
  • fix(query): fix result cache in empty result by @sundy-li in #15683
  • fix: display expr considering associativity by @andylokandy in #15685
  • fix: fix create table error but table id uncommttied by @lichuang in #15665
  • fix: failed to select blocks for recluster by @zhyass in #15667
  • fix: add unit test of concurrent commit table meta data by @lichuang in #15703
  • fix(executor): fix lost pipeline on finished callback by @zhang2014 in #15693
  • fix(executor): fix query log lost for http kill by @zhang2014 in #15715
  • fix: fuse_snapshot only show the latest snapshot for external table by @dantengsky in #15721
  • fix: fix union and lazy materialization by @xudong963 in #15727
  • fix(ast): assert ast display and reparse by @andylokandy in #15666
  • fix: Handle Duplicate Tombstones in Meta-Service List Operation by @drmingdrmer in #15736
  • fix: rebuild missing bloom index while pruning by @dantengsky in #15738
  • fix(storage): Fix a potential data loss while concurrent write by @Xuanwo in #15759
  • fix: unload csv file too large. by @youngsofun in #15758
  • fix: off by one mistake in auto compaction by @dantengsky in #15761
  • fix(query): fix decimal cmp by @sundy-li in #15764
  • fix: lazy materialization does not support set operation by @xudong963 in #15754
  • fix: auto cast fail error in copy should show the related columns #15768 by @youngsofun in #15770
  • fix: http handler handle panic when starting query. by @youngsofun in #15785
  • fix(ci): meta chaos dockerfile path by @everpcpc in #15787
  • fix: add settings enable_streaming_load, default 0. by @youngsofun in #15794
  • fix(query): create ownership obejct only check current_role privilege by @TCeason in #15751

Code Refactor 🎉

  • refactor: list stage use stream to avoid OOM. by @youngsofun in #15405
  • refactor: Minimize lock duration during raft-log export by @drmingdrmer in #15417
  • refactor: move storage/fuse/io/files.rs to storage/common/io/files.rs by @BohuTANG in #15428
  • refactor: define CatalogIdIdent with TIdent by @drmingdrmer in #15429
  • refactor: define CatalogIdToNameIdent with TIdent by @drmingdrmer in #15432
  • refactor: set enable_experimental_rbac_check default 1 by @TCeason in #15436
  • refactor(query): enable little vacuum after query finish by @zhang2014 in #15434
  • refactor: remove MergeIntoSource operator by @xudong963 in #15435
  • refactor: ScalarVisitor refactor replace_predicate_column by @TCeason in #15439
  • refactor: use scalarvisitor replace predicate recursive by @TCeason in #15449
  • refactor(query): remove useless settings by @zhang2014 in #15461
  • refactor(query): add query id for query log by @zhang2014 in #15466
  • refactor: Bump OpenDAL 0.46, arrow 51, tonic 0.11, reqwest 0.12, hyper 1, http 1 by @Xuanwo in #15442
  • refactor: SessionContext doesn't require Arc by @drmingdrmer in #15491
  • refactor: remove project operator by @xudong963 in #15479
  • refactor(query): revert dev_setup by @sundy-li in #15516
  • refactor: remove databend-common-meta-app from databend-common-ast by @andylokandy in #15509
  • refactor: remove databend-common-base from databend-common-ast by @andylokandy in #15519
  • refactor(query): allow udf server in insert source by @sundy-li in #15523
  • refactor: remove redundant lifetime on fmt::Formatter by @andylokandy in #15528
  • refactor: SessionContext::current_tenant should not be changed at runtime by @drmingdrmer in #15504
  • refactor: migrate TSV input format to new framework. by @youngsofun in #15506
  • refactor(cluster): refactor flight service actions by @zhang2014 in #15419
  • refactor: Improve logging and snapshot management by @drmingdrmer in #15571
  • refactor: Embed key number into snapshot id by @drmingdrmer in #15588
  • refactor: Remove install_snapshot v0 meta-service API by @drmingdrmer in #15590
  • refactor: Enhance Meta-Service Cluster Management by @drmingdrmer in #15601
  • refactor: Decouple Snapshot Write and Commit Operations by @drmingdrmer in #15603
  • refactor: polish memory usage when copy into parquet. by @youngsofun in #15596
  • refactor(storage): bloom index filter use fold_with_domain instead fold by @b41sh in #15619
  • refactor: refine cast reloperator error info by @xudong963 in #15648
  • refactor: refactor merge into optimizer and fix unexpected distribution plan by @xudong963 in #15507
  • refactor(query): no need to deserialize all userinfo when create user by @TCeason in #15671
  • refactor: enable mem-profiling for meta-service by @drmingdrmer in #15705
  • refactor: delete raft log one by one. by @drmingdrmer in #15723
  • refactor: more explicit error confirmation for http query route error. by @youngsofun in #15621
  • refactor: split None and Owner/Grantobject by @TCeason in #15739
  • refactor(cluster): record query id for cluster mode log by @zhang2014 in #15609
  • refactor: GrantEntry not store ownership privilege by @TCeason in #15730
  • refactor: add workspace crates to Cargo.toml by @andylokandy in #15747
  • refactor: remove unsafe impl Send+Sync for GlobalRegistry by @drmingdrmer in #15782
  • refactor: remove meta-service V001 data version by @drmingdrmer in #15778
  • refactor(ast): eliminate clone when displaying expr by @zhang2014 in #15773
  • refactor: clustering_information support specify cluster keys by @zhyass in #15783
  • refactor: add non-async version of mete-client API request_sync by @drmingdrmer in #15786

Build/Testing/CI Infra Changes 🔌

  • ci: Bump version to 2024-02-08 (the same commit with 1.78) by @Xuanwo in #15455
  • ci: tcph/tpcds use copy instead of streaming load for prepare data. by @youngsofun in #15681

Documentation 📔

  • docs: delete repeat summary by @baerwang in #15744

Others 📒

  • chore: Vendor parquet2 instead by @Xuanwo in #15422
  • chore(query): refactor metrics by @sundy-li in #15421
  • chore: remove unused file by @drmingdrmer in #15438
  • chore(storage): allow append when change tracking was enabled by @zhyass in #15427
  • chore: Bump to rust 2024-02-08 (also 1.78) by @Xuanwo in #15456
  • chore: ignore stream column compare in block compact mutator by @zhyass in #15446
  • chore: polish log for copy. by @youngsofun in #15463
  • chore: improve show streams performance by @zhyass in #15460
  • chore(ci): update deprecated actions by @everpcpc in #15470
  • chore: change flood info log to debug by @BohuTANG in #15473
  • chore: remove breakline for log by @BohuTANG in #15476
  • chore: remove incorrect usage of Box::leak by @dantengsky in #15481
  • chore: fix copy/spill log message by @BohuTANG in #15482
  • chore: Upgrade opentelemetry and remove metrics crate by @Xuanwo in #15484
  • chore: add KVPbApi::get_pb_stream() but it is not yet ready for production use by @drmingdrmer in #15471
  • chore: log file_name name for short by @BohuTANG in #15487
  • chore: Bump poem to 3.0 by @Xuanwo in #15486
  • chore: fix some flood logs with batch progress by @BohuTANG in #15492
  • chore(meta): tracking query id for meta log by @zhang2014 in #15497
  • chore: Increase default io timeout for behavior changes by @Xuanwo in #15493
  • chore(query): tracking query log for mysql handler by @zhang2014 in #15495
  • chore: polish error message NDJSON. by @youngsofun in #15502
  • chore(query): pyenv to 3.9 by @sundy-li in #15511
  • chore: recluster disable sort spill by @zhyass in #15490
  • chore: change load_file_metadata_expire_hours default from 24*7 to 12 hours by @BohuTANG in #15514
  • chore(ci): use python3.12 as build tool base by @everpcpc in #15520
  • chore: adjust elapsed time of various log messages by @dantengsky in #15517
  • chore: Upgrade minitrace related crates to 0.6.5 by @Xuanwo in #15530
  • chore: reduce snapshot building progress logging by @drmingdrmer in #15532
  • chore: convert to arrow column remove ignore inside nullable by @b41sh in #15477
  • chore: leave all un-committed data to purge/vacuum by @dantengsky in #15535
  • chore: mask the config sensitive field for display by @BohuTANG in #15543
  • chore: Remove tokio-console since we don't need it by @Xuanwo in #15546
  • chore: Bump databend client and geozero to latest version by @Xuanwo in #15550
  • chore: Bump rustls to 0.22 by @Xuanwo in #15552
  • chore: remove unused micromarshal patch by @PsiACE in #15551
  • chore(executor): set plain_statistics to false for spill time profiling statistics by @zhang2014 in #15555
  • chore: change the short_sql from 64bytes to 10KB by @BohuTANG in #15554
  • chore: support aborting table navigation by @dantengsky in #15549
  • chore(query): add more heavy actions query type by @zhang2014 in #15556
  • chore(storage): remove abort operation by @zhyass in #15557
  • chore(query): all queryies should generate hash by @TCeason in #15560
  • chore: Remove sentry entirely and bump databend client by @Xuanwo in #15563
  • chore: Reduce the duplicated clap and gix by @Xuanwo in #15564
  • chore(ast): unify all of {escape,unescape,quote,unquote}\_{string/at_string/ident} by @andylokandy in #15548
  • chore: short_sql from 10KB to 30KB by @BohuTANG in #15575
  • chore: disk cache load refine and reduce the info flood logs by @BohuTANG in #15576
  • chore: Bump jsonb version 0.4.0 by @b41sh in #15589
  • chore: fix error capture of mulit statement transaction in mysql session by @SkyFan2002 in #15583
  • chore: udpate doc for meta-service compat change by @drmingdrmer in #15593
  • chore(deps): bump rustls from 0.22.2 to 0.22.4 by @dependabot in #15586
  • chore: add more logs for vacuum temp files by @BohuTANG in #15597
  • chore: Refine state machine v002 by @drmingdrmer in #15608
  • chore(deps): bump gix from 0.62.0 to 0.63.0 by @dependabot in #15615
  • chore: more commit logs by @dantengsky in #15617
  • chore: prepare for SMV003: move shared code to separate dir by @drmingdrmer in #15622
  • chore(query): add sort plan before window plan by @sundy-li in #15616
  • chore(query): fix filter reorder check by @Dousir9 in #15613
  • chore: reduce the short_sql to 128bytes by @BohuTANG in #15626
  • chore: remove databend-common-{io, exception} from databend-common-ast by @andylokandy in #15618
  • chore(query): add timeout for open telemetry logger by @zhang2014 in #15627
  • chore: spill out short_sql to common/base/src/string.rs by @BohuTANG in #15633
  • chore: add setting compact_max_block_selection by @zhyass in #15641
  • chore(executor): add some time tracking log by @zhang2014 in #15651
  • chore: minimize #[derive(skip)] by @andylokandy in #15654
  • chore(planner): remove z3 by @Dousir9 in #15663
  • chore: re-enable musl-g++ link for musl build image by @dantengsky in #15679
  • chore(query): fix replace into from distributed select plan by @sundy-li in #15657
  • chore: fix wrong print status by @zhyass in #15684
  • chore(query): add assert on explain parser by @sundy-li in #15658
  • chore(ci): add build for udf feature by @everpcpc in #15682
  • chore(ci): fix release workflow by @everpcpc in #15687
  • chore(ci): fix sqllogic test for release by @everpcpc in #15688
  • chore(ci): fix pack for release by @everpcpc in #15689
  • chore(ci): fix artifact upload/download for gcs by @everpcpc in #15691
  • chore: add unit test of vacuum orphan table meta data by @lichuang in #15690
  • chore(ci): fix checkout and version on release build by @everpcpc in #15692
  • chore(ci): fix distribution for release by @everpcpc in #15695
  • chore(ci): fix notify & publish release by @everpcpc in #15696
  • chore(ci): fix release notify template by @everpcpc in #15697
  • chore(query): add create query in admin tables api by @zhang2014 in #15694
  • chore(ci): fix publish service image by @everpcpc in #15698
  • chore(ci): fix run benchmark after release by @everpcpc in #15699
  • chore(ci): ci-cloud build with python-udf by @everpcpc in #15701
  • chore(ci): only provide debian based service image by @everpcpc in #15704
  • chore(ci): fix download artifacts for docker_all_in_one by @everpcpc in #15708
  • chore: more log for table history traversing by @dantengsky in #15710
  • chore(executor): revert #15693 by @zhang2014 in #15711
  • chore(ci): fix download artifacts for cloud build by @everpcpc in #15712
  • chore(ci): fix dockerfile for cloud service image build by @everpcpc in #15713
  • chore(ci): fix checking release status from jobs by @everpcpc in #15714
  • chore(planner): fix merge predicates with different data types by @Dousir9 in #15725
  • chore: add databend user to docker service images by @everpcpc in #15728
  • chore(query): fix pipeline display by @sundy-li in #15729
  • chore(ci): fix download artifacts for local benchmark by @everpcpc in #15735
  • chore(ci): setup python3.12 for local benchmark by @everpcpc in #15737
  • chore(ci): fix check release status by @everpcpc in #15748
  • chore(ci): fix release notify icon by @everpcpc in #15750
  • chore: output raft log index range for purge logging by @drmingdrmer in #15740
  • chore: add more test case about create table fail by @lichuang in #15765
  • chore: add error info when table name is misleading by @xudong963 in #15771
  • chore: Pin opendal to released version by @Xuanwo in #15766
  • chore: limit the count of set operation by @xudong963 in #15777
  • chore: replace manually incr/decr with couter guard by @drmingdrmer in #15784
  • chore(query): remove useless async keyword by @zhang2014 in #15781
  • chore: add more alias name tests for recursive cte by @xudong963 in #15788
  • chore: remove code about access share table into from meta service by @lichuang in #15793
  • chore: remove meta chaos from release action by @lichuang in #15796
  • chore(query): substr support function domain by @zhyass in #15795
  • chore: update doc about meta compatibility by @drmingdrmer in #15799

New Contributors

  • @forsaken628 made their first contribution in #15444
  • @hanxuanliang made their first contribution in #15465
  • @maxjustus made their first contribution in #15529
  • @baerwang made their first contribution in #15744

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.453...v1.2.530

May 6, 2024 (v1.2.452)

What's Changed

Exciting New Features ✨

  • feat(query): implement ST_GEOMPOINTFROMGEOHASH by @kkk25641463 in #15182
  • feat(rbac): drop role will transfer role owns object to account_admin by @TCeason in #15154
  • feat(query): implement ST_MAKEPOLYGON by @kkk25641463 in #15194
  • feat: distinguish early close from kill/cancel with new LogType::Closed. by @youngsofun in #15206
  • feat: add EXECUTE IMMEDIATE statement by @andylokandy in #15144
  • feat(http): get client ip from headers by @flaneur2020 in #15205
  • feat(query): support alter table comment by @TCeason in #15227
  • feat(query): match function support multiple fields with boost by @b41sh in #15196
  • feat: support insert with cte by @xudong963 in #15241
  • feat: add setting script_max_steps by @andylokandy in #15244
  • feat: atomic CTAS by @dantengsky in #15186
  • feat(query): integrate "like" to filter execution by @Dousir9 in #15233
  • feat(query): add user option: disabled by @TCeason in #15249
  • feat(query): Inverted index support set filters and index record options by @b41sh in #15254
  • feat: add sequence meta api by @lichuang in #15247
  • feat(query): add new setting parse_datetime_ignore_remainder. by @TCeason in #15260
  • feat: http handler response add field has_result_set and state Starting. by @youngsofun in #15258
  • feat: support cte in update selection by @xudong963 in #15276
  • feat: impl create dynamic table sql parser and planner by @zhyass in #15250
  • feat: support cte in delete by @xudong963 in #15284
  • feat(query): implement ST_ASGEOJSON by @kkk25641463 in #15214
  • feat(query): support map_keys and map_values function by @b41sh in #15291
  • feat(query): implement TO_GEOMETRY/TRY_TO_GEOMETRY function by @kkk25641463 in #15302
  • feat(query): 14925, support udf wasm by @shamb0 in #15107
  • feat(query): top k syntax support by @TCeason in #15318
  • feat(query): implement st_asewkb/st_aswkb/st_asewkt/st_aswkt/st_geohash by @kkk25641463 in #15325
  • feat(query): support with clause in copy into by @sundy-li in #15343
  • feat(query): inverted index support json type by @b41sh in #15267
  • feat(query): support compute quota feature by @zhang2014 in #15336
  • feat: Introduce enterprise storage quota feature by @Xuanwo in #15338
  • feat(query): inverted index use empty position data when query not contain phrase terms by @b41sh in #15362
  • feat(query): implement ST_DIMENSION function by @kkk25641463 in #15347
  • feat(query): implement ST_ENDPOINT by @kkk25641463 in #15366
  • feat(query): implement ST_POINTN by @kkk25641463 in #15373
  • feat(query): support explain insert with insert source by @TCeason in #15357
  • feat(query): implement ST_SRID by @kkk25641463 in #15377
  • feat(query): implement ST_X/ST_Y/ST_STARTPOINT by @kkk25641463 in #15384
  • feat: Add: stream based snapshot API install_snapshot_v2() to meta-service by @drmingdrmer in #15395

Thoughtful Bug Fix 🔧

  • fix(query): set setting case insensitive by @TCeason in #15199
  • fix(query): show grants panic when role/user has immute db/table privilege by @TCeason in #15188
  • fix: geos-sys compile error #15200 by @drmingdrmer in #15209
  • fix(query): alter modify col comment need modify when type not modify by @TCeason in #15217
  • fix(query): fix negative session_running_acquired_queries by @zhang2014 in #15223
  • fix: fix map get return double nested nullable by @ariesdevil in #15230
  • fix: show locks in current tenant by @zhyass in #15252
  • fix: solve some inconsistencies between code and comments by @YichiZhang0613 in #15229
  • fix: fix delete from subquery and one more condition bug by @lichuang in #15279
  • fix(query): fix udf name with upper case letters by @b41sh in #15300
  • fix: don't constant fold giant expr by @andylokandy in #15311
  • fix(query): fix decimal op loss precision by @sundy-li in #15313
  • fix(query): NDJSON copy into allow cast bool and number to string by @b41sh in #15308
  • fix: incorrect overlap checking (replace into) by @dantengsky in #15315
  • fix(typo): refering -> referring by @andylokandy in #15323
  • fix: insert Plan should not have schema. by @youngsofun in #15330
  • fix(executor): queries exeuctor points calc incorrectly by @dqhl76 in #15349
  • fix(query): show processlit host is null in HTTPQuery by @TCeason in #15363
  • fix: check all partitions for small files instead of the last only. by @youngsofun in #15360
  • fix: structlog should print non-zero-rooted tree by @andylokandy in #15370
  • fix: when listing-kv, limit the right bound if possible by @drmingdrmer in #15372
  • fix: list stage returns at most 15000 files. by @youngsofun in #15385
  • fix(query): spill block need consider scalar by @TCeason in #15387
  • fix: Respond with SnapshotMismatch error instead of internal error by @drmingdrmer in #15391
  • fix: solve some inconsistencies between code and comments by @YichiZhang0613 in #15394

Code Refactor 🎉

  • refactor(query): remove useless code (system.tracing) by @zhang2014 in #15185
  • refactor: explain merge into by @xudong963 in #15195
  • refactor: simplify DatabaseFactory, just static dispatch by @drmingdrmer in #15203
  • refactor(query): move api and metrics to servers mod by @zhang2014 in #15210
  • refactor: add view system tables by @TCeason in #15184
  • refactor: introduce UninitTenant and TIdentRaw by @drmingdrmer in #15207
  • refactor: define DatabaseNameIdent with TIdent by @drmingdrmer in #15235
  • refactor: ScalarVisitor refactor push_down_prewhere by @TCeason in #15215
  • refactor: replace String with Tenant in xxxPlan by @drmingdrmer in #15240
  • refactor: avoid using PREFIX directly. use method to build list-prefix by @drmingdrmer in #15243
  • refactor: remove unused list_all_tables() by @drmingdrmer in #15251
  • refactor: remove kvapi::Key impl for String by @drmingdrmer in #15253
  • refactor: grant_ownership retry logic by @TCeason in #15275
  • refactor: Extract kvapi::KeyCodec trait from kvapi::Key by @drmingdrmer in #15262
  • refactor: Introduce generic parameter N to TIdent by @drmingdrmer in #15271
  • refactor: define ShareConsumer with TIdent by @drmingdrmer in #15280
  • refactor: define VirtualColumnNameIdent with TIdent by @drmingdrmer in #15285
  • refactor: define DbIdListKey with TIdent, and rename it to DatabaseIdHistoryIdent by @drmingdrmer in #15287
  • refactor: define TenantQuotaIdent with TIdent by @drmingdrmer in #15289
  • refactor: Define TableLockKey with TIdent by @drmingdrmer in #15319
  • refactor: TIdent add a flag to indicate if the tenant should be encoded into key by @drmingdrmer in #15321
  • refactor: define BackgroundJobIdIdent with TIdent, with HAS_TENANT disabled by @drmingdrmer in #15327
  • refactor: http handler check has_result_set() before get schema of plan. by @youngsofun in #15329
  • refactor: Define DataMaskIdIdent with TIdent by @drmingdrmer in #15331
  • refactor: add right bound when listing expired keys to avoid unnecessary data copy by @drmingdrmer in #15334
  • refactor: Upgrade Openraft to 0.9.7 by @drmingdrmer in #15339
  • refactor: improve install-snapshot performance by @drmingdrmer in #15345
  • refactor: move snapshot building io to other task. by @drmingdrmer in #15351
  • refactor: only has password policy update user login res by @TCeason in #15356
  • refactor: Define TenantOwnershiptObject with TIdent by @drmingdrmer in #15358
  • refactor: define TenantUserIdent with TIdent by @drmingdrmer in #15361
  • refactor: session_ctx.client_ip refactor to Option<String> by @TCeason in #15368
  • refactor: move DirName to separate file; add peek() to KeyParser by @drmingdrmer in #15365
  • refactor: Define StageFileIdent with TIdent by @drmingdrmer in #15371
  • refactor: remove stage use stream to avoid OOM. by @youngsofun in #15378
  • refactor: sync create inverted index by @b41sh in #15379
  • refactor: dispatch load to 16 threads when installing snapshot by @drmingdrmer in #15381
  • refactor: get_table_by_id() returns an Option<Seqv<TableMeta>> by @drmingdrmer in #15388
  • refactor: remove get_table_name_by_id() by @drmingdrmer in #15389

Others 📒

  • chore(executor): add time limit daemon for queries executor by @dqhl76 in #15174
  • chore(query): max_running_queries from 0 to 8, enable the max running queries by @zhang2014 in #15190
  • chore(query): add session_running_acquired_queries metric by @zhang2014 in #15193
  • chore(executor): add end_of_stream flag for executor stream by @zhang2014 in #15202
  • chore(planner): refine explain read size by @Dousir9 in #15204
  • chore: change parquet_fast_read_bytes setting from 0 to 16MB by @BohuTANG in #15212
  • chore(planner): fix decorrelate exists subquery by @Dousir9 in #15208
  • chore: old op LogicalGet modify to new op name Scan by @TracyZYJ in #15222
  • chore(query): parser error need to log error by @sundy-li in #15219
  • chore: vacuum drop table for external location table by @BohuTANG in #15220
  • chore(query): fix nullable array domain by @sundy-li in #15224
  • chore(query): add query queued duration for query log by @zhang2014 in #15225
  • chore(query): remove useless function map:get by @sundy-li in #15245
  • chore: update sqllogictests crate to 0.19.1 by @finchxx in #15242
  • chore(query): support like push down not by @Dousir9 in #15268
  • chore(executor): add node level error tracking by @dqhl76 in #15239
  • chore(query): refine set sql_dialect error message by @Dousir9 in #15282
  • chore(query): bump arrow-udf to extra lib by @sundy-li in #15283
  • chore(planner): improve delete explain by @Dousir9 in #15290
  • chore: Enhance Flexibility of Dynamic Table Creation by Allowing Options in Any Order by @zhyass in #15297
  • chore: add license to cargo.toml by @BohuTANG in #15299
  • chore(planner): fix can_filter_null by @Dousir9 in #15298
  • chore: improve distributed plan by @xudong963 in #15296
  • chore(query): simplify stage list by @sundy-li in #15303
  • chore: enable distributed optimization for subquery in delete by @xudong963 in #15288
  • chore: correct stream error message by @zhyass in #15309
  • chore: tune auto-compact parameters triggered by insertions by @dantengsky in #15261
  • chore(query): improve delete pruner by @sundy-li in #15316
  • chore(planner): fix merge into statistics by @Dousir9 in #15320
  • chore(planner): add disable merge into join reorder settings by @Dousir9 in #15322
  • chore: fix RulePushDownLimitWindow id by @xudong963 in #15326
  • chore(query): add merge ratio in sort spill by @sundy-li in #15332
  • chore: refine runtime filter log by @xudong963 in #15335
  • chore: add err message to help capture new agg ht hang by @Freejww in #15344
  • chore: enhance runtime filter log by @xudong963 in #15350
  • chore(planner): support fast return for hash join spill by @Dousir9 in #15364
  • chore(ci): fix flaky test; by @TCeason in #15369
  • chore: add more thest to prefix_right_bound() by @drmingdrmer in #15374
  • chore: purge inverted index by @dantengsky in #15354
  • chore(query): reuse the script runtime during pipeline builder by @sundy-li in #15375
  • chore: adjust row count assertion in table compaction by @dantengsky in #15380
  • chore: if a txn log is too long, display only the first 5 items by @drmingdrmer in #15392
  • chore: Fix opendal upstream commit missing by @Xuanwo in #15393

New Contributors

  • @finchxx made their first contribution in #15242
  • @YichiZhang0613 made their first contribution in #15229
  • @shamb0 made their first contribution in #15107

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.410...v1.2.452

Apr 8, 2024 (v1.2.410)

What's Changed

Exciting New Features ✨

  • feat(query): support settings admin api for global level settings by @zhang2014 in #14901
  • feat: add a new ratio config instead of table_data_deserialized_data_bytes by @chienguo in #14896
  • feat(ast): add parser for script by @andylokandy in #14891
  • feat: support cross join spilling by @xudong963 in #14887
  • feat: add more function alias names by @sundy-li in #14924
  • feat(query): support spill for new agg hashtable by @Freejww in #14905
  • feat(query): Support refresh inverted index by @b41sh in #14912
  • feat: ndjson support option null_if. by @youngsofun in #14938
  • feat(query): add statement_queued_timeout_in_seconds setting for queries queue by @zhang2014 in #14945
  • feat: change tracking enabled table support merge into by @zhyass in #14900
  • feat: histogram aggregate function by @suimenno3002 in #14839
  • feat(query): add metrics for session queue manager by @everpcpc in #14966
  • feat(query): add error_or function by @sundy-li in #14980
  • feat: purge duplicated files (copy into table) by @dantengsky in #14962
  • feat(query): support parse mulity modify column? ty by @TracyZYJ in #14979
  • feat: support prql by @ncuwaln in #14922
  • chore(query): support domain contains in string type by @sundy-li in #15023
  • feat(query): check view's inner table privilege when create or alter view by @TCeason in #14986
  • feat: add buffer for spiller by @xudong963 in #15021
  • feat: add show views & desc view by @blackstar-baba in #14926
  • feat: support pagination on task_history by @ZhiHanZ in #15047
  • feat(ast): add parser for template sql by @andylokandy in #15026
  • refactor: SHOW GRANTS should not display drop on table/database by @TCeason in #14931
  • feat(query): Support query match inverted index by @b41sh in #14997
  • feat: Implement concurrent stat by @Xuanwo in #15083
  • feat(metasrv): add network metrics rpc_delay_ms by @everpcpc in #15084
  • feat: create stream at navigation point by @zhyass in #15062
  • feat(query): implement st_makeline() function by @kkk25641463 in #15059
  • feat(planner): improve push down filter join by @Dousir9 in #14872
  • feat: support otel context propagation for databend-query by @ZhiHanZ in #15096
  • feat(query): add query_result_cache_min_execute_secs by @sundy-li in #15101
  • feat(query): Support hook refresh inverted index after insert by @b41sh in #15094
  • feat: time travel support at stream by @zhyass in #15116
  • feat(query): implement st_geometryfromewkb function by @kkk25641463 in #15118
  • feat: refactor ndjson input format. by @youngsofun in #14943
  • feat(script): add script compiler by @andylokandy in #15113
  • feat: support multi-table insert by @SkyFan2002 in #15002
  • feat(query): to_timestamp support parser string to ts without timezone by @TCeason in #15131
  • feat: Enhance show views by @blackstar-baba in #15109
  • feat: support expr in values clause of multi table insert by @SkyFan2002 in #15147
  • feat(query): implement ST_GEOMFROMGEOHASH by @kkk25641463 in #15156
  • feat(query): show create table support display inverted index by @b41sh in #15169
  • feat(query): support CHANGES clause by @zhyass in #15163
  • feat: add inverted index pruning status by @BohuTANG in #15175
  • feat: Support to add random seed on random engine by @blackstar-baba in #15167
  • feat(planner): push down not in filter expression by @Dousir9 in #15168
  • feat: time travel support at offset by @zhyass in #15177

Thoughtful Bug Fix 🔧

  • fix(ci): fix internal benchmark by @TCeason in #14897
  • fix: Display of merge into .. insert * by @dantengsky in #14892
  • fix: check_and_upgrade_to_pb() should return MetaError, add quota by @drmingdrmer in #14899
  • fix: Display of compact AST by @SkyFan2002 in #14914
  • fix(query): fix native topk_sorter by @sundy-li in #14939
  • fix: stream source can't get the same tablemeta by @zhyass in #14935
  • fix: cluster table with change tracking enabled append panic by @zhyass in #14956
  • fix: enable runtime filter when join_spilling_memory_ratio !=0 by @xudong963 in #14959
  • fix: fix stream explain bug for merge into by @JackTan25 in #14969
  • fix(query): Need return error udf_name when derialize from pb failed. by @TCeason in #14964
  • fix: try fix compute stddev for decimal type overflow by @ariesdevil in #14952
  • fix: csv read error with skip_header. by @youngsofun in #14981
  • fix: fix incorrect agg spill in new agg hashtable by @Freejww in #14995
  • fix: wrong row id by @youngsofun in #15018
  • fix: trim leading '/'s for files specified in copy opt by @dantengsky in #15056
  • fix: stream get wrong result if change tracking is changed by @zhyass in #15072
  • fix(ast): fix incorrect display for ast (part 1) by @andylokandy in #15069
  • fix: privilege check for insert statement by @SkyFan2002 in #15075
  • fix(query): fix attach_query_str by @sundy-li in #15066
  • fix: fix plan_sql error can not write query_log by @BohuTANG in #15077
  • fix: improve error hints when select * from xx.yy by @ZhengLin-Li in #14663
  • fix: Support udf for copy table by @JackTan25 in #15078
  • fix(ast): fix incorrect display for ast (part 2) by @andylokandy in #15076
  • fix: make block data type be consistent with hash method by @xudong963 in #15089
  • fix: add prql description by @ncuwaln in #15091
  • fix: sqllogicaltest datatype mismatch in expr_in_window.test by @c8ef in #15090
  • fix: schema mismatch in named_window_basic.test by @c8ef in #15099
  • fix: test duplicate in window_basic.test by @c8ef in #15100
  • fix(query): verify_privilege need check privilege one by one by @TCeason in #15105
  • fix(query):user option DEFAULT_ROLE ignore ascii_case by @TCeason in #15137
  • fix(query): tuple field name can not contain special char by @TCeason in #15126
  • fix(query): new agg hashtable hung with little data on singleton by @Freejww in #15140
  • fix(query): show query with name consider quote by @TCeason in #15138
  • fix: forbid end at stream syntax by @zhyass in #15173
  • fix(storage): Don't return empty credential while loading failed by @Xuanwo in #15181

Code Refactor 🎉

  • refactor: upgrade to openraft 0.9.0 by @drmingdrmer in #14907
  • refactor: use kvapi::Key to define meta-service key for QuotaMgr by @drmingdrmer in #14915
  • refactor: reduce hash join build spilling files by @xudong963 in #14919
  • refactor: define kvapi::Key for PasswordPolicyMgr by @drmingdrmer in #14920
  • refactor: reduce meta call by @TCeason in #14917
  • refactor: add NetworkPolicyIdent as NetworkPolicy meta-service key by @drmingdrmer in #14933
  • refactor: http handler plan sql async. by @youngsofun in #14908
  • refactor: Add TIdent for all meta-service key that share the same form of <PREFIX>/<TENANT>/<NAME> by @drmingdrmer in #14940
  • refactor: add generic CrudMgr to implement various simple CRUD manager by @drmingdrmer in #14988
  • refactor(query): use project plan for copy into select by @zhang2014 in #14977
  • refactor: use CrudMgr to impl PasswordPolicyMgr by @drmingdrmer in #14994
  • refactor: Add ExistError and UnknownError, TenantResource does not need to define error type any more by @drmingdrmer in #14996
  • refactor: Use CrudMgr to impl FileFormatMgr by @drmingdrmer in #14999
  • refactor: Use CrudMgr to impl ConnectionMgr by @drmingdrmer in #15003
  • refactor(query): add copy into location physical plan by @zhang2014 in #15010
  • refactor(query): csv reader support prefetch by @youngsofun in #14983
  • refactor: alter view only need to modify table option by @TCeason in #15012
  • refactor: truncate support build pipeline by @zhyass in #15027
  • refactor: move id-generator from crate meta-api to meta-app by @drmingdrmer in #15036
  • refactor: remove innecessary serde by @drmingdrmer in #15037
  • refactor(query): use scan table physical plan for copy into table from stage by @zhang2014 in #15016
  • refactor: remove innecessary serde by @drmingdrmer in #15040
  • refactor: remove innecessary serde by @drmingdrmer in #15042
  • refactor: spead up csv reader. by @youngsofun in #15043
  • refactor(query): remove metrics if moved to the profile by @zhang2014 in #15054
  • refactor: reduce hash join probe spilled files by @xudong963 in #14942
  • refactor: separate key and value: add BackgroundTaskCreator by @drmingdrmer in #15063
  • refactor: remove serde from CatalogId and CatalogNameIdent by @drmingdrmer in #15068
  • refactor: remove serde from kvapi::Key types by @drmingdrmer in #15070
  • refactor: use type Tenant for BackgroundJobIdent.tenant by @drmingdrmer in #15074
  • refactor: BackgroundTaskIdent use Tenant by @drmingdrmer in #15080
  • refactor: make the block pruning more clear by @BohuTANG in #15079
  • refactor: change PartInfo is_lazy to LazyLevel by @BohuTANG in #15081
  • refactor: QueryFragmentsPlanPacket should include just setting changes by @drmingdrmer in #15095
  • refactor(executor): refactor queries executor to prepare by @dqhl76 in #14965
  • refactor: show grants support display ObjectId by @TCeason in #15064
  • refactor: replace NonEmptyString with Tenant by @drmingdrmer in #15103
  • refactor: replace String with Tenant by @drmingdrmer in #15119
  • refactor: use TIdent to unify kvapi key def for MaskPolicyTableIdListIdent and DataMaskNameIdent by @drmingdrmer in #15123
  • refactor: remove count-table for tenant. It is not used at all by @drmingdrmer in #15134
  • refactor: use Tenat in VirtualColumnNameIdent by @drmingdrmer in #15135
  • refactor(query): record physical plan level metrics into profiling log by @zhang2014 in #15092
  • refactor: define ShareEndpointIdent with TIdent by @drmingdrmer in #15136
  • refactor: add peer info to network error context by @drmingdrmer in #15139
  • refactor: add Tenant to ShareConsumer by @drmingdrmer in #15142
  • chore: client should not depend on the final response. by @youngsofun in #15146
  • refactor(executor): globalize QueriesPipelineExecutor creation method by @dqhl76 in #15129
  • refactor(planner): remove statistics collection from bind phase by @Dousir9 in #15112
  • refactor(query): only intercept dml and queries in queries queue by @zhang2014 in #15143
  • refactor: user can execute show grants query itself by @TCeason in #15149
  • refactor: generate inverted indexs for each blocks by @b41sh in #15150
  • refactor: remove dependency on parquet2: Part I by @SkyFan2002 in #15158
  • refactor: use Tenant for ListBackgroundTasksReq by @drmingdrmer in #15176
  • refactor: move TIdent into separate files by @drmingdrmer in #15179

Others 📒

  • chore(storage): split mutation_log file by @zhyass in #14903
  • chore(query): create user/role name can not contain single/double quote by @TCeason in #14902
  • chore(ci): benchmark local need set meta client timeout by @TCeason in #14910
  • chore: extend time tolerance for meta-service test by @drmingdrmer in #14927
  • chore: add step logs for hash join build side by @xudong963 in #14930
  • chore: use shorter name for management API by @drmingdrmer in #14937
  • chore(ci): add pr summary check by @everpcpc in #14944
  • chore(ci): fix set failure for pr check by @everpcpc in #14946
  • chore(ci): fix description check by @everpcpc in #14947
  • chore(ci): fix event for pr assistant by @everpcpc in #14948
  • chore(query): improve push_column_with_selection by @Dousir9 in #14954
  • chore(query): add disable system tables by @zhang2014 in #14958
  • chore(planner): fix distributed query plan by @Dousir9 in #14951
  • chore: tweak txn syntax by @SkyFan2002 in #14960
  • chore(query): add settings level for admin api by @zhang2014 in #14967
  • chore(ast): CreatOption::None -> CreateOption::Create by @andylokandy in #14941
  • chore: improve histogram‘s implement and comments by @suimenno3002 in #14976
  • chore: number range for settings by @BohuTANG in #14985
  • chore: group limit optimization in new hashtable by @sundy-li in #14989
  • chore: add integration test for runtime filter by @xudong963 in #14992
  • chore: upgrade openraft to 0.9.1 by @drmingdrmer in #15006
  • chore: max_running_queries from 8 to 0, disable the max running queries by @BohuTANG in #15013
  • chore: tweak deletion batch size & log messages by @dantengsky in #15005
  • chore: rollback partial tests by @JackTan25 in #15015
  • chore: add benchmark for loading data on cloud by @everpcpc in #15017
  • chore: add a setting for inlist to join by @xudong963 in #15024
  • chore: unify TIdent key format by @drmingdrmer in #15025
  • chore: the fist http resp wait longer to tolerant old bendsql clients. by @youngsofun in #15022
  • chore: add log about presign elapsed by @flaneur2020 in #14990
  • chore: fix sql seq for benchmark report by @everpcpc in #15030
  • chore: print query detail for bench & fix prepare database for load bench by @everpcpc in #15032
  • chore: default enalbe merge into by @BohuTANG in #15034
  • chore: run load test with new label by @everpcpc in #15035
  • chore(query): add spill profile for join by @zhang2014 in #15044
  • chore(http): tune log on getting /v1/query/:id/kill by @flaneur2020 in #14987
  • chore: default enable aggregate/sort/join spill by @BohuTANG in #15038
  • chore: do not drop database first for bench by @everpcpc in #15041
  • chore(query): improve group by @sundy-li in #15031
  • chore(planner): fix invalid aggregate physical plan by @Dousir9 in #15046
  • chore: adjust the default value for table lock timeout by @zhyass in #15055
  • chore: use Tenant::new_literal() and new_nonempty() to avoid creating empty Tenant by @drmingdrmer in #15061
  • chore(ci): fix benchmark job dependency by @everpcpc in #15087
  • chore: remove rebinding merge into for insert-only by @xudong963 in #15086
  • chore: clean z3 and meta dir by @zhyass in #15098
  • chore(executor): remove duplicate profile statistics by @zhang2014 in #15115
  • chore(query): errorcode use snake shape format by @TCeason in #15122
  • chore: treat seq as place holder when navigate by @zhyass in #15124
  • chore(planner): make optimize function async by @Dousir9 in #15121
  • chore: adjust stack size of flaky ut by @SkyFan2002 in #15130
  • chore: to avoid UT stackoverflow, disable async_backtrace of TypeChecker::resolve by @dantengsky in #15133
  • chore(query): refactor new transform partition bucket for new aggregation hashtable by @Freejww in #15093
  • chore: enable inlist to join in cluster by @xudong963 in #15108
  • chore: remove no longer used message encoding benchmark by @dantengsky in #15148
  • chore(query): fix order for scalar cmp with null by @sundy-li in #15145
  • chore: fix typos by @Dousir9 in #15153
  • chore(query): use i64 min max value as histogram bucket min max limit by @zhang2014 in #15152
  • chore(query): turn on new agg hashtable by @Freejww in #15155
  • chore(query): row-fetcher-cache by @sundy-li in #15159
  • chore: add CopyInto to query queue by @BohuTANG in #15160
  • chore: refine the query/02_function to query/functions by @BohuTANG in #15161
  • chore: refine the query queue is_heavy_action func by @BohuTANG in #15166
  • chore: fix window plan in lazy topn by @sundy-li in #15165
  • chore(deps): bump h2 from 0.3.24 to 0.3.26 by @dependabot in #15172
  • chore(query): fix boolean type comparison by @Dousir9 in #15171
  • chore: return inserted rows for each table in multi table insert by @SkyFan2002 in #15178

New Contributors

  • @suimenno3002 made their first contribution in #14839
  • @ncuwaln made their first contribution in #14922
  • @blackstar-baba made their first contribution in #14926
  • @kkk25641463 made their first contribution in #15059
  • @c8ef made their first contribution in #15090

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.371...v1.2.410

Mar 11, 2024 (v1.2.371)

What's Changed

Exciting New Features ✨

  • feat: add div0 divnull method by @sundy-li in #14693
  • feat(query): grant object visibility check by @TCeason in #14458
  • feat: add a new kvapi::Key type: Tenant by @drmingdrmer in #14704
  • feat(query): privilege access check need consider if exists by @TCeason in #14700
  • feat: add kvapi::Value to define meta-service value behavior by @drmingdrmer in #14708
  • feat(geo): support geometry type for create table by @ariesdevil in #14615
  • feat(query): support vacuum temporary files by @zhang2014 in #14690
  • feat: add parent() method to kvapi::Key to describe the hierarchical structure of meta-service data by @drmingdrmer in #14716
  • feat(query): integrate hash join with new filter framework by @Dousir9 in #14689
  • feat: add kvapi::ValueType::dependency_keys() to retrieve relation such as table-name-to-table-id and table-id-to-table-meta by @drmingdrmer in #14735
  • feat: support multi-statement transactions by @SkyFan2002 in #14562
  • feat: DESC[RIBE] [TABLE] name by @dracoooooo in #14736
  • feat(query): new implementation of analyze table by @sundy-li in #14725
  • feat: add Analyze for merge into by @JackTan25 in #14763
  • feat(executor): implement time slicing for shared executor scheduling across queries by @dqhl76 in #14770
  • feat(sqlsmith): Support generating binary and geometry values by @b41sh in #14758
  • feat: get_client_info() returns the server time in milliseconds by @drmingdrmer in #14805
  • feat: http handler txn support. by @youngsofun in #14802
  • feat(planner): implement explain query in verbose mode by @leiysky in #14801
  • feat: add new string view type [part 1] by @ariesdevil in #14662
  • feat: upgrade arrow to version 50 by @sundy-li in #14784
  • feat(query): introduce arrow-udf-js by @sundy-li in #14799
  • feat: support left-related join spilling by @xudong963 in #14828
  • feat: impl like selectivity compution by probability predication by @xudong963 in #14846
  • feat: add vacuum table result table by @lichuang in #14830
  • feat(query): read write inverted index by @b41sh in #14827
  • feat: change tracking enabled table support replace into by @zhyass in #14831
  • feat: support notification syntax with rpc, and support task error notification integration by @ZhiHanZ in #14845
  • feat: support right-related join spilling by @xudong963 in #14853
  • feat: add vacuum table dry run result table by @lichuang in #14851
  • feat: add transaction and mutli-line SQL script support for task by @ZhiHanZ in #14865
  • feat: add vacuum drop table dry run result table by @lichuang in #14870
  • feat(query): support variant is_type functions by @b41sh in #14876
  • feat(query): support queries queue by @zhang2014 in #14878
  • feat(query): create drop inverted index by @b41sh in #14859
  • feat(query): show grants support mget table/db name by table/db id. by @TCeason in #14862

Thoughtful Bug Fix 🔧

  • fix: do compact first , then do sort for cluster table by @JackTan25 in #14707
  • fix(query): filter response session settings with ScopeLevel::Session by @everpcpc in #14702
  • fix: fix create or replace table as select bug by @lichuang in #14730
  • fix: check if global memory is negative by @xudong963 in #14733
  • fix: Don't return empty credentail if load failed by @Xuanwo in #14734
  • fix: wrong result for CSV EMPTY_FIELD_AS = STRING. by @youngsofun in #14744
  • fix: fix create or replace stage bug by @lichuang in #14745
  • fix: force attachment to use EmptyFieldAs::FieldDefault. by @youngsofun in #14767
  • fix: fix create or replace stage bug, only remove dir when success by @lichuang in #14766
  • fix(planner): use correct alias for column reference in select clause by @leiysky in #14775
  • fix(executor): queries pipeline executor schedule incorrectly by @dqhl76 in #14787
  • fix: fix hash join spilling hang and add more tests by @xudong963 in #14794
  • fix(ast): add missing #[drive(skip)] by @andylokandy in #14823
  • fix: Transaction state is not changed correctly when returning error in commit statement by @SkyFan2002 in #14824
  • fix(query): today, yesterday, tomorrow need consider tz by @TCeason in #14841
  • fix: flaky sqllogic test by @zhyass in #14848
  • fix: fix wrong results for left join spilling by @xudong963 in #14860
  • fix: allow insert in multiline statment by @andylokandy in #14882
  • fix: rewrite ifnull and nvl as coalesce by @PsiACE in #14877
  • fix: fix incorrect schema in window expr in subquery with lazy_materialization by @ariesdevil in #14895

Code Refactor 🎉

  • refactor: new impl for loading CSV. by @youngsofun in #14645
  • refactor: add UdfError::Exist by @drmingdrmer in #14691
  • refactor: remove kvapi::Key prefix const by @drmingdrmer in #14701
  • refactor: add Humanizer traits for operator display by @leiysky in #14718
  • refactor: schema_api::update_mask_policy by @drmingdrmer in #14724
  • refactor: rename ShareAccountNameIdent::acount to tenant; remove unused serde derive by @drmingdrmer in #14722
  • refactor: support decimal for quantile_cont by @ariesdevil in #14726
  • refactor(query): refactor runtime part 4 by @zhang2014 in #14728
  • refactor(query): migrate profile to common base by @zhang2014 in #14739
  • refactor: refactor hash join spill probe side by @xudong963 in #14743
  • refactor(storage): treat the standard stream as append only when only insertion by @zhyass in #14748
  • refactor: Add RoleGrantee to provide meta-service key for RoleMgr by @drmingdrmer in #14752
  • refactor: upgrade Openraft v0.9.0-alpha.5..v0.9.0-alpha.6 by @drmingdrmer in #14769
  • refactor: hash join build spilling by @xudong963 in #14768
  • refactor: Add RoleIdent to define kvapi::Key to access roles in meta-service by @drmingdrmer in #14772
  • refactor: make IndexNameIdent.tenant a Tenant struct by @drmingdrmer in #14776
  • refactor: refactor join spilling settings by @xudong963 in #14781
  • refactor: add TenantUserIdent as meta-service key to access user data by @drmingdrmer in #14793
  • refactor(executor): tracking memory for plan level by @zhang2014 in #14747
  • refactor(ast): rename CreateOption::CreateIfNotExists(false) -> CreateOption::None by @andylokandy in #14795
  • refactor(ast): refine crate structure by @andylokandy in #14803
  • refactor: upgrade Openraft to v0.9.0-alpha.7 by @drmingdrmer in #14809
  • refactor: tenant is not allowed to be empty by @drmingdrmer in #14815
  • refactor(ast): add derive-visitor for expr and query by @andylokandy in #14814
  • refactor(ast): add derive-visitor for Statement by @andylokandy in #14822
  • refactor: remove backward compatibility for data version V0 by @drmingdrmer in #14818
  • refactor: use relative expire time ttl to replace expire_at, tolerate query-meta time drift by @drmingdrmer in #14807
  • refactor: upgrade openraft v0.9.0-alpha.7 to v0.9.0-alpha.8 by @drmingdrmer in #14832
  • refactor: enable feature flag openraft:storage-v2 by @drmingdrmer in #14838
  • refactor: use NonEmptyString to access UDFMgr. by @drmingdrmer in #14844
  • refactor: use NonEmptyString for creating StageMgr by @drmingdrmer in #14850
  • refactor: add StageIdent and StageFileIdent by @drmingdrmer in #14863
  • refactor: use KVPbApi to simplify StageMgr by @drmingdrmer in #14871
  • refactor: use NonEmptyString to create Settings by @drmingdrmer in #14875
  • refactor: enable openraft feature flag generic-snapshot-data by @drmingdrmer in #14881
  • refactor: use SettingIdent to build string key to access settings by @drmingdrmer in #14883
  • refactor(ast): migrate to derive-visitor by @andylokandy in #14829
  • refactor(ast): remove databend-common-functions dependency by @andylokandy in #14893

Build/Testing/CI Infra Changes 🔌

  • ci: add tests for transaction. by @youngsofun in #14826
  • ci: fix clippy tests. by @youngsofun in #14855

Others 📒

  • chore: add plan id for mutations by @JackTan25 in #14685
  • chore: install sqlite3 for build-tool by @ariesdevil in #14695
  • chore: install libsqlite3 for build-tool by @ariesdevil in #14696
  • chore: minor refactor of adjust_bloom_runtime_filter by @xudong963 in #14697
  • chore: extract common functions from metactl by @drmingdrmer in #14698
  • chore(query): support custom query graceful shutdown timeout by @zhang2014 in #14712
  • chore(query): improve new filter execution by @Dousir9 in #14715
  • chore: add metrics for compact and recluster by @zhyass in #14717
  • chore: static features for async-compression deps by @everpcpc in #14711
  • chore: remove stream pruner by @zhyass in #14723
  • chore(query): improve aggregate unary function by @sundy-li in #14729
  • chore: polish error message. by @youngsofun in #14737
  • chore: rename by @JackTan25 in #14740
  • chore: refactor RoleMgr by @drmingdrmer in #14741
  • chore: build-tool base for both arm64 & amd64 by @everpcpc in #14751
  • chore: fix setup protoc aarch64 by @everpcpc in #14754
  • chore: fix compile error by @SkyFan2002 in #14753
  • chore: change build image to debian/alpine by @everpcpc in #14755
  • chore(ci): fix LIBCLANG_PATH for alpine build-tool by @everpcpc in #14764
  • chore(query): fix location ver by @sundy-li in #14762
  • chore(query): add max_running_query_execute_time for http status by @zhang2014 in #14750
  • chore(query): support string eq int by @sundy-li in #14759
  • chore: add more transaction test by @SkyFan2002 in #14746
  • chore(ci): change build runner with target by @everpcpc in #14761
  • chore: fix build for cloud by @everpcpc in #14773
  • chore(query): use create_time of process info for http status by @zhang2014 in #14777
  • chore: show create table with location by @ariesdevil in #14778
  • chore(query): add drop guard for impl drop trait by @zhang2014 in #14783
  • chore(ci): fix target for release by @everpcpc in #14785
  • chore(query): remove codes of storage/parquet2 by @sundy-li in #14305
  • chore: patch z3-sys & add musl build to ci by @everpcpc in #14786
  • chore: fix musl build dockerfile by @everpcpc in #14790
  • chore: release add musl targets by @everpcpc in #14791
  • chore(test): reduce the number of tables in large join test by @Dousir9 in #14797
  • chore: add Date Type for runtime filter by @JackTan25 in #14780
  • chore: rename update_star_columns by @JackTan25 in #14806
  • chore(query): add unit test for mem stat by @zhang2014 in #14812
  • chore(ci): include nohup.out in failure artifact by @everpcpc in #14820
  • chore: remove unused package patch by @everpcpc in #14821
  • chore(ci): fix upload/download sha256sums artifacts by @everpcpc in #14833
  • chore(ci): fix release artifacts conflict by @everpcpc in #14834
  • chore(ci): fix artifacts for benchmark by @everpcpc in #14837
  • chore: tolerate inaccuracy of expire time for test by @drmingdrmer in #14836
  • chore(query): modify new_agg_hashtable payload transmission method on singleton and support new_agg_hashtable on cluster by @Freejww in #14798
  • chore: Enable stage table distributed by @JackTan25 in #14847
  • chore: disable inlist to subquery under cluster by @xudong963 in #14864
  • chore(query): enable fast return for cluster hash join by @Dousir9 in #14854
  • chore: log txn id for multi stmt transaction by @SkyFan2002 in #14861
  • chore: improve tolerance for expire test test_kv_read_v1_on_follower by @drmingdrmer in #14868
  • chore: open join spilling by default by @xudong963 in #14843
  • chore: increase tolerance for expiration test by @drmingdrmer in #14880

New Contributors

  • @dracoooooo made their first contribution in #14736

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.344...v1.2.371

Feb 22, 2024 (v1.2.344)

What's Changed

Exciting New Features ✨

  • feat: Enable async backtrace for opendal by @Xuanwo in #14384
  • feat: Allow users to configure tcp keepalive for opendal by @Xuanwo in #14385
  • feat: Remove the need of setting allow_anonymous by @Xuanwo in #14398
  • feat: support outer to inner join by @xudong963 in #14401
  • feat: more constrain for file format string options. by @youngsofun in #14374
  • feat: support functions nvl and nvl2 by @PsiACE in #14421
  • feat: add json delete by name and index by @akoshchiy in #14397
  • feat: Add COS location support by @Xuanwo in #14435
  • feat(query): support session setting scope LOCAL by @everpcpc in #14441
  • feat: allow to config task session parameters. by @ZhiHanZ in #14446
  • feat(query): copy into allow disable_variant_check for parquet file by @b41sh in #14444
  • feat: add create or replace database support by @lichuang in #14449
  • feat: add basic support for show user functions by @PsiACE in #14440
  • feat: meta-service: customize chunk size when export by @drmingdrmer in #14467
  • feat: pushdown limit to single window by @ariesdevil in #14460
  • feat: Improve merge into performance, Update Column Only Optimization by @JackTan25 in #14429
  • feat(planner): support predicates move around by @Dousir9 in #14338
  • feat(rbac): ensure the builtin roles on auth by @flaneur2020 in #14451
  • feat: treat path in uri as prefix. by @youngsofun in #14420
  • feat: add structlog by @andylokandy in #14495
  • feat: add create or replace table support by @lichuang in #14508
  • feat: use parquet-rs as an alternative in fuse engine by @SkyFan2002 in #14268
  • feat: allow to config cloud control plane grpc timeout by @ZhiHanZ in #14522
  • feat: support window function in subquery by @xudong963 in #14518
  • chore(query): add system.tables query speed test by @TCeason in #14517
  • feat(query): agg-hashtable-singleton by @Freejww in #14524
  • feat: system table add query_log.log_type_name and cluster.cluster_id by @BohuTANG in #14545
  • feat: add column create_on for user functions table by @PsiACE in #14540
  • feat: support single join to inner join by @xudong963 in #14442
  • feat: add create or replace stage support by @lichuang in #14548
  • feat(geo): add geometry data type by @ariesdevil in #14470
  • feat(query): support early filtering for more join types by @Dousir9 in #14525
  • feat: add runtime filter stats info to profile by @xudong963 in #14569
  • feat: add create or replace connection support by @lichuang in #14580
  • feat: add create or replace user support by @lichuang in #14583
  • feat: add #- json operator by @akoshchiy in #14584
  • feat: add support for create or replace function by @PsiACE in #14581
  • feat: add create or replace stream support by @lichuang in #14596
  • feat: add create or replace view support by @lichuang in #14599
  • feat: Add KVPbApi: protobuf support for meta-service by @drmingdrmer in #14605
  • feat: use materialized cte for standard stream by @zhyass in #14561
  • feat: add create or replace file format support by @lichuang in #14611
  • feat: add create or replace agg index support by @lichuang in #14636
  • feat: add create or replace virtual column support by @lichuang in #14649
  • feat: add create or replace password policy support by @lichuang in #14654
  • feat: add create or replace masking policy support by @lichuang in #14656
  • feat: add create or replace network policy support by @lichuang in #14658
  • feat: add create or replace share endpoint support by @lichuang in #14660
  • refactor: enables the bloom runtime filter to be turned on adaptively by @xudong963 in #14686

Thoughtful Bug Fix 🔧

  • fix(sqlsmith): fix generate nested not null types caused panic by @b41sh in #14392
  • fix: optimize vacuum drop table call schema gc_drop_tables API in batch mode by @lichuang in #14395
  • fix: min/max of default col values in snapshot should adhere to rules by @dantengsky in #14368
  • fix: fix row number concat by @JackTan25 in #14416
  • fix(query): onwership/role info serialize to pb by @TCeason in #14393
  • fix: incorrect column id in inner_project schema by @SkyFan2002 in #14419
  • fix(query): fix pow and add strcmp domain by @Freejww in #14418
  • fix: stream cannot be accessed correctly by @zhyass in #14433
  • fix(query): global contain stage and udf priv by @TCeason in #14344
  • fix(query): add utf8 validation in deser string by @sundy-li in #14443
  • fix: flaky test by @zhyass in #14454
  • fix(query): config value for session settings by @everpcpc in #14475
  • fix: throw err when cast Decimal overflow by @guojidan in #14489
  • fix(query): replace mock_s3 with mock_aws by @zhang2014 in #14498
  • fix: should use is_nested while populating in-memory cache by @dantengsky in #14503
  • fix: wrong result with special stream alias by @zhyass in #14494
  • fix: fix update/delete using subquery only support boolean data type by @lichuang in #14413
  • fix(query): privilege type forward compat by @TCeason in #14501
  • fix(query): add range for memory ratio settings by @everpcpc in #14513
  • fix: refactor merge into insert spectify default values by @JackTan25 in #14477
  • fix(query): fix variant function with invalid jsonb panic by @b41sh in #14520
  • fix: Smaller random data by @JackTan25 in #14521
  • fix: Remove test by @JackTan25 in #14526
  • fix: table maybe locked when recluster by @zhyass in #14510
  • fix: fix ranking function out of bounds when calculate partition end by @ariesdevil in #14535
  • fix(query): do not check exists for unsetting by @everpcpc in #14536
  • fix(query): add warning for unset unknown setting by @everpcpc in #14537
  • fix(executor): fix some display name for profile plans by @zhang2014 in #14523
  • fix(ddl): fix create table bitmap and variant with default value by @b41sh in #14472
  • fix(executor): fix broken profile graph for cluster mode by @zhang2014 in #14554
  • fix: runtime filters add to wrong table by @xudong963 in #14552
  • fix(query): revoke need consider GrantObject::Database/Table by @TCeason in #14567
  • fix(query): fix double panic if enable_experimental_aggregate_hashtable by @zhang2014 in #14576
  • fix: fix comment for observe_append_send_spent by @domyway in #14586
  • fix: insert binary by @andylokandy in #14556
  • fix(query): fix parse unsigned integer with number by @b41sh in #14590
  • fix: incorrect schema may be used while executing replace-into by @sundy-li in #14592
  • fix(ast): unexpected cut in not null by @andylokandy in #14604
  • fix(query): try fix oom with large sort limit by @zhang2014 in #14629
  • fix: otlp logger runtime for graceful shutdown by @everpcpc in #14539
  • fix(executor): fix broken profile graph for cluster mode by @zhang2014 in #14666
  • fix: Use stat instead of list for checking storage by @Xuanwo in #14680
  • fix: fix get by keypath with null value by @b41sh in #14684
  • fix(query): account_admin should support grant ownership to new role after drop an created role by @TCeason in #14597
  • fix(query): fix unnest variant array panic by @b41sh in #14688
  • fix(query): Pass ci Test when enable enable_experimental_aggregate_hashtable . by @sundy-li in #14544

Code Refactor 🎉

  • refactor: StringType::Scalar: Vec<u8> -> String by @andylokandy in #14349
  • refactor: Add trait FromToProtoEnum by @drmingdrmer in #14423
  • refactor: add kv-api::Key::ValueType by @drmingdrmer in #14425
  • refactor: drop unused blocks when using ranking window function by @ariesdevil in #14428
  • refactor(executor): refactor processor profiling by @zhang2014 in #14377
  • refactor(executor): remove old profile framework by @zhang2014 in #14468
  • refactor: replace OneTable with DummyTableScan by @xudong963 in #14461
  • refactor: remove udf from system functions table by @PsiACE in #14471
  • refactor: Using reqwest-hickory-resolver as global dns client by @Xuanwo in #14476
  • refactor: meta-service: purge log in another task by @drmingdrmer in #14519
  • refactor: meta-service purge log in small chunks to avoid blocking by @drmingdrmer in #14527
  • refactor(query): account_admin role generate in query by @TCeason in #14514
  • refactor(query): disallowed tokio::spawn method by @zhang2014 in #14563
  • refactor: refine UdfMgr by @drmingdrmer in #14589
  • refactor: get_udf() does not need to specify Seq by @drmingdrmer in #14613
  • refactor: Add KVPbApi::list_pb(), simplify UdfMgr::get_udfs() by @drmingdrmer in #14623
  • refactor: Simplify UdfMgr::add_udf() by @drmingdrmer in #14631
  • refactor: UdfApi::drop_udf() return the dropped record instead of returning ErrorCode by @drmingdrmer in #14640
  • refactor(query): support additional param to set the error rate in approx_count_distinct function by @sundy-li in #14609
  • refactor: Remove UdfApi, use UdfMgr directly by @drmingdrmer in #14642
  • refactor(query): disallowed tokio spawn and block_on by @zhang2014 in #14639
  • refactor(executor): add graph information to prepare adding points by @dqhl76 in #14681

Build/Testing/CI Infra Changes 🔌

  • ci: fix previous tag for nightly release by @everpcpc in #14478

Others 📒

  • chore: upgrade crate shlex to version 1.3.0 by @dantengsky in #14404
  • chore: move meta-data type Id from crate meta-api to meta-app by @drmingdrmer in #14408
  • chore: Add debug log for read parts numbers by @Xuanwo in #14396
  • chore(planner): fix push down filter scan by @Dousir9 in #14415
  • chore: modify the max block count in compact by @zhyass in #14424
  • chore: enable new setting data_retention_time_in_days by @dantengsky in #14402
  • chore: fix cache variable name by @xx01cyx in #14430
  • chore: remove binary collatoin by @andylokandy in #14409
  • chore: bump rust toolchain for dev container by @leiysky in #14436
  • chore: bump rust version of build container by @leiysky in #14437
  • chore: fix build tool installation by @leiysky in #14438
  • chore: bump rust toolchain to nightly-2024-01-23 by @leiysky in #14434
  • chore: Forbidden non equal merge into distributed by @JackTan25 in #14426
  • chore(query): reduce get ownership by @TCeason in #14427
  • chore: simplify IndexScalar by @andylokandy in #14432
  • chore: make cost factor configurable by @leiysky in #14445
  • chore: set peer_start to current row when do ranking functions by @ariesdevil in #14450
  • chore: only wrap logical get with distinct to eliminate duplicates rows for subquery in single mode by @xudong963 in #14452
  • chore: support mark join commutation and fix left mark join by @xudong963 in #14455
  • chore(query): add buffer size in spill write by @sundy-li in #14466
  • chore: avoid wildcard match on data type by @andylokandy in #14457
  • chore: fix cyclic deps by @SkyFan2002 in #14483
  • chore(query): fix hash join spill block size by @Dousir9 in #14484
  • chore(query): bump backtrace-rs version by @zhang2014 in #14485
  • chore: correct ci-benchmark TPC-H Q5 by @Dousir9 in #14491
  • chore: update Domain::Map by @andylokandy in #14314
  • chore(query): update opendal version and add retry io timeout by @zhang2014 in #14496
  • chore(executor): add unit for profile statistics by @zhang2014 in #14499
  • chore: clean license message by @ZhiHanZ in #14509
  • chore: support row fetcher with window function by @ariesdevil in #14492
  • chore(query): mark obsoleted session settings in config by @everpcpc in #14512
  • chore: remove unused codes by @zhyass in #14511
  • chore: sleep 5ms before next batch purge by @drmingdrmer in #14530
  • chore(query): make column as an optional keyword in add/modify/drop by @sundy-li in #14533
  • chore: fix openraft version: use v0.9.0-alpha.5 instead of v0.9.0-alpha.4 by @drmingdrmer in #14538
  • chore(query): need refactor system table test into benchmark by @TCeason in #14546
  • chore: refactor pattern plan with Matcher by @leiysky in #14549
  • chore: remove get_table_index method by @xudong963 in #14557
  • chore: make runtime filter for broadcast rework by @xudong963 in #14559
  • chore(ci): add system tables/columns benchmark cloud by @TCeason in #14555
  • chore: format reuse.benchmark_query_meta.yml by @dantengsky in #14575
  • chore: improve parallel run for merge into by @JackTan25 in #14574
  • chore(executor): using human format for explain analyze profile by @zhang2014 in #14578
  • chore(ci): fix benchmark labels for local by @everpcpc in #14603
  • chore: add error message on refresh table info by @flaneur2020 in #14606
  • chore: refine hash join code by @Dousir9 in #14571
  • chore(ci): fix benchmark scripts by @everpcpc in #14608
  • chore: make materialized cte scan_progress accurate by @xudong963 in #14602
  • chore: Bump OpenDAL to pinned tag version by @Xuanwo in #14601
  • chore(ci): benchmark system table query speed with a large test data by @TCeason in #14612
  • chore(ci): load benchmark data from hits_100m_obfuscated_v1 by @everpcpc in #14619
  • chore(ci): remove dummy dataset for benchmark by @everpcpc in #14620
  • chore(ci): fix hits benchmark schema by @everpcpc in #14621
  • chore(ci): tmp disable local hits benchmark by @everpcpc in #14625
  • chore(ci): fix internal test reduce bendsql warn by @TCeason in #14627
  • chore(ci): fix download benchmark results by @everpcpc in #14628
  • chore(ci): fix get dataset from benchmark result file by @everpcpc in #14630
  • chore: add table index to explain join by @xudong963 in #14632
  • chore(ci): fix download benchmark result by @everpcpc in #14633
  • chore(ci): fix new line in benchmark comment by @everpcpc in #14638
  • chore(query): improve hash join code by @Dousir9 in #14614
  • chore: reduce small block writes (improve merge into write performance) by @JackTan25 in #14637
  • chore: move KVPbApi impl to seperate files by @drmingdrmer in #14634
  • chore(ci): fix comment benchmark result by @everpcpc in #14643
  • chore: refine plan id generation by @leiysky in #14644
  • chore(ci): fix runner provider for release benchmark by @everpcpc in #14646
  • chore(meta): remove tenant for table info by @zhang2014 in #14647
  • chore(ci): fix artifacts for benchmark archive by @everpcpc in #14652
  • chore: remove spawn_blocking for merge into and replace into by @JackTan25 in #14664
  • chore(query): update async trait version by @zhang2014 in #14667
  • chore: add dev deps for proj crate by @ariesdevil in #14670
  • chore: add dev dep sqlite binary for proj crate by @ariesdevil in #14673
  • chore(executor): remove useless warn log by @zhang2014 in #14672
  • chore: install liblzma-dev for build-tool by @everpcpc in #14678
  • chore: upgrade arrow-format by @ariesdevil in #14675
  • chore: optimize q17 join order by @xudong963 in #14624
  • chore(query): max_decoding_message_size by @sundy-li in #14687

New Contributors

  • @xx01cyx made their first contribution in #14430
  • @domyway made their first contribution in #14586

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.307...v1.2.344

Jan 25, 2024 (v1.2.307)

What's Changed

Exciting New Features ✨

  • feat: Add huggingface support by @Xuanwo in #14304
  • feat(query): add float64 version of distance function overload by @sundy-li in #14311
  • feat: CopyIntoLocation support output. by @youngsofun in #14315
  • feat: query level data cache metrics by @dantengsky in #14313
  • feat: CSV format add option binary_format and output_header. by @youngsofun in #14329
  • feat(planner): implement cost-based optimization for distributed query by @leiysky in #14318
  • feat(query): add function MONTHS_BETWEEN by @sundy-li in #14352
  • feat: support runtime filter for right join and string join key by @xudong963 in #14359
  • feat(query): admin api add system info by @everpcpc in #14360
  • feat: target build optimization for merge into by @JackTan25 in #14066
  • feat: support min_max runtime filter for string type by @xudong963 in #14372
  • feat: add task_depents and task_dependents_enable function by @ZhiHanZ in #14383
  • feat: add data_retention_time_in_days_max config and data_retention_time_in_days setting by @BohuTANG in #14388
  • chore(query): add index scalar by @sundy-li in #14362
  • feat: standard stream by @zhyass in #14272

Thoughtful Bug Fix 🔧

  • fix(query): add comment in table meta by @sundy-li in #14321
  • fix(query): grant all to role_name need succ by @TCeason in #14333
  • fix(query): missing push binary column by @everpcpc in #14342
  • fix: fix full outer join bug by @lichuang in #14347
  • fix: clear the derived columns after flattening by @xudong963 in #14353
  • fix: flaky test by @zhyass in #14366
  • fix: keys in stage_attachment.file_format_options should be case-insensitive by @youngsofun in #14357
  • fix: unexpected '/' in unload path. by @youngsofun in #14375
  • fix: fix update using using constant in predicate from subquery panic by @lichuang in #14367
  • fix(query): fix inner type of nested type can not be set as not null by @b41sh in #14379
  • fix(query): fix json cast to string with quote by @b41sh in #14389
  • fix(query): use serde_json maybe cause old query can not access data that generate by new query by @TCeason in #14380

Code Refactor 🎉

  • refactor: add cardinality to CteScan explain by @xudong963 in #14310
  • refactor: add BinaryColumn, BinaryColumnBuilder by @andylokandy in #14306
  • refactor(query): delete owner in table/db meta by @TCeason in #14307
  • refactor(query): built-in roles inside meta by @TCeason in #14320
  • refactor(query): built-in roles inside meta by @TCeason in #14339
  • refactor: optimize SchemaApi::create_table() by @drmingdrmer in #14341
  • refactor: meta-client should not block by RPC by @drmingdrmer in #14355
  • refactor: create-table fetch latest state after backoff sleep by @drmingdrmer in #14364
  • refactor: do not use TxnPutRequest|TxnDeleteRequest.prev_value, always return previous value by @drmingdrmer in #14371
  • refactor: adjsut file format options. by @youngsofun in #14335

Build/Testing/CI Infra Changes 🔌

  • ci: ci-cloud build both amd64 & arm64 by @everpcpc in #14317

Others 📒

  • chore: update testing parquet assets for stage test by @andylokandy in #14303
  • chore(query): forbiden revoke ownership statement by @TCeason in #14301
  • chore: fix system table ctx warning and log more warnings to log by @BohuTANG in #14309
  • chore: fix mark scan, test performance. by @JackTan25 in #14319
  • chore(ci): do not release macos build by @everpcpc in #14326
  • chore(query): add log prefix filter config by @zhang2014 in #14330
  • chore: tweak fuse compat test by @dantengsky in #14322
  • chore(ci): fix bendsql version for macos by @everpcpc in #14351
  • chore: test binary with udf by @everpcpc in #14354
  • chore: add a setting to enforce broadcast join by @xudong963 in #14361
  • chore(planner): refine cascades optimizer by @leiysky in #14350
  • chore(query): improve agg index range comparison by @Dousir9 in #14378
  • chore(planner): improve selectivity estimator by @Dousir9 in #14386
  • chore: set auto_compaction_imperfect_blocks_threshold to 50 by @zhyass in #14390

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.292...v1.2.307

Jan 11, 2024 (v1.2.292)

What's Changed

Exciting New Features ✨

  • feat: return latest leader addr if request is forwarded by @drmingdrmer in #14288
  • feat: add task sql check by @ZhiHanZ in #14292
  • feat(query): support adaptive filter reorder by @Dousir9 in #14295
  • feat: support semi join to inner join and optimize tpch q18 by @xudong963 in #14221

Thoughtful Bug Fix 🔧

  • fix(query): remove topk optimization in parquet2 by @sundy-li in #14297

Code Refactor 🎉

  • refactor: use binary for non-utf8 string type by @andylokandy in #14235

Others 📒

  • chore: add select tests for agg index with order by by @ariesdevil in #14289
  • chore(flaky): fix flaky 18_0002_ownership_cover by @TCeason in #14300

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.290...v1.2.292

Jan 10, 2024 (v1.2.290)

What's Changed

Exciting New Features ✨

  • feat: show full databases add owner field by @guojidan in #14184
  • feat(query): Add array_reduce function with lambda expression by @Freejww in #14143
  • feat(query): ownership cover stage/udf by @TCeason in #14217
  • feat: distributed execution of update statement by @SkyFan2002 in #13971
  • feat(query): Support check password policy when login by @b41sh in #14124
  • feat: add owner field into show full tables by @guojidan in #14219
  • feat: push filter down below the window by @lichuang in #14108
  • feat: support explain fragments for update by @SkyFan2002 in #14227
  • feat: cast between string and binary by @andylokandy in #14247
  • feat(query): string to ts support parse ISO 8601 tz format by @TCeason in #14253
  • feat: add json concat by @akoshchiy in #14251
  • feat: CSV add option empty_field_as. by @youngsofun in #14269
  • feat(planner): support greedy join order algorithm by @Dousir9 in #14240
  • feat: auto refresh ReadOnlyAttach-ed table schema by @dantengsky in #14241
  • feat(query): add basic sort support for agg index rewrite by @ariesdevil in #14270
  • feat(query): support transfer grant by @TCeason in #14238

Thoughtful Bug Fix 🔧

  • fix: load parquet with missing_field_as get wrong results. by @youngsofun in #14207
  • fix: disable distributed plan after disabling join reorder by @xudong963 in #14218
  • fix: fix EmptyMap domain as undefine by @lichuang in #14226
  • fix: aggregate and window function as the argument of unnest return unable to get field named by @Freejww in #14233
  • fix(query): fix missing hash fields for NonZeroU64 by @sundy-li in #14245
  • fix: fix count rewrite in agg index by @ariesdevil in #14256
  • fix(query): fix copy into float value cast to integer by @b41sh in #14257
  • fix(query): Add LegacyScalar & LegacyColumn by @sundy-li in #14264
  • fix(rbac): revoke privilege by @flaneur2020 in #14278

Code Refactor 🎉

  • refactor: Bump OpenDAL to 0.44 by @Xuanwo in #14202
  • refactor: Define features provided by raft server and features required by raft client by @drmingdrmer in #14214
  • refactor(query): Determine whether to check the ownership based on the object type by @TCeason in #14210
  • refactor: refactor NativeDeserializeDataTransform. by @RinChanNOWWW in #14213
  • refactor(schema): re-arrange conversion between arrow2 by @andylokandy in #14175
  • refactor: Replace KVApi::get() and KVApi::mget_kv() with get_kv_stream() by @drmingdrmer in #14255
  • refactor: remove from message::Request: Get(r), MGet(r), List(r) and StreamGet(r). by @drmingdrmer in #14258
  • refactor: Endpoint: add parse(); make fields private by @drmingdrmer in #14265
  • refactor: common-exception crate should be independent of meta-types by @drmingdrmer in #14275
  • refactor: move GrpcHelper to crate common-types by @drmingdrmer in #14277

Build/Testing/CI Infra Changes 🔌

  • ci: docker image tag latest only on stable release by @everpcpc in #14266

Others 📒

  • chore: Pin hive_metastore to tagged version 0.0.2 by @Xuanwo in #14201
  • chore: disable distributed execution of merge-into when spilling is enabled by @dantengsky in #14178
  • chore: modify the max block count in compact by @zhyass in #14222
  • chore(query): support selection for render_error by @Dousir9 in #14220
  • chore(executor): add more tests for executor schedule by @dqhl76 in #14225
  • chore(query): fix first error row by @Dousir9 in #14228
  • chore: test Mergeinto runtime filter performance and wizard by @JackTan25 in #14212
  • chore: refine the error message for management api by @BohuTANG in #14242
  • chore(query): fix password policy typos by @b41sh in #14246
  • chore: refactor the hook by @BohuTANG in #14250
  • chore: update doc about compatibility changes by @drmingdrmer in #14260
  • chore(planner): add TPC-DS join order sqllogictest by @Dousir9 in #14244
  • chore(ci): fix docker push tags by @everpcpc in #14273
  • chore: use raw sql as task SQL instead of pretty one by @ZhiHanZ in #14281
  • chore(query): AlterUdf and AlterShareTenants needs alter global priv by @TCeason in #14284

New Contributors

  • @Freejww made their first contribution in #14143

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.279...v1.2.290

Jan 2, 2024 (v1.2.279)

What's Changed

Exciting New Features ✨

  • feat: support abs for decimal types by @sundy-li in #14195

Thoughtful Bug Fix 🔧

  • chore(query): refresh hook does not return an error by @b41sh in #14194

Code Refactor 🎉

  • refactor: use Binaryfuse16 and disable feature uniform-random by @xudong963 in #14187
  • refactor: Implement exponential backoff in meta-service txn retries by @drmingdrmer in #14196
  • refactor: Remove Compatible error by @drmingdrmer in #14198

Build/Testing/CI Infra Changes 🔌

  • ci: add ee management mode test by @zhyass in #14188

Others 📒

  • chore(query): better handling of nullable columns during filter execution by @Dousir9 in #14197

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.275...v1.2.279

Dec 30, 2023 (v1.2.275)

What's Changed

Exciting New Features ✨

  • feat(query): Support create/alter/drop/desc/show password policy by @b41sh in #14012
  • feat: tweak auto compaction strategy by @zhyass in #13577
  • feat: support reading partitioned Delta table. by @youngsofun in #14084
  • feat: create function support lambda by @guojidan in #14104
  • feat: make Top-N sort can be spilled. by @RinChanNOWWW in #14131
  • feat: support to build DAG on task DDL by @ZhiHanZ in #14141
  • feat: copy from parquet support missing_field_as. by @youngsofun in #14149
  • feat: Bump OpenDAL to v0.43.0 by @Xuanwo in #13978
  • feat(expr): add Binary type by @andylokandy in #14142
  • feat(rbac): treat the empty ownership as owned by ACCOUNT_ADMIN instead of PUBLIC by @flaneur2020 in #14112
  • feat: support more runtime filter types by @xudong963 in #13993
  • feat: add stream_status http api by @zhyass in #14185
  • feat(query): new filter execution framework by @Dousir9 in #13846

Thoughtful Bug Fix 🔧

  • fix(executor): fix broken profile graph for union all by @zhang2014 in #14107
  • fix(query): add udf name(in meta) into UdfServerCall by @TCeason in #14105
  • fix: select stream limit get error by @zhyass in #14086
  • fix(query): grant object use table/db id by @TCeason in #14016
  • fix: hash join hang by @xudong963 in #14157
  • fix(query): ownership only support grant or revoke to role by @TCeason in #14160
  • fix: trim block metas pruned by runtime filter by @dantengsky in #14166
  • fix(expr): fix cast map column validity during evaluation by @andylokandy in #14113
  • fix(query): fix parse string to JSON value by @b41sh in #14177
  • fix(executor): fix broken query profiling graph by @zhang2014 in #14183
  • fix(query): fix cast domain values did not perform rounding operation by @b41sh in #14158

Code Refactor 🎉

  • refactor: refactor error message return for drop table and set options by @JackTan25 in #14078
  • refactor: refactor create agg index by @ariesdevil in #14085
  • refactor: refine if enable join spill by @xudong963 in #14130
  • refactor: integrate HeapMerger to TransformMultiSortMerge. by @RinChanNOWWW in #14065
  • refactor: make sync agg index default by @ariesdevil in #14140
  • refactor: delegate backoff handling to Openraft by @drmingdrmer in #14154
  • refactor: move to meta-service handshake into pool by @drmingdrmer in #14161
  • refactor: build_block method by @xudong963 in #14164
  • refactor(planner): Correct usage of exchange operator by @leiysky in #14163

Build/Testing/CI Infra Changes 🔌

  • ci: add lychee links checker with cache by @PsiACE in #14100

Documentation 📔

  • docs: replace the wrong links in README by @dqhl76 in #14090
  • docs: add compatibility to meta/README.md by @drmingdrmer in #14128

Others 📒

  • chore(query): desugar round/truncate by @sundy-li in #14087
  • chore(query): Rename table not allow modify catalog or database by @TCeason in #14098
  • chore: add more comments for auto compact and remove the settings table goldenfile test by @BohuTANG in #14101
  • chore: handle error returned from resolve_data_source() by @lichuang in #14097
  • chore(query): add log on slow http query request by @flaneur2020 in #14082
  • chore: the final merge sort run outputs blocks directly. by @RinChanNOWWW in #14091
  • chore(query): improve concat boolean types by @Dousir9 in #14096
  • chore(query): round the results for decimal division by @sundy-li in #14109
  • chore: refine enable_compact_after_write, enable_recluster_after_write need deprecated by @BohuTANG in #14114
  • chore: rename CompactOptions::limit to num_segment_limit by @dantengsky in #14116
  • chore: add limit for compact hook by @zhyass in #14115
  • chore: remove the compact hook warn message and rename auto_compaction_threshold to auto_compaction_imperfect_blocks_threshold by @BohuTANG in #14117
  • chore: remove flaky test(cluster) in 01_0013_system_metrics.test by @BohuTANG in #14123
  • chore(planner): support enumerating required properties by @leiysky in #14121
  • chore(ast): support expr in the position of function parameter by @andylokandy in #14122
  • chore: improve the error message by GPT4 by @BohuTANG in #14133
  • chore(expr): cleanup code for kernels/group_by by @andylokandy in #14136
  • chore: remove the unused setting:enable_recluster_after_write by @BohuTANG in #14146
  • chore(query): virtual columns not pushdown if not created by @b41sh in #14147
  • chore: remove onlyif mysql in tests by @xudong963 in #14170
  • chore(query): remove some onlyif mysql in test by @TCeason in #14168
  • chore(executor): add error when dump running graph by @zhang2014 in #14173
  • chore(executor): add tests for schedule queue by @dqhl76 in #14180
  • chore: rename hash join probe state name by @xudong963 in #14186

New Contributors

  • @dqhl76 made their first contribution in #14090

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.262...v1.2.275

Dec 20, 2023 (v1.2.262)

What's Changed

Exciting New Features ✨

  • feat: auto enable change tracking when create stream by @zhyass in #13858
  • feat: support qualify clause by @ariesdevil in #13829
  • feat: matched only and insert only pipeline separate by @JackTan25 in #13680
  • feat: Load credential while role_arn has been specified by @Xuanwo in #13875
  • feat(query): support take_ranges by @Dousir9 in #13878
  • feat: add json_path_match & @?,@@ operators by @akoshchiy in #13906
  • feat: add merge status by @JackTan25 in #13903
  • feat: support connection in stage related infer_schema by @guojidan in #13890
  • feat: iceberg table engine. by @youngsofun in #13835
  • feat(query): add TransformFilter by @Dousir9 in #13922
  • feat: mask admon_key_id && secret_access_key by @guojidan in #13916
  • feat(query): Add warnings in the query response by @flaneur2020 in #13918
  • feat: Try to use volo thrift for async support by @Xuanwo in #13921
  • feat(cluster): support compression for query flight data by @zhang2014 in #13934
  • feat(planner): support timeout termination for query optimization by @leiysky in #13955
  • feat: add internal stream columns by @zhyass in #13960
  • feat: add table engine delta. by @youngsofun in #13949
  • feat: add new processor TransformSortSpill to handle external sort. by @RinChanNOWWW in #13938
  • feat: add kill query when extend lock failure by @zhyass in #13980
  • feat(executor): support output query plan profiles to logs by @zhang2014 in #13945
  • feat(query): add config for query profile log by @everpcpc in #13987
  • feat: enable sort spilling when memory reach the limit. by @RinChanNOWWW in #13989
  • feat: support maximum check by @guojidan in #13992
  • feat(meta): support use id get table/db_name by @TCeason in #14006
  • feat: show locks by @zhyass in #14004
  • feat(query): Support enable hook to refresh virtual column after insert by @b41sh in #13983
  • feat: add setting to disable clickhouse handler by @guojidan in #13965
  • feat: rotating logs by @yufan022 in #14008
  • feat(planner): Eliminate unnecessary sort by @leiysky in #14023
  • feat: add column create_on for stages table by @everpcpc in #14017
  • feat: Introduced MetaSpec for setting relative expiration by @drmingdrmer in #14041
  • feat(query): add decimal round/truncate/ceil/floor function by @sundy-li in #14040
  • feat: add range for show settings by @guojidan in #14049
  • feat(query): Support show virtual columns by @b41sh in #14038

Thoughtful Bug Fix 🔧

  • fix: filter logs from other modules by @everpcpc in #13861
  • fix: fix lower_case by @JackTan25 in #13854
  • fix(cluster): close output port when bucket work finished by @zhang2014 in #13870
  • fix: fix avg for decimal type by @ariesdevil in #13877
  • fix(query): Skip checking the privilege about the user stage @~ by @flaneur2020 in #13823
  • fix: resolve flight connection leak by @dantengsky in #13866
  • fix: remove merge into lock by @JackTan25 in #13897
  • fix: add visit for merge into/update/replace/insert/delete/copy statement by @JackTan25 in #13848
  • fix(query): fix check join condition with aggregate and window functions by @b41sh in #13879
  • fix: Fix source optimized without enable distributed but in clusters environment by @JackTan25 in #13910
  • fix(query): add some dml prvilege check by @TCeason in #13845
  • fix: max_active_session does not work for http handler. by @youngsofun in #13931
  • fix: wrong use size_hint in concat_columns by @xudong963 in #13936
  • fix: make build and probe hash method consistent in spill by @xudong963 in #13942
  • fix(query): fix create table tuple default value painc by @b41sh in #13946
  • fix: fix non-null constraint by @JackTan25 in #13939
  • fix: build release warning by @xudong963 in #13954
  • fix(query): ignore ascii case for flight compression setting by @zhang2014 in #13957
  • fix(query): avoid SHOW TABLES failure by @flaneur2020 in #13982
  • fix(query): fix count all in lambda function by @b41sh in #13991
  • fix(query): fix incorrect function name of decimal, neq --> noteq by @sundy-li in #14001
  • fix: add better explain for merge into by @JackTan25 in #14005
  • fix: query flight rpc connection leaking by @dantengsky in #13956
  • fix: fix enable_refresh_virtual_column_after_write range by @BohuTANG in #14009
  • fix: remove update expr cast early, use cast after all macthed operations by @JackTan25 in #14037
  • fix: incorrect stats when auto-compaction is hooked by @dantengsky in #14042
  • fix: multiple otlp loggers conflict by @everpcpc in #14050
  • fix: abort vacuum if there are io execption by @dantengsky in #14052
  • fix: treat change$* as internal columns by @dantengsky in #14061
  • fix: fix get db without DbIdList key by @lichuang in #14059
  • fix(query): add missing server_version for query log by @everpcpc in #14074
  • fix(query): fix panic when drop UncompressedBuffer by @zhang2014 in #14068

Code Refactor 🎉

  • refactor: auto split large append-entries RPC by @drmingdrmer in #13857
  • refactor: abstract TransformSortMergeBase to unify merge sort logic. by @RinChanNOWWW in #13927
  • refactor: optimize read of small row group in parquet by @zenus in #13530
  • refactor: use LazyLock to replace lazy_static crate by @PsiACE in #13943
  • refactor(query): prune unused flatten result columns by @b41sh in #13935
  • refactor: use LazyLock to replace once_cell::sync::Lazy by @PsiACE in #13952
  • refactor: runtime filter by @xudong963 in #13842
  • refactor: access KVMeta with methods instead of field by @drmingdrmer in #14013
  • refactor: normalize internal crate names by @everpcpc in #14039
  • refactor: using borsh instead of bincode for serde agg function state by @ariesdevil in #13997
  • refactor: abstract a common merger to do merge sort. by @RinChanNOWWW in #14020

Build/Testing/CI Infra Changes 🔌

  • ci: format test not dep on clickhouse handler. by @youngsofun in #14036

Others 📒

  • chore: utilize Visitor for ScalarExpr take 2 by @andylokandy in #13833
  • chore(query): add columns expression which expand star column select target by @sundy-li in #13850
  • chore(query): support from first query by @sundy-li in #13859
  • chore: bump h3 from 0.3.0 to 0.4.0 by @ariesdevil in #13874
  • chore(cluster): only dump once graph for slow dump by @zhang2014 in #13880
  • chore: add decimal type agg stddev support by @lichuang in #13864
  • chore: improve explain insert into with select by @xudong963 in #13876
  • chore: remove ck client in sqllogictest by @xudong963 in #13882
  • chore: change stream logic by @zhyass in #13808
  • chore(query): rename enable_stage_udf_priv_check by @TCeason in #13863
  • chore(planner): Refine explain output of statements by @leiysky in #13888
  • chore(query): add probe state for slow dump by @zhang2014 in #13892
  • chore(ci): adjust release changelog by @everpcpc in #13895
  • chore(query): add python style list comprehension by @sundy-li in #13887
  • chore: remove old runtime filter by @xudong963 in #13896
  • chore: new setting create_rpc_client_with_current_rt by @dantengsky in #13898
  • chore: change Result to Option to avoid bring error stack by @BohuTANG in #13901
  • chore: refine the unit test framework by @BohuTANG in #13902
  • chore: refactor unit test framework by @BohuTANG in #13905
  • chore(benchmark): insert result of Q0 for tpch by @everpcpc in #13911
  • chore(ci): install zig for dev setup by @everpcpc in #13913
  • chore(ci): fix install ziglang for debian by @everpcpc in #13914
  • chore(ci): install zig on centos by @everpcpc in #13915
  • chore(cluster): add allow_adjust_parallelism flag for cluster by @zhang2014 in #13909
  • chore: fix array() function parse error by @lichuang in #13912
  • chore: push down sort pipeline to cluster nodes. by @RinChanNOWWW in #13881
  • chore(ast): improve experiment dialect by @andylokandy in #13923
  • chore(query): remove extra error code construction by @sundy-li in #13926
  • chore(ci): fix install zig by @everpcpc in #13933
  • chore: fix generate_series to_timestamp step unit bug by @lichuang in #13917
  • chore: fix lag default arg bug by @lichuang in #13937
  • chore(ci): fix benchmark result for tpch by @everpcpc in #13941
  • chore(planner): merge into prune row_id if insert_only by @Dousir9 in #13951
  • chore: assert that pushdown columns must exist by @andylokandy in #13228
  • chore(ci): fix benchmark tpch result by @everpcpc in #13958
  • chore(planner): add tips for wrong usage of semi or anti join by @Dousir9 in #13948
  • chore: fix runtime ut to tokio::time::sleep in async by @BohuTANG in #13963
  • chore: bump tokio from 1.26.0 to 1.35.0 by @BohuTANG in #13967
  • chore: move crates deps to the root Cargo.toml by @BohuTANG in #13969
  • chore(query): refactor binder virtual column by @b41sh in #13920
  • chore: enable runtime filter for native datasource by @xudong963 in #13976
  • chore: bincode api to common/io by @BohuTANG in #13973
  • chore(planner): fix unexpected anti semi wrong table_index by @Dousir9 in #13977
  • chore(ci): workflow for stable release by @everpcpc in #14003
  • chore(ci): fix note for stable release by @everpcpc in #14007
  • chore(ci): only publish deb & python binding on stable release by @everpcpc in #14015
  • chore: fix drop_table_by_id fail when TableIdToName cannot found by @lichuang in #14002
  • chore(planner): minor refactor heuristic optimizer by @leiysky in #14021
  • chore: merge settings possible_values to range by @BohuTANG in #14010
  • chore: tmp remove clickhouse http handler test by @everpcpc in #14032
  • chore(deps): bump zerocopy from 0.6.1 to 0.6.6 by @dependabot in #14031
  • chore: fix test privilege access by @everpcpc in #14034
  • chore: move interpreter/*_hook.rs to hook/ by @BohuTANG in #14044
  • chore: simplify txn-op by @drmingdrmer in #14048
  • chore(executor): use null as root profile parent id by @zhang2014 in #14051
  • chore: reduce tpcds Q2 result set by @xudong963 in #14055
  • chore: update doc: since 1.2.258 ttl is supported by meta-service by @drmingdrmer in #14058
  • chore: fix license by @drmingdrmer in #14060
  • chore(tests): add explain for sync agg index by @ariesdevil in #14053
  • chore(query): refactor decimal scalar functions by @sundy-li in #14057
  • chore(executor): add exchange row and bytes profile by @zhang2014 in #14067
  • chore(query): refactor function params to be scalar by @sundy-li in #14079
  • chore: fix typo and refine the merge into status by @BohuTANG in #14081

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.233...v1.2.262

Dec 13, 2023 (v1.2.233)

What's Changed

Exciting New Features ✨

  • feat: add labels config for otlp logging by @everpcpc in #13820
  • feat: http handler return reason for 404. by @youngsofun in #13831
  • feat(query): add chain function support by @sundy-li in #13799
  • feat: limit max copied files to 15000. by @youngsofun in #13770

Thoughtful Bug Fix 🔧

  • fix: file not purged when copy into table. by @youngsofun in #13812
  • fix(query): remove unexpected strong ref for http query by @zhang2014 in #13807
  • fix: check unsupported exprs in update, delete and merge by @JackTan25 in #13803
  • fix(query): external location no need to check priv by @TCeason in #13821
  • fix(query): use warnings in system.tables on bad remote databases by @flaneur2020 in #13834
  • fix: endpoint for otlp exporter config by @everpcpc in #13839
  • fix: killing in-progress tasks causes query node to crash (Segment Fault) by @dantengsky in #13847

Code Refactor 🎉

  • refactor: refactor other unary functions by @ariesdevil in #13768
  • refactor(query): refactor settings by @zhang2014 in #13837

Others 📒

  • chore(cluster): add details status for slow dump by @zhang2014 in #13816
  • chore(cluster): add ports status for slow dump by @zhang2014 in #13822
  • chore(query): rewrite binder lambda and udf by @b41sh in #13818
  • chore: utilize Visitor for ScalarExpr by @andylokandy in #13793
  • chore: add arrow extention data type support by @lichuang in #13796
  • chore: split the output of SRF by max_block_size. by @RinChanNOWWW in #13817
  • chore(query): add Expreimental dialect by @sundy-li in #13832
  • chore(query): refactor lambda fucntion as expression evaluator by @b41sh in #13836
  • chore(query): rewrite binder lateral table function by @b41sh in #13789
  • chore(query): add settings to control enable stage/udf privilege check by @TCeason in #13852
  • chore(ci): remove test for clickhouse handler by @everpcpc in #13851
  • chore(ast): make run_parser!() a function by @andylokandy in #13849
  • chore(ci): fix build python binding by @everpcpc in #13856

Full Changelog: https://github.com/databendlabs/databend/compare/v1.2.226-nightly...v1.2.233

Explore Databend Cloud for FREE
Low-cost
Fast Analytics
Easy Data Ingestion
Elastic Scaling
Try it today