Release 1.3.0 (30 Jun 2021)
Key Features
Area | Feature | PR #s |
---|---|---|
Task Recovery | Fixed a few important bugs and further improved the stability of this function. It now can work with spill-to-disk feature. | 812,813,837, 838,842,843, 847,863,868, 874,875,885, 889,891,901, 906,917,930, 932 |
CTE (Common Table Expressions) | Additional optimization on top of 1.2.0 CTE optimization. Added cost based decision to decide whether to enable CTE or not. Added support for pushdown of dynamic filters and predicates into CTE nodes. | 722,811,815, 876,921,927 |
DM (Data Management) | Further improved the performance of Data Management Operations. Exposed performance tuning parameters as: - metastore -client-service-threads: Parallelize operations to Hive metastore by using multiple clients to send/receive requests - metastore -write-bach-size: Reduce round trip to hive metastore by packing multiple operation objects per call | 888 |
Star Tree Index | 1. Star Tree Cube now supports up to 10 billion cardinality. 2. openLooKeng CLI updates to improve cube management experience. User now has to issue a single sql statement to both create and populate data in the cube instead of multiple sql statements. The CLI changes help avoid query exceeded cluster memory limit issue. 3. Bug fixes a. Merge continuous ranges into single range so cube can be utilized b. Count distinct issue: Filter source data during cube insertion | 834,867,890, 902,907 |
CBO | Support Sorted Source Aggregator Added support for sort based aggregator in cases where input source is pre-ordered. This greatly reduces the amount of memory used for hashes and can finalize the majority of the results at the partial aggregation stage itself, thereby reducing the final aggregation load at a higher plan stage. The openLooKeng optimizer makes choices between Sort Aggregator and Hash Aggregator based on the estimated cost of operation for the given memory. | 855,905,906 |
Hudi Connector | Snapshot queries for Hudi COW table is supported; snapshot queries and read optimized queries for HUDI MOR table are supported. | 881,900 |
GreenPlum Connector | Support read and write operations on the GreenPlum datasource. But update and delete operations are not yet supported. | 689 |
Oracle Connector | Add new capability to support update and delete operation within Oracle datasource. | 897 |
ClickHouse Connector | Support read and write operations to the ClickHouse datasource. Also add support for SQL query pushdown, and registration & pushdown of external functions. | 920 |
JDBC Connector | Enhance JDBC to support the multiple splits so that it can improve the performance of high concurrency scenarios. | 939 |
Hive Connector | Upgrade the Hive dependency from 3.0.0 to 3.1.2 and fixed the compatibility issue of timestamp caused by the upgrade. | 903 |
Memory Connector | Memory Connector Optimizations - HetuMetaStore integration to persist table info - New data formation (LogicalParts) to support sorting and indexing - Predicate pushdown - Automatic spill-to-disk management | 914 |
Resource | Enhanced resource group to throttle scheduling or kill query based on resource usage and user configurations. | 779,821,822, 836 |
Known Issues
Category | Description | Gitee issue |
---|---|---|
Task Recovery | An error message: “Unsuccessful query retry”, is shown when CTA creates a transaction table and inserts data. | I3YF45 |
A query can hang when there is insufficient memory for a node. | I3YF4O | |
When an exception is thrown during stage 1, the value is doubled. | I3YF4V |
Obtaining the Document
For details, see https://gitee.com/openlookeng/hetu-core/tree/1.3.0/hetu-docs/en