Release 0.290¶
Highlights¶
Fix to reduce drop time for Iceberg tables with deleted metadata in S3 storage. #23510
Fix a data corruption in uncompressed ORC/DWRF files with large values in string/binary columns. #23760
Improve JoinPrefilter optimizer for wide join keys and multiple join keys. #23858
Add UUID type support to the Parquet reader and writer. #23627
Add a configurable-sized cache for Iceberg table puffin files to improve query planning time controlled by the
iceberg.max-statistics-file-cache-size
configuration property. #23177Add support of UUID-typed columns. #23627
Add support to query Iceberg table by branch/tag name. #23539
Add support for procedure
fast_forward
for Iceberg. #23589Add support for using named arguments in procedures
register_table
andunregister_table
. #23592
Details¶
General Changes¶
Fix
array_intersect()
for single parameter array<array<T>> to be deterministic regardless of the order of null input. #23890Fix bug in local property calculation when spill is enabled. #23922
Fix bug to unescape like pattern and validate escape string with no unresolved value. #23456
Fix to query and filter using Iceberg metadata columns
$path
and$data_sequence_number
. #23472Fix nullability of columns in information schema. #23577
Fix distinct operator for UUID type. #23732
Improve
element_at
by avoiding pushdown of negative position forelement_at
for array. #23479Improve
GET /v1/info/state
to return INACTIVE state until the resource group configuration manager is fully initialized. #23585Improve JoinPrefilter optimizer for wide join keys and multiple join keys. #23858
Improve writer scaling in skewed conditions by setting
optimized_scale_writer_producer_buffer
toon
by default. #23774Add UUID type support to the Parquet reader and writer. #23627
Add a configurable-sized cache for Iceberg table puffin files to improve query planning time controlled by the
iceberg.max-statistics-file-cache-size
configuration property. #23177Add a flag to the Presto CLI which allows skipping SSL certificate verification. #23780
Add a session property
native_max_extended_partial_aggregation_memory
which specifies Presto native max partial aggregation memory when data reduction is optimal. #23527Add a session property
native_max_partial_aggregation_memory
which specifies Presto native max partial aggregation memory when data reduction is not optimal. #23527Add a session property
native_max_spill_bytes
which specifies Presto native max allowed spill bytes. #23527Add function
is_private_ip()
that returns true when the input IP address is private or a reserved IP address. #23520Add function
ip_prefix_subnets()
that splits the input prefix into subnets the size of the new prefix length. #23656Add new configuration property
eager-plan-validation-enabled
for eager building of validation of a logical plan before queuing. #23649Add session property
inline_projections_on_values
and configuration propertyoptimizer.inline-projections-on-values
to evaluate project node on values node. #23245Add support in QueuedStatement protocol to accept pre-minted query id and slug. #23407
Add support to proxy AuthorizedIdentity using JWT. #23546
Add support for casting
char
datatype to various numeric datatypes. #23792Replace configuration property
async-cache-full-persistence-interval
withasync-cache-persistence-interval
. #23626Remove
array_dupes
andarray_has_dupes
alias names from functionsarray_duplicates()
andarray_has_duplicates()
. #23762
Presto C++ Changes¶
Fix
task.writer-count
andtask.partitioned-writer-count
configuration properties in Presto C++ for consistency with Presto. #23902Fix a bug where users weren’t able to set the
native_expression.max_array_size_in_reduce
session property. #23856Fix plan validation failures for some join queries running with spill enabled when using Presto C++. #23595
Fix bug so that proper logical type parameters are now read and written to Parquet files. #23388
Fix a data corruption in uncompressed ORC/DWRF files with large values in string/binary columns. #23760
Improve arbitrator configs to use the new string-based format. #23496
Add
$path
and$bucket
to split info, and fixed the split counts in the coordinator UI. #23755Add a metric
presto_cpp.memory_pushback_expected_reduction_bytes
to track expected reduction in memory after a pushback attempt. #23872Add a new counter,
presto_cpp.memory_pushback_reduction_bytes
, to monitor the actual memory reduction achieved with each memory pushback attempt. #23813Add
native_max_local_exchange_partition_count
session property which maps to themax_local_exchange_partition_count
velox query property to limit the number of partitions created by a local exchange. #23910Add session property:
native_writer_flush_threshold_bytes
which specifies the minimum memory footprint size required to reclaim memory from a file writer by flushing its buffered data to disk. #23891Add session property:
native_max_page_partitioning_buffer_size
which specifies the maximum bytes to buffer per PartitionedOutput operator to avoid creating tiny SerializedPages. #23853Add session property:
native_max_output_buffer_size
which specifies the maximum size in bytes for the task’s buffered output. The buffer is shared among all drivers. #23853Add incremental periodic cache persistence for Presto C++ worker. #23626
Add native system session property provider. #23045
Remove session property
native_join_spiller_partition_bits
. #23906Revert merging of
FilterNode
intoTableScanNode
done in #23755. #23855
Security Changes¶
Upgrade Postgres JDBC Driver to 42.6.1 in response to CVE-2024-1597. #23710
Upgrade the logback-core version to 1.2.13 in response to CVE-2023-6378. #23735
Hive Connector Changes¶
Fix interpretation of ambiguous timestamps inside array, map, or row types for tables using
TEXTFILE
format to interpret the timestamps as the earliest possible unixtime for consistency with the rest of Presto. #23593Fix timestamps inside array, map, or row types for tables using
TEXTFILE
format to respect thehive.time-zone property
. #23593
Iceberg Connector Changes¶
Fix time-type columns to return properly when
iceberg.parquet-batch-read-optimization-enabled
is set toTRUE
. #23542Fix to reduce drop time for Iceberg tables with deleted metadata in S3 storage. #23510
Fix bug so that proper logical type parameters are now read and written to Parquet files. #23388
Fix a data corruption in uncompressed ORC/DWRF files with large values in string/binary columns. #23760
Add Iceberg metadata table
$ref
. #23503Add configuration property
iceberg.rest.auth.oauth2.scope
for OAUTH2 authentication in Iceberg’s REST catalog. #23884Add configuration property
iceberg.rest.auth.oauth2.uri
. #23739Add procedure
rollback_to_timestamp
to rollback an Iceberg table to a given point in time. #23559Add support of UUID-typed columns. #23627
Add support to query Iceberg table by branch/tag name. #23539
Add table property
metrics_max_inferred_column
to configure the max columns number for which metrics are collected, and supportmetrics_max_inferred_column
for Iceberg tables with PARQUET format. #23468Add support for procedure
fast_forward
for Iceberg. #23589Add support for using named arguments in procedures
register_table
andunregister_table
. #23592Support new procedure
set_current_snapshot
for Iceberg. #23567Support timestamp without timezone in time travel expressions. #23714
MongoDB Connector Changes¶
Add support for
varbinary
data type in MongoDB. #23386Add support for MongoDB
ALTER TABLE
statement. #23266
Cassandra Connector Changes¶
Upgrade cassandra-driver-core to 3.11.5 for SSL support. #23493
Elasticsearch Connector Changes¶
Improve handling of exceptions for empty tables in Elasticsearch. #23850
SPI Changes¶
Add
Partitioning
,PartitioningScheme
,PartitioningHandle
,PlanFragmentId
,StageExecutionDescriptor
andSimplePlanFragment
to the SPI. #23601
Credits¶
Abhisek Saikia, Amit Dutta, Anant Aneja, Ananthu-Nair, Andrii Rosa, Bikramjeet Vig, Bryan Cutler, Chen Yang, Christian Zentgraf, David Tolnay, Deepa-George, Deepak Majeti, Denodo Research Labs, Elbin Pallimalil, Elliotte Rusty Harold, Feilong Liu, Ge Gao, Hazmi, Jalpreet Singh Nanda (:imjalpreet), Jayaprakash Sivaprasad, Jialiang Tan, Jimmy Lu, Joe Abraham, Karnati-Naga-Vivek, Ke, Konjac Huang, Krishna Pai, Linsong Wang, Mahadevuni Naveen Kumar, Matt Calder, Naveen Nitturu, Nikhil Collooru, Pramod, Pratik Joseph Dabre, Rebecca Schlussel, Reetika Agrawal, Richard Barnes, Rohan Pal Sidhu, Sam Partington, Serge Druzkin, Sergey Pershin, Steve Burnett, SthuthiGhosh9400, Swapnil Tailor, Timothy Meehan, Xiaoxuan Meng, Yihong Wang, Ying, Zac Blanco, Zac Wen, Zuyu ZHANG, abhibongale, aditi-pandit, ajay-kharat, auden-woolfson, exxiang, jackychen718, jaystarshot, kiersten-stokes, lingbin, lithinpurushothaman, lukmanulhakkeem, misterjpapa, mohsaka, namya28, oyeliseiev-ua, pratyakshsharma, prithvip, wangd