Changelog for azure.qubole.com

Find in the following table the changelog of the azure.qubole.com environment.

Date and time of release Version Change type Change
16th Apr, 2021 (11:59 CST) 59.5 Bug fix JUPY-630 Fixed the error in the branch details of the gitlab server, configuring the bastion with the account public SSH key.
29th Sep, 2020 (9:08 AM PST) 59.0.1045 Enhancement

PRES-3435: The QueryTracker link is now available in the Workbench/Analyze UI’s Logs tab for queries run through the third-generation drivers.

PRES-3722: Optimization is added to push null filters to table scans by inferring them from the JOIN criteria of equi-joins and semi-joins in Presto version 317 and later. You can enable it through optimize-nulls-in-joins as a Presto cluster override or optimize_nulls_in_join as a session property. Use this enhancement to reduce the cost of performing JOIN operations when JOIN columns contain a significant number of NULLs.

PRES-3724: Backported hive.ignore-corrupted-statistics into Presto version 0.208 to avoid query failures in case of corrupted Hive metastore statistics and it is enabled by default. Presto version 317 supported this property, which is now enabled by default.

PRES-3748: Presto query retries for memory exceeded exceptions are triggered in a graded manner. Qubole retries the failed query in three steps. First two steps occur on the cluster size that lies between the minimum and the maximum cluster size. The last step of occurs at the maximum cluster size. To know more, see Graded Presto Query Retries. This enhancement is part of Gradual Rollout.

PRES-3761: The Presto Mongo Connector now supports querying Cosmos DB using Mongo APIs.

PRES-3788: You can now add a comma-separated list of endpoints and pass them as cluster override values of qubole.bypass-authentication-endpoints if you want to skip authentication of such endpoints. For example, if qubole.bypass-authentication-endpoints= /,query,node, then only endpoints that matches with these are skipped for authentication. Contact Qubole Support to enable this enhancement at the account level.

Bug fix

PRES-3632: Fixed the File '000000' does not match the standard naming pattern error that Presto threw when trying to read bucketed Hive tables. Qubole Hive INSERT commands had written bucketed Hive tables.

PRES-3787: Fixed the Ranger access control for Presto views in Presto version 0.208.

PRES-3790: Fixed the issue that failed queries when there was no space before or after a single line comment in Presto queries.

RUB-239: Fixed the issue in RubiX that sometimes caused query failures around the cache data invalidation.

PRES-3660: Fixed the Presto query failure with Error opening Hive split : <FILENAME>: null when Rubix was enabled.

PRES-3701: Fixed the connection leak in RetryingPooledThriftClient of RubiX, which caused the slowness in source stages that slowed down the query.

PRES-3708: Fixed the possible deadlock between Hive loadPartitionByName and getTable when the Hive metastore caching is enabled with refresh TTL (time-to-live).

PRES-3543: Fixed the issue where the aggregation node in case of a UNION query (when union sources are tablescan node and values node) did not use the distributed partitioning and caused an OOM exception. It is fixed by disregarding the SINGLE distribution for the UNION query.

PRES-3588: Fixed issues related to updating the table statistics performance. As a result, the bug fix has improved the performance of updating table statistics. In addition, a new configuration property, hive.table-statistics-enabled with its default value set to true is added that you can use to disable updating table statistics.

PRES-3602: Fixed the issue in reading the TEXT file collection delimiter configured in the Hive versions (earlier to Hive 3.0) in Presto version 317.

PRES-3604: Fixed the Ranger access control for Presto views that had earlier failed.

PRES-3618: The Presto catalog configuration for external data sources that skipped validation in Qubole was not added to the cluster earlier. Fixed this issue and now such configuration is added to the cluster.

PRES-3641: Fixed the failure in planning for spatial JOINs with dynamic filtering enabled.

PRES-3662: Fixed the issue where pushing configuration to a cluster corrupted the Presto configuration and failed the Presto server restart.

PRES-3672: Fixed query failures that occurred as too many partitions’ metadata were requested from the metastore in Presto versions 0.208 and 317.

PRES-3673: Fixed the issue where the Presto cluster start failed when resource-groups.user-scaling-limits-enable was turned on and resource groups were configured by a user.

PRES-3677: Fixed the issue where the default location (DefLoc) was picked as the DB location of non-default schemas in the Presto version 317. The correct behavior is that DefLoc should be the DB location of only the default schema.

25th Aug, 2020 (04:39 AM PST) 59.0.1040 Bug fix JUPY-929: Dependencies that were installed through the Environments page were not accessible for the scheduled and API runs of Python notebooks. This issue is fixed.
21st Aug, 2020 (08:44 AM PST) 59.0.1037 Bug fix

ZEP-4789: The paragraph status was not getting updated after the web socket reconnect. This issue is fixed.

ZEP-4590: Interpreter settings were getting lost because of _COPYING_ file present in defloc. This issue is fixed.

ZEP-4130: Notebook commands were failing when the status was NOT_STARTED_RUNNING_NOTEBOOK. To fix this issue, 20 retries at an interval of 10 seconds upto 3 min is added when the notebook command status fetched is NOT_STARTED_RUNNING_NOTEBOOK.

ZEP-4642: Notebook rendering was delayed due to extra web socket calls made for each paragraph to fetch editor settings. This issue is fixed.

2nd Jun, 2020 (11:09 AM PST) 59.0.1015 Bug fix

PRES-3543: Fixed the issue where the aggregation node in case of a UNION query (when union sources are tablescan node and values node) did not use the distributed partitioning and caused an OOM exception. It is fixed by disregarding the SINGLE distribution for the UNION query.

PRES-3588: Fixed issues related to updating the table statistics performance. As a result, the bug fix has improved the performance of updating table statistics. In addition, a new configuration property, hive.table-statistics-enabled with its default value set to true is added that you can use to disable updating table statistics.

PRES-3602: Fixed the issue in reading the TEXT file collection delimiter configured in the Hive versions (earlier to Hive 3.0) in Presto version 317.

PRES-3604: Fixed the Ranger access control for Presto views that had earlier failed.

PRES-3641: Fixed the failure in planning for spatial JOINs with dynamic filtering enabled.

PRES-3662: Fixed the issue where pushing configuration to a cluster corrupted the Presto configuration and failed the Presto server restart.

PRES-3672: Fixed query failures that occurred as too many partitions’ metadata were requested from the metastore in Presto versions 0.208 and 317.

PRES-3673: Fixed the issue where the Presto cluster start failed when resource-groups.user-scaling-limits-enable was turned on and resource groups were configured by a user.

PRES-3677: Fixed the issue where the default location (DefLoc) was picked as the DB location of non-default schemas in the Presto version 317. The correct behavior is that DefLoc should be the DB location of only the default schema.

PRES-3694: Fixed an issue to allow overriding of the -Xmx configuration value to less than 80% in the cluster’s Presto overrides.

PRES-3708: Fixed the possible deadlock between Hive loadPartitionByName and getTable when the Hive metastore caching is enabled with refresh TTL (time-to-live).

24th June 2020 (1:57 AM PST) 59.0.1009 Major release R59 release
20th Apr 2020 (8:10 AM PST) 58.0.1088 Bug fix Airflow and other bug fixes.
6th Apr 2020 (9:30 PM PST) 58.0.1005 Major release R58 release
24th Mar 2020 (2:12 AM PST) 57.0.1055 Bug fix Various bug fixes.
20th Feb 2020 (1:32 AM PST) 57.0.1051 Bug Fix

PRES-3249: Fixes UnsupportedOperationException occurring in some complex outer join queries when Dynamic Filtering was enabled.

PRES-3282: Adds support for lambdas in ExpressionEquivalence.

PRES-3051: Fixes “Invalid partition value” exception and intermittent ArrayIndexOutOfBoundsException in queries with Dynamic Filtering enabled.

PRES-3112: Enables dynamic partition pruning on Hive tables at the account level.

PRES-3113: Improves autoscaling through better accounting of queued work.

SQOOP-242: For new accounts, QDS does not provide a Pixie cluster by default to run DB import and export commands.

    Enhancement PRES-2990: Improves efficiency of dynamic partition pruning by preventing listing and creation of Hive splits from partitions, which are pruned at runtime.
27th Jan 2020 (11:34 PM PST) 57.0.1044 Bug fix ZEP-4275: QDS now uses HTTPS for Apache Maven access because Maven no longer supports HTTP.
10th Dec 2019 (2:30 AM PST) 57.0.1034 Bug fix ACM-5996: Accelerated networking is now enabled by default for supported Azure instance types.
29th Nov 2019 (1:17 AM PST) 57.0.1030 Bug fix ACM-6095: Fixes a problem that caused a push of a cluster configuration to fail if a network interface card (NIC) was attached.
20th Nov 2019 (12:31 AM PST) 57.0.1027 Major release R57 release
7th Oct 2019 (3:01 PM PST) 56.0.1078 Bug FIx JUPY-318: Fixes a problem that caused notebooks not to run after a package upgrade.
23rd Sep 2019 (11:07 PM PST) 56.0.1070 Bug Fix AN-2233: Allows ADLS Gen 2 filesystems to be seen from the Explore page of the QDS UI.
12th Sep 2019 (3:46 AM PST) 56.0.1062 Enhancement JUPY-1: Provides support for the JupyterLab interface, which is the next-generation UI for Jupyter notebooks. Jupyter notebooks are supported on Spark 2.2 and later versions. Beta, Via Support. See Jupyter Notebooks.
30th Jul 2019 (07:11 AM PST) 56.0.1040 Bug fix

PRES-2889: Fixes an issue that caused Presto queries to fail when the QDS account was configured to use ADLS Gen 2 with authorization via per-user tokens.

PRES-2871: Fixes an issue that prevented the tmp directory from being created in Presto 0.208.

PRES-2775: Fixes a problem which could prevent an upscaled Presto cluster from downscaling if short-running queries were regularly scheduled.

PRES-2727: Fixes a problem that caused queries to fail when a table contained sub-columns of type struct with names starting with a number.

PRES-2915: Fixes an issue which caused an ArrayIndexOutOfBoundsException for some queries using dynamic filtering.

HIVE-4681: Fixes a race condition which caused Thrift Metastore Server to fail to start.

ACM-5167: Fixes a problem that caused data disks to persist after a cluster had been terminated.

    Enhancement

PRES-2600:

  • You can now set session properties in Presto notebooks. They will persist across paragraphs in a notebook.
  • To improve debugging, a source field is identified as notebook_<notename>_<note_id> for notebooks and dashboard_<dashboard_name>_<dashboard_id>_<source_note_id> for dashboards. A source field is directly searchable in the Presto UI; you can search a notebook by its name or ID to quickly filter queries when debugging.

PRES-2254: You can now set the zeppelin.presto.stacktrace interpreter property to see the stack trace for some errors.

JDBC-124: Adds support for multiple statements launched together in Fastpath.

30th May 2019 (2:34 AM PST) 56.0.1013 Major release R56 release
2nd May 2019 (2:06 AM PST) 50.0.1006 Bug fix

AN-1974: Column headers now appear on the Analyze page for large Hive queries (approximately 65KB and greater).

AN-1667: Provides faster data preview in the Analyze preview pane.

Enhancement AN-1751: Adds Examples tab on the Analyze page.
27th Mar 2019 (01:38 AM PST) 55.0.1001 Bug fix PRES-2635: An earlier fix to improve the planning time of queries which use the CONCAT function caused a degradation in execution in some cases. The current fix disables the planning optimization by default; to enable it, enable qubole.skip-concat-partition-pruning as a Presto cluster override. You can also enable it for a session by setting qubole.skip-concat-partition-pruning as a session property.
14th March 2019 (2:18 AM PST) 54.0.1000 Major release R55 release
14th January 2019 (6:02 AM PST) 54.0.1008 Bug fix PRES-2443: Fixes an issue that caused a Presto query to fail if it needed to launch a cluster (though the cluster itself eventually came up).
28th November 2018 (1:35 PM PST) 54.0.1002 Major release R54 release