Changelog for us.qubole.com

Date and time of release Version Change type Change
3rd Oct, 2019 (03:31 AM PST) 56.0.160 Enhancement INFRA-2441: The default Account Level Concurrent Command Limit on the Account Settings tab has increased from 20 to 100.
26th Sep, 2019 (09:43 AM PST) 56.0.158 Enhancement PRES-2791: Qubole has ported open-source changes that are related to improvements in S3 reads to Qubole Presto 0.208 version. For more information, see Faster S3 reads.
Bug fix

PRES-2961: In an IAM-Role-based account, the issue where incorrect IAM-Role info had been sent to Presto has been resolved now.

PRES-2999: The NullPointerException when local memory limits are exceeded and a leak in operator peak memory computations in Presto version 0.208 queries have been resolved now.

PRES-3009: The issue where the Presto coordinator disk was filling up due to presence of RubiX logs in the autoscaling log file has been resolved. rubix.log is excluded from autoscaling logs.

    Bug fix ACM-5664: Fixed the caching issue of the cluster list in the Analyze (Workbench (beta) UI page.
    Bug fix ACM-5616: Status checking of Spot requests has been made more robust by doing retries for DescribeSpotInstanceRequests in case of the RequestResourceCountExceeded error.
13th Sep, 2019 (09:53 AM PST) 56.0.143 Bug fix QHIVE-4798: Fixed an issue which can lead to a memory leak in HiveServer2 JVM when a large number of concurrent applications are running on a given cluster.
09th Sep, 2019 (04:33 PM PST) 56.6.11 Bug fix ACM-5689: Fixed an issue where the default Hadoop cluster did not start during the AWS test drive signup.
03 Sep, 2019 (05:55 AM PST) 56.1 Enhancement

ACM-5266: Qubole supports i3en.large, i3en.xlarge, i3en.2xlarge, i3en.3xlarge, i3en.6xlarge, i3en.12xlarge, and i3en.24xlarge instances.

ACM-5413: Qubole supports m5.8xlarge, m5.16xlarge, m5a.8xlarge, m5a.16xlarge, r5.8xlarge, r5a.8xlarge, r5a.16xlarge, and r5.16xlarge instances.

ACM-5558: During provisioning of nodes on a running heterogeneous cluster, Qubole tries providing instance types set in the heterogeneous configuration before falling back to On-Demand instances. For more information, see Additional Permissions.

Bug fix

ACM-5532: During the scrub run that is removing dead nodes for a cluster, there were nodes that were unreachable. This resulted in performance issues. To resolve this issue, Qubole terminates such nodes that cannot be connected. Via Support.

HADTWO-2098: In some cases, the scrub run did not remove dead nodes from a cluster. It happened as the scrub run connected to dead nodes as well through SSH to check on the node bootstrap completion. To resolve this issue, Qubole has improved the scrub run process, which does not connect to a dead node registered in the ResourceManager. This helps removing dead nodes that remained in the cluster and prevents such dead nodes from remaining in the cluster unlike before.

22 Aug, 2019 (10:52 PM PST) 56.0.120 Enhancement

JUPY-1: Qubole provides JupyterLab interface, which is the next generation user interface for Jupyter, to create and manage Jupyter notebooks. Jupyter notebooks are supported on Spark 2.2 and later versions. Beta, Via Support. Learn more.

AN-2168: New Analyze is now Workbench. Beta.

AN-1814: You can resize the command query composer in Workbench for Hive, Presto, Quantum, and DB Query commands. Beta. Beta.

AN-1324: Cluster live health metrics are now available as part of the Clusters drop-down list in Workbench. Via Support. Beta.

AN-1327: To make debugging easier, Qubole now displays the Cluster Instance ID under the Processing tab of the Status pane. This enables you to collect logs of the particular command by cluster instance. Beta.

AN-2210: You can now tag commands on the History tab in Workbench. You can later use these tags to filter out commands using the Tags field (in the history filter). Beta.

Bug fix AN-2219: Resource links are now clickable in the Logs pane in Workbench. Clicking the link redirects the user to the corresponding cluster dashboard.
09 Aug, 2019 (7:49 AM PST) 56.0.112 Bug fix AN-2240: The cluster selection drop-down list in Workbench now displays Hadoop2 clusters.
06 Aug, 2019 (04:13 AM PST) 56.0.108 Enhancement AIR-390: Now Airflow is supported on New Package Management. New Package Management brings in features like Python 3.7, a new version of Conda, and a lot of fixes on package installation such as support for no-arch packages, and so on (Cluster Restart Required, Disabled, Via Support).
21st Aug, 2019 (4:29 AM PST) 56.0.117 Enhancement ZEP-3373: Users can now edit the notebooks even when the clusters are offline. Via Support
Bug fix ACM-5344: Qubole supports configuring proxy setting of Internet proxy server and no_proxy settings for cluster nodes. To override them, contact Qubole Support. Via Support
9th August, 2019 (7:39 AM PST) 56.0.112 Bug fix ACM-5171: Fixed the issue where a multi-instance HS2-enabled Hadoop (Hive) cluster, the multi-instance HS2 intermittently failed to start at the first attempt while waiting for the Hadoop (Hive) cluster.
06 Aug, 2019 (11:56 AM PST) 56.0.108 Enhancement

ZEP-2717 and ZEP-3602: The Environments UI is now available in the Control Panel by default for the new users.

Limitation: The packages that are installed by default cannot be uninstalled in new version of Package Management.

1st August, 2019 (01:05 AM PST) 56.0.102 Enhancement

SCHED-376: SQL Command type has been renamed as Quantum in the Scheduler UI’s command type drop-down list.

AD-2476: To mitigate risk while rolling out a new feature, Qubole groups users, and accounts into different pods. When a change is rolled out to a pod, Qubole monitors the feature’s performance before rolling it out to subsequent pods. You can view the pod you belong to in the Account Details section of the Account Settings tab.

29th July, 2019 (10:28 AM PST) 56.0.99 Bug fix

QHIVE-4662: Fixed an issue which caused recursive listing while dropping partition(s) on a managed Hive table. Related OSS Jira: HIVE-22054.

QTEZ-443: Fixed an issue in which Tez UI was unable to download LevelDB file (that contains timeline data/log) for application when the IP address of a cluster node in a private subnet got repeated.

QTEZ-450: Fixed the issue in which Reducer was not visible in Tez UI when the configured Tez version was 0.8.4.

19th July, 2019 (6:04 AM PST) 56.0.92 Enhancement PRES-2918: A new experimental configuration property called experimental.reserved-pool-enabled is added to Presto version 0.208 to allow disabling Reserved Pool, which is used to prevent deadlocks when memory is exhausted in the General Pool by promoting the biggest query to Reserved Pool. However, only one query gets promoted to Reserved Pool and queries in General Pool get into the blocked state whenever it becomes full. To avoid this scenario, you can set experimental.reserved-pool-enabled to false for disabling Reserved Pool. For more information, see Disabling Reserved Pool.
Bug fix

PRES-2746: S3 buckets were inaccessible when Dual IAM Roles were configured on the Qubole account. To resolve this issue, Presto has added hive.s3-secondary-role-arn and hive.s3-secondary-role-extid. You can add the ARN and External ID of the secondary IAM Role (which is inaccessible to Qubole) in Hive catalog properties. For more information, see catalog/hive.properties.

PRES-2797: Fixed the issue in the generated Presto Query Tracker when Presto version was changed on an active Presto cluster.

PRES-2856: Fixed the issue in which command results were displayed without column headers for Presto queries when Qubole drivers executed such queries in Presto FastPath.

PRES-2915: Fixed the issue in which a Presto cluster with idle cluster timeout configuration did not automatically terminate even when it was idle for a longer time.

8th July, 2019 (6:16 AM PST) 56.0.86 Enhancement Hive version 2.3 is generally available.
Bug fix

QHIVE-4385: To resolve FileNotFoundException while calculating FileSplits for the ORC file format, Qubole has added retries in the Tez AM configuration. You can configure number of retries by using hive.qubole.handle.s3.stale.listing.retries.split.generation, which defaults to 10. This configuration minimizes query failures due to inconsistency in S3 listing.

This is an enhancement over QHIVE-3675, which handled FileNotFoundException that occurred while processing a specific FileSplit. Via Support

20th June, 2019 (4:39 AM PST) 56.2.1 Enhancement ACM-4221 and ACM-5016: Qubole supports Hive 3.1.1 (beta) on a Hive cluster. Starting cluster API v2.1, Hadoop 2 (Hive) clusters are renamed as Hive clusters. You can set Hive 3.1.1 (beta) version while creating/editing a cluster. Qubole supports creating Hive clusters only from cluster API v2.1 onwards. Via Support
13th June, 2019 (11:56 PM PST) 56.0.76 Enhancement

AN-1639: The Status pane on the new Analyze page is now available for Quantum commands.

AN-1708: Permalinks for Hive tables on the new Analyze page now contain account IDs. When you navigate to a Hive table from a different account, a confirmation dialog box appears.

QUEST-321: Users can now use custom code in Python language for creating a streaming pipeline.

QUEST-332: Users can select AVRO as one of the input formats when creating a streaming pipeline in assisted mode with Kafka as source.

QUEST-341: Users can add additional configurations such as user defined metadata for data written to S3 sink.

SPAR-3514: The Test Run option in the QuEST UI, processes a limited number of records and maintains separate temporary checkpoint location to prevent any corruption to the runtime production checkpoint.

SPAR-3591: Users can pass x-amz-meta-metadata(key1=value1,key2=val2) while creating a new streaming job with s3 as the sink by setting the option fs.s3a.user.metadata as key1=val1,key2=val2. The streaming application creates new files with this metadata. This is supported on Spark 2.3.2 and later versions.

TOOLS-1440: The `s3cmd` version is upgraded from 1.5.2 to 2.0.2.

Bug fix

AN-1974: Column headers now appear on the Analyze page for large Hive queries (approximately 65KB and more).

QUEST-340: When the users performed a test run with Kinesis as source and IAM role enabled in the account, the test run failed. This issue is now fixed.

QUEST-324: If the checkpoint location is not unique, the QuEST UI displayed an inaccurate notification when running the streaming pipeline in assisted mode. Now, the accurate notification is displayed in this case.

10th June, 2019 (9:04 AM PST) 56.0.67 Enhancement

QHIVE-4527: Qubole will deprecate Hive notebooks in the near future.

AD-1629: Qubole now supports IAM role-based account creation via API for AWS. Learn more.

    Bug fix

AD-2554: This bug fix resolves the intermittent provided token has expired error for AWS.

AD-2441: The Usage Status Dashboard has been revamped.

QTEZ-440: It is the fix for the issue in which a Tez DAG got hung when exceptions were uncaught during a DAG transition.

4th June, 2019 (11:07 PM PST) 56.0.63 Enhancement AN-2100: All file downloads on the Explore page are now handled through AWS Signature Version 4.
4th June, 2019 (11:07 PM PST) 56.0.63 Enhancement AN-2100: All file downloads on the Explore page are now handled through AWS Signature Version 4.
3rd June, 2019 (9:45 AM PST) 56.0.61 Enhancement

JDBC-124: Qubole now supports concurrency of multiple statements in Presto FastPath.

PRES-2254: In a Presto notebook, you can now set zeppelin.presto.stacktrace as an interpreter property for displaying stacktrace for certain errors.

PRES-2600: These are the new enhancements in the Presto notebooks:

  • You can now set session properties in Presto notebooks in a paragraph and run it. When set, these session properties are applicable to paragraphs in the notebook’s current session.
  • In Presto notebooks, for improving debugging experience, the source field is set as notebook_<notebook-name>_<notebook-id> and in the dashboards, the source field is set as dashboard_<dashboard-name>_<dashboard-id>_<source-note-id>. A source field is directly searchable in the Presto UI. For example, in the Presto UI, you can search a notebook by its name or ID to quickly filter queries, which are run from that specific notebook while debugging an error.
Bug fix

PRES-2515: The issue which caused ArrayIndexOutOfBoundsException for some queries when using Dynamic Filtering is resolved now.

PRES-2727: This fixes the issue where queries with a table containing sub column names of a struct column type starting with numbers failed. Sub-column names of a struct column type can now start with numbers.

PRES-2775: This fixes a bug which could prevent an upscaled Presto cluster from downscaling if short running queries are regularly scheduled on the cluster.

24th May, 2019 (2:20 PM PST) 56.0.57 Bug fix ZEP-3659: The newly added paragraphs were displayed only after refreshing the page. This issue is fixed.
23rd May, 2019 (1:37 AM PST) 56.0.54 Major Release R56
21st May, 2019 (7:48 AM PST) 55.0.65 Enhancement AN-1382: Click the preview icon to preview a query. You can also dock the query preview while you work on another query.
15th May, 2019 (7:38 AM PST) 55.0.61 Enhancement SQOOP-242: A new enhancement provides a better focus on a functional add-on for SQOOP rather than moving prematurely to serverless model. If Pixie (Qubole Cluster) is not available for the account (due to a configuration), the user must use his own cluster. If Pixie is present on the environment, it is a default option for all the new customers to select their own cluster.
25th April, 2019 (9:33 AM PST) 55.0.50 Bug fix AN-1974: The issue of the Analyze page not displaying results for large Hive queries (approximately 65KB and more) was resolved by falling back to object storage to retrieve query metadata.
New feature ACM-4765: Qubole now supports m5ad and r5ad family instance types in US East (N. Virginia), US West (Oregon), US East (Ohio), and Asia Pacific (Singapore) AWS regions.
22nd April, 2019 (8:13 AM PST) 55.0.48 Bug fix AN-2056: Uploading objects or files through the Qubole UI is now supported for all AWS regions. KMS-enabled buckets are also supported.
02nd April, 2019 (6:31 PM PST) 55.0.37 Enhancement

PRES-1350: Qubole supports configuring the required number of worker nodes during autoscaling. It is a cluster configuration override, query-manager.required-workers. You can set it to denote the number of worker nodes that must be in the cluster before a query is scheduled to be run on the cluster. This enhancement is only supported with Presto 0.193 and later versions. For more information, see Configuring the Required Number of Worker Nodes.

PRES-2397: Qubole supports escaping newline \n and carriage return (\r) characters in data for correctly parsing on the QDS UI. This enhancement is not available by default and it is only supported with Presto 0.193 and later versions. Create a ticket with the Qubole Support to enable it on the QDS account.

PRES-2417: Presto clusters do not terminate while actively running Presto notebook paragraphs. The enhancement is not available by default. Create a ticket with Qubole Support to use it.

PRES-2474: The optimization to speed up queries on system.jdbc.tables with filter on a single table name. This speeds up the extract operation in Business Intelligence tools such as DBeaver, which query system.jdbc.tables with filter on a single table name.

PRES-2521: Qubole has added a cluster configuration property, ascm.downscaling.staggered for controlling aggressive the downscaling velocity in Presto clusters. Enabling this configuration results in a linear downscaling profile which can be a better choice for unpredictable workloads without well-defined peak and lean periods. For more information, see Controlling the Nodes’ Downscaling Velocity.

Bug fix

PRES-1373: This is a fix for an issue where the retry operation was not working on the failed Presto queries. Retries work on the failed Presto queries now. As part of the fix, retries are configurable for Presto queries that run on the QDS platform.

Caution

Configuring retries will just do a blind retry of a Presto query. This may lead to data corruption for non-Insert Overwrite Directory (IOD) queries.

PRES-2306: Fixed a bug where the Usage page on the QDS UI did not show the Presto query’s bytes read statistics.

PRES-2493: Fixed the data loss issue in the Hive connector when writing bucketed sorted table in Qubole Presto 0.208 queries.

PRES-2567: Fixed a bug in the user-overridden IAM Role feature where stale credentials were being used in a long-running cluster.

PRES-2571: Fixed a concurrency related issue in autoscaling on Qubole Presto 0.193 and 0.208 versions due to which the cluster’s size permanently exceeds its maximum size.

Bug fix

QHIVE-3740: The Hive metastore API call has been replaced with a less expensive API call to resolve the issue of false alerts that were triggered on Datadog.

QHIVE-4040: It is a fix for an issue in DelimitedJSONSerde that caused NullPointerException when hive.qubole.escape.crlf.in.result was enabled.

QHIVE-4053: It is a fix for an issue where the hive.qubole.drop.table.metastore.client.socket.timeout configuration was not honored when a Hive query containing the DROP Table statement was run.

QHIVE-4280: Hive jobs were failing with a Metaexception that included a timeout message. To resolve this issue, the SSH configuration parameters’ maximum value is increased on QDS servers.

QTEZ-335: It resolves an issue where the Tez Offline UI was inaccessible when yarn.timeline-service.store-class was set to RollingLevelDBTimelineStore in the cluster’s Hadoop overrides.

27th March, 2019 (2:01 PM PST) 55.0.32 Bug fix SPAR-3391: Some paragraphs ran successfully but with a warning message during the first run. However, those paragraphs ran successfully without the warning on a re-run. This issue is fixed.
22nd March, 2019 (6:05 AM PST) 55.0.29 Enhancement

ACM-4112: The Spot Request Timeout option in the Cluster Composition UI tab and timeout_for_request under stable_spot_instance_setting of a cluster API now accept 1 and 2 minutes as values.

ACM-4259: Qubole supports the r5 instance type in Asia Pacific (Mumbai), Asia Pacific (Singapore), Asia Pacific (Tokyo), Asia Pacific (Seoul), Canada (Central), EU (London), US West (N. California), and EU (Frankfurt) AWS regions.

ACM-4426: Qubole supports c5d and m5d instance types in Asia Pacific (Mumbai), Asia Pacific (Singapore), Asia Pacific (Tokyo), Asia Pacific (Seoul), Canada (Central), EU (Ireland), EU (London), US West (N. California), and EU (Frankfurt) AWS regions.

22nd March, 2019 (6:05 AM PST) 55.0.29 Bug fix TOOLS-1139: The previous docker version had security vulnerabilities, which could allow malicious containers to gain root-level privileges on the host. To resolve security vulnerabilities of the previous docker version, the latest patch version of docker-18.06.1ce is installed.
14th March, 2019 (9:33 PM PST) 55.0.23 Bug fix PRES-2635: The earlier fix for improving planning time of queries which use CONCAT function in the WHERE clause on a partitioned column introduced a degradation in execution time of such queries in some cases. This bug fix introduces the optimization which is disabled by default. Enable qubole.skip-concat-partition-pruning as a Presto cluster override at the cluster level. You can also enable it at the session level using the qubole_skip_concat_partition_pruning session property.
13th March, 2019 (4:45 AM PST) 55.0.22 Enhancement SPAR-3005: For the offline Spark clusters, only the event log files that are less than 400 MB are processed in the offline Spark History Server (SHS). This prevents high CPU utilization on the internal servers due to SHS.
Bug fix SPAR-3371: Dynamic filtering now works for join keys of type Int, String, Long, and Short.
13th March, 2019 (4:45 AM PST) 55.0.22 Bug fix

ZEP-3247: Zeppelin failed to start due to the interpreter.json._COPYING_ file in defloc. As a result, the notebooks failed to load. With this fix, Zeppelin starts successfully and the interpreter.json._COPYING_ file is not seen on defloc.

ZEP-3252: On a cluster restart, the latest dashboards are displayed after the scheduled runs.

ZEP-3230: Zeppelin server in Java 7 clusters was failing due to some ciphers that were disabled for security. Now, the Zeppelin server starts successfully as the required ciphers are added when starting the Zeppelin server.

ZEP-3243: A notebook with the ERROR status on a cluster failed to associate back to the cluster even after switching to another cluster. This issue is fixed.

ZEP-3234: Notebooks failed to load in case of higher latencies when fetching notebook permissions. This issue is fixed.

04th March, 2019 (9:30 PM PST) 55.0.14 R55 Upgrade
15th January, 2019 (7:18 AM PST) 54.0.53 Bug fix SCHED-321: In the Cron expression, the scheduler calculates the Next Materialized Time (NMT)/Start time considering the current time as the base time and Cron expression passed. Start time is not honored in the Cron expression. It resolves the issue where the first instance of the scheduler did not honor the scheduled time.
11th January, 2019 (12:30 PM PST) 54.0.52 Bug fix AN-1407: While filtering commands on the Analyze page, you must now provide the date range (start/end date) when searching for cluster labels.
7th January, 2019 (1:30 AM PST) 54.0.46 Enhancement AD-1834: The list of email IDs configured in Account Settings will now receive notifications related to releases, account configuration and feature changes.
21st December, 2018 (8:35 AM PST) 54.0.45 Bug fix SPAR-3207: The Spark shuffle cleanup feature is not supported in Spark 2.2.1. However, the Spark shuffle cleanup feature continues to be supported in Spark 2.3.1 and later versions.
17th December, 2018 (9:51 AM PST) 54.0.41 Bug fix ZEP-3109: Fixed the issue that caused notebooks to freeze frequently due to the huge content, such as large number of clusters, users, and schemas.
13th December, 2018 (3:45 PM PST) 54.0.40 Bug fix RUB-102: It is a fix to avoid unnecessary HTTP GET calls to Ganglia for fetching the cluster metrics due to which the master node was overloaded.
12th December, 2018 (12:00 AM PST) 54.0.39 Enhancement

ACM-3436: QDS now supports c5n, m5a, and r5a instances.

ACM-3545: Qubole now provides a feature to avoid the cluster from starting whenever the master node’s Elastic IP address fails. This feature is not enabled by default. Contact Qubole Support to enable this feature.

6th December, 2018 (10:56 PM PST) 54.0.38 Bug fix EAM-1502: Fixed an issue in the Automatic Statistics Collection framework due to which the fresh statistics were not collected for a few tables.
29th November, 2018 (12:56 AM PST) 54.0.33

Major release

For the list of changes until 29th November, 2018, see Changelog for api.qubole.com.