Backported Issues from Open-source Hive

This table contains the list of issues backported from open-source Hive.

List of issues backported from open-source Hive
Open Source JIRA ID Description
HIVE-20660 Group by statistics estimation could be improved by bounding the total number of rows to source table (Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-20702 Account for overhead from datastructure aware estimations during mapjoin selection (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)
HIVE-22243 Align Apache Thrift version to 0.9.3-1 in standalone-metastore as well (David Lavati via László Bodor)
HIVE-21173 Upgrade Apache Thrift to 0.9.3-1 (David Lavati via Laszlo Bodor)
HIVE-20953 Remove a function from function registry when it can not be added to the metastore when creating it. (Ashutosh Bapat, reviewed by Mahesh Kumar Behera)
HIVE-20762 NOTIFICATION_LOG cleanup interval is hardcoded as 60s and is too small (Sankar Hariappan, reviewed by Thejas Nair)
HIVE-20761 Select for update on notification_sequence table has retry interval and retries count too small (Sankar Hariappan, reviewed by Thejas M Nair)
HIVE-20646 Partition filter condition is not pushed down to metastore query if it has IS NOT NULL (Sankar Hariappan, reviewed by Daniel Dai)
HIVE-21924 Split text files even if header/footer exists (Mustafa Iman, reviewed by Sankar Hariappan)
HIVE-22134 HIVE-22129: Remove glassfish.jersey and mssql-jdbc classes from jdbc-standalone jar (Naveen Gangam, reviewed by Thejas Nair
HIVE-21915 Hive with TEZ UNION ALL and UDTF results in data loss (Wei Zhang, reviewed by Vineet Garg)
HIVE-21799 NullPointerException in DynamicPartitionPruningOptimization, when join key is on aggregation column (Jason Dere, reviewed by Vineet Garg)
HIVE-21182 Skip setting up hive scratch dir during planning (Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-20804 Further improvements to group by optimization with constraints (Vineet Garg, reviewed by Jesus Camacho Rodriguez)
HIVE-20703 Put dynamic sort partition optimization under cost based decision (Vineet Garg, reviewed by Prasanth Jayachandran, Sergey Shelukhin)
HIVE-20918 Flag to enable/disable pushdown of computation from Calcite into JDBC connection (Jesus Camacho Rodriguez, reviewed by Thejas M Nair)
HIVE-20744 Use SQL constraints to improve join reordering algorithm (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20976 JDBC queries containing joins gives wrong results (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20830 JdbcStorageHandler range query assertion failure in some cases (Daniel Dai, reviewed by Thejas Nair)
HIVE-20829 JdbcStorageHandler range split throws NPE (Daniel Dai, reviewed by Thejas Nair)
HIVE-20720 Add partition column option to JDBC handler (Daniel Dai, reviewed by Jesus Camacho Rodriguez)
HIVE-20651 JdbcStorageHandler password should be encrypted (Daniel Dai, reviewed by Sankar Hariappan)
HIVE-20652 JdbcStorageHandler push join of two different datasource to jdbc driver (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20095 Fix feature to push computation to jdbc external tables (Jonathan Doron, reviewed by Jesus Camacho Rodriguez)
HIVE-20718 Add perf cli driver with constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-18862 qfiles: prepare .q files for using datasets (Laszlo Bodor via Zoltan Haindrich) Signed-off-by: Zoltan Haindrich <kirk@rxd.hu> (cherry picked from commit 86b678f502bc6bb86c2279ef1288627bd8e7421a)
HIVE-17040 Join elimination in the presence of FK relationship (Jesus Camacho Rodriguez, reviewed by Vineet Garg)
HIVE-20213 Upgrade Calcite to 1.17.0 (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-22275 OperationManager.queryIdOperation does not properly clean up multiple queryIds (Jason Dere, reviewed by Prasanth Jayachandran)
HIVE-22221 Llap external client- Need to reduce LlapBaseInputFormat#getSplits() footprint (Shubham Chaurasia, reviewed by Jason Dere)
HIVE-21641 Llap external client returns decimal columns in different precision/scale as compared to beeline (Shubham Chaurasia, via Gopal V)
HIVE-21041 NPE, ParseException in getting schema from logical plan (Teddy Choi, reviewed by Jesus Camacho Rodriguez)
HIVE-21126 Allow session level queries in LlapBaseInputFormat#getSplits() before actual get_splits() call (Shubham Chaurasia, reviewed by Teddy Choi)
HIVE-20552 Get Schema from LogicalPlan faster (Teddy Choi, reviewed by Jesus Camacho Rodriguez)
HIVE-21270 A UDTF to show schema (column names and types) of given query (Shubham Chaurasia, reviewed by Mani M, Sankar Hariappan)
HIVE-21864 LlapBaseInputFormat#closeAll() throws ConcurrentModificationException (Shubham Chaurasia, reviewed by Jason Dere)
HIVE-22231 Hive query with big size via knox fails with Broken pipe Write failed (Denys Kuzmenko via Peter Vary)
HIVE-22219 Bringing a node manager down blocks restart of LLAP service (Jesus Camacho Rodriguez, reviewed by Slim Bouguerra)
HIVE-22204 Beeline option to show/not show execution report (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-21866 LLAP status service driver may get stuck with wrong Yarn app ID (Adam Szita, reviewed by Marta Kuczoram)
HIVE-21009 Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) (addendum)
HIVE-21009 Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran)
HIVE-20767 Multiple project between join operators may affect join reordering using constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20320 Turn on hive.optimize.remove.sq_count_check flag (Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-20292 Bad join ordering in tpcds query93 with primary constraint defined (Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-22169 Tez: SplitGenerator tries to look for plan files which won’t exist for Tez (Gopal V via Vineet Garg)
HIVE-22121 Turning on hive.tez.bucket.pruning produce wrong results (Vineet Garg, reviewed by Gopal V)
HIVE-22115 Prevent the creation of query routing appender if property is set to false (Slim Bouguerra reviewed by Gopal V)
HIVE-21171 Skip creating scratch dirs for tez if RPC is on (Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-20940 Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) (addendum)
HIVE-20940 Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-22241 Implement UDF to interpret date/timestamp using its internal representation and Gregorian-Julian hybrid calendar (Jesus Camacho Rodriguez, reviewed by Adam Szita)
HIVE-22106 Remove cross-query synchronization for the partition-eval (Slim B via Gopal V)
HIVE-22168 Remove very expensive logging from the llap cache hotpath (Slim B via Jesus Camacho Rodriguez)
HIVE-22161 UDF: FunctionRegistry synchronizes on org.apache.hadoop.hive.ql.udf.UDFType class (Gopal V, reviewed by Ashutosh Chauhan)
HIVE-22120 Fix wrong results/ArrayOutOfBound exception in left outer map joins on specific boundary conditions (Ramesh Kumar Thangarajan, reviewed by Jason Dere)
HIVE-19661 switch Hive UDFs to use Re2J regex engine (Rajkumar Singh via Ashutosh Chauhan)
HIVE-20873 Use Murmur hash for VectorHashKeyWrapperTwoLong to reduce hash collision (Teddy Choi, reviewed by Gopal V)
HIVE-20617 Fix type of constants in IN expressions to have correct type (Zoltan Haindrich via Jesus Camacho Rodriguez)
HIVE-20193 cboInfo is not present in the explain plan json (Anishek Agarwal, reviewed by Ashutosh Chauhan)
HIVE-20684 Make compute stats work for Druid tables (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20636 Improve number of null values estimation after outer join (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20366 TPC-DS query78 stats estimates are off for is null filter(Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-19992 Vectorization: Follow-on to HIVE -19951 –> add call to SchemaEvolution.isOnlyImplicitConversion to disable encoded LLAP I/O for ORC only when data type conversion is not implicit (Matt McCline, reviewed by Prasanth Jayachandran)
HIVE-20988 Wrong results for group by queries with primary key on multiple columns (Vineet Garg, reviewed by Jesus Camacho Rodriguez)
HIVE-20985 If select operator inputs are temporary columns vectorization may reuse some of them as output (Zoltan Haindrich reviewed by Teddy Choi)
HIVE-20954 Vector RS operator is not using uniform hash function for TPC-DS query 95 (Teddy Choi, reviewed by Gopal V)
HIVE-20775 Factor cost of each SJ reduction when costing a follow-up reduction (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20788 Extended SJ reduction may backtrack columns incorrectly when creating filters (Jesus Camacho Rodriguez, reviewed by Deepak Jaiswal)
HIVE-17043 Remove non unique columns from group by keys if not referenced later (Vineet Garg, reviewed by Jesus Camacho Rodriguez)
HIVE-20712 HivePointLookupOptimizer should extract deep cases (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez)
HIVE-20563 Vectorization: CASE WHEN expression fails when THEN/ELSE type and result type are different (Matt McCline, reviewed by Teddy Choi)
HIVE-20609 Create SSD cache dir if it doesnt exist already (Miklos Gergely via Sergey Shelukhin)
HIVE-20657 pre-allocate LLAP cache at init time (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-20540 Vectorization : Support loading bucketed tables using sorted dynamic partition optimizer- II (Deepak Jaiswal, reviewed by Prasanth Jayachandran)
HIVE-20296 Improve HivePointLookupOptimizerRule to be able to extract from more sophisticated contexts (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20513 Vectorization: Improve Fast Vector MapJoin Bytes Hash Tables (Matt McCline, reviewed by Zoltan Haindrich)
HIVE-20013 Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20315 Vectorization: Fix more NULL / Wrong Results issues and avoid unnecessary casts/conversions (Matt McCline, reviewed by Teddy Choi)
HIVE-20339 Vectorization: Lift unneeded restriction causing some PTF with RANK not to be vectorized (Matt McCline, reviewed by Teddy Choi)
HIVE-20367 Vectorization: Support streaming for PTF AVG, MAX, MIN, SUM (Matt McCline, reviewed by Teddy Choi)
HIVE-20418 LLAP IO may not handle ORC files that have row index disabled correctly for queries with no columns selected (Sergey Shelukhin, reviewed by Gopal Vijayaraghavan)
HIVE-20321 Vectorization: Cut down memory size of 1 col VectorHashKeyWrapper to <1 CacheLine (Matt McCline, reviewed by Gopal Vijayaraghavan)
HIVE-20177 Vectorization: Reduce KeyWrapper allocation in GroupBy Streaming mode (Gopal V, reviewed by Matt McCline)
HIVE-20300 VectorFileSinkArrowOperator (Eric Wohlstadter, reviewed by Jason Dere, Matt McCline, Teddy Choi)
HIVE-20277 Vectorization: Case expressions that return BOOLEAN are not supported for FILTER (Matt McCline, reviewed by Gopal Vijayaraghavan)
HIVE-20294 Vectorization: Fix NULL / Wrong Results issues in COALESCE / ELT (Matt McCline, reviewed by Teddy Choi)
HIVE-20252 Semijoin Reduction : Cycles due to semi join branch may remain undetected if small table side has a map join upstream.(Deepak Jaiswal, reviewed by Jesus Camacho Rodriguez)
HIVE-18201 Disable XPROD_EDGE for sq_count_check() created for scalar subqueries (Ashutosh Chauhan via Jesus Camacho Rodriguez)
HIVE-20244 forward port HIVE -19704 to master (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-19663 refactor LLAP IO report generation (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-20090 Extend creation of semijoin reduction filters to be able to discover new opportunities (Jesus Camacho Rodriguez via Deepak Jaiswal)
HIVE-20069 Fix reoptimization in case of DPP and Semijoin optimization (Zoltan Haindrich reviewed by Ashutosh Chauhan, Zoltan Haindrich)
HIVE-17896 TopNKey: Create a standalone vectorizable TopNKey operator (Teddy Choi, reviewed by Jesus Camacho Rodriguez)
HIVE-20245 Vectorization: Fix NULL / Wrong Results issues in BETWEEN / IN (Matt McCline, reviewed by Teddy Choi)
HIVE-20207 Vectorization: Fix NULL / Wrong Results issues in Filter / Compare (Matt McCline, reviewed by Teddy Choi)
HIVE-20203 Arrow SerDe leaks a DirectByteBuffer (Eric Wohlstadter, reviewed by Teddy Choi)
HIVE-20082 HiveDecimal to string conversion doesn’t format the decimal correctly (Jason Dere, reviewed by Ashutosh Chauhan)
HIVE-19891 inserting into external tables with custom partition directories may cause data loss (Sergey Shelukhin, reviewed by Ashutosh Chauhan)
HIVE-20204 Type conversion during IN () comparisons is using different rules from other comparison operations (Jason Dere, reviewed by Ashutosh Chauhan)
HIVE-19360 CBO: Add an “optimizedSQL” to QueryPlan object (Gopal V, reviewed by Jesus Camacho Rodriguez)
HIVE-20197 Vectorization: Add DECIMAL_64 testing, add Date/Interval/Timestamp arithmetic, and add more GROUP BY Aggregation tests (Matt McCline, reviewed by Teddy Choi)
HIVE-20498 Support date type for column stats autogather (Zoltan Haindrich reviewed by Prasanth Jayachandran)
HIVE-20537 Multi-column joins estimates with uncorrelated columns different in CBO and Hive (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20503 Use datastructure aware estimations during mapjoin selection (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20432 Rewrite BETWEEN to IN for integer types for stats estimation(Vineet Garg, reviewed by Ashutosh Chauhan)
HIVE-20491 Fix mapjoin size estimations for Fast implementation (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20821 Rewrite SUM0 into SUM + COALESCE combination (Jesus Camacho Rodriguez, reviewed by Gopal V)
HIVE-20391 HiveAggregateReduceFunctionsRule may infer wrong return type when decomposing aggregate function (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-14431 Recognize COALESCE as CASE (Remus Rusanu, Jesus Camacho Rodriguez via Ashutosh Chauhan, Zoltan Haindrich)
HIVE-17683 Add explain locks <sql> command (Igor Kryvenko via Eugene Koifman)
HIVE-20898 For time related functions arguments may not be casted to a non nullable type (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez)
HIVE-20704 Extend HivePreFilteringRule to support other functions (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20710 Constant folding may not create null constants without types (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-18871 hive on tez execution error due to set hive.aux.jars.path to hdfs:// (zhuwei reviewed by Prasanth Jayachandran)
HIVE-20612 Create new join multi-key correlation flag for CBO (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20013 Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20807 Refactor LlapStatusServiceDriver (Miklos Gergely via Sergey Shelukhin)
HIVE-20019 Ban commons-logging and log4j (Prasanth Jayachandran reviewed by Ashutosh Chauhan)
HIVE-20099 Fix logger for LlapServlet (Rajkumar Singh via Gopal V)
HIVE-20752 In case of LLAP start failure add info how to find YARN logs (Miklos Gergely via Ashutosh Chauhan)
HIVE-20676 HiveServer2: PrivilegeSynchronizer is not set to daemon status (Vaibhav Gumashta via Thejas Nair)
HIVE-20455 Log spew from security.authorization.PrivilegeSynchonizer.run (Daniel Dai, reviewed by Gopal V)
HIVE-20344 PrivilegeSynchronizer for SBA might hit AccessControlException (Daniel Dai, reviewed by Vaibhav Gumashta)
HIVE-20130 Better logging for information schema synchronizer (Daniel Dai, reviewed by Vaibhav Gumashta)
HIVE-20853 Expose ShuffleHandler.registerDag in the llap daemon API (Jaume Marhuenda, reviewed by Sergey Shelukhin)
HIVE-20772 record per-task CPU counters in LLAP (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-19568 Active/Passive HS2 HA: Disallow direct connection to passive HS2 instance (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-19097 related equals and in operators may cause inaccurate stats estimations (Zoltan Haindrich via Ashutosh Chauhan)
HIVE-20951 LLAP: Set Xms to 50% always (Gopal V via Ashutosh Chauhan)
HIVE-20937 Postgres jdbc query fail with “LIMIT must not be negative” (Daniel Dai, reviewed by Thejas Nair)
HIVE-20910 Insert in bucketed table fails due to dynamic partition sort optimization (Vineet Garg via Ashutosh Chauhan)
HIVE-19701 getDelegationTokenFromMetaStore doesn’t need to be synchronized (Sankar Hariappan, reviewed by Thejas M Nair)
HIVE-20899 Keytab URI for LLAP YARN Service is restrictive to support HDFS only (Gour Saha reviewd by Prasanth Jayachandran)
HIVE-20881 Constant propagation oversimplifies projections (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20817 Reading Timestamp datatype via HiveServer2 gives errors (Mahesh Kumar Behera, reviewed by Thejas M Nair)
HIVE-20834 Hive QueryResultCache entries keeping reference to SemanticAnalyzer from cached query (Jason Dere, reviewed by GopalV)
HIVE-20815 JdbcRecordReader.next shall not eat exception (Daniel Dai, reviewed by Thejas Nair)
HIVE-20792 Inserting timestamp with zones truncates the data (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez)
HIVE-20638 Upgrade version of Jetty to 9.3.25.v20180904 (Laszlo Bodor via Thejas Nair)
HIVE-20746 HiveProtoHookLogger does not close file at end of day. (Harish JP, reviewd by Anishek Agarwal)
HIVE-20768 Adding Tumbling Window UDF (Slim Bouguerra, via Gopal V)
HIVE-20763 Add google cloud storage (gs) to the exim uri schema whitelist (Siddharth Seth, reviewed by Prasanth Jayachandran, Sergey Shelukhin)
HIVE-20477 OptimizedSql is not shown if the expression contains INs (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20649 LLAP aware memory manager for Orc writers (Prasanth Jayachandran reviewed by Sergey Shelukhin)
HIVE-20696 msck_*.q tests are broken (Jesus Camacho Rodriguez, reviewed by Prasanth Jayachandran)
HIVE-20719 SELECT statement fails after UPDATE with hive.optimize.sort.dynamic.partition optimization and vectorization on (Eugene Koifman, reviewed by Gopal V)
HIVE-20644 Avoid exposing sensitive infomation through a Hive Runtime exception (Ashutosh Bapat reviewed by Thejas Nair, Sankar Hariappan)
HIVE-20705 Vectorization: Native Vector MapJoin doesn’t support Complex Big Table values (Matt McCline, reviewed by Jason Dere)
HIVE-20700 Add config to diable rollup for druid tables (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20698 Add better message for NPE when inserting rows with null timestamp to druid (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20686 Sync query IDs between druid and Hive (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20648 LLAP: Vector group by operator should use memory per executor
HIVE-20692 Enable folding of NOT x IS (NOT) [TRUE|FALSE] expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20623 Shared work: Extend sharing of map-join cache entries in LLAP (Jesus Camacho Rodriguez, reviewed by Gopal V)
HIVE-20691 Fix org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[cttl] (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)
HIVE-10296 Cast exception observed when hive runs a multi join query on metastore (postgres), since postgres pushes the filter into the join, and ignores the condition before applying cast (Karthik Manamcheri, reviewed by Sergey Shelukhin)
HIVE-20618 During join selection BucketMapJoin might be choosen for non bucketed tables (Zoltan Haindrich reviewed by Ashutosh Chauhan, Deepak Jaiswal)
HIVE-20593 Load Data for partitioned ACID tables fails with bucketId out of range: -1 (Deepak Jaiswal, reviewed by Eugene Koifman)
HIVE-20620 manifest collisions when inserting into bucketed sorted MM tables with dynamic partitioning (Sergey Shelukhin, reviewed by Deepak Jaiswal)
HIVE-20626 Log more details when druid metastore transaction fails in callback (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20267 Expanding WebUI to include form to dynamically config log levels (Zoltan Chovan reviewed by Prasanth Jayachandran)
HIVE-20349 Implement Retry Logic in HiveDruidSplit for Scan Queries (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20582 Make hflush in hive proto logging configurable (Prasanth Jayachandran reviewed by Thejas M Nair)
HIVE-20583 Use canonical hostname only for kerberos auth in HiveConnection (Prasanth Jayachandran reviewed by Gopal V)
HIVE-20558 Change default of hive.hashtable.key.count.adjustment to 0.99
HIVE-20524 Schema Evolution checking is broken in going from Hive version 2 to version 3 for ALTER TABLE VARCHAR to DECIMAL
HIVE-20541 REPL DUMP on external table with add partition event throws NoSuchElementException (Sankar Hariappan, reviewed by Anishek Agarwal)
HIVE-20412 NPE in HiveMetaHook (Jason Dere, reviewed by Sergey Shelukhin)
HIVE-19552 Enable TestMiniDruidKafkaCliDriver#druidkafkamini_basic.q (Nishant Bangarwa, reviewed by Jesus Camacho Rodriguez)
HIVE-20521 HS2 doAs=true has permission issue with hadoop.tmp.dir, with MR and S3A filesystem (Thejas Nair, reviewed by Zoltan Haindrich)
HIVE-20510 Vectorization: Support loading bucketed tables using sorted dynamic partition optimizer (Deepak Jaiswal, reviewed by Gopal Vijayarahavan, Matt Mccline, and Thejas Nair)
HIVE-20508 Hive does not support user names of type “user@realm” (Deepak Jaiswal, reviewed by Thejas Nair)
HIVE-20522 HiveFilterSetOpTransposeRule may throw assertion error due to nullability of fields (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20515 Empty query results when using results cache and query temp dir, results cache dir in different filesystems (Jason Dere via Thejas Nair)
HIVE-20499 GetTablesOperation pull all the tables meta irrespective of auth. (Rajkumar Singh via Ashutosh Chauhan)
HIVE-19993 Using a table alias which also appears as a column name is not possible (Jaume M via Zoltan Haindrich)
HIVE-20476 CopyUtils used by REPL LOAD and EXPORT/IMPORT operations ignore distcp error (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Thejas M Nair)
HIVE-20496 Vectorization: Vectorized PTF IllegalStateException (Matt McCline, reviewed by Teddy Choi)
HIVE-20433 Implicit String to Timestamp conversion is slow (Deepak Jaiswal, reviewed by Gopal Vijayaraghavan)
HIVE-20439 addendum
HIVE-20352 Vectorization: Support grouping function (Matt McCline, reviewed by Teddy Choi)
HIVE-20399 CTAS w/a custom table location that is not fully qualified fails for MM tables (Sergey Shelukhin, reviewed by Ashutosh Chauhan)
HIVE-20409 Hive ACID: Update/delete/merge does not clean hdfs staging directory (Rajkumar Singh, reviewed by Vineet Garg)
HIVE-18624 Parsing time is extremely high (~10 min) for queries with complex select expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20406 Addendum patch
HIVE-20383 Invalid queue name and synchronisation issues in hive proto events hook.(Harish JP, reviewd by Anishek Agarwal)
HIVE-14898 HS2 shouldn’t log callstack for an empty auth header error
HIVE-20389 NPE in SessionStateUserAuthenticator when authenticator=SessionStateUserAuthenticator (Daniel Dai, reviewed by Thejas Nair)
HIVE-20345 Drop database may hang if the tables get deleted from a different call (Aihua Xu, reviewed by Naveen Gangam)
HIVE-19316 StatsTask fails due to ClassCastException (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez)
HIVE-20361 ReplDumpTaskTest is failing (Sankar Hariappan, reviewed by Jesus Camacho Rodriguez)
HIVE-20353 Follow redirects when hive connects to a passive druid overlord/coordinator (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-20336 Masking and filtering policies for materialized views (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20326 Create constraints with RELY as default instead of NO RELY (Vineet Garg, reviewed by Jesus Camacho Rodriguez)
HIVE-20118 SessionStateUserAuthenticator.getGroupNames() is always empty (Daniel Dai, reviewed by Thejas Nair)
HIVE-20290 Lazy initialize ArrowColumnarBatchSerDe so it doesn’t allocate buffers during GetSplits (Eric Wohlstadter, reviewed by Jason Dere)
HIVE-20301 Enable vectorization for materialized view rewriting tests (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20302 LLAP: non-vectorized execution in IO ignores virtual columns, including ROW__ID (Jesus Camacho Rodriguez, reviewed by Sergey Shelukhin)
HIVE-20299 potential race in LLAP signer unit test (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-20281 SharedWorkOptimizer fails with ‘operator cache contents and actual plan differ’ (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20101 BloomKFilter: Avoid using the local byte[] arrays entirely (Gopal V, reviewed by Prasanth Jayachandran)
HIVE-20263 Typo in HiveReduceExpressionsWithStatsRule variable (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)
HIVE-20212 Hiveserver2 in http mode emitting metric default.General.open_connections incorrectly (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)
HIVE-20228 configure repl configuration directories based on user running hiveserver2 (Anishek Agarwal, reviewed by Sankar Hariappan)
HIVE-19935 Hive WM session killed: Failed to update LLAP tasks count (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-20192 HS2 with embedded metastore is leaking JDOPersistenceManager objects (Sankar Hariappan, reviewed by Vihang Karajgaonkar)
HIVE-20174 Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions (Matt McCline, reviewed by Teddy Choi)
HIVE-20127 fix some issues with LLAP Parquet cache (Sergey Shelukhin, reviewed by Prasanth Jayachandran)
HIVE-20149 TestHiveCli failing/timing out(Vineet Garg, reviewed by Zoltan Haindrich)
HIVE-20183 Inserting from bucketed table can cause data loss, if the source table contains empty bucket (Peter Vary, reviewed by Naveen Gangam)
HIVE-20172 StatsUpdater failed with GSS Exception while trying to connect to remote metastore (Rajkumar Singh via Ashutosh Chauhan)
HIVE-20165 Enable ZLIB for streaming ingest (Prasanth Jayachandran, via Gopal V)
HIVE-20116 TezTask is using parent logger (Prasanth Jayachandran reviewed by Sergey Shelukhin)
HIVE-20152 reset db state, when repl dump fails, so rename table can be done (Anishek Agarwal, reviewed by Sankar Hariappan)
HIVE-20185 Backport HIVE -20111 to branch-3 (Daniel Voros via Zoltan Haindrich)
HIVE-20085 Druid-Hive (managed) table creation fails with strict managed table checks: Table is marked as a managed table but is not transactional (Nishant Bangarwa via Ashutosh Chauhan)
HIVE-19981 Managed tables converted to external tables by the HiveStrictManagedMigration utility should be set to delete data when the table is dropped (Jason Dere, reviewed by Daniel Dai)
HIVE-20147 Hive streaming ingest is contented on synchronized logging (Prasanth Jayachandran reviewed by Sergey Shelukhin)
HIVE-19387 Truncate table for Acid tables conflicts with ResultSet cache (Eugene Koifman, reviewed by Jason Dere)
HIVE-20088 Beeline config location path is assembled incorrectly (Denes Bodo via Zoltan Haindrich)
HIVE-20129 Revert to position based schema evolution for orc tables (Prasanth Jayachandran reviewed by Gunther Hagleitner)
HIVE-20126 OrcInputFormat does not pass conf to orc reader options (Prasanth Jayachandran reviewed by Sergey Shelukhin)
HIVE-19970 Replication dump has a NPE when table is empty (Mahesh Kumar Behera, reviewed by Peter Vary, Sankar Hariappan)
HIVE-20103 WM: Only Aggregate DAG counters if at least one is used (Gopal V, reviewed by Prasanth Jayachandran)
HIVE-19812 Disable external table replication by default via a configuration property (Mahesh Kumar Behera, reviewed by Sankar Hariappan)
HIVE-20182 Backport HIVE -20067 to branch-3 (Daniel Voros via Zoltan Haindrich)
HIVE-20025 Clean-up of event files created by HiveProtoLoggingHook (Sankar Hariappan, reviewed by Harish Jaiprakash, Anishek Agarwal)
HIVE-20039 Bucket pruning: Left Outer Join on bucketed table gives wrong result (Deepak Jaiswal, reviewed by Gopal V)
HIVE-19850 Dynamic partition pruning in Tez is leading to ‘No work found for tablescan’ error (Ganesha Shreedhara via Ashutosh Chauhan)
HIVE-19860 HiveServer2 ObjectInspectorFactory memory leak with cachedUnionStructObjectInspector (Rajkumar Singh)
HIVE-19326 stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect results) (Zoltan Haindrich via Ashutosh Chauhan)
HIVE-19995 Aggregate row traffic for acid tables (Zoltan Haindrich reviewed by Ashutosh Chauhan)
HIVE-20051 Skip authorization for temp tables (Zoltan Haindrich reviewed by Igor Kryvenko, Jason Dere)
HIVE-17840 HiveMetaStore eats exception if transactionalListeners.notifyEvent fail (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Daniel Dai)
HIVE-20059 Hive streaming should try shade prefix unconditionally on exception (Prasanth Jayachandran reviewed by Vineet Garg)
HIVE-20038 Update queries on non-bucketed + partitioned tables throws NPE (Prasanth Jayachandran reviewed by Gopal V)
HIVE-20028 Metastore client cache config is used incorrectly (Prasanth Jayachandran reviewed by Sergey Shelukhin)
HIVE-19967 SMB Join: Need Optraits for PTFOperator ala GBY Op (Deepak Jaiswal, reviewed by Jason Dere)
HIVE-20004 Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (addendum)
HIVE-20004 Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (Prasanth Jayachandran reviewed by Matt McCline)
HIVE-19432 GetTablesOperation is too slow if the hive has too many databases and tables (Rajkumar Singh via Ashutosh Chauhan)
HIVE-19285 Add logs to the subclasses of MetaDataOperation (Marta Kuczora, via Peter Vary)
HIVE-20009 Fix runtime stats for merge statement (Zoltan Haindrich via Ashutosh Chauhan)
HIVE-19989 Metastore uses wrong application name for HADOOP2 metrics (Vineet Garg, reviewed by Alan Gates)
HIVE-20011 Move away from append mode in proto logging hook (Harish JP, reviewd by Anishek Agarwal)
HIVE-19564 Vectorization: Fix NULL / Wrong Results issues in Arithmetic (Matt McCline, reviewed by Teddy Choi)
HIVE-20180 Backport HIVE -19759 to branch-3 (Daniel Voros via Zoltan Haindrich)
HIVE-6980 rop table by using direct sql (Peter Vary, reviewed by Alexander Kolbasov, Vihang Karajgaonkar)
HIVE-19628 possible NPE in LLAP testSigning (Sergey Shelukhin, reviewed by Jason Dere)
HIVE-19979 Backport HIVE -19304 to branch-3 (Daniel Voros via Zoltan Haindrich)
HIVE-19313 TestJdbcWithDBTokenStoreNoDoAs tests are failing (Peter Vary via Ashutosh Chauhan)
HIVE-18986 Table rename will run java.lang.StackOverflowError in dataNucleus if the table contains large number of columns (Aihua Xu, reviewed by Yongzhi Chen)
HIVE-19104 When test MetaStore is started with retry the instances should be independent (Peter Vary, reviewed by Sahil Takiar)
HIVE-21177 ACID:Optimize AcidUtils.getLogicalLength()
HIVE-21660 Wrong result when union all and later view with explode is used (Ganesha Shreedhara via Jesus Camacho Rodriguez)
HIVE-22891 To Skip PartitionDesc Extraction For Non-LLAP Execution Mode (Syed Shameerur Rahman, reviewed by Ádám Szita)