Backported Issues from Open-source Hive

This table contains the list of issues backported from open-source Hive.

List of issues backported from open-source Hive

Open Source JIRA ID

Description

HIVE-20660

Group by statistics estimation could be improved by bounding the total number of rows to source table (Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-20702

Account for overhead from datastructure aware estimations during mapjoin selection (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)

HIVE-22243

Align Apache Thrift version to 0.9.3-1 in standalone-metastore as well (David Lavati via László Bodor)

HIVE-21173

Upgrade Apache Thrift to 0.9.3-1 (David Lavati via Laszlo Bodor)

HIVE-20953

Remove a function from function registry when it can not be added to the metastore when creating it. (Ashutosh Bapat, reviewed by Mahesh Kumar Behera)

HIVE-20762

NOTIFICATION_LOG cleanup interval is hardcoded as 60s and is too small (Sankar Hariappan, reviewed by Thejas Nair)

HIVE-20761

Select for update on notification_sequence table has retry interval and retries count too small (Sankar Hariappan, reviewed by Thejas M Nair)

HIVE-20646

Partition filter condition is not pushed down to metastore query if it has IS NOT NULL (Sankar Hariappan, reviewed by Daniel Dai)

HIVE-21924

Split text files even if header/footer exists (Mustafa Iman, reviewed by Sankar Hariappan)

HIVE-22134

HIVE-22129: Remove glassfish.jersey and mssql-jdbc classes from jdbc-standalone jar (Naveen Gangam, reviewed by Thejas Nair

HIVE-21915

Hive with TEZ UNION ALL and UDTF results in data loss (Wei Zhang, reviewed by Vineet Garg)

HIVE-21799

NullPointerException in DynamicPartitionPruningOptimization, when join key is on aggregation column (Jason Dere, reviewed by Vineet Garg)

HIVE-21182

Skip setting up hive scratch dir during planning (Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-20804

Further improvements to group by optimization with constraints (Vineet Garg, reviewed by Jesus Camacho Rodriguez)

HIVE-20703

Put dynamic sort partition optimization under cost based decision (Vineet Garg, reviewed by Prasanth Jayachandran, Sergey Shelukhin)

HIVE-20918

Flag to enable/disable pushdown of computation from Calcite into JDBC connection (Jesus Camacho Rodriguez, reviewed by Thejas M Nair)

HIVE-20744

Use SQL constraints to improve join reordering algorithm (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20976

JDBC queries containing joins gives wrong results (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20830

JdbcStorageHandler range query assertion failure in some cases (Daniel Dai, reviewed by Thejas Nair)

HIVE-20829

JdbcStorageHandler range split throws NPE (Daniel Dai, reviewed by Thejas Nair)

HIVE-20720

Add partition column option to JDBC handler (Daniel Dai, reviewed by Jesus Camacho Rodriguez)

HIVE-20651

JdbcStorageHandler password should be encrypted (Daniel Dai, reviewed by Sankar Hariappan)

HIVE-20652

JdbcStorageHandler push join of two different datasource to jdbc driver (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20095

Fix feature to push computation to jdbc external tables (Jonathan Doron, reviewed by Jesus Camacho Rodriguez)

HIVE-20718

Add perf cli driver with constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-18862

qfiles: prepare .q files for using datasets (Laszlo Bodor via Zoltan Haindrich) Signed-off-by: Zoltan Haindrich <kirk@rxd.hu> (cherry picked from commit 86b678f502bc6bb86c2279ef1288627bd8e7421a)

HIVE-17040

Join elimination in the presence of FK relationship (Jesus Camacho Rodriguez, reviewed by Vineet Garg)

HIVE-20213

Upgrade Calcite to 1.17.0 (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-22275

OperationManager.queryIdOperation does not properly clean up multiple queryIds (Jason Dere, reviewed by Prasanth Jayachandran)

HIVE-22221

Llap external client- Need to reduce LlapBaseInputFormat#getSplits() footprint (Shubham Chaurasia, reviewed by Jason Dere)

HIVE-21641

Llap external client returns decimal columns in different precision/scale as compared to beeline (Shubham Chaurasia, via Gopal V)

HIVE-21041

NPE, ParseException in getting schema from logical plan (Teddy Choi, reviewed by Jesus Camacho Rodriguez)

HIVE-21126

Allow session level queries in LlapBaseInputFormat#getSplits() before actual get_splits() call (Shubham Chaurasia, reviewed by Teddy Choi)

HIVE-20552

Get Schema from LogicalPlan faster (Teddy Choi, reviewed by Jesus Camacho Rodriguez)

HIVE-21270

A UDTF to show schema (column names and types) of given query (Shubham Chaurasia, reviewed by Mani M, Sankar Hariappan)

HIVE-21864

LlapBaseInputFormat#closeAll() throws ConcurrentModificationException (Shubham Chaurasia, reviewed by Jason Dere)

HIVE-22231

Hive query with big size via knox fails with Broken pipe Write failed (Denys Kuzmenko via Peter Vary)

HIVE-22219

Bringing a node manager down blocks restart of LLAP service (Jesus Camacho Rodriguez, reviewed by Slim Bouguerra)

HIVE-22204

Beeline option to show/not show execution report (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-21866

LLAP status service driver may get stuck with wrong Yarn app ID (Adam Szita, reviewed by Marta Kuczoram)

HIVE-21009

Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) (addendum)

HIVE-21009

Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran)

HIVE-20767

Multiple project between join operators may affect join reordering using constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20320

Turn on hive.optimize.remove.sq_count_check flag (Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-20292

Bad join ordering in tpcds query93 with primary constraint defined (Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-22169

Tez: SplitGenerator tries to look for plan files which won’t exist for Tez (Gopal V via Vineet Garg)

HIVE-22121

Turning on hive.tez.bucket.pruning produce wrong results (Vineet Garg, reviewed by Gopal V)

HIVE-22115

Prevent the creation of query routing appender if property is set to false (Slim Bouguerra reviewed by Gopal V)

HIVE-21171

Skip creating scratch dirs for tez if RPC is on (Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-20940

Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) (addendum)

HIVE-20940

Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-22241

Implement UDF to interpret date/timestamp using its internal representation and Gregorian-Julian hybrid calendar (Jesus Camacho Rodriguez, reviewed by Adam Szita)

HIVE-22106

Remove cross-query synchronization for the partition-eval (Slim B via Gopal V)

HIVE-22168

Remove very expensive logging from the llap cache hotpath (Slim B via Jesus Camacho Rodriguez)

HIVE-22161

UDF: FunctionRegistry synchronizes on org.apache.hadoop.hive.ql.udf.UDFType class (Gopal V, reviewed by Ashutosh Chauhan)

HIVE-22120

Fix wrong results/ArrayOutOfBound exception in left outer map joins on specific boundary conditions (Ramesh Kumar Thangarajan, reviewed by Jason Dere)

HIVE-19661

switch Hive UDFs to use Re2J regex engine (Rajkumar Singh via Ashutosh Chauhan)

HIVE-20873

Use Murmur hash for VectorHashKeyWrapperTwoLong to reduce hash collision (Teddy Choi, reviewed by Gopal V)

HIVE-20617

Fix type of constants in IN expressions to have correct type (Zoltan Haindrich via Jesus Camacho Rodriguez)

HIVE-20193

cboInfo is not present in the explain plan json (Anishek Agarwal, reviewed by Ashutosh Chauhan)

HIVE-20684

Make compute stats work for Druid tables (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20636

Improve number of null values estimation after outer join (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20366

TPC-DS query78 stats estimates are off for is null filter(Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-19992

Vectorization: Follow-on to HIVE -19951 –> add call to SchemaEvolution.isOnlyImplicitConversion to disable encoded LLAP I/O for ORC only when data type conversion is not implicit (Matt McCline, reviewed by Prasanth Jayachandran)

HIVE-20988

Wrong results for group by queries with primary key on multiple columns (Vineet Garg, reviewed by Jesus Camacho Rodriguez)

HIVE-20985

If select operator inputs are temporary columns vectorization may reuse some of them as output (Zoltan Haindrich reviewed by Teddy Choi)

HIVE-20954

Vector RS operator is not using uniform hash function for TPC-DS query 95 (Teddy Choi, reviewed by Gopal V)

HIVE-20775

Factor cost of each SJ reduction when costing a follow-up reduction (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20788

Extended SJ reduction may backtrack columns incorrectly when creating filters (Jesus Camacho Rodriguez, reviewed by Deepak Jaiswal)

HIVE-17043

Remove non unique columns from group by keys if not referenced later (Vineet Garg, reviewed by Jesus Camacho Rodriguez)

HIVE-20712

HivePointLookupOptimizer should extract deep cases (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez)

HIVE-20563

Vectorization: CASE WHEN expression fails when THEN/ELSE type and result type are different (Matt McCline, reviewed by Teddy Choi)

HIVE-20609

Create SSD cache dir if it doesnt exist already (Miklos Gergely via Sergey Shelukhin)

HIVE-20657

pre-allocate LLAP cache at init time (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-20540

Vectorization : Support loading bucketed tables using sorted dynamic partition optimizer- II (Deepak Jaiswal, reviewed by Prasanth Jayachandran)

HIVE-20296

Improve HivePointLookupOptimizerRule to be able to extract from more sophisticated contexts (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20513

Vectorization: Improve Fast Vector MapJoin Bytes Hash Tables (Matt McCline, reviewed by Zoltan Haindrich)

HIVE-20013

Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20315

Vectorization: Fix more NULL / Wrong Results issues and avoid unnecessary casts/conversions (Matt McCline, reviewed by Teddy Choi)

HIVE-20339

Vectorization: Lift unneeded restriction causing some PTF with RANK not to be vectorized (Matt McCline, reviewed by Teddy Choi)

HIVE-20367

Vectorization: Support streaming for PTF AVG, MAX, MIN, SUM (Matt McCline, reviewed by Teddy Choi)

HIVE-20418

LLAP IO may not handle ORC files that have row index disabled correctly for queries with no columns selected (Sergey Shelukhin, reviewed by Gopal Vijayaraghavan)

HIVE-20321

Vectorization: Cut down memory size of 1 col VectorHashKeyWrapper to <1 CacheLine (Matt McCline, reviewed by Gopal Vijayaraghavan)

HIVE-20177

Vectorization: Reduce KeyWrapper allocation in GroupBy Streaming mode (Gopal V, reviewed by Matt McCline)

HIVE-20300

VectorFileSinkArrowOperator (Eric Wohlstadter, reviewed by Jason Dere, Matt McCline, Teddy Choi)

HIVE-20277

Vectorization: Case expressions that return BOOLEAN are not supported for FILTER (Matt McCline, reviewed by Gopal Vijayaraghavan)

HIVE-20294

Vectorization: Fix NULL / Wrong Results issues in COALESCE / ELT (Matt McCline, reviewed by Teddy Choi)

HIVE-20252

Semijoin Reduction : Cycles due to semi join branch may remain undetected if small table side has a map join upstream.(Deepak Jaiswal, reviewed by Jesus Camacho Rodriguez)

HIVE-18201

Disable XPROD_EDGE for sq_count_check() created for scalar subqueries (Ashutosh Chauhan via Jesus Camacho Rodriguez)

HIVE-20244

forward port HIVE -19704 to master (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-19663

refactor LLAP IO report generation (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-20090

Extend creation of semijoin reduction filters to be able to discover new opportunities (Jesus Camacho Rodriguez via Deepak Jaiswal)

HIVE-20069

Fix reoptimization in case of DPP and Semijoin optimization (Zoltan Haindrich reviewed by Ashutosh Chauhan, Zoltan Haindrich)

HIVE-17896

TopNKey: Create a standalone vectorizable TopNKey operator (Teddy Choi, reviewed by Jesus Camacho Rodriguez)

HIVE-20245

Vectorization: Fix NULL / Wrong Results issues in BETWEEN / IN (Matt McCline, reviewed by Teddy Choi)

HIVE-20207

Vectorization: Fix NULL / Wrong Results issues in Filter / Compare (Matt McCline, reviewed by Teddy Choi)

HIVE-20203

Arrow SerDe leaks a DirectByteBuffer (Eric Wohlstadter, reviewed by Teddy Choi)

HIVE-20082

HiveDecimal to string conversion doesn’t format the decimal correctly (Jason Dere, reviewed by Ashutosh Chauhan)

HIVE-19891

inserting into external tables with custom partition directories may cause data loss (Sergey Shelukhin, reviewed by Ashutosh Chauhan)

HIVE-20204

Type conversion during IN () comparisons is using different rules from other comparison operations (Jason Dere, reviewed by Ashutosh Chauhan)

HIVE-19360

CBO: Add an “optimizedSQL” to QueryPlan object (Gopal V, reviewed by Jesus Camacho Rodriguez)

HIVE-20197

Vectorization: Add DECIMAL_64 testing, add Date/Interval/Timestamp arithmetic, and add more GROUP BY Aggregation tests (Matt McCline, reviewed by Teddy Choi)

HIVE-20498

Support date type for column stats autogather (Zoltan Haindrich reviewed by Prasanth Jayachandran)

HIVE-20537

Multi-column joins estimates with uncorrelated columns different in CBO and Hive (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20503

Use datastructure aware estimations during mapjoin selection (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20432

Rewrite BETWEEN to IN for integer types for stats estimation(Vineet Garg, reviewed by Ashutosh Chauhan)

HIVE-20491

Fix mapjoin size estimations for Fast implementation (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20821

Rewrite SUM0 into SUM + COALESCE combination (Jesus Camacho Rodriguez, reviewed by Gopal V)

HIVE-20391

HiveAggregateReduceFunctionsRule may infer wrong return type when decomposing aggregate function (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-14431

Recognize COALESCE as CASE (Remus Rusanu, Jesus Camacho Rodriguez via Ashutosh Chauhan, Zoltan Haindrich)

HIVE-17683

Add explain locks <sql> command (Igor Kryvenko via Eugene Koifman)

HIVE-20898

For time related functions arguments may not be casted to a non nullable type (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez)

HIVE-20704

Extend HivePreFilteringRule to support other functions (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20710

Constant folding may not create null constants without types (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-18871

hive on tez execution error due to set hive.aux.jars.path to hdfs:// (zhuwei reviewed by Prasanth Jayachandran)

HIVE-20612

Create new join multi-key correlation flag for CBO (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20013

Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20807

Refactor LlapStatusServiceDriver (Miklos Gergely via Sergey Shelukhin)

HIVE-20019

Ban commons-logging and log4j (Prasanth Jayachandran reviewed by Ashutosh Chauhan)

HIVE-20099

Fix logger for LlapServlet (Rajkumar Singh via Gopal V)

HIVE-20752

In case of LLAP start failure add info how to find YARN logs (Miklos Gergely via Ashutosh Chauhan)

HIVE-20676

HiveServer2: PrivilegeSynchronizer is not set to daemon status (Vaibhav Gumashta via Thejas Nair)

HIVE-20455

Log spew from security.authorization.PrivilegeSynchonizer.run (Daniel Dai, reviewed by Gopal V)

HIVE-20344

PrivilegeSynchronizer for SBA might hit AccessControlException (Daniel Dai, reviewed by Vaibhav Gumashta)

HIVE-20130

Better logging for information schema synchronizer (Daniel Dai, reviewed by Vaibhav Gumashta)

HIVE-20853

Expose ShuffleHandler.registerDag in the llap daemon API (Jaume Marhuenda, reviewed by Sergey Shelukhin)

HIVE-20772

record per-task CPU counters in LLAP (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-19568

Active/Passive HS2 HA: Disallow direct connection to passive HS2 instance (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-19097

related equals and in operators may cause inaccurate stats estimations (Zoltan Haindrich via Ashutosh Chauhan)

HIVE-20951

LLAP: Set Xms to 50% always (Gopal V via Ashutosh Chauhan)

HIVE-20937

Postgres jdbc query fail with “LIMIT must not be negative” (Daniel Dai, reviewed by Thejas Nair)

HIVE-20910

Insert in bucketed table fails due to dynamic partition sort optimization (Vineet Garg via Ashutosh Chauhan)

HIVE-19701

getDelegationTokenFromMetaStore doesn’t need to be synchronized (Sankar Hariappan, reviewed by Thejas M Nair)

HIVE-20899

Keytab URI for LLAP YARN Service is restrictive to support HDFS only (Gour Saha reviewd by Prasanth Jayachandran)

HIVE-20881

Constant propagation oversimplifies projections (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20817

Reading Timestamp datatype via HiveServer2 gives errors (Mahesh Kumar Behera, reviewed by Thejas M Nair)

HIVE-20834

Hive QueryResultCache entries keeping reference to SemanticAnalyzer from cached query (Jason Dere, reviewed by GopalV)

HIVE-20815

JdbcRecordReader.next shall not eat exception (Daniel Dai, reviewed by Thejas Nair)

HIVE-20792

Inserting timestamp with zones truncates the data (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez)

HIVE-20638

Upgrade version of Jetty to 9.3.25.v20180904 (Laszlo Bodor via Thejas Nair)

HIVE-20746

HiveProtoHookLogger does not close file at end of day. (Harish JP, reviewd by Anishek Agarwal)

HIVE-20768

Adding Tumbling Window UDF (Slim Bouguerra, via Gopal V)

HIVE-20763

Add google cloud storage (gs) to the exim uri schema whitelist (Siddharth Seth, reviewed by Prasanth Jayachandran, Sergey Shelukhin)

HIVE-20477

OptimizedSql is not shown if the expression contains INs (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20649

LLAP aware memory manager for Orc writers (Prasanth Jayachandran reviewed by Sergey Shelukhin)

HIVE-20696

msck_*.q tests are broken (Jesus Camacho Rodriguez, reviewed by Prasanth Jayachandran)

HIVE-20719

SELECT statement fails after UPDATE with hive.optimize.sort.dynamic.partition optimization and vectorization on (Eugene Koifman, reviewed by Gopal V)

HIVE-20644

Avoid exposing sensitive infomation through a Hive Runtime exception (Ashutosh Bapat reviewed by Thejas Nair, Sankar Hariappan)

HIVE-20705

Vectorization: Native Vector MapJoin doesn’t support Complex Big Table values (Matt McCline, reviewed by Jason Dere)

HIVE-20700

Add config to diable rollup for druid tables (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20698

Add better message for NPE when inserting rows with null timestamp to druid (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20686

Sync query IDs between druid and Hive (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20648

LLAP: Vector group by operator should use memory per executor

HIVE-20692

Enable folding of NOT x IS (NOT) [TRUE|FALSE] expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20623

Shared work: Extend sharing of map-join cache entries in LLAP (Jesus Camacho Rodriguez, reviewed by Gopal V)

HIVE-20691

Fix org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[cttl] (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)

HIVE-10296

Cast exception observed when hive runs a multi join query on metastore (postgres), since postgres pushes the filter into the join, and ignores the condition before applying cast (Karthik Manamcheri, reviewed by Sergey Shelukhin)

HIVE-20618

During join selection BucketMapJoin might be choosen for non bucketed tables (Zoltan Haindrich reviewed by Ashutosh Chauhan, Deepak Jaiswal)

HIVE-20593

Load Data for partitioned ACID tables fails with bucketId out of range: -1 (Deepak Jaiswal, reviewed by Eugene Koifman)

HIVE-20620

manifest collisions when inserting into bucketed sorted MM tables with dynamic partitioning (Sergey Shelukhin, reviewed by Deepak Jaiswal)

HIVE-20626

Log more details when druid metastore transaction fails in callback (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20267

Expanding WebUI to include form to dynamically config log levels (Zoltan Chovan reviewed by Prasanth Jayachandran)

HIVE-20349

Implement Retry Logic in HiveDruidSplit for Scan Queries (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20582

Make hflush in hive proto logging configurable (Prasanth Jayachandran reviewed by Thejas M Nair)

HIVE-20583

Use canonical hostname only for kerberos auth in HiveConnection (Prasanth Jayachandran reviewed by Gopal V)

HIVE-20558

Change default of hive.hashtable.key.count.adjustment to 0.99

HIVE-20524

Schema Evolution checking is broken in going from Hive version 2 to version 3 for ALTER TABLE VARCHAR to DECIMAL

HIVE-20541

REPL DUMP on external table with add partition event throws NoSuchElementException (Sankar Hariappan, reviewed by Anishek Agarwal)

HIVE-20412

NPE in HiveMetaHook (Jason Dere, reviewed by Sergey Shelukhin)

HIVE-19552

Enable TestMiniDruidKafkaCliDriver#druidkafkamini_basic.q (Nishant Bangarwa, reviewed by Jesus Camacho Rodriguez)

HIVE-20521

HS2 doAs=true has permission issue with hadoop.tmp.dir, with MR and S3A filesystem (Thejas Nair, reviewed by Zoltan Haindrich)

HIVE-20510

Vectorization: Support loading bucketed tables using sorted dynamic partition optimizer (Deepak Jaiswal, reviewed by Gopal Vijayarahavan, Matt Mccline, and Thejas Nair)

HIVE-20508

Hive does not support user names of type “user@realm” (Deepak Jaiswal, reviewed by Thejas Nair)

HIVE-20522

HiveFilterSetOpTransposeRule may throw assertion error due to nullability of fields (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20515

Empty query results when using results cache and query temp dir, results cache dir in different filesystems (Jason Dere via Thejas Nair)

HIVE-20499

GetTablesOperation pull all the tables meta irrespective of auth. (Rajkumar Singh via Ashutosh Chauhan)

HIVE-19993

Using a table alias which also appears as a column name is not possible (Jaume M via Zoltan Haindrich)

HIVE-20476

CopyUtils used by REPL LOAD and EXPORT/IMPORT operations ignore distcp error (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Thejas M Nair)

HIVE-20496

Vectorization: Vectorized PTF IllegalStateException (Matt McCline, reviewed by Teddy Choi)

HIVE-20433

Implicit String to Timestamp conversion is slow (Deepak Jaiswal, reviewed by Gopal Vijayaraghavan)

HIVE-20439

addendum

HIVE-20352

Vectorization: Support grouping function (Matt McCline, reviewed by Teddy Choi)

HIVE-20399

CTAS w/a custom table location that is not fully qualified fails for MM tables (Sergey Shelukhin, reviewed by Ashutosh Chauhan)

HIVE-20409

Hive ACID: Update/delete/merge does not clean hdfs staging directory (Rajkumar Singh, reviewed by Vineet Garg)

HIVE-18624

Parsing time is extremely high (~10 min) for queries with complex select expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20406

Addendum patch

HIVE-20383

Invalid queue name and synchronisation issues in hive proto events hook.(Harish JP, reviewd by Anishek Agarwal)

HIVE-14898

HS2 shouldn’t log callstack for an empty auth header error

HIVE-20389

NPE in SessionStateUserAuthenticator when authenticator=SessionStateUserAuthenticator (Daniel Dai, reviewed by Thejas Nair)

HIVE-20345

Drop database may hang if the tables get deleted from a different call (Aihua Xu, reviewed by Naveen Gangam)

HIVE-19316

StatsTask fails due to ClassCastException (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez)

HIVE-20361

ReplDumpTaskTest is failing (Sankar Hariappan, reviewed by Jesus Camacho Rodriguez)

HIVE-20353

Follow redirects when hive connects to a passive druid overlord/coordinator (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-20336

Masking and filtering policies for materialized views (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20326

Create constraints with RELY as default instead of NO RELY (Vineet Garg, reviewed by Jesus Camacho Rodriguez)

HIVE-20118

SessionStateUserAuthenticator.getGroupNames() is always empty (Daniel Dai, reviewed by Thejas Nair)

HIVE-20290

Lazy initialize ArrowColumnarBatchSerDe so it doesn’t allocate buffers during GetSplits (Eric Wohlstadter, reviewed by Jason Dere)

HIVE-20301

Enable vectorization for materialized view rewriting tests (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20302

LLAP: non-vectorized execution in IO ignores virtual columns, including ROW__ID (Jesus Camacho Rodriguez, reviewed by Sergey Shelukhin)

HIVE-20299

potential race in LLAP signer unit test (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-20281

SharedWorkOptimizer fails with ‘operator cache contents and actual plan differ’ (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20101

BloomKFilter: Avoid using the local byte[] arrays entirely (Gopal V, reviewed by Prasanth Jayachandran)

HIVE-20263

Typo in HiveReduceExpressionsWithStatsRule variable (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan)

HIVE-20212

Hiveserver2 in http mode emitting metric default.General.open_connections incorrectly (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich)

HIVE-20228

configure repl configuration directories based on user running hiveserver2 (Anishek Agarwal, reviewed by Sankar Hariappan)

HIVE-19935

Hive WM session killed: Failed to update LLAP tasks count (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-20192

HS2 with embedded metastore is leaking JDOPersistenceManager objects (Sankar Hariappan, reviewed by Vihang Karajgaonkar)

HIVE-20174

Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions (Matt McCline, reviewed by Teddy Choi)

HIVE-20127

fix some issues with LLAP Parquet cache (Sergey Shelukhin, reviewed by Prasanth Jayachandran)

HIVE-20149

TestHiveCli failing/timing out(Vineet Garg, reviewed by Zoltan Haindrich)

HIVE-20183

Inserting from bucketed table can cause data loss, if the source table contains empty bucket (Peter Vary, reviewed by Naveen Gangam)

HIVE-20172

StatsUpdater failed with GSS Exception while trying to connect to remote metastore (Rajkumar Singh via Ashutosh Chauhan)

HIVE-20165

Enable ZLIB for streaming ingest (Prasanth Jayachandran, via Gopal V)

HIVE-20116

TezTask is using parent logger (Prasanth Jayachandran reviewed by Sergey Shelukhin)

HIVE-20152

reset db state, when repl dump fails, so rename table can be done (Anishek Agarwal, reviewed by Sankar Hariappan)

HIVE-20185

Backport HIVE -20111 to branch-3 (Daniel Voros via Zoltan Haindrich)

HIVE-20085

Druid-Hive (managed) table creation fails with strict managed table checks: Table is marked as a managed table but is not transactional (Nishant Bangarwa via Ashutosh Chauhan)

HIVE-19981

Managed tables converted to external tables by the HiveStrictManagedMigration utility should be set to delete data when the table is dropped (Jason Dere, reviewed by Daniel Dai)

HIVE-20147

Hive streaming ingest is contented on synchronized logging (Prasanth Jayachandran reviewed by Sergey Shelukhin)

HIVE-19387

Truncate table for Acid tables conflicts with ResultSet cache (Eugene Koifman, reviewed by Jason Dere)

HIVE-20088

Beeline config location path is assembled incorrectly (Denes Bodo via Zoltan Haindrich)

HIVE-20129

Revert to position based schema evolution for orc tables (Prasanth Jayachandran reviewed by Gunther Hagleitner)

HIVE-20126

OrcInputFormat does not pass conf to orc reader options (Prasanth Jayachandran reviewed by Sergey Shelukhin)

HIVE-19970

Replication dump has a NPE when table is empty (Mahesh Kumar Behera, reviewed by Peter Vary, Sankar Hariappan)

HIVE-20103

WM: Only Aggregate DAG counters if at least one is used (Gopal V, reviewed by Prasanth Jayachandran)

HIVE-19812

Disable external table replication by default via a configuration property (Mahesh Kumar Behera, reviewed by Sankar Hariappan)

HIVE-20182

Backport HIVE -20067 to branch-3 (Daniel Voros via Zoltan Haindrich)

HIVE-20025

Clean-up of event files created by HiveProtoLoggingHook (Sankar Hariappan, reviewed by Harish Jaiprakash, Anishek Agarwal)

HIVE-20039

Bucket pruning: Left Outer Join on bucketed table gives wrong result (Deepak Jaiswal, reviewed by Gopal V)

HIVE-19850

Dynamic partition pruning in Tez is leading to ‘No work found for tablescan’ error (Ganesha Shreedhara via Ashutosh Chauhan)

HIVE-19860

HiveServer2 ObjectInspectorFactory memory leak with cachedUnionStructObjectInspector (Rajkumar Singh)

HIVE-19326

stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect results) (Zoltan Haindrich via Ashutosh Chauhan)

HIVE-19995

Aggregate row traffic for acid tables (Zoltan Haindrich reviewed by Ashutosh Chauhan)

HIVE-20051

Skip authorization for temp tables (Zoltan Haindrich reviewed by Igor Kryvenko, Jason Dere)

HIVE-17840

HiveMetaStore eats exception if transactionalListeners.notifyEvent fail (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Daniel Dai)

HIVE-20059

Hive streaming should try shade prefix unconditionally on exception (Prasanth Jayachandran reviewed by Vineet Garg)

HIVE-20038

Update queries on non-bucketed + partitioned tables throws NPE (Prasanth Jayachandran reviewed by Gopal V)

HIVE-20028

Metastore client cache config is used incorrectly (Prasanth Jayachandran reviewed by Sergey Shelukhin)

HIVE-19967

SMB Join: Need Optraits for PTFOperator ala GBY Op (Deepak Jaiswal, reviewed by Jason Dere)

HIVE-20004

Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (addendum)

HIVE-20004

Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (Prasanth Jayachandran reviewed by Matt McCline)

HIVE-19432

GetTablesOperation is too slow if the hive has too many databases and tables (Rajkumar Singh via Ashutosh Chauhan)

HIVE-19285

Add logs to the subclasses of MetaDataOperation (Marta Kuczora, via Peter Vary)

HIVE-20009

Fix runtime stats for merge statement (Zoltan Haindrich via Ashutosh Chauhan)

HIVE-19989

Metastore uses wrong application name for HADOOP2 metrics (Vineet Garg, reviewed by Alan Gates)

HIVE-20011

Move away from append mode in proto logging hook (Harish JP, reviewd by Anishek Agarwal)

HIVE-19564

Vectorization: Fix NULL / Wrong Results issues in Arithmetic (Matt McCline, reviewed by Teddy Choi)

HIVE-20180

Backport HIVE -19759 to branch-3 (Daniel Voros via Zoltan Haindrich)

HIVE-6980

rop table by using direct sql (Peter Vary, reviewed by Alexander Kolbasov, Vihang Karajgaonkar)

HIVE-19628

possible NPE in LLAP testSigning (Sergey Shelukhin, reviewed by Jason Dere)

HIVE-19979

Backport HIVE -19304 to branch-3 (Daniel Voros via Zoltan Haindrich)

HIVE-19313

TestJdbcWithDBTokenStoreNoDoAs tests are failing (Peter Vary via Ashutosh Chauhan)

HIVE-18986

Table rename will run java.lang.StackOverflowError in dataNucleus if the table contains large number of columns (Aihua Xu, reviewed by Yongzhi Chen)

HIVE-19104

When test MetaStore is started with retry the instances should be independent (Peter Vary, reviewed by Sahil Takiar)

HIVE-21177

ACID:Optimize AcidUtils.getLogicalLength()

HIVE-21660

Wrong result when union all and later view with explode is used (Ganesha Shreedhara via Jesus Camacho Rodriguez)

HIVE-22891

To Skip PartitionDesc Extraction For Non-LLAP Execution Mode (Syed Shameerur Rahman, reviewed by Ádám Szita)