Backported Issues from Open-source Hive¶
This table contains the list of issues backported from open-source Hive.
Open Source JIRA ID | Description |
---|---|
HIVE-20660 | Group by statistics estimation could be improved by bounding the total number of rows to source table (Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-20702 | Account for overhead from datastructure aware estimations during mapjoin selection (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
HIVE-22243 | Align Apache Thrift version to 0.9.3-1 in standalone-metastore as well (David Lavati via László Bodor) |
HIVE-21173 | Upgrade Apache Thrift to 0.9.3-1 (David Lavati via Laszlo Bodor) |
HIVE-20953 | Remove a function from function registry when it can not be added to the metastore when creating it. (Ashutosh Bapat, reviewed by Mahesh Kumar Behera) |
HIVE-20762 | NOTIFICATION_LOG cleanup interval is hardcoded as 60s and is too small (Sankar Hariappan, reviewed by Thejas Nair) |
HIVE-20761 | Select for update on notification_sequence table has retry interval and retries count too small (Sankar Hariappan, reviewed by Thejas M Nair) |
HIVE-20646 | Partition filter condition is not pushed down to metastore query if it has IS NOT NULL (Sankar Hariappan, reviewed by Daniel Dai) |
HIVE-21924 | Split text files even if header/footer exists (Mustafa Iman, reviewed by Sankar Hariappan) |
HIVE-22134 | HIVE-22129: Remove glassfish.jersey and mssql-jdbc classes from jdbc-standalone jar (Naveen Gangam, reviewed by Thejas Nair |
HIVE-21915 | Hive with TEZ UNION ALL and UDTF results in data loss (Wei Zhang, reviewed by Vineet Garg) |
HIVE-21799 | NullPointerException in DynamicPartitionPruningOptimization, when join key is on aggregation column (Jason Dere, reviewed by Vineet Garg) |
HIVE-21182 | Skip setting up hive scratch dir during planning (Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-20804 | Further improvements to group by optimization with constraints (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
HIVE-20703 | Put dynamic sort partition optimization under cost based decision (Vineet Garg, reviewed by Prasanth Jayachandran, Sergey Shelukhin) |
HIVE-20918 | Flag to enable/disable pushdown of computation from Calcite into JDBC connection (Jesus Camacho Rodriguez, reviewed by Thejas M Nair) |
HIVE-20744 | Use SQL constraints to improve join reordering algorithm (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20976 | JDBC queries containing joins gives wrong results (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20830 | JdbcStorageHandler range query assertion failure in some cases (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20829 | JdbcStorageHandler range split throws NPE (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20720 | Add partition column option to JDBC handler (Daniel Dai, reviewed by Jesus Camacho Rodriguez) |
HIVE-20651 | JdbcStorageHandler password should be encrypted (Daniel Dai, reviewed by Sankar Hariappan) |
HIVE-20652 | JdbcStorageHandler push join of two different datasource to jdbc driver (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20095 | Fix feature to push computation to jdbc external tables (Jonathan Doron, reviewed by Jesus Camacho Rodriguez) |
HIVE-20718 | Add perf cli driver with constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-18862 | qfiles: prepare .q files for using datasets (Laszlo Bodor via Zoltan Haindrich) Signed-off-by: Zoltan Haindrich <kirk@rxd.hu> (cherry picked from commit 86b678f502bc6bb86c2279ef1288627bd8e7421a) |
HIVE-17040 | Join elimination in the presence of FK relationship (Jesus Camacho Rodriguez, reviewed by Vineet Garg) |
HIVE-20213 | Upgrade Calcite to 1.17.0 (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-22275 | OperationManager.queryIdOperation does not properly clean up multiple queryIds (Jason Dere, reviewed by Prasanth Jayachandran) |
HIVE-22221 | Llap external client- Need to reduce LlapBaseInputFormat#getSplits() footprint (Shubham Chaurasia, reviewed by Jason Dere) |
HIVE-21641 | Llap external client returns decimal columns in different precision/scale as compared to beeline (Shubham Chaurasia, via Gopal V) |
HIVE-21041 | NPE, ParseException in getting schema from logical plan (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
HIVE-21126 | Allow session level queries in LlapBaseInputFormat#getSplits() before actual get_splits() call (Shubham Chaurasia, reviewed by Teddy Choi) |
HIVE-20552 | Get Schema from LogicalPlan faster (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
HIVE-21270 | A UDTF to show schema (column names and types) of given query (Shubham Chaurasia, reviewed by Mani M, Sankar Hariappan) |
HIVE-21864 | LlapBaseInputFormat#closeAll() throws ConcurrentModificationException (Shubham Chaurasia, reviewed by Jason Dere) |
HIVE-22231 | Hive query with big size via knox fails with Broken pipe Write failed (Denys Kuzmenko via Peter Vary) |
HIVE-22219 | Bringing a node manager down blocks restart of LLAP service (Jesus Camacho Rodriguez, reviewed by Slim Bouguerra) |
HIVE-22204 | Beeline option to show/not show execution report (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-21866 | LLAP status service driver may get stuck with wrong Yarn app ID (Adam Szita, reviewed by Marta Kuczoram) |
HIVE-21009 | Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) (addendum) |
HIVE-21009 | Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) |
HIVE-20767 | Multiple project between join operators may affect join reordering using constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20320 | Turn on hive.optimize.remove.sq_count_check flag (Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-20292 | Bad join ordering in tpcds query93 with primary constraint defined (Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-22169 | Tez: SplitGenerator tries to look for plan files which won’t exist for Tez (Gopal V via Vineet Garg) |
HIVE-22121 | Turning on hive.tez.bucket.pruning produce wrong results (Vineet Garg, reviewed by Gopal V) |
HIVE-22115 | Prevent the creation of query routing appender if property is set to false (Slim Bouguerra reviewed by Gopal V) |
HIVE-21171 | Skip creating scratch dirs for tez if RPC is on (Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-20940 | Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) (addendum) |
HIVE-20940 | Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-22241 | Implement UDF to interpret date/timestamp using its internal representation and Gregorian-Julian hybrid calendar (Jesus Camacho Rodriguez, reviewed by Adam Szita) |
HIVE-22106 | Remove cross-query synchronization for the partition-eval (Slim B via Gopal V) |
HIVE-22168 | Remove very expensive logging from the llap cache hotpath (Slim B via Jesus Camacho Rodriguez) |
HIVE-22161 | UDF: FunctionRegistry synchronizes on org.apache.hadoop.hive.ql.udf.UDFType class (Gopal V, reviewed by Ashutosh Chauhan) |
HIVE-22120 | Fix wrong results/ArrayOutOfBound exception in left outer map joins on specific boundary conditions (Ramesh Kumar Thangarajan, reviewed by Jason Dere) |
HIVE-19661 | switch Hive UDFs to use Re2J regex engine (Rajkumar Singh via Ashutosh Chauhan) |
HIVE-20873 | Use Murmur hash for VectorHashKeyWrapperTwoLong to reduce hash collision (Teddy Choi, reviewed by Gopal V) |
HIVE-20617 | Fix type of constants in IN expressions to have correct type (Zoltan Haindrich via Jesus Camacho Rodriguez) |
HIVE-20193 | cboInfo is not present in the explain plan json (Anishek Agarwal, reviewed by Ashutosh Chauhan) |
HIVE-20684 | Make compute stats work for Druid tables (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20636 | Improve number of null values estimation after outer join (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20366 | TPC-DS query78 stats estimates are off for is null filter(Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-19992 | Vectorization: Follow-on to HIVE -19951 –> add call to SchemaEvolution.isOnlyImplicitConversion to disable encoded LLAP I/O for ORC only when data type conversion is not implicit (Matt McCline, reviewed by Prasanth Jayachandran) |
HIVE-20988 | Wrong results for group by queries with primary key on multiple columns (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
HIVE-20985 | If select operator inputs are temporary columns vectorization may reuse some of them as output (Zoltan Haindrich reviewed by Teddy Choi) |
HIVE-20954 | Vector RS operator is not using uniform hash function for TPC-DS query 95 (Teddy Choi, reviewed by Gopal V) |
HIVE-20775 | Factor cost of each SJ reduction when costing a follow-up reduction (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20788 | Extended SJ reduction may backtrack columns incorrectly when creating filters (Jesus Camacho Rodriguez, reviewed by Deepak Jaiswal) |
HIVE-17043 | Remove non unique columns from group by keys if not referenced later (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
HIVE-20712 | HivePointLookupOptimizer should extract deep cases (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez) |
HIVE-20563 | Vectorization: CASE WHEN expression fails when THEN/ELSE type and result type are different (Matt McCline, reviewed by Teddy Choi) |
HIVE-20609 | Create SSD cache dir if it doesnt exist already (Miklos Gergely via Sergey Shelukhin) |
HIVE-20657 | pre-allocate LLAP cache at init time (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-20540 | Vectorization : Support loading bucketed tables using sorted dynamic partition optimizer- II (Deepak Jaiswal, reviewed by Prasanth Jayachandran) |
HIVE-20296 | Improve HivePointLookupOptimizerRule to be able to extract from more sophisticated contexts (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20513 | Vectorization: Improve Fast Vector MapJoin Bytes Hash Tables (Matt McCline, reviewed by Zoltan Haindrich) |
HIVE-20013 | Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20315 | Vectorization: Fix more NULL / Wrong Results issues and avoid unnecessary casts/conversions (Matt McCline, reviewed by Teddy Choi) |
HIVE-20339 | Vectorization: Lift unneeded restriction causing some PTF with RANK not to be vectorized (Matt McCline, reviewed by Teddy Choi) |
HIVE-20367 | Vectorization: Support streaming for PTF AVG, MAX, MIN, SUM (Matt McCline, reviewed by Teddy Choi) |
HIVE-20418 | LLAP IO may not handle ORC files that have row index disabled correctly for queries with no columns selected (Sergey Shelukhin, reviewed by Gopal Vijayaraghavan) |
HIVE-20321 | Vectorization: Cut down memory size of 1 col VectorHashKeyWrapper to <1 CacheLine (Matt McCline, reviewed by Gopal Vijayaraghavan) |
HIVE-20177 | Vectorization: Reduce KeyWrapper allocation in GroupBy Streaming mode (Gopal V, reviewed by Matt McCline) |
HIVE-20300 | VectorFileSinkArrowOperator (Eric Wohlstadter, reviewed by Jason Dere, Matt McCline, Teddy Choi) |
HIVE-20277 | Vectorization: Case expressions that return BOOLEAN are not supported for FILTER (Matt McCline, reviewed by Gopal Vijayaraghavan) |
HIVE-20294 | Vectorization: Fix NULL / Wrong Results issues in COALESCE / ELT (Matt McCline, reviewed by Teddy Choi) |
HIVE-20252 | Semijoin Reduction : Cycles due to semi join branch may remain undetected if small table side has a map join upstream.(Deepak Jaiswal, reviewed by Jesus Camacho Rodriguez) |
HIVE-18201 | Disable XPROD_EDGE for sq_count_check() created for scalar subqueries (Ashutosh Chauhan via Jesus Camacho Rodriguez) |
HIVE-20244 | forward port HIVE -19704 to master (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-19663 | refactor LLAP IO report generation (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-20090 | Extend creation of semijoin reduction filters to be able to discover new opportunities (Jesus Camacho Rodriguez via Deepak Jaiswal) |
HIVE-20069 | Fix reoptimization in case of DPP and Semijoin optimization (Zoltan Haindrich reviewed by Ashutosh Chauhan, Zoltan Haindrich) |
HIVE-17896 | TopNKey: Create a standalone vectorizable TopNKey operator (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
HIVE-20245 | Vectorization: Fix NULL / Wrong Results issues in BETWEEN / IN (Matt McCline, reviewed by Teddy Choi) |
HIVE-20207 | Vectorization: Fix NULL / Wrong Results issues in Filter / Compare (Matt McCline, reviewed by Teddy Choi) |
HIVE-20203 | Arrow SerDe leaks a DirectByteBuffer (Eric Wohlstadter, reviewed by Teddy Choi) |
HIVE-20082 | HiveDecimal to string conversion doesn’t format the decimal correctly (Jason Dere, reviewed by Ashutosh Chauhan) |
HIVE-19891 | inserting into external tables with custom partition directories may cause data loss (Sergey Shelukhin, reviewed by Ashutosh Chauhan) |
HIVE-20204 | Type conversion during IN () comparisons is using different rules from other comparison operations (Jason Dere, reviewed by Ashutosh Chauhan) |
HIVE-19360 | CBO: Add an “optimizedSQL” to QueryPlan object (Gopal V, reviewed by Jesus Camacho Rodriguez) |
HIVE-20197 | Vectorization: Add DECIMAL_64 testing, add Date/Interval/Timestamp arithmetic, and add more GROUP BY Aggregation tests (Matt McCline, reviewed by Teddy Choi) |
HIVE-20498 | Support date type for column stats autogather (Zoltan Haindrich reviewed by Prasanth Jayachandran) |
HIVE-20537 | Multi-column joins estimates with uncorrelated columns different in CBO and Hive (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20503 | Use datastructure aware estimations during mapjoin selection (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20432 | Rewrite BETWEEN to IN for integer types for stats estimation(Vineet Garg, reviewed by Ashutosh Chauhan) |
HIVE-20491 | Fix mapjoin size estimations for Fast implementation (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20821 | Rewrite SUM0 into SUM + COALESCE combination (Jesus Camacho Rodriguez, reviewed by Gopal V) |
HIVE-20391 | HiveAggregateReduceFunctionsRule may infer wrong return type when decomposing aggregate function (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-14431 | Recognize COALESCE as CASE (Remus Rusanu, Jesus Camacho Rodriguez via Ashutosh Chauhan, Zoltan Haindrich) |
HIVE-17683 | Add explain locks <sql> command (Igor Kryvenko via Eugene Koifman) |
HIVE-20898 | For time related functions arguments may not be casted to a non nullable type (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez) |
HIVE-20704 | Extend HivePreFilteringRule to support other functions (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20710 | Constant folding may not create null constants without types (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-18871 | hive on tez execution error due to set hive.aux.jars.path to hdfs:// (zhuwei reviewed by Prasanth Jayachandran) |
HIVE-20612 | Create new join multi-key correlation flag for CBO (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20013 | Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20807 | Refactor LlapStatusServiceDriver (Miklos Gergely via Sergey Shelukhin) |
HIVE-20019 | Ban commons-logging and log4j (Prasanth Jayachandran reviewed by Ashutosh Chauhan) |
HIVE-20099 | Fix logger for LlapServlet (Rajkumar Singh via Gopal V) |
HIVE-20752 | In case of LLAP start failure add info how to find YARN logs (Miklos Gergely via Ashutosh Chauhan) |
HIVE-20676 | HiveServer2: PrivilegeSynchronizer is not set to daemon status (Vaibhav Gumashta via Thejas Nair) |
HIVE-20455 | Log spew from security.authorization.PrivilegeSynchonizer.run (Daniel Dai, reviewed by Gopal V) |
HIVE-20344 | PrivilegeSynchronizer for SBA might hit AccessControlException (Daniel Dai, reviewed by Vaibhav Gumashta) |
HIVE-20130 | Better logging for information schema synchronizer (Daniel Dai, reviewed by Vaibhav Gumashta) |
HIVE-20853 | Expose ShuffleHandler.registerDag in the llap daemon API (Jaume Marhuenda, reviewed by Sergey Shelukhin) |
HIVE-20772 | record per-task CPU counters in LLAP (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-19568 | Active/Passive HS2 HA: Disallow direct connection to passive HS2 instance (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-19097 | related equals and in operators may cause inaccurate stats estimations (Zoltan Haindrich via Ashutosh Chauhan) |
HIVE-20951 | LLAP: Set Xms to 50% always (Gopal V via Ashutosh Chauhan) |
HIVE-20937 | Postgres jdbc query fail with “LIMIT must not be negative” (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20910 | Insert in bucketed table fails due to dynamic partition sort optimization (Vineet Garg via Ashutosh Chauhan) |
HIVE-19701 | getDelegationTokenFromMetaStore doesn’t need to be synchronized (Sankar Hariappan, reviewed by Thejas M Nair) |
HIVE-20899 | Keytab URI for LLAP YARN Service is restrictive to support HDFS only (Gour Saha reviewd by Prasanth Jayachandran) |
HIVE-20881 | Constant propagation oversimplifies projections (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20817 | Reading Timestamp datatype via HiveServer2 gives errors (Mahesh Kumar Behera, reviewed by Thejas M Nair) |
HIVE-20834 | Hive QueryResultCache entries keeping reference to SemanticAnalyzer from cached query (Jason Dere, reviewed by GopalV) |
HIVE-20815 | JdbcRecordReader.next shall not eat exception (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20792 | Inserting timestamp with zones truncates the data (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez) |
HIVE-20638 | Upgrade version of Jetty to 9.3.25.v20180904 (Laszlo Bodor via Thejas Nair) |
HIVE-20746 | HiveProtoHookLogger does not close file at end of day. (Harish JP, reviewd by Anishek Agarwal) |
HIVE-20768 | Adding Tumbling Window UDF (Slim Bouguerra, via Gopal V) |
HIVE-20763 | Add google cloud storage (gs) to the exim uri schema whitelist (Siddharth Seth, reviewed by Prasanth Jayachandran, Sergey Shelukhin) |
HIVE-20477 | OptimizedSql is not shown if the expression contains INs (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20649 | LLAP aware memory manager for Orc writers (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
HIVE-20696 | msck_*.q tests are broken (Jesus Camacho Rodriguez, reviewed by Prasanth Jayachandran) |
HIVE-20719 | SELECT statement fails after UPDATE with hive.optimize.sort.dynamic.partition optimization and vectorization on (Eugene Koifman, reviewed by Gopal V) |
HIVE-20644 | Avoid exposing sensitive infomation through a Hive Runtime exception (Ashutosh Bapat reviewed by Thejas Nair, Sankar Hariappan) |
HIVE-20705 | Vectorization: Native Vector MapJoin doesn’t support Complex Big Table values (Matt McCline, reviewed by Jason Dere) |
HIVE-20700 | Add config to diable rollup for druid tables (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20698 | Add better message for NPE when inserting rows with null timestamp to druid (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20686 | Sync query IDs between druid and Hive (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20648 | LLAP: Vector group by operator should use memory per executor |
HIVE-20692 | Enable folding of NOT x IS (NOT) [TRUE|FALSE] expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20623 | Shared work: Extend sharing of map-join cache entries in LLAP (Jesus Camacho Rodriguez, reviewed by Gopal V) |
HIVE-20691 | Fix org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[cttl] (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
HIVE-10296 | Cast exception observed when hive runs a multi join query on metastore (postgres), since postgres pushes the filter into the join, and ignores the condition before applying cast (Karthik Manamcheri, reviewed by Sergey Shelukhin) |
HIVE-20618 | During join selection BucketMapJoin might be choosen for non bucketed tables (Zoltan Haindrich reviewed by Ashutosh Chauhan, Deepak Jaiswal) |
HIVE-20593 | Load Data for partitioned ACID tables fails with bucketId out of range: -1 (Deepak Jaiswal, reviewed by Eugene Koifman) |
HIVE-20620 | manifest collisions when inserting into bucketed sorted MM tables with dynamic partitioning (Sergey Shelukhin, reviewed by Deepak Jaiswal) |
HIVE-20626 | Log more details when druid metastore transaction fails in callback (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20267 | Expanding WebUI to include form to dynamically config log levels (Zoltan Chovan reviewed by Prasanth Jayachandran) |
HIVE-20349 | Implement Retry Logic in HiveDruidSplit for Scan Queries (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20582 | Make hflush in hive proto logging configurable (Prasanth Jayachandran reviewed by Thejas M Nair) |
HIVE-20583 | Use canonical hostname only for kerberos auth in HiveConnection (Prasanth Jayachandran reviewed by Gopal V) |
HIVE-20558 | Change default of hive.hashtable.key.count.adjustment to 0.99 |
HIVE-20524 | Schema Evolution checking is broken in going from Hive version 2 to version 3 for ALTER TABLE VARCHAR to DECIMAL |
HIVE-20541 | REPL DUMP on external table with add partition event throws NoSuchElementException (Sankar Hariappan, reviewed by Anishek Agarwal) |
HIVE-20412 | NPE in HiveMetaHook (Jason Dere, reviewed by Sergey Shelukhin) |
HIVE-19552 | Enable TestMiniDruidKafkaCliDriver#druidkafkamini_basic.q (Nishant Bangarwa, reviewed by Jesus Camacho Rodriguez) |
HIVE-20521 | HS2 doAs=true has permission issue with hadoop.tmp.dir, with MR and S3A filesystem (Thejas Nair, reviewed by Zoltan Haindrich) |
HIVE-20510 | Vectorization: Support loading bucketed tables using sorted dynamic partition optimizer (Deepak Jaiswal, reviewed by Gopal Vijayarahavan, Matt Mccline, and Thejas Nair) |
HIVE-20508 | Hive does not support user names of type “user@realm” (Deepak Jaiswal, reviewed by Thejas Nair) |
HIVE-20522 | HiveFilterSetOpTransposeRule may throw assertion error due to nullability of fields (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20515 | Empty query results when using results cache and query temp dir, results cache dir in different filesystems (Jason Dere via Thejas Nair) |
HIVE-20499 | GetTablesOperation pull all the tables meta irrespective of auth. (Rajkumar Singh via Ashutosh Chauhan) |
HIVE-19993 | Using a table alias which also appears as a column name is not possible (Jaume M via Zoltan Haindrich) |
HIVE-20476 | CopyUtils used by REPL LOAD and EXPORT/IMPORT operations ignore distcp error (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Thejas M Nair) |
HIVE-20496 | Vectorization: Vectorized PTF IllegalStateException (Matt McCline, reviewed by Teddy Choi) |
HIVE-20433 | Implicit String to Timestamp conversion is slow (Deepak Jaiswal, reviewed by Gopal Vijayaraghavan) |
HIVE-20439 | addendum |
HIVE-20352 | Vectorization: Support grouping function (Matt McCline, reviewed by Teddy Choi) |
HIVE-20399 | CTAS w/a custom table location that is not fully qualified fails for MM tables (Sergey Shelukhin, reviewed by Ashutosh Chauhan) |
HIVE-20409 | Hive ACID: Update/delete/merge does not clean hdfs staging directory (Rajkumar Singh, reviewed by Vineet Garg) |
HIVE-18624 | Parsing time is extremely high (~10 min) for queries with complex select expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20406 | Addendum patch |
HIVE-20383 | Invalid queue name and synchronisation issues in hive proto events hook.(Harish JP, reviewd by Anishek Agarwal) |
HIVE-14898 | HS2 shouldn’t log callstack for an empty auth header error |
HIVE-20389 | NPE in SessionStateUserAuthenticator when authenticator=SessionStateUserAuthenticator (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20345 | Drop database may hang if the tables get deleted from a different call (Aihua Xu, reviewed by Naveen Gangam) |
HIVE-19316 | StatsTask fails due to ClassCastException (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez) |
HIVE-20361 | ReplDumpTaskTest is failing (Sankar Hariappan, reviewed by Jesus Camacho Rodriguez) |
HIVE-20353 | Follow redirects when hive connects to a passive druid overlord/coordinator (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-20336 | Masking and filtering policies for materialized views (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20326 | Create constraints with RELY as default instead of NO RELY (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
HIVE-20118 | SessionStateUserAuthenticator.getGroupNames() is always empty (Daniel Dai, reviewed by Thejas Nair) |
HIVE-20290 | Lazy initialize ArrowColumnarBatchSerDe so it doesn’t allocate buffers during GetSplits (Eric Wohlstadter, reviewed by Jason Dere) |
HIVE-20301 | Enable vectorization for materialized view rewriting tests (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20302 | LLAP: non-vectorized execution in IO ignores virtual columns, including ROW__ID (Jesus Camacho Rodriguez, reviewed by Sergey Shelukhin) |
HIVE-20299 | potential race in LLAP signer unit test (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-20281 | SharedWorkOptimizer fails with ‘operator cache contents and actual plan differ’ (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20101 | BloomKFilter: Avoid using the local byte[] arrays entirely (Gopal V, reviewed by Prasanth Jayachandran) |
HIVE-20263 | Typo in HiveReduceExpressionsWithStatsRule variable (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
HIVE-20212 | Hiveserver2 in http mode emitting metric default.General.open_connections incorrectly (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
HIVE-20228 | configure repl configuration directories based on user running hiveserver2 (Anishek Agarwal, reviewed by Sankar Hariappan) |
HIVE-19935 | Hive WM session killed: Failed to update LLAP tasks count (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-20192 | HS2 with embedded metastore is leaking JDOPersistenceManager objects (Sankar Hariappan, reviewed by Vihang Karajgaonkar) |
HIVE-20174 | Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions (Matt McCline, reviewed by Teddy Choi) |
HIVE-20127 | fix some issues with LLAP Parquet cache (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
HIVE-20149 | TestHiveCli failing/timing out(Vineet Garg, reviewed by Zoltan Haindrich) |
HIVE-20183 | Inserting from bucketed table can cause data loss, if the source table contains empty bucket (Peter Vary, reviewed by Naveen Gangam) |
HIVE-20172 | StatsUpdater failed with GSS Exception while trying to connect to remote metastore (Rajkumar Singh via Ashutosh Chauhan) |
HIVE-20165 | Enable ZLIB for streaming ingest (Prasanth Jayachandran, via Gopal V) |
HIVE-20116 | TezTask is using parent logger (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
HIVE-20152 | reset db state, when repl dump fails, so rename table can be done (Anishek Agarwal, reviewed by Sankar Hariappan) |
HIVE-20185 | Backport HIVE -20111 to branch-3 (Daniel Voros via Zoltan Haindrich) |
HIVE-20085 | Druid-Hive (managed) table creation fails with strict managed table checks: Table is marked as a managed table but is not transactional (Nishant Bangarwa via Ashutosh Chauhan) |
HIVE-19981 | Managed tables converted to external tables by the HiveStrictManagedMigration utility should be set to delete data when the table is dropped (Jason Dere, reviewed by Daniel Dai) |
HIVE-20147 | Hive streaming ingest is contented on synchronized logging (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
HIVE-19387 | Truncate table for Acid tables conflicts with ResultSet cache (Eugene Koifman, reviewed by Jason Dere) |
HIVE-20088 | Beeline config location path is assembled incorrectly (Denes Bodo via Zoltan Haindrich) |
HIVE-20129 | Revert to position based schema evolution for orc tables (Prasanth Jayachandran reviewed by Gunther Hagleitner) |
HIVE-20126 | OrcInputFormat does not pass conf to orc reader options (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
HIVE-19970 | Replication dump has a NPE when table is empty (Mahesh Kumar Behera, reviewed by Peter Vary, Sankar Hariappan) |
HIVE-20103 | WM: Only Aggregate DAG counters if at least one is used (Gopal V, reviewed by Prasanth Jayachandran) |
HIVE-19812 | Disable external table replication by default via a configuration property (Mahesh Kumar Behera, reviewed by Sankar Hariappan) |
HIVE-20182 | Backport HIVE -20067 to branch-3 (Daniel Voros via Zoltan Haindrich) |
HIVE-20025 | Clean-up of event files created by HiveProtoLoggingHook (Sankar Hariappan, reviewed by Harish Jaiprakash, Anishek Agarwal) |
HIVE-20039 | Bucket pruning: Left Outer Join on bucketed table gives wrong result (Deepak Jaiswal, reviewed by Gopal V) |
HIVE-19850 | Dynamic partition pruning in Tez is leading to ‘No work found for tablescan’ error (Ganesha Shreedhara via Ashutosh Chauhan) |
HIVE-19860 | HiveServer2 ObjectInspectorFactory memory leak with cachedUnionStructObjectInspector (Rajkumar Singh) |
HIVE-19326 | stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect results) (Zoltan Haindrich via Ashutosh Chauhan) |
HIVE-19995 | Aggregate row traffic for acid tables (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
HIVE-20051 | Skip authorization for temp tables (Zoltan Haindrich reviewed by Igor Kryvenko, Jason Dere) |
HIVE-17840 | HiveMetaStore eats exception if transactionalListeners.notifyEvent fail (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Daniel Dai) |
HIVE-20059 | Hive streaming should try shade prefix unconditionally on exception (Prasanth Jayachandran reviewed by Vineet Garg) |
HIVE-20038 | Update queries on non-bucketed + partitioned tables throws NPE (Prasanth Jayachandran reviewed by Gopal V) |
HIVE-20028 | Metastore client cache config is used incorrectly (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
HIVE-19967 | SMB Join: Need Optraits for PTFOperator ala GBY Op (Deepak Jaiswal, reviewed by Jason Dere) |
HIVE-20004 | Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (addendum) |
HIVE-20004 | Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (Prasanth Jayachandran reviewed by Matt McCline) |
HIVE-19432 | GetTablesOperation is too slow if the hive has too many databases and tables (Rajkumar Singh via Ashutosh Chauhan) |
HIVE-19285 | Add logs to the subclasses of MetaDataOperation (Marta Kuczora, via Peter Vary) |
HIVE-20009 | Fix runtime stats for merge statement (Zoltan Haindrich via Ashutosh Chauhan) |
HIVE-19989 | Metastore uses wrong application name for HADOOP2 metrics (Vineet Garg, reviewed by Alan Gates) |
HIVE-20011 | Move away from append mode in proto logging hook (Harish JP, reviewd by Anishek Agarwal) |
HIVE-19564 | Vectorization: Fix NULL / Wrong Results issues in Arithmetic (Matt McCline, reviewed by Teddy Choi) |
HIVE-20180 | Backport HIVE -19759 to branch-3 (Daniel Voros via Zoltan Haindrich) |
HIVE-6980 | rop table by using direct sql (Peter Vary, reviewed by Alexander Kolbasov, Vihang Karajgaonkar) |
HIVE-19628 | possible NPE in LLAP testSigning (Sergey Shelukhin, reviewed by Jason Dere) |
HIVE-19979 | Backport HIVE -19304 to branch-3 (Daniel Voros via Zoltan Haindrich) |
HIVE-19313 | TestJdbcWithDBTokenStoreNoDoAs tests are failing (Peter Vary via Ashutosh Chauhan) |
HIVE-18986 | Table rename will run java.lang.StackOverflowError in dataNucleus if the table contains large number of columns (Aihua Xu, reviewed by Yongzhi Chen) |
HIVE-19104 | When test MetaStore is started with retry the instances should be independent (Peter Vary, reviewed by Sahil Takiar) |
HIVE-21177 | ACID:Optimize AcidUtils.getLogicalLength() |
HIVE-21660 | Wrong result when union all and later view with explode is used (Ganesha Shreedhara via Jesus Camacho Rodriguez) |
HIVE-22891 | To Skip PartitionDesc Extraction For Non-LLAP Execution Mode (Syed Shameerur Rahman, reviewed by Ádám Szita) |