Backported Issues from Open-source Hive
This table contains the list of issues backported from open-source Hive.
Open Source JIRA ID |
Description |
---|---|
Group by statistics estimation could be improved by bounding the total number of rows to source table (Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Account for overhead from datastructure aware estimations during mapjoin selection (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
|
Align Apache Thrift version to 0.9.3-1 in standalone-metastore as well (David Lavati via László Bodor) |
|
Upgrade Apache Thrift to 0.9.3-1 (David Lavati via Laszlo Bodor) |
|
Remove a function from function registry when it can not be added to the metastore when creating it. (Ashutosh Bapat, reviewed by Mahesh Kumar Behera) |
|
NOTIFICATION_LOG cleanup interval is hardcoded as 60s and is too small (Sankar Hariappan, reviewed by Thejas Nair) |
|
Select for update on notification_sequence table has retry interval and retries count too small (Sankar Hariappan, reviewed by Thejas M Nair) |
|
Partition filter condition is not pushed down to metastore query if it has IS NOT NULL (Sankar Hariappan, reviewed by Daniel Dai) |
|
Split text files even if header/footer exists (Mustafa Iman, reviewed by Sankar Hariappan) |
|
HIVE-22129: Remove glassfish.jersey and mssql-jdbc classes from jdbc-standalone jar (Naveen Gangam, reviewed by Thejas Nair |
|
Hive with TEZ UNION ALL and UDTF results in data loss (Wei Zhang, reviewed by Vineet Garg) |
|
NullPointerException in DynamicPartitionPruningOptimization, when join key is on aggregation column (Jason Dere, reviewed by Vineet Garg) |
|
Skip setting up hive scratch dir during planning (Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Further improvements to group by optimization with constraints (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
|
Put dynamic sort partition optimization under cost based decision (Vineet Garg, reviewed by Prasanth Jayachandran, Sergey Shelukhin) |
|
Flag to enable/disable pushdown of computation from Calcite into JDBC connection (Jesus Camacho Rodriguez, reviewed by Thejas M Nair) |
|
Use SQL constraints to improve join reordering algorithm (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
JDBC queries containing joins gives wrong results (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
JdbcStorageHandler range query assertion failure in some cases (Daniel Dai, reviewed by Thejas Nair) |
|
JdbcStorageHandler range split throws NPE (Daniel Dai, reviewed by Thejas Nair) |
|
Add partition column option to JDBC handler (Daniel Dai, reviewed by Jesus Camacho Rodriguez) |
|
JdbcStorageHandler password should be encrypted (Daniel Dai, reviewed by Sankar Hariappan) |
|
JdbcStorageHandler push join of two different datasource to jdbc driver (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Fix feature to push computation to jdbc external tables (Jonathan Doron, reviewed by Jesus Camacho Rodriguez) |
|
Add perf cli driver with constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
qfiles: prepare .q files for using datasets (Laszlo Bodor via Zoltan Haindrich) Signed-off-by: Zoltan Haindrich <kirk@rxd.hu> (cherry picked from commit 86b678f502bc6bb86c2279ef1288627bd8e7421a) |
|
Join elimination in the presence of FK relationship (Jesus Camacho Rodriguez, reviewed by Vineet Garg) |
|
Upgrade Calcite to 1.17.0 (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
OperationManager.queryIdOperation does not properly clean up multiple queryIds (Jason Dere, reviewed by Prasanth Jayachandran) |
|
Llap external client- Need to reduce LlapBaseInputFormat#getSplits() footprint (Shubham Chaurasia, reviewed by Jason Dere) |
|
Llap external client returns decimal columns in different precision/scale as compared to beeline (Shubham Chaurasia, via Gopal V) |
|
NPE, ParseException in getting schema from logical plan (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
|
Allow session level queries in LlapBaseInputFormat#getSplits() before actual get_splits() call (Shubham Chaurasia, reviewed by Teddy Choi) |
|
Get Schema from LogicalPlan faster (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
|
A UDTF to show schema (column names and types) of given query (Shubham Chaurasia, reviewed by Mani M, Sankar Hariappan) |
|
LlapBaseInputFormat#closeAll() throws ConcurrentModificationException (Shubham Chaurasia, reviewed by Jason Dere) |
|
Hive query with big size via knox fails with Broken pipe Write failed (Denys Kuzmenko via Peter Vary) |
|
Bringing a node manager down blocks restart of LLAP service (Jesus Camacho Rodriguez, reviewed by Slim Bouguerra) |
|
Beeline option to show/not show execution report (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
LLAP status service driver may get stuck with wrong Yarn app ID (Adam Szita, reviewed by Marta Kuczoram) |
|
Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) (addendum) |
|
Adding ability for user to set bind user (David McGinnis reviewed by Prasanth Jayachandran) |
|
Multiple project between join operators may affect join reordering using constraints (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Turn on hive.optimize.remove.sq_count_check flag (Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Bad join ordering in tpcds query93 with primary constraint defined (Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Tez: SplitGenerator tries to look for plan files which won’t exist for Tez (Gopal V via Vineet Garg) |
|
Turning on hive.tez.bucket.pruning produce wrong results (Vineet Garg, reviewed by Gopal V) |
|
Prevent the creation of query routing appender if property is set to false (Slim Bouguerra reviewed by Gopal V) |
|
Skip creating scratch dirs for tez if RPC is on (Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) (addendum) |
|
Bridge cases in which Calcite’s type resolution is more stricter than Hive. (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Implement UDF to interpret date/timestamp using its internal representation and Gregorian-Julian hybrid calendar (Jesus Camacho Rodriguez, reviewed by Adam Szita) |
|
Remove cross-query synchronization for the partition-eval (Slim B via Gopal V) |
|
Remove very expensive logging from the llap cache hotpath (Slim B via Jesus Camacho Rodriguez) |
|
UDF: FunctionRegistry synchronizes on org.apache.hadoop.hive.ql.udf.UDFType class (Gopal V, reviewed by Ashutosh Chauhan) |
|
Fix wrong results/ArrayOutOfBound exception in left outer map joins on specific boundary conditions (Ramesh Kumar Thangarajan, reviewed by Jason Dere) |
|
switch Hive UDFs to use Re2J regex engine (Rajkumar Singh via Ashutosh Chauhan) |
|
Use Murmur hash for VectorHashKeyWrapperTwoLong to reduce hash collision (Teddy Choi, reviewed by Gopal V) |
|
Fix type of constants in IN expressions to have correct type (Zoltan Haindrich via Jesus Camacho Rodriguez) |
|
cboInfo is not present in the explain plan json (Anishek Agarwal, reviewed by Ashutosh Chauhan) |
|
Make compute stats work for Druid tables (Nishant Bangarwa via Ashutosh Chauhan) |
|
Improve number of null values estimation after outer join (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
TPC-DS query78 stats estimates are off for is null filter(Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Vectorization: Follow-on to HIVE -19951 –> add call to SchemaEvolution.isOnlyImplicitConversion to disable encoded LLAP I/O for ORC only when data type conversion is not implicit (Matt McCline, reviewed by Prasanth Jayachandran) |
|
Wrong results for group by queries with primary key on multiple columns (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
|
If select operator inputs are temporary columns vectorization may reuse some of them as output (Zoltan Haindrich reviewed by Teddy Choi) |
|
Vector RS operator is not using uniform hash function for TPC-DS query 95 (Teddy Choi, reviewed by Gopal V) |
|
Factor cost of each SJ reduction when costing a follow-up reduction (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Extended SJ reduction may backtrack columns incorrectly when creating filters (Jesus Camacho Rodriguez, reviewed by Deepak Jaiswal) |
|
Remove non unique columns from group by keys if not referenced later (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
|
HivePointLookupOptimizer should extract deep cases (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez) |
|
Vectorization: CASE WHEN expression fails when THEN/ELSE type and result type are different (Matt McCline, reviewed by Teddy Choi) |
|
Create SSD cache dir if it doesnt exist already (Miklos Gergely via Sergey Shelukhin) |
|
pre-allocate LLAP cache at init time (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
Vectorization : Support loading bucketed tables using sorted dynamic partition optimizer- II (Deepak Jaiswal, reviewed by Prasanth Jayachandran) |
|
Improve HivePointLookupOptimizerRule to be able to extract from more sophisticated contexts (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Vectorization: Improve Fast Vector MapJoin Bytes Hash Tables (Matt McCline, reviewed by Zoltan Haindrich) |
|
Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan) |
|
Vectorization: Fix more NULL / Wrong Results issues and avoid unnecessary casts/conversions (Matt McCline, reviewed by Teddy Choi) |
|
Vectorization: Lift unneeded restriction causing some PTF with RANK not to be vectorized (Matt McCline, reviewed by Teddy Choi) |
|
Vectorization: Support streaming for PTF AVG, MAX, MIN, SUM (Matt McCline, reviewed by Teddy Choi) |
|
LLAP IO may not handle ORC files that have row index disabled correctly for queries with no columns selected (Sergey Shelukhin, reviewed by Gopal Vijayaraghavan) |
|
Vectorization: Cut down memory size of 1 col VectorHashKeyWrapper to <1 CacheLine (Matt McCline, reviewed by Gopal Vijayaraghavan) |
|
Vectorization: Reduce KeyWrapper allocation in GroupBy Streaming mode (Gopal V, reviewed by Matt McCline) |
|
VectorFileSinkArrowOperator (Eric Wohlstadter, reviewed by Jason Dere, Matt McCline, Teddy Choi) |
|
Vectorization: Case expressions that return BOOLEAN are not supported for FILTER (Matt McCline, reviewed by Gopal Vijayaraghavan) |
|
Vectorization: Fix NULL / Wrong Results issues in COALESCE / ELT (Matt McCline, reviewed by Teddy Choi) |
|
Semijoin Reduction : Cycles due to semi join branch may remain undetected if small table side has a map join upstream.(Deepak Jaiswal, reviewed by Jesus Camacho Rodriguez) |
|
Disable XPROD_EDGE for sq_count_check() created for scalar subqueries (Ashutosh Chauhan via Jesus Camacho Rodriguez) |
|
forward port HIVE -19704 to master (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
refactor LLAP IO report generation (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
Extend creation of semijoin reduction filters to be able to discover new opportunities (Jesus Camacho Rodriguez via Deepak Jaiswal) |
|
Fix reoptimization in case of DPP and Semijoin optimization (Zoltan Haindrich reviewed by Ashutosh Chauhan, Zoltan Haindrich) |
|
TopNKey: Create a standalone vectorizable TopNKey operator (Teddy Choi, reviewed by Jesus Camacho Rodriguez) |
|
Vectorization: Fix NULL / Wrong Results issues in BETWEEN / IN (Matt McCline, reviewed by Teddy Choi) |
|
Vectorization: Fix NULL / Wrong Results issues in Filter / Compare (Matt McCline, reviewed by Teddy Choi) |
|
Arrow SerDe leaks a DirectByteBuffer (Eric Wohlstadter, reviewed by Teddy Choi) |
|
HiveDecimal to string conversion doesn’t format the decimal correctly (Jason Dere, reviewed by Ashutosh Chauhan) |
|
inserting into external tables with custom partition directories may cause data loss (Sergey Shelukhin, reviewed by Ashutosh Chauhan) |
|
Type conversion during IN () comparisons is using different rules from other comparison operations (Jason Dere, reviewed by Ashutosh Chauhan) |
|
CBO: Add an “optimizedSQL” to QueryPlan object (Gopal V, reviewed by Jesus Camacho Rodriguez) |
|
Vectorization: Add DECIMAL_64 testing, add Date/Interval/Timestamp arithmetic, and add more GROUP BY Aggregation tests (Matt McCline, reviewed by Teddy Choi) |
|
Support date type for column stats autogather (Zoltan Haindrich reviewed by Prasanth Jayachandran) |
|
Multi-column joins estimates with uncorrelated columns different in CBO and Hive (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Use datastructure aware estimations during mapjoin selection (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Rewrite BETWEEN to IN for integer types for stats estimation(Vineet Garg, reviewed by Ashutosh Chauhan) |
|
Fix mapjoin size estimations for Fast implementation (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Rewrite SUM0 into SUM + COALESCE combination (Jesus Camacho Rodriguez, reviewed by Gopal V) |
|
HiveAggregateReduceFunctionsRule may infer wrong return type when decomposing aggregate function (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Recognize COALESCE as CASE (Remus Rusanu, Jesus Camacho Rodriguez via Ashutosh Chauhan, Zoltan Haindrich) |
|
Add explain locks <sql> command (Igor Kryvenko via Eugene Koifman) |
|
For time related functions arguments may not be casted to a non nullable type (Zoltan Haindrich reviewed by Jesus Camacho Rodriguez) |
|
Extend HivePreFilteringRule to support other functions (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Constant folding may not create null constants without types (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
hive on tez execution error due to set hive.aux.jars.path to hdfs:// (zhuwei reviewed by Prasanth Jayachandran) |
|
Create new join multi-key correlation flag for CBO (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Add an Implicit cast to date type for to_date function (Nishant Bangarwa via Ashutosh Chauhan) |
|
Refactor LlapStatusServiceDriver (Miklos Gergely via Sergey Shelukhin) |
|
Ban commons-logging and log4j (Prasanth Jayachandran reviewed by Ashutosh Chauhan) |
|
Fix logger for LlapServlet (Rajkumar Singh via Gopal V) |
|
In case of LLAP start failure add info how to find YARN logs (Miklos Gergely via Ashutosh Chauhan) |
|
HiveServer2: PrivilegeSynchronizer is not set to daemon status (Vaibhav Gumashta via Thejas Nair) |
|
Log spew from security.authorization.PrivilegeSynchonizer.run (Daniel Dai, reviewed by Gopal V) |
|
PrivilegeSynchronizer for SBA might hit AccessControlException (Daniel Dai, reviewed by Vaibhav Gumashta) |
|
Better logging for information schema synchronizer (Daniel Dai, reviewed by Vaibhav Gumashta) |
|
Expose ShuffleHandler.registerDag in the llap daemon API (Jaume Marhuenda, reviewed by Sergey Shelukhin) |
|
record per-task CPU counters in LLAP (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
Active/Passive HS2 HA: Disallow direct connection to passive HS2 instance (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
related equals and in operators may cause inaccurate stats estimations (Zoltan Haindrich via Ashutosh Chauhan) |
|
LLAP: Set Xms to 50% always (Gopal V via Ashutosh Chauhan) |
|
Postgres jdbc query fail with “LIMIT must not be negative” (Daniel Dai, reviewed by Thejas Nair) |
|
Insert in bucketed table fails due to dynamic partition sort optimization (Vineet Garg via Ashutosh Chauhan) |
|
getDelegationTokenFromMetaStore doesn’t need to be synchronized (Sankar Hariappan, reviewed by Thejas M Nair) |
|
Keytab URI for LLAP YARN Service is restrictive to support HDFS only (Gour Saha reviewd by Prasanth Jayachandran) |
|
Constant propagation oversimplifies projections (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Reading Timestamp datatype via HiveServer2 gives errors (Mahesh Kumar Behera, reviewed by Thejas M Nair) |
|
Hive QueryResultCache entries keeping reference to SemanticAnalyzer from cached query (Jason Dere, reviewed by GopalV) |
|
JdbcRecordReader.next shall not eat exception (Daniel Dai, reviewed by Thejas Nair) |
|
Inserting timestamp with zones truncates the data (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez) |
|
Upgrade version of Jetty to 9.3.25.v20180904 (Laszlo Bodor via Thejas Nair) |
|
HiveProtoHookLogger does not close file at end of day. (Harish JP, reviewd by Anishek Agarwal) |
|
Adding Tumbling Window UDF (Slim Bouguerra, via Gopal V) |
|
Add google cloud storage (gs) to the exim uri schema whitelist (Siddharth Seth, reviewed by Prasanth Jayachandran, Sergey Shelukhin) |
|
OptimizedSql is not shown if the expression contains INs (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
LLAP aware memory manager for Orc writers (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
|
msck_*.q tests are broken (Jesus Camacho Rodriguez, reviewed by Prasanth Jayachandran) |
|
SELECT statement fails after UPDATE with hive.optimize.sort.dynamic.partition optimization and vectorization on (Eugene Koifman, reviewed by Gopal V) |
|
Avoid exposing sensitive infomation through a Hive Runtime exception (Ashutosh Bapat reviewed by Thejas Nair, Sankar Hariappan) |
|
Vectorization: Native Vector MapJoin doesn’t support Complex Big Table values (Matt McCline, reviewed by Jason Dere) |
|
Add config to diable rollup for druid tables (Nishant Bangarwa via Ashutosh Chauhan) |
|
Add better message for NPE when inserting rows with null timestamp to druid (Nishant Bangarwa via Ashutosh Chauhan) |
|
Sync query IDs between druid and Hive (Nishant Bangarwa via Ashutosh Chauhan) |
|
LLAP: Vector group by operator should use memory per executor |
|
Enable folding of NOT x IS (NOT) [TRUE|FALSE] expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Shared work: Extend sharing of map-join cache entries in LLAP (Jesus Camacho Rodriguez, reviewed by Gopal V) |
|
Fix org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[cttl] (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
|
Cast exception observed when hive runs a multi join query on metastore (postgres), since postgres pushes the filter into the join, and ignores the condition before applying cast (Karthik Manamcheri, reviewed by Sergey Shelukhin) |
|
During join selection BucketMapJoin might be choosen for non bucketed tables (Zoltan Haindrich reviewed by Ashutosh Chauhan, Deepak Jaiswal) |
|
Load Data for partitioned ACID tables fails with bucketId out of range: -1 (Deepak Jaiswal, reviewed by Eugene Koifman) |
|
manifest collisions when inserting into bucketed sorted MM tables with dynamic partitioning (Sergey Shelukhin, reviewed by Deepak Jaiswal) |
|
Log more details when druid metastore transaction fails in callback (Nishant Bangarwa via Ashutosh Chauhan) |
|
Expanding WebUI to include form to dynamically config log levels (Zoltan Chovan reviewed by Prasanth Jayachandran) |
|
Implement Retry Logic in HiveDruidSplit for Scan Queries (Nishant Bangarwa via Ashutosh Chauhan) |
|
Make hflush in hive proto logging configurable (Prasanth Jayachandran reviewed by Thejas M Nair) |
|
Use canonical hostname only for kerberos auth in HiveConnection (Prasanth Jayachandran reviewed by Gopal V) |
|
Change default of hive.hashtable.key.count.adjustment to 0.99 |
|
Schema Evolution checking is broken in going from Hive version 2 to version 3 for ALTER TABLE VARCHAR to DECIMAL |
|
REPL DUMP on external table with add partition event throws NoSuchElementException (Sankar Hariappan, reviewed by Anishek Agarwal) |
|
NPE in HiveMetaHook (Jason Dere, reviewed by Sergey Shelukhin) |
|
Enable TestMiniDruidKafkaCliDriver#druidkafkamini_basic.q (Nishant Bangarwa, reviewed by Jesus Camacho Rodriguez) |
|
HS2 doAs=true has permission issue with hadoop.tmp.dir, with MR and S3A filesystem (Thejas Nair, reviewed by Zoltan Haindrich) |
|
Vectorization: Support loading bucketed tables using sorted dynamic partition optimizer (Deepak Jaiswal, reviewed by Gopal Vijayarahavan, Matt Mccline, and Thejas Nair) |
|
Hive does not support user names of type “user@realm” (Deepak Jaiswal, reviewed by Thejas Nair) |
|
HiveFilterSetOpTransposeRule may throw assertion error due to nullability of fields (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Empty query results when using results cache and query temp dir, results cache dir in different filesystems (Jason Dere via Thejas Nair) |
|
GetTablesOperation pull all the tables meta irrespective of auth. (Rajkumar Singh via Ashutosh Chauhan) |
|
Using a table alias which also appears as a column name is not possible (Jaume M via Zoltan Haindrich) |
|
CopyUtils used by REPL LOAD and EXPORT/IMPORT operations ignore distcp error (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Thejas M Nair) |
|
Vectorization: Vectorized PTF IllegalStateException (Matt McCline, reviewed by Teddy Choi) |
|
Implicit String to Timestamp conversion is slow (Deepak Jaiswal, reviewed by Gopal Vijayaraghavan) |
|
addendum |
|
Vectorization: Support grouping function (Matt McCline, reviewed by Teddy Choi) |
|
CTAS w/a custom table location that is not fully qualified fails for MM tables (Sergey Shelukhin, reviewed by Ashutosh Chauhan) |
|
Hive ACID: Update/delete/merge does not clean hdfs staging directory (Rajkumar Singh, reviewed by Vineet Garg) |
|
Parsing time is extremely high (~10 min) for queries with complex select expressions (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Addendum patch |
|
Invalid queue name and synchronisation issues in hive proto events hook.(Harish JP, reviewd by Anishek Agarwal) |
|
HS2 shouldn’t log callstack for an empty auth header error |
|
NPE in SessionStateUserAuthenticator when authenticator=SessionStateUserAuthenticator (Daniel Dai, reviewed by Thejas Nair) |
|
Drop database may hang if the tables get deleted from a different call (Aihua Xu, reviewed by Naveen Gangam) |
|
StatsTask fails due to ClassCastException (Jaume Marhuenda, reviewed by Jesus Camacho Rodriguez) |
|
ReplDumpTaskTest is failing (Sankar Hariappan, reviewed by Jesus Camacho Rodriguez) |
|
Follow redirects when hive connects to a passive druid overlord/coordinator (Nishant Bangarwa via Ashutosh Chauhan) |
|
Masking and filtering policies for materialized views (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Create constraints with RELY as default instead of NO RELY (Vineet Garg, reviewed by Jesus Camacho Rodriguez) |
|
SessionStateUserAuthenticator.getGroupNames() is always empty (Daniel Dai, reviewed by Thejas Nair) |
|
Lazy initialize ArrowColumnarBatchSerDe so it doesn’t allocate buffers during GetSplits (Eric Wohlstadter, reviewed by Jason Dere) |
|
Enable vectorization for materialized view rewriting tests (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
LLAP: non-vectorized execution in IO ignores virtual columns, including ROW__ID (Jesus Camacho Rodriguez, reviewed by Sergey Shelukhin) |
|
potential race in LLAP signer unit test (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
SharedWorkOptimizer fails with ‘operator cache contents and actual plan differ’ (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
BloomKFilter: Avoid using the local byte[] arrays entirely (Gopal V, reviewed by Prasanth Jayachandran) |
|
Typo in HiveReduceExpressionsWithStatsRule variable (Jesus Camacho Rodriguez, reviewed by Ashutosh Chauhan) |
|
Hiveserver2 in http mode emitting metric default.General.open_connections incorrectly (Jesus Camacho Rodriguez, reviewed by Zoltan Haindrich) |
|
configure repl configuration directories based on user running hiveserver2 (Anishek Agarwal, reviewed by Sankar Hariappan) |
|
Hive WM session killed: Failed to update LLAP tasks count (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
HS2 with embedded metastore is leaking JDOPersistenceManager objects (Sankar Hariappan, reviewed by Vihang Karajgaonkar) |
|
Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions (Matt McCline, reviewed by Teddy Choi) |
|
fix some issues with LLAP Parquet cache (Sergey Shelukhin, reviewed by Prasanth Jayachandran) |
|
TestHiveCli failing/timing out(Vineet Garg, reviewed by Zoltan Haindrich) |
|
Inserting from bucketed table can cause data loss, if the source table contains empty bucket (Peter Vary, reviewed by Naveen Gangam) |
|
StatsUpdater failed with GSS Exception while trying to connect to remote metastore (Rajkumar Singh via Ashutosh Chauhan) |
|
Enable ZLIB for streaming ingest (Prasanth Jayachandran, via Gopal V) |
|
TezTask is using parent logger (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
|
reset db state, when repl dump fails, so rename table can be done (Anishek Agarwal, reviewed by Sankar Hariappan) |
|
Backport HIVE -20111 to branch-3 (Daniel Voros via Zoltan Haindrich) |
|
Druid-Hive (managed) table creation fails with strict managed table checks: Table is marked as a managed table but is not transactional (Nishant Bangarwa via Ashutosh Chauhan) |
|
Managed tables converted to external tables by the HiveStrictManagedMigration utility should be set to delete data when the table is dropped (Jason Dere, reviewed by Daniel Dai) |
|
Hive streaming ingest is contented on synchronized logging (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
|
Truncate table for Acid tables conflicts with ResultSet cache (Eugene Koifman, reviewed by Jason Dere) |
|
Beeline config location path is assembled incorrectly (Denes Bodo via Zoltan Haindrich) |
|
Revert to position based schema evolution for orc tables (Prasanth Jayachandran reviewed by Gunther Hagleitner) |
|
OrcInputFormat does not pass conf to orc reader options (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
|
Replication dump has a NPE when table is empty (Mahesh Kumar Behera, reviewed by Peter Vary, Sankar Hariappan) |
|
WM: Only Aggregate DAG counters if at least one is used (Gopal V, reviewed by Prasanth Jayachandran) |
|
Disable external table replication by default via a configuration property (Mahesh Kumar Behera, reviewed by Sankar Hariappan) |
|
Backport HIVE -20067 to branch-3 (Daniel Voros via Zoltan Haindrich) |
|
Clean-up of event files created by HiveProtoLoggingHook (Sankar Hariappan, reviewed by Harish Jaiprakash, Anishek Agarwal) |
|
Bucket pruning: Left Outer Join on bucketed table gives wrong result (Deepak Jaiswal, reviewed by Gopal V) |
|
Dynamic partition pruning in Tez is leading to ‘No work found for tablescan’ error (Ganesha Shreedhara via Ashutosh Chauhan) |
|
HiveServer2 ObjectInspectorFactory memory leak with cachedUnionStructObjectInspector (Rajkumar Singh) |
|
stats auto gather: incorrect aggregation during UNION queries (may lead to incorrect results) (Zoltan Haindrich via Ashutosh Chauhan) |
|
Aggregate row traffic for acid tables (Zoltan Haindrich reviewed by Ashutosh Chauhan) |
|
Skip authorization for temp tables (Zoltan Haindrich reviewed by Igor Kryvenko, Jason Dere) |
|
HiveMetaStore eats exception if transactionalListeners.notifyEvent fail (Sankar Hariappan, reviewed by Mahesh Kumar Behera, Daniel Dai) |
|
Hive streaming should try shade prefix unconditionally on exception (Prasanth Jayachandran reviewed by Vineet Garg) |
|
Update queries on non-bucketed + partitioned tables throws NPE (Prasanth Jayachandran reviewed by Gopal V) |
|
Metastore client cache config is used incorrectly (Prasanth Jayachandran reviewed by Sergey Shelukhin) |
|
SMB Join: Need Optraits for PTFOperator ala GBY Op (Deepak Jaiswal, reviewed by Jason Dere) |
|
Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (addendum) |
|
Wrong scale used by ConvertDecimal64ToDecimal results in incorrect results (Prasanth Jayachandran reviewed by Matt McCline) |
|
GetTablesOperation is too slow if the hive has too many databases and tables (Rajkumar Singh via Ashutosh Chauhan) |
|
Add logs to the subclasses of MetaDataOperation (Marta Kuczora, via Peter Vary) |
|
Fix runtime stats for merge statement (Zoltan Haindrich via Ashutosh Chauhan) |
|
Metastore uses wrong application name for HADOOP2 metrics (Vineet Garg, reviewed by Alan Gates) |
|
Move away from append mode in proto logging hook (Harish JP, reviewd by Anishek Agarwal) |
|
Vectorization: Fix NULL / Wrong Results issues in Arithmetic (Matt McCline, reviewed by Teddy Choi) |
|
Backport HIVE -19759 to branch-3 (Daniel Voros via Zoltan Haindrich) |
|
rop table by using direct sql (Peter Vary, reviewed by Alexander Kolbasov, Vihang Karajgaonkar) |
|
possible NPE in LLAP testSigning (Sergey Shelukhin, reviewed by Jason Dere) |
|
Backport HIVE -19304 to branch-3 (Daniel Voros via Zoltan Haindrich) |
|
TestJdbcWithDBTokenStoreNoDoAs tests are failing (Peter Vary via Ashutosh Chauhan) |
|
Table rename will run java.lang.StackOverflowError in dataNucleus if the table contains large number of columns (Aihua Xu, reviewed by Yongzhi Chen) |
|
When test MetaStore is started with retry the instances should be independent (Peter Vary, reviewed by Sahil Takiar) |
|
ACID:Optimize AcidUtils.getLogicalLength() |
|
Wrong result when union all and later view with explode is used (Ganesha Shreedhara via Jesus Camacho Rodriguez) |
|
To Skip PartitionDesc Extraction For Non-LLAP Execution Mode (Syed Shameerur Rahman, reviewed by Ádám Szita) |