Builder that produces a Hive-aware SessionState
.
Builder that produces a Hive-aware SessionState
.
Relation conversion from metastore relations to data source relations for better performance
Relation conversion from metastore relations to data source relations for better performance
- When writing to non-partitioned Hive-serde Parquet/Orc tables - When scanning Hive-serde Parquet/ORC tables
This rule must be run before all other DDL post-hoc resolution rules, i.e.
PreprocessTableCreation
, PreprocessTableInsertion
, DataSourceAnalysis
and HiveAnalysis
.
Determine the database, serde/format and schema of the Hive serde table, according to the storage properties.
An instance of the Spark SQL execution engine that integrates with data stored in Hive.
An instance of the Spark SQL execution engine that integrates with data stored in Hive. Configuration for Hive is read from hive-site.xml on the classpath.
(Since version 2.0.0) Use SparkSession.builder.enableHiveSupport instead
Replaces generic operations with specific variants that are designed to work with Hive.
Replaces generic operations with specific variants that are designed to work with Hive.
Note that, this rule must be run after PreprocessTableCreation
and
PreprocessTableInsertion
.
Support for running Spark SQL queries using functionality from Apache Hive (does not require an existing Hive installation). Supported Hive features include:
Users that would like access to this functionality should create a HiveContext instead of a SQLContext.