LogicalRelation Leaf Logical Operator — Representing BaseRelations in Logical Plan

LogicalRelation is a leaf logical operator that represents a BaseRelation in a logical query plan.

val q1 = spark.read.option("header", true).csv("../datasets/people.csv")
scala> println(q1.queryExecution.logical.numberedTreeString)
00 Relation[id#72,name#73,age#74] csv

val q2 = sql("select * from `csv`.`../datasets/people.csv`")
scala> println(q2.queryExecution.optimizedPlan.numberedTreeString)
00 Relation[_c0#175,_c1#176,_c2#177] csv

LogicalRelation is created when:

DataFrameReader loads data from a data source that supports multiple paths (through SparkSession.baseRelationToDataFrame)
DataFrameReader is requested to load data from an external table using JDBC (through SparkSession.baseRelationToDataFrame)
TextInputCSVDataSource and TextInputJsonDataSource are requested to infer schema
ResolveSQLOnFile converts a logical plan
FindDataSourceTable logical evaluation rule is executed
RelationConversions logical evaluation rule is executed
CreateTempViewUsing logical command is requested to run
Structured Streaming’s FileStreamSource creates batches of records

The simple text representation of a LogicalRelation (aka simpleString) is Relation[output] [relation] (that uses the output and BaseRelation).

val q = spark.read.text("README.md")
val logicalPlan = q.queryExecution.logical
scala> println(logicalPlan.simpleString)
Relation[value#2] text

Creating LogicalRelation Instance

LogicalRelation takes the following when created:

BaseRelation
Output schema AttributeReferences
Optional CatalogTable

`apply` Factory Utility

apply(
  relation: BaseRelation,
  isStreaming: Boolean = false): LogicalRelation
apply(
  relation: BaseRelation,
  table: CatalogTable): LogicalRelation

apply creates a LogicalRelation for the input BaseRelation (and CatalogTable or optional isStreaming flag).

Note

apply is used when:

SparkSession is requested for a DataFrame for a BaseRelation
CreateTempViewUsing command is executed
ResolveSQLOnFile and FindDataSourceTable logical evaluation rules are executed
HiveMetastoreCatalog is requested to convert a HiveTableRelation to a LogicalRelation over a HadoopFsRelation

`refresh` Method

refresh(): Unit

Note	`refresh` is part of LogicalPlan Contract to refresh itself.

refresh requests the FileIndex of a HadoopFsRelation relation to refresh.

Note	`refresh` does the work for HadoopFsRelation relations only.

LogicalRelation

LogicalRelation Leaf Logical Operator — Representing BaseRelations in Logical Plan

Creating LogicalRelation Instance

`apply` Factory Utility

`refresh` Method

results matching ""

No results matching ""

LogicalRelation Leaf Logical Operator — Representing BaseRelations in Logical Plan

Creating LogicalRelation Instance

apply Factory Utility

refresh Method

results matching ""

No results matching ""

`apply` Factory Utility

`refresh` Method