doExecute(): RDD[InternalRow]
StreamingGlobalLimitExec Unary Physical Operator
StreamingGlobalLimitExec
is a unary physical operator that represents a Limit
logical operator of a streaming query at execution time.
Note
|
A unary physical operator ( Read up on UnaryExecNode (and physical operators in general) in The Internals of Spark SQL book. |
StreamingGlobalLimitExec
is created exclusively when StreamingGlobalLimitStrategy execution planning strategy is requested to plan a Limit
logical operator (in the logical plan of a streaming query) for execution.
Note
|
Read up on Limit Logical Operator in The Internals of Spark SQL book. |
StreamingGlobalLimitExec
is a stateful physical operator that can write to a state store.
StreamingGlobalLimitExec
supports Append output mode only.
The optional properties, i.e. the StatefulOperatorStateInfo and the output mode, are initially undefined when StreamingGlobalLimitExec
is created. StreamingGlobalLimitExec
is updated to hold execution-specific configuration when IncrementalExecution
is requested to prepare the logical plan (of a streaming query) for execution (when the state preparation rule is executed).
Creating StreamingGlobalLimitExec Instance
StreamingGlobalLimitExec
takes the following to be created:
-
StatefulOperatorStateInfo (default:
None
) -
OutputMode (default:
None
)
StreamingGlobalLimitExec
initializes the internal properties.
StreamingGlobalLimitExec as StateStoreWriter
StreamingGlobalLimitExec
is a stateful physical operator that can write to a state store.
Performance Metrics
StreamingGlobalLimitExec
uses the performance metrics of the parent StateStoreWriter.
Executing Physical Operator (Generating RDD[InternalRow]) — doExecute
Method
Note
|
doExecute is part of SparkPlan Contract to generate the runtime representation of an physical operator as a recipe for distributed computation over internal binary rows on Apache Spark (RDD[InternalRow] ).
|
doExecute
…FIXME