Deactivating instances related to checkpoint location [runId]: [storeIdsToRemove]
StateStoreCoordinator RPC Endpoint — Tracking Locations of StateStores for StateStoreRDD
StateStoreCoordinator
keeps track of state stores on Spark executors (per host and executor ID).
StateStoreCoordinator
is used by StateStoreRDD
when requested to get the location preferences of partitions (based on the location of the stores).
StateStoreCoordinator
is a ThreadSafeRpcEndpoint
RPC endpoint that manipulates instances registry through RPC messages.
Message | Message Handler |
---|---|
|
Removes StateStoreProviderIds of a streaming query (given Internally,
|
|
Gives the location of StateStoreProviderId (from instances) with the host and an executor id on that host. You should see the following DEBUG message in the logs:
|
|
One-way asynchronous (fire-and-forget) message to register a new StateStoreProviderId on an executor (given Sent out exclusively when StateStoreCoordinatorRef RPC endpoint reference is requested to reportActiveInstance (when Internally,
In the end, |
|
|
|
Verifies if a given StateStoreProviderId is registered (in instances) on You should see the following DEBUG message in the logs:
|
Tip
|
Enable Add the following line to
Refer to Logging. |
instances
Internal Registry
instances: HashMap[StateStoreProviderId, ExecutorCacheTaskLocation]
instances
is an internal registry of StateStoreProviders by their StateStoreProviderIds and ExecutorCacheTaskLocations
(with a host
and a executorId
).
-
A new
StateStoreProviderId
added whenStateStoreCoordinator
is requested to handle a ReportActiveInstance message -
All
StateStoreProviderIds
of a streaming query are removed whenStateStoreCoordinator
is requested to handle a DeactivateInstances message