Deactivating instances related to checkpoint location [runId]: [storeIdsToRemove]
StateStoreCoordinator RPC Endpoint — Tracking Locations of StateStores for StateStoreRDD
StateStoreCoordinator keeps track of state stores on Spark executors (per host and executor ID).
StateStoreCoordinator is used by StateStoreRDD when requested to get the location preferences of partitions (based on the location of the stores).
StateStoreCoordinator is a ThreadSafeRpcEndpoint RPC endpoint that manipulates instances registry through RPC messages.
| Message | Message Handler |
|---|---|
|
Removes StateStoreProviderIds of a streaming query (given Internally,
|
|
Gives the location of StateStoreProviderId (from instances) with the host and an executor id on that host. You should see the following DEBUG message in the logs:
|
|
One-way asynchronous (fire-and-forget) message to register a new StateStoreProviderId on an executor (given Sent out exclusively when StateStoreCoordinatorRef RPC endpoint reference is requested to reportActiveInstance (when Internally,
In the end, |
|
|
|
Verifies if a given StateStoreProviderId is registered (in instances) on You should see the following DEBUG message in the logs:
|
|
Tip
|
Enable Add the following line to
Refer to Logging. |
instances Internal Registry
instances: HashMap[StateStoreProviderId, ExecutorCacheTaskLocation]
instances is an internal registry of StateStoreProviders by their StateStoreProviderIds and ExecutorCacheTaskLocations (with a host and a executorId).
-
A new
StateStoreProviderIdadded whenStateStoreCoordinatoris requested to handle a ReportActiveInstance message -
All
StateStoreProviderIdsof a streaming query are removed whenStateStoreCoordinatoris requested to handle a DeactivateInstances message