pyspark.streaming.StreamingContext.checkpoint¶
-
StreamingContext.
checkpoint
(directory: str) → None[source]¶ Sets the context to periodically checkpoint the DStream operations for master fault-tolerance. The graph will be checkpointed every batch interval.
- Parameters
- directorystr
HDFS-compatible directory where the checkpoint data will be reliably stored