pyspark.
Broadcast
A broadcast variable created with SparkContext.broadcast(). Access its value through value.
SparkContext.broadcast()
value
Examples
>>> from pyspark.context import SparkContext >>> sc = SparkContext('local', 'test') >>> b = sc.broadcast([1, 2, 3, 4, 5]) >>> b.value [1, 2, 3, 4, 5] >>> sc.parallelize([0, 0]).flatMap(lambda x: b.value).collect() [1, 2, 3, 4, 5, 1, 2, 3, 4, 5] >>> b.unpersist()
>>> large_broadcast = sc.broadcast(range(10000))
Methods
destroy([blocking])
destroy
Destroy all data and metadata related to this broadcast variable.
dump(value, f)
dump
load(file)
load
load_from_path(path)
load_from_path
unpersist([blocking])
unpersist
Delete cached copies of this broadcast on the executors.
Attributes
Return the broadcasted value