pyspark.RDD.stats# RDD.stats()[source]# Return a StatCounter object that captures the mean, variance and count of the RDD’s elements in one operation. New in version 0.9.1. Returns StatCountera StatCounter capturing the mean, variance and count of all elements See also RDD.stdev() RDD.sampleStdev() RDD.variance() RDD.sampleVariance() RDD.histogram() pyspark.sql.DataFrame.stat()