cume_dist {SparkR} | R Documentation |
Window function: returns the cumulative distribution of values within a window partition, i.e. the fraction of rows that are below the current row.
cume_dist(x = "missing") ## S4 method for signature 'missing' cume_dist()
x |
empty. Should be used with no argument. |
N = total number of rows in the partition cume_dist(x) = number of values before (and including) x / N
This is equivalent to the CUME_DIST
function in SQL.
cume_dist since 1.6.0
Other window_funcs: dense_rank
,
lag
, lead
,
ntile
, percent_rank
,
rank
, row_number
## Not run:
##D df <- createDataFrame(mtcars)
##D ws <- orderBy(windowPartitionBy("am"), "hp")
##D out <- select(df, over(cume_dist(), ws), df$hp, df$am)
## End(Not run)