countDistinct {SparkR} | R Documentation |
Count Distinct
Aggregate function: returns the number of distinct items in a group.
## S4 method for signature 'Column' countDistinct(x, ...) ## S4 method for signature 'Column' n_distinct(x, ...) countDistinct(x, ...) n_distinct(x, ...)
x |
Column to compute on |
... |
other columns |
x |
Column to compute on |
... |
other columns |
the number of distinct items in a group.
countDistinct since 1.4.0
n_distinct since 1.4.0
Other agg_funcs: agg
, agg
,
agg
, agg,GroupedData-method
,
agg,SparkDataFrame-method
,
summarize
, summarize
,
summarize
,
summarize,GroupedData-method
,
summarize,SparkDataFrame-method
;
avg
, avg
,
avg,Column-method
; count
,
count
, count,Column-method
,
count,SparkDataFrame-method
,
n
, n
,
n,Column-method
, nrow
,
nrow,SparkDataFrame-method
;
first
, first
,
first,SparkDataFrame-method
,
first,characterOrColumn-method
;
kurtosis
, kurtosis
,
kurtosis,Column-method
; last
,
last
,
last,characterOrColumn-method
;
max
, max,Column-method
;
mean
, mean,Column-method
;
min
, min,Column-method
;
sd
, sd
,
sd,Column-method
, stddev
,
stddev
, stddev,Column-method
;
skewness
, skewness
,
skewness,Column-method
;
stddev_pop
, stddev_pop
,
stddev_pop,Column-method
;
stddev_samp
, stddev_samp
,
stddev_samp,Column-method
;
sumDistinct
, sumDistinct
,
sumDistinct,Column-method
;
sum
, sum,Column-method
;
var_pop
, var_pop
,
var_pop,Column-method
;
var_samp
, var_samp
,
var_samp,Column-method
; var
,
var
, var,Column-method
,
variance
, variance
,
variance,Column-method
## Not run: countDistinct(df$c)
## Not run: n_distinct(df$c)