R Frontend for Apache Spark


[Up] [Top]

Documentation for package ‘SparkR’ version 2.0.1

Help Pages

A B C D E F G H I J K L M N O P Q R S T U V W Y misc

-- A --

abs abs
abs-method abs
acos acos
acos-method acos
add_months add_months
add_months-method add_months
AFTSurvivalRegressionModel-class S4 class that represents a AFTSurvivalRegressionModel
agg Summarize data across columns
agg-method Summarize data across columns
alias alias
alias-method alias
approxCountDistinct Returns the approximate number of distinct items in a group
approxCountDistinct-method Returns the approximate number of distinct items in a group
approxQuantile Calculates the approximate quantiles of a numerical column of a SparkDataFrame
approxQuantile-method Calculates the approximate quantiles of a numerical column of a SparkDataFrame
arrange Arrange Rows by Variables
arrange-method Arrange Rows by Variables
array_contains array_contains
array_contains-method array_contains
as.data.frame Download data from a SparkDataFrame into a R data.frame
as.data.frame-method Download data from a SparkDataFrame into a R data.frame
as.DataFrame Create a SparkDataFrame
as.DataFrame.default Create a SparkDataFrame
asc A set of operations working with SparkDataFrame columns
ascii ascii
ascii-method ascii
asin asin
asin-method asin
atan atan
atan-method atan
atan2 atan2
atan2-method atan2
attach Attach SparkDataFrame to R search path
attach-method Attach SparkDataFrame to R search path
avg avg
avg-method avg

-- B --

base64 base64
base64-method base64
between between
between-method between
bin bin
bin-method bin
bitwiseNOT bitwiseNOT
bitwiseNOT-method bitwiseNOT
bround bround
bround-method bround

-- C --

cache Cache
cache-method Cache
cacheTable Cache Table
cacheTable.default Cache Table
cancelJobGroup Cancel active jobs for the specified group
cancelJobGroup.default Cancel active jobs for the specified group
cast Casts the column to a different data type.
cast-method Casts the column to a different data type.
cbrt cbrt
cbrt-method cbrt
ceil Computes the ceiling of the given value
ceil-method Computes the ceiling of the given value
ceiling Computes the ceiling of the given value
ceiling-method Computes the ceiling of the given value
clearCache Clear Cache
clearCache.default Clear Cache
clearJobGroup Clear current job group ID and its description
clearJobGroup.default Clear current job group ID and its description
collect Collects all the elements of a SparkDataFrame and coerces them into an R data.frame.
collect-method Collects all the elements of a SparkDataFrame and coerces them into an R data.frame.
colnames Column Names of SparkDataFrame
colnames-method Column Names of SparkDataFrame
colnames<- Column Names of SparkDataFrame
colnames<--method Column Names of SparkDataFrame
coltypes coltypes
coltypes-method coltypes
coltypes<- coltypes
coltypes<--method coltypes
column S4 class that represents a SparkDataFrame column
Column-class S4 class that represents a SparkDataFrame column
column-method S4 class that represents a SparkDataFrame column
columnfunctions A set of operations working with SparkDataFrame columns
columns Column Names of SparkDataFrame
columns-method Column Names of SparkDataFrame
concat concat
concat-method concat
concat_ws concat_ws
concat_ws-method concat_ws
contains A set of operations working with SparkDataFrame columns
conv conv
conv-method conv
corr corr
corr-method corr
cos cos
cos-method cos
cosh cosh
cosh-method cosh
count Returns the number of items in a group
count-method Returns the number of items in a group
count-method Returns the number of rows in a SparkDataFrame
countDistinct Count Distinct Values
countDistinct-method Count Distinct Values
cov cov
cov-method cov
covar_pop covar_pop
covar_pop-method covar_pop
covar_samp cov
covar_samp-method cov
crc32 crc32
crc32-method crc32
createDataFrame Create a SparkDataFrame
createDataFrame.default Create a SparkDataFrame
createExternalTable Create an external table
createExternalTable.default Create an external table
createOrReplaceTempView Creates a temporary view using the given name.
createOrReplaceTempView-method Creates a temporary view using the given name.
crosstab Computes a pair-wise frequency table of the given columns
crosstab-method Computes a pair-wise frequency table of the given columns
cume_dist cume_dist
cume_dist-method cume_dist

-- D --

dapply dapply
dapply-method dapply
dapplyCollect dapplyCollect
dapplyCollect-method dapplyCollect
datediff datediff
datediff-method datediff
date_add date_add
date_add-method date_add
date_format date_format
date_format-method date_format
date_sub date_sub
date_sub-method date_sub
dayofmonth dayofmonth
dayofmonth-method dayofmonth
dayofyear dayofyear
dayofyear-method dayofyear
decode decode
decode-method decode
dense_rank dense_rank
dense_rank-method dense_rank
desc A set of operations working with SparkDataFrame columns
describe summary
describe-method summary
dim Returns the dimensions of SparkDataFrame
dim-method Returns the dimensions of SparkDataFrame
distinct Distinct
distinct-method Distinct
drop drop
drop-method drop
dropDuplicates dropDuplicates
dropDuplicates-method dropDuplicates
dropna A set of SparkDataFrame functions working with NA values
dropna-method A set of SparkDataFrame functions working with NA values
dropTempTable (Deprecated) Drop Temporary Table
dropTempTable.default (Deprecated) Drop Temporary Table
dropTempView Drops the temporary view with the given view name in the catalog.
dtypes DataTypes
dtypes-method DataTypes

-- E --

encode encode
encode-method encode
endsWith endsWith
endsWith-method endsWith
except except
except-method except
exp exp
exp-method exp
explain Explain
explain-method Explain
explode explode
explode-method explode
expm1 expm1
expm1-method expm1
expr expr
expr-method expr

-- F --

factorial factorial
factorial-method factorial
fillna A set of SparkDataFrame functions working with NA values
fillna-method A set of SparkDataFrame functions working with NA values
filter Filter
filter-method Filter
first Return the first row of a SparkDataFrame
first-method Return the first row of a SparkDataFrame
fitted Get fitted result from a k-means model
fitted-method Get fitted result from a k-means model
floor floor
floor-method floor
format_number format_number
format_number-method format_number
format_string format_string
format_string-method format_string
freqItems Finding frequent items for columns, possibly with false positives
freqItems-method Finding frequent items for columns, possibly with false positives
from_unixtime from_unixtime
from_unixtime-method from_unixtime
from_utc_timestamp from_utc_timestamp
from_utc_timestamp-method from_utc_timestamp

-- G --

gapply gapply
gapply-method gapply
gapplyCollect gapplyCollect
gapplyCollect-method gapplyCollect
GeneralizedLinearRegressionModel-class S4 class that represents a generalized linear model
generateAliasesForIntersectedCols Creates a list of columns by replacing the intersected ones with aliases
getField A set of operations working with SparkDataFrame columns
getItem A set of operations working with SparkDataFrame columns
glm Generalized Linear Models (R-compliant)
glm-method Generalized Linear Models (R-compliant)
greatest greatest
greatest-method greatest
groupBy GroupBy
groupBy-method GroupBy
groupedData S4 class that represents a GroupedData
GroupedData-class S4 class that represents a GroupedData
group_by GroupBy
group_by-method GroupBy

-- H --

hash hash
hash-method hash
hashCode Compute the hashCode of an object
head Head
head-method Head
hex hex
hex-method hex
histogram Compute histogram statistics for given column
histogram-method Compute histogram statistics for given column
hour hour
hour-method hour
hypot hypot
hypot-method hypot

-- I --

ifelse ifelse
ifelse-method ifelse
initcap initcap
initcap-method initcap
insertInto insertInto
insertInto-method insertInto
install.spark Download and Install Apache Spark to a Local Directory
instr instr
instr-method instr
intersect Intersect
intersect-method Intersect
is.nan is.nan
is.nan-method is.nan
isLocal isLocal
isLocal-method isLocal
isNaN A set of operations working with SparkDataFrame columns
isnan is.nan
isnan-method is.nan
isNotNull A set of operations working with SparkDataFrame columns
isNull A set of operations working with SparkDataFrame columns

-- J --

join Join
join-method Join
jsonFile Create a SparkDataFrame from a JSON file.
jsonFile.default Create a SparkDataFrame from a JSON file.

-- K --

KMeansModel-class S4 class that represents a KMeansModel
kurtosis kurtosis
kurtosis-method kurtosis

-- L --

lag lag
lag-method lag
last last
last-method last
last_day last_day
last_day-method last_day
lead lead
lead-method lead
least least
least-method least
length length
length-method length
levenshtein levenshtein
levenshtein-method levenshtein
like A set of operations working with SparkDataFrame columns
limit Limit
limit-method Limit
lit lit
lit-method lit
loadDF Load a SparkDataFrame
loadDF.default Load a SparkDataFrame
locate locate
locate-method locate
log log
log-method log
log10 log10
log10-method log10
log1p log1p
log1p-method log1p
log2 log2
log2-method log2
lower lower
lower-method lower
lpad lpad
lpad-method lpad
ltrim ltrim
ltrim-method ltrim

-- M --

max max
max-method max
md5 md5
md5-method md5
mean mean
mean-method mean
merge Merges two data frames
merge-method Merges two data frames
min min
min-method min
minute minute
minute-method minute
monotonically_increasing_id monotonically_increasing_id
monotonically_increasing_id-method monotonically_increasing_id
month month
month-method month
months_between months_between
months_between-method months_between
mutate Mutate
mutate-method Mutate

-- N --

n Returns the number of items in a group
n-method Returns the number of items in a group
na.omit A set of SparkDataFrame functions working with NA values
na.omit-method A set of SparkDataFrame functions working with NA values
NaiveBayesModel-class S4 class that represents a NaiveBayesModel
names Column Names of SparkDataFrame
names-method Column Names of SparkDataFrame
names<- Column Names of SparkDataFrame
names<--method Column Names of SparkDataFrame
nanvl nanvl
nanvl-method nanvl
ncol Returns the number of columns in a SparkDataFrame
ncol-method Returns the number of columns in a SparkDataFrame
negate negate
negate-method negate
next_day next_day
next_day-method next_day
nrow Returns the number of rows in a SparkDataFrame
nrow-method Returns the number of rows in a SparkDataFrame
ntile ntile
ntile-method ntile
n_distinct Count Distinct Values
n_distinct-method Count Distinct Values

-- O --

orderBy Ordering Columns in a WindowSpec
orderBy-method Arrange Rows by Variables
orderBy-method Ordering Columns in a WindowSpec
otherwise otherwise
otherwise-method otherwise
over over
over-method over

-- P --

parquetFile Create a SparkDataFrame from a Parquet file.
parquetFile.default Create a SparkDataFrame from a Parquet file.
partitionBy partitionBy
partitionBy-method partitionBy
percent_rank percent_rank
percent_rank-method percent_rank
persist Persist
persist-method Persist
pivot Pivot a column of the GroupedData and perform the specified aggregation.
pivot-method Pivot a column of the GroupedData and perform the specified aggregation.
pmod pmod
pmod-method pmod
posexplode posexplode
posexplode-method posexplode
predict Makes predictions from a MLlib model
predict-method Generalized Linear Models
predict-method K-Means Clustering Model
predict-method Naive Bayes Models
predict-method Accelerated Failure Time (AFT) Survival Regression Model
print.jobj Print a JVM object reference.
print.structField Print a Spark StructField.
print.structType Print a Spark StructType.
print.summary.GeneralizedLinearRegressionModel Generalized Linear Models
printSchema Print Schema of a SparkDataFrame
printSchema-method Print Schema of a SparkDataFrame

-- Q --

quarter quarter
quarter-method quarter

-- R --

rand rand
rand-method rand
randn randn
randn-method randn
randomSplit randomSplit
randomSplit-method randomSplit
rangeBetween rangeBetween
rangeBetween-method rangeBetween
rank rank
rank-method rank
rbind Union two or more SparkDataFrames
rbind-method Union two or more SparkDataFrames
read.df Load a SparkDataFrame
read.df.default Load a SparkDataFrame
read.jdbc Create a SparkDataFrame representing the database table accessible via JDBC URL
read.json Create a SparkDataFrame from a JSON file.
read.json.default Create a SparkDataFrame from a JSON file.
read.ml Load a fitted MLlib model from the input path.
read.orc Create a SparkDataFrame from an ORC file.
read.parquet Create a SparkDataFrame from a Parquet file.
read.parquet.default Create a SparkDataFrame from a Parquet file.
read.text Create a SparkDataFrame from a text file.
read.text.default Create a SparkDataFrame from a text file.
regexp_extract regexp_extract
regexp_extract-method regexp_extract
regexp_replace regexp_replace
regexp_replace-method regexp_replace
registerTempTable (Deprecated) Register Temporary Table
registerTempTable-method (Deprecated) Register Temporary Table
rename rename
rename-method rename
repartition Repartition
repartition-method Repartition
reverse reverse
reverse-method reverse
rint rint
rint-method rint
rlike A set of operations working with SparkDataFrame columns
round round
round-method round
rowsBetween rowsBetween
rowsBetween-method rowsBetween
row_number row_number
row_number-method row_number
rpad rpad
rpad-method rpad
rtrim rtrim
rtrim-method rtrim

-- S --

sample Sample
sample-method Sample
sampleBy Returns a stratified sample without replacement
sampleBy-method Returns a stratified sample without replacement
sample_frac Sample
sample_frac-method Sample
saveAsParquetFile Save the contents of SparkDataFrame as a Parquet file, preserving the schema.
saveAsParquetFile-method Save the contents of SparkDataFrame as a Parquet file, preserving the schema.
saveAsTable Save the contents of the SparkDataFrame to a data source as a table
saveAsTable-method Save the contents of the SparkDataFrame to a data source as a table
saveDF Save the contents of SparkDataFrame to a data source.
saveDF-method Save the contents of SparkDataFrame to a data source.
schema Get schema object
schema-method Get schema object
sd sd
sd-method sd
second second
second-method second
select Select
select-method Select
selectExpr SelectExpr
selectExpr-method SelectExpr
setJobGroup Assigns a group ID to all the jobs started by this thread until the group ID is set to a different value or cleared.
setJobGroup.default Assigns a group ID to all the jobs started by this thread until the group ID is set to a different value or cleared.
setLogLevel Set new log level
sha1 sha1
sha1-method sha1
sha2 sha2
sha2-method sha2
shiftLeft shiftLeft
shiftLeft-method shiftLeft
shiftRight shiftRight
shiftRight-method shiftRight
shiftRightUnsigned shiftRightUnsigned
shiftRightUnsigned-method shiftRightUnsigned
show show
show-method show
showDF showDF
showDF-method showDF
sign signum
sign-method signum
signum signum
signum-method signum
sin sin
sin-method sin
sinh sinh
sinh-method sinh
size size
size-method size
skewness skewness
skewness-method skewness
sort_array sort_array
sort_array-method sort_array
soundex soundex
soundex-method soundex
spark.glm Generalized Linear Models
spark.glm-method Generalized Linear Models
spark.kmeans K-Means Clustering Model
spark.kmeans-method K-Means Clustering Model
spark.lapply Run a function over a list of elements, distributing the computations with Spark
spark.naiveBayes Naive Bayes Models
spark.naiveBayes-method Naive Bayes Models
spark.survreg Accelerated Failure Time (AFT) Survival Regression Model
spark.survreg-method Accelerated Failure Time (AFT) Survival Regression Model
SparkDataFrame-class S4 class that represents a SparkDataFrame
sparkR.callJMethod Call Java Methods
sparkR.callJStatic Call Static Java Methods
sparkR.conf Get Runtime Config from the current active SparkSession
sparkR.init (Deprecated) Initialize a new Spark Context
sparkR.newJObject Create Java Objects
sparkR.session Get the existing SparkSession or initialize a new SparkSession.
sparkR.session.stop Stop the Spark Session and Spark Context
sparkR.stop Stop the Spark Session and Spark Context
sparkR.version Get version of Spark on which this application is running
sparkRHive.init (Deprecated) Initialize a new HiveContext
sparkRSQL.init (Deprecated) Initialize a new SQLContext
spark_partition_id Return the partition ID as a column
spark_partition_id-method Return the partition ID as a column
sql SQL Query
sql.default SQL Query
sqrt sqrt
sqrt-method sqrt
startsWith startsWith
startsWith-method startsWith
stddev sd
stddev-method sd
stddev_pop stddev_pop
stddev_pop-method stddev_pop
stddev_samp stddev_samp
stddev_samp-method stddev_samp
str Compactly display the structure of a dataset
str-method Compactly display the structure of a dataset
struct struct
struct-method struct
structField structField
structField.character structField
structField.jobj structField
structType structType
structType.jobj structType
structType.structField structType
subset Subset
subset-method Subset
substr substr
substr-method substr
substring_index substring_index
substring_index-method substring_index
sum sum
sum-method sum
sumDistinct sumDistinct
sumDistinct-method sumDistinct
summarize Summarize data across columns
summarize-method Summarize data across columns
summary summary
summary-method Generalized Linear Models
summary-method K-Means Clustering Model
summary-method Naive Bayes Models
summary-method Accelerated Failure Time (AFT) Survival Regression Model
summary-method summary

-- T --

tableNames Table Names
tableNames.default Table Names
tables Tables
tables.default Tables
tableToDF Create a SparkDataFrame from a SparkSQL Table
take Take the first NUM rows of a SparkDataFrame and return the results as a R data.frame
take-method Take the first NUM rows of a SparkDataFrame and return the results as a R data.frame
tan tan
tan-method tan
tanh tanh
tanh-method tanh
toDegrees toDegrees
toDegrees-method toDegrees
toRadians toRadians
toRadians-method toRadians
to_date to_date
to_date-method to_date
to_utc_timestamp to_utc_timestamp
to_utc_timestamp-method to_utc_timestamp
transform Mutate
transform-method Mutate
translate translate
translate-method translate
trim trim
trim-method trim

-- U --

unbase64 unbase64
unbase64-method unbase64
uncacheTable Uncache Table
uncacheTable.default Uncache Table
unhex unhex
unhex-method unhex
union Return a new SparkDataFrame containing the union of rows
union-method Return a new SparkDataFrame containing the union of rows
unionAll Return a new SparkDataFrame containing the union of rows
unionAll-method Return a new SparkDataFrame containing the union of rows
unique Distinct
unique-method Distinct
unix_timestamp unix_timestamp
unix_timestamp-method unix_timestamp
unpersist Unpersist
unpersist-method Unpersist
upper upper
upper-method upper

-- V --

var var
var-method var
variance var
variance-method var
var_pop var_pop
var_pop-method var_pop
var_samp var_samp
var_samp-method var_samp

-- W --

weekofyear weekofyear
weekofyear-method weekofyear
when when
when-method when
where Filter
where-method Filter
window window
window-method window
windowOrderBy windowOrderBy
windowOrderBy-method windowOrderBy
windowPartitionBy windowPartitionBy
windowPartitionBy-method windowPartitionBy
WindowSpec-class S4 class that represents a WindowSpec
with Evaluate a R expression in an environment constructed from a SparkDataFrame
with-method Evaluate a R expression in an environment constructed from a SparkDataFrame
withColumn WithColumn
withColumn-method WithColumn
withColumnRenamed rename
withColumnRenamed-method rename
write.df Save the contents of SparkDataFrame to a data source.
write.df-method Save the contents of SparkDataFrame to a data source.
write.jdbc Save the content of SparkDataFrame to an external database table via JDBC.
write.jdbc-method Save the content of SparkDataFrame to an external database table via JDBC.
write.json Save the contents of SparkDataFrame as a JSON file
write.json-method Save the contents of SparkDataFrame as a JSON file
write.ml Saves the MLlib model to the input path
write.ml-method Generalized Linear Models
write.ml-method K-Means Clustering Model
write.ml-method Naive Bayes Models
write.ml-method Accelerated Failure Time (AFT) Survival Regression Model
write.orc Save the contents of SparkDataFrame as an ORC file, preserving the schema.
write.orc-method Save the contents of SparkDataFrame as an ORC file, preserving the schema.
write.parquet Save the contents of SparkDataFrame as a Parquet file, preserving the schema.
write.parquet-method Save the contents of SparkDataFrame as a Parquet file, preserving the schema.
write.text Save the content of SparkDataFrame in a text file at the specified path.
write.text-method Save the content of SparkDataFrame in a text file at the specified path.

-- Y --

year year
year-method year

-- misc --

$ Select
$-method Select
$<- Select
$<--method Select
%in% Match a column with given values.
%in%-method Match a column with given values.
[ Subset
[-method Subset
[[ Subset
[[-method Subset