R frontend for Spark


[Up] [Top]

Documentation for package ‘SparkR’ version 1.5.2

Help Pages

A B C D E F G H I J L M N O P Q R S T U W Y misc

-- A --

abs abs
acos acos
add_months add_months
agg Summarize data across columns
agg-method Summarize data across columns
alias alias
approxCountDistinct approxCountDistinct
arrange Arrange
asc S4 class that represents a DataFrame column
ascii ascii
asin asin
atan atan
atan2 atan2
avg avg

-- B --

base64 base64
between between
between S4 class that represents a DataFrame column
bin bin
bitwiseNOT bitwiseNOT

-- C --

cache Cache
cacheTable Cache Table
cancelJobGroup Cancel active jobs for the specified group
cast Casts the column to a different data type.
cast S4 class that represents a DataFrame column
cbrt cbrt
ceil ceil
clearCache Clear Cache
clearJobGroup Clear current job group ID and its description
collect Collects all the elements of a Spark DataFrame and coerces them into an R data.frame.
Column-class S4 class that represents a DataFrame column
columns Column names
columns Get schema object
concat concat
concat_ws concat_ws
contains S4 class that represents a DataFrame column
conv conv
cos cos
cosh cosh
count Count
count Number of rows for a DataFrame
count-method Summarize data across columns
countDistinct Count Distinct
crc32 crc32
createDataFrame Create a DataFrame from an RDD
createExternalTable Create an external table
crosstab crosstab

-- D --

dataFrame S4 class that represents a DataFrame
DataFrame-class S4 class that represents a DataFrame
datediff datediff
date_add date_add
date_format date_format
date_sub date_sub
dayofmonth dayofmonth
dayofyear dayofyear
desc S4 class that represents a DataFrame column
describe describe
dim Returns the dimentions (number of rows and columns) of a DataFrame
distinct Distinct
distinct Distinct rows in a DataFrame
dropna dropna
dropTempTable Drop Temporary Table
dtypes DataTypes
dtypes Get schema object

-- E --

endsWith S4 class that represents a DataFrame column
except except
exp exp
explain Explain
explode explode
expm1 expm1
expr expr

-- F --

factorial factorial
fillna dropna
filter Filter
first Return the first row of a DataFrame
floor floor
format_number format_number
format_string format_string
from_unixtime from_unixtime
from_utc_timestamp from_utc_timestamp

-- G --

getField S4 class that represents a DataFrame column
getItem S4 class that represents a DataFrame column
glm Fits a generalized linear model
glm-method Fits a generalized linear model
greatest greatest
groupBy GroupBy
groupedData S4 class that represents a DataFrame
GroupedData-class S4 class that represents a GroupedData
group_by GroupBy

-- H --

hashCode Compute the hashCode of an object
head Head
hex hex
hour hour
hypot hypot

-- I --

ifelse ifelse
infer_type infer the SQL type
initcap initcap
insertInto insertInto
instr instr
intersect Intersect
isLocal isLocal
isNaN isNaN
isNotNull S4 class that represents a DataFrame column
isNull S4 class that represents a DataFrame column

-- J --

join Join
join rdname merge
jsonFile Create a DataFrame from a JSON file.

-- L --

last last
last_day last_day
least least
length length
levenshtein levenshtein
like S4 class that represents a DataFrame column
limit Limit
lit Creates a 'Column' of literal value.
loadDF Load an DataFrame
locate locate
log log
log10 log10
log1p log1p
log2 log2
lower lower
lpad lpad
ltrim ltrim

-- M --

max max
md5 md5
mean mean
merge rdname merge
min min
minute minute
month month
months_between months_between
mutate WithColumn

-- N --

n Count
na.omit dropna
names Column names
names<- Column names
nanvl nanvl
ncol Returns the number of columns in a DataFrame
negate negate
next_day next_day
nrow Count
nrow Number of rows for a DataFrame
n_distinct Count Distinct

-- O --

orderBy Arrange
orderby Arrange
otherwise S4 class that represents a DataFrame column
otherwise otherwise

-- P --

parquetFile Create a DataFrame from a Parquet file.
persist Persist
PipelineModel-class S4 class that represents a PipelineModel
pmod pmod
predict-method Make predictions from a model
print.jobj Print a JVM object reference.
print.structField Print a Spark StructField.
print.structType Print a Spark StructType.
printSchema Print Schema of a DataFrame
printSchema Get schema object

-- Q --

quarter quarter

-- R --

rand rand
randn randn
rbind Union two or more DataFrames
read.df Load an DataFrame
regexp_extract regexp_extract
regexp_replace regexp_replace
registerTempTable Register Temporary Table
rename WithColumnRenamed
repartition Repartition
reverse reverse
rint rint
rlike S4 class that represents a DataFrame column
round round
rpad rpad
rtrim rtrim

-- S --

sample Sample
sample-method Sample
sample_frac Sample
saveAsParquetFile saveAsParquetFile
saveAsTable saveAsTable
saveDF Save the contents of the DataFrame to a data source
schema Get schema object
second second
select Select
select-method Select
selectExpr Select
selectExpr SelectExpr
setJobGroup Assigns a group ID to all the jobs started by this thread until the group ID is set to a different value or cleared.
sha1 sha1
sha2 sha2
shiftLeft shiftLeft
shiftRight shiftRight
shiftRightUnsigned shiftRightUnsigned
show show
show-method show
showDF showDF
signum signum
sin sin
sinh sinh
size size
soundex soundex
sparkR.init Initialize a new Spark Context.
sparkR.stop Stop the Spark context.
sparkRHive.init Initialize a new HiveContext.
sparkRSQL.init Initialize a new SQLContext.
sql SQL Query
sqrt sqrt
startsWith S4 class that represents a DataFrame column
structField structField
structType structType
subset Subset
substr substr
substring_index substring_index
sum sum
sumDistinct sumDistinct
summarize Summarize data across columns
summarize-method Summarize data across columns
summary describe
summary Summary
summary-method Summary

-- T --

table Create a DataFrame from a SparkSQL Table
tableNames Table Names
tables Tables
take Take the first NUM rows of a DataFrame and return a the results as a data.frame
tan tan
tanh tanh
toDegrees toDegrees
toRadians toRadians
toRDD S4 class that represents a DataFrame
to_date to_date
to_utc_timestamp to_utc_timestamp
transform WithColumn
translate translate
trim trim

-- U --

unbase64 unbase64
uncacheTable Uncache Table
unhex unhex
unionAll Union two or more DataFrames
unionAll UnionAll
unique Distinct rows in a DataFrame
unix_timestamp unix_timestamp
unpersist Unpersist
upper upper

-- W --

weekofyear weekofyear
when S4 class that represents a DataFrame column
when when
where Filter
withColumn WithColumn
withColumnRenamed WithColumnRenamed
write.df Save the contents of the DataFrame to a data source

-- Y --

year year

-- misc --

$ Select
$<- Select
%in% Match a column with given values.
[ Subset
[[ Subset