1.6.3
Overview
Programming Guides
Quick Start
Spark Programming Guide
Spark Streaming
DataFrames, Datasets and SQL
MLlib (Machine Learning)
GraphX (Graph Processing)
Bagel (Pregel on Spark)
SparkR (R on Spark)
API Docs
Scala
Java
Python
R
Deploying
Overview
Submitting Applications
Spark Standalone
Mesos
YARN
Amazon EC2
More
Configuration
Monitoring
Tuning Guide
Job Scheduling
Security
Hardware Provisioning
Building Spark
Contributing to Spark
Supplemental Projects
spark.ml package
Overview: estimators, transformers and pipelines
Extracting, transforming and selecting features
Classification and Regression
Clustering
Advanced topics
spark.mllib package
Data types
Basic statistics
Classification and regression
Collaborative filtering
Clustering
Dimensionality reduction
Feature extraction and transformation
Frequent pattern mining
Evaluation metrics
PMML model export
Optimization (developer)
Decision trees - spark.ml
This section has been moved into the
classification and regression section
.