pyspark.mllib.classification.
LogisticRegressionWithSGD
Train a classification model for Binary Logistic Regression using Stochastic Gradient Descent.
New in version 0.9.0.
Deprecated since version 2.0.0: Use ml.classification.LogisticRegression or LogisticRegressionWithLBFGS.
Methods
train(data[, iterations, step, …])
train
Train a logistic regression model on the given data.
Methods Documentation
pyspark.RDD
The training data, an RDD of pyspark.mllib.regression.LabeledPoint.
pyspark.mllib.regression.LabeledPoint
The number of iterations. (default: 100)
The step parameter used in SGD. (default: 1.0)
Fraction of data to be used for each SGD iteration. (default: 1.0)
pyspark.mllib.linalg.Vector
The initial weights. (default: None)
The regularizer parameter. (default: 0.01)
The type of regularizer used for training our model. Supported values:
“l1” for using L1 regularization
“l2” for using L2 regularization (default)
None for no regularization
Boolean parameter which indicates the use or not of the augmented representation for training data (i.e., whether bias features are activated or not). (default: False)
Boolean parameter which indicates if the algorithm should validate data before training. (default: True)
A condition which decides iteration termination. (default: 0.001)