pyspark.ml.clustering.
GaussianMixtureSummary
Gaussian mixture clustering results for a given model.
New in version 2.1.0.
Attributes
cluster
DataFrame of predicted cluster centers for each training data point.
clusterSizes
Size of (number of data points in) each cluster.
featuresCol
Name for column of features in predictions.
k
The number of clusters the model was trained with.
logLikelihood
Total log-likelihood for this model on the given data.
numIter
Number of iterations.
predictionCol
Name for column of predicted clusters in predictions.
predictions
DataFrame produced by the model’s transform method.
probability
DataFrame of probabilities of each cluster for each training data point.
probabilityCol
Name for column of predicted probability of each cluster in predictions.
Attributes Documentation
New in version 2.2.0.
New in version 2.4.0.