
EM algorithm with weights starting with M-step for parameterized Gaussian mixture models
me.weighted.RdImplements the EM algorithm for fitting Gaussian mixture models parameterized by eigenvalue decomposition, when observations have weights, starting with the maximization step.
Usage
me.weighted(data, modelName, z, weights = NULL, prior = NULL,
control = emControl(), Vinv = NULL, warn = NULL, ...)Arguments
- data
A numeric vector, matrix, or data frame of observations. Categorical variables are not allowed. If a matrix or data frame, rows correspond to observations and columns correspond to variables.
- modelName
A character string indicating the model. The help file for
mclustModelNamesdescribes the available models.- z
A matrix whose
[i,k]th entry is an initial estimate of the conditional probability of the ith observation belonging to the kth component of the mixture.- weights
A vector of positive weights, where the
[i]th entry is the weight for the ith observation. If any of the weights are greater than one, then they are scaled so that the maximum weight is one.- prior
Specification of a conjugate prior on the means and variances. See the help file for
priorControlfor further information. The default assumes no prior.- control
A list of control parameters for EM. The defaults are set by the call
emControl.- Vinv
If the model is to include a noise term,
Vinvis an estimate of the reciprocal hypervolume of the data region. If set to a negative value or 0, the model will include a noise term with the reciprocal hypervolume estimated by the functionhypvol. The default is not to assume a noise term in the model through the settingVinv=NULL.- warn
A logical value indicating whether or not certain warnings (usually related to singularity) should be issued when the estimation fails. The default is set by
warnusingmclust.options.- ...
Catches unused arguments in indirect or list calls via
do.call.
Value
A list including the following components:
- modelName
A character string identifying the model (same as the input argument).
- z
A matrix whose
[i,k]th entry is the conditional probability of the ith observation belonging to the kth component of the mixture.- parameters
proA vector whose kth component is the mixing proportion for the kth component of the mixture model. If the model includes a Poisson term for noise, there should be one more mixing proportion than the number of Gaussian components.
meanThe mean for each component. If there is more than one component, this is a matrix whose kth column is the mean of the kth component of the mixture model.
varianceA list of variance parameters for the model. The components of this list depend on the model specification. See the help file for
mclustVariancefor details.VinvThe estimate of the reciprocal hypervolume of the data region used in the computation when the input indicates the addition of a noise component to the model.
- loglik
The log-likelihood for the estimated mixture model.
- bic
The BIC value for the estimated mixture model.
- Attributes:
"info"Information on the iteration."WARNING"An appropriate warning if problems are encountered in the computations.
Details
This is a more efficient version made available with mclust \(ge 6.1\) using Fortran code internally.
See also
me,
meE, ...,
meVVV,
em,
mstep,
estep,
priorControl,
mclustModelNames,
mclustVariance,
mclust.options
Examples
w = rexp(nrow(iris))
w = w/mean(w)
c(summary(w), sum = sum(w))
#> Min. 1st Qu. Median Mean 3rd Qu. Max.
#> 0.00206268 0.32520507 0.71293241 1.00000000 1.47179878 6.53900532
#> sum
#> 150.00000000
z = unmap(sample(1:3, size = nrow(iris), replace = TRUE))
MEW = me.weighted(data = iris[,-5], modelName = "VVV",
z = z, weights = w)
str(MEW,1)
#> List of 10
#> $ modelName : chr "VVV"
#> $ prior : NULL
#> $ n : int 150
#> $ d : int 4
#> $ G : int 3
#> $ z : num [1:150, 1:3] 2.89e-18 6.45e-12 7.86e-15 3.23e-12 2.31e-19 ...
#> ..- attr(*, "dimnames")=List of 2
#> $ parameters:List of 3
#> $ weights : num [1:150] 0.0267 0.0986 0.2817 0.029 0.429 ...
#> $ loglik : num -178
#> $ bic : num -275
#> - attr(*, "returnCode")= num 0