Questions tagged [model-based-clustering]
The model-based-clustering tag has no summary.
64 questions
2
votes
0
answers
65
views
Estimating number of clusters using Scikit Bayesian GMM
I am generating clustering data using the Bayesian mixture of Gaussian models described in Bishop's Pattern Recognition and Machine Learning textbook, with model parameters drawn from the following ...
0
votes
0
answers
148
views
Expectation Maximization on Multivariate Gaussian Mixture Model for clustering
I have a dataset with 1000 observations and two features that define those N=1000 data points. Hence it is 1000*2 input matrix. I need to cluster them into k clusters.
I am not understanding the E-M ...
1
vote
1
answer
68
views
Cluster uncertainty in Linear Regression
Is it possible to introduce cluster probabilities into a regression?
Consider the Old Faithful Geyser data set. Most clustering algorithms find 2 clusters when analysing eruption times and waiting ...
2
votes
1
answer
152
views
PCA : how to cluster data to differenciate my data the most while considering their groups
I have to do a PCA in R for a project, but I have 300 data in 15 differents groups, and I want to find the reduced space which gives me the most variability between the groups and cluster my data in ...
0
votes
1
answer
153
views
Model based clustering equivalent to K means?
Is it OK to say something like this: "A model-based clustering with a hard threshold is equivalent to a k means clustering"? One of my instructors stated this in his slides, I kind of doubt ...
1
vote
0
answers
64
views
Loss function definiton for relabelling
Taken from the appendix to the paper (Yongning Wang & Ruey S. Tsay) of this (2019) paper Clustering Multiple Time Series with Structural Breaks. Appendix to be downloaded her Appendix
To fix label ...
1
vote
0
answers
172
views
K-Means clustering technique for monthly data
I have an Unsupervised problem where user's Credit Card payment data is given for each month for various users for one year. One of the feature in the data having "User Id". For most of the ...
0
votes
0
answers
676
views
EM algorithm for multivariate gaussian with diagonal covariance matrix
Ok so quick question.
Say I need to use the EM-algorithm to estimate the parameters of a multivariate gaussian
$$
f_{k}\left(x ; \theta_{k}\right)=\frac{1}{(2 \pi)^{P / 2}|V|} \exp \left(-\frac{1}{2}\...
1
vote
0
answers
62
views
Which type of cluster analysis to perform?
I am trying to explain variability in the outcome, and understand factors that may be associated with different clusters? I am confused as to which clustering method to use to obtain 3 clusters as ...
1
vote
0
answers
126
views
Do Gaussian Mixture Models monotonically decrease the sum of squared distances when number of clusters increases?
I am comparing the clustering performance of two closely related machine learning methods: K-means and Gaussian Mixture Models (GMM). Part of this research is selecting the best number of clusters K. ...
1
vote
0
answers
566
views
Understanding and implementing the last step in the 3-Step Latent Class Analysis using covariates from Vermunt 2010
I am interested in implementing the 3-Step approach for LCA with covariates ($Z_i$) in R. According to Vermunt (2010), the "Standard" three-step approach would involve (mentioned in pages 5 ...
0
votes
0
answers
88
views
How to make inference on cluster-specific parameters in a Bayesian mixture model
Suppose I have a mixture model, for example of the kind
$$
y_i \mid w, \{\theta_h\}, H \sim \sum_{h=1}^H w_h f(y_i \mid \theta_h) \\
P(H=h) = q_h \\
w \mid H \sim Dirichlet(\alpha) \\
\theta_1, \ldots,...
1
vote
0
answers
645
views
Identifying inflexion point in elbow method (cluster analysis)
I am looking for the optimal number of clusters to conduct a cluster analysis and used the following code to determine it:
...
0
votes
0
answers
243
views
Clustering Highly Skewed and Segmented Data
So I am trying to cluster a dataset that looks like the following:
I have tried K-Means and GMM, which give me horrible results. I have tried DBSCAN, which was okay, but it is difficult to choose the ...
4
votes
0
answers
805
views
Can clustering with Gaussian mixture models be done based on cosine similarity?
Apologies if this has already been answered; I found some similar posts (here and here) but don't feel they answered the specific question I have. Please feel free to correct any misunderstandings in ...