Skip to main content

Questions tagged [model-based-clustering]

Filter by
Sorted by
Tagged with
2 votes
0 answers
65 views

Estimating number of clusters using Scikit Bayesian GMM

I am generating clustering data using the Bayesian mixture of Gaussian models described in Bishop's Pattern Recognition and Machine Learning textbook, with model parameters drawn from the following ...
PJB's user avatar
  • 21
0 votes
0 answers
148 views

Expectation Maximization on Multivariate Gaussian Mixture Model for clustering

I have a dataset with 1000 observations and two features that define those N=1000 data points. Hence it is 1000*2 input matrix. I need to cluster them into k clusters. I am not understanding the E-M ...
Oindri's user avatar
  • 1
1 vote
1 answer
68 views

Cluster uncertainty in Linear Regression

Is it possible to introduce cluster probabilities into a regression? Consider the Old Faithful Geyser data set. Most clustering algorithms find 2 clusters when analysing eruption times and waiting ...
29703461's user avatar
2 votes
1 answer
152 views

PCA : how to cluster data to differenciate my data the most while considering their groups

I have to do a PCA in R for a project, but I have 300 data in 15 differents groups, and I want to find the reduced space which gives me the most variability between the groups and cluster my data in ...
Marguerite's user avatar
0 votes
1 answer
153 views

Model based clustering equivalent to K means?

Is it OK to say something like this: "A model-based clustering with a hard threshold is equivalent to a k means clustering"? One of my instructors stated this in his slides, I kind of doubt ...
Zhili Qiao's user avatar
1 vote
0 answers
64 views

Loss function definiton for relabelling

Taken from the appendix to the paper (Yongning Wang & Ruey S. Tsay) of this (2019) paper Clustering Multiple Time Series with Structural Breaks. Appendix to be downloaded her Appendix To fix label ...
user773674's user avatar
1 vote
0 answers
172 views

K-Means clustering technique for monthly data

I have an Unsupervised problem where user's Credit Card payment data is given for each month for various users for one year. One of the feature in the data having "User Id". For most of the ...
Archaeolexicologist's user avatar
0 votes
0 answers
676 views

EM algorithm for multivariate gaussian with diagonal covariance matrix

Ok so quick question. Say I need to use the EM-algorithm to estimate the parameters of a multivariate gaussian $$ f_{k}\left(x ; \theta_{k}\right)=\frac{1}{(2 \pi)^{P / 2}|V|} \exp \left(-\frac{1}{2}\...
Susan-l3p's user avatar
1 vote
0 answers
62 views

Which type of cluster analysis to perform?

I am trying to explain variability in the outcome, and understand factors that may be associated with different clusters? I am confused as to which clustering method to use to obtain 3 clusters as ...
user13514792's user avatar
1 vote
0 answers
126 views

Do Gaussian Mixture Models monotonically decrease the sum of squared distances when number of clusters increases?

I am comparing the clustering performance of two closely related machine learning methods: K-means and Gaussian Mixture Models (GMM). Part of this research is selecting the best number of clusters K. ...
Rinze Bloem's user avatar
1 vote
0 answers
566 views

Understanding and implementing the last step in the 3-Step Latent Class Analysis using covariates from Vermunt 2010

I am interested in implementing the 3-Step approach for LCA with covariates ($Z_i$) in R. According to Vermunt (2010), the "Standard" three-step approach would involve (mentioned in pages 5 ...
srikanth's user avatar
0 votes
0 answers
88 views

How to make inference on cluster-specific parameters in a Bayesian mixture model

Suppose I have a mixture model, for example of the kind $$ y_i \mid w, \{\theta_h\}, H \sim \sum_{h=1}^H w_h f(y_i \mid \theta_h) \\ P(H=h) = q_h \\ w \mid H \sim Dirichlet(\alpha) \\ \theta_1, \ldots,...
mariob6's user avatar
  • 550
1 vote
0 answers
645 views

Identifying inflexion point in elbow method (cluster analysis)

I am looking for the optimal number of clusters to conduct a cluster analysis and used the following code to determine it: ...
Catarina Toscano's user avatar
0 votes
0 answers
243 views

Clustering Highly Skewed and Segmented Data

So I am trying to cluster a dataset that looks like the following: I have tried K-Means and GMM, which give me horrible results. I have tried DBSCAN, which was okay, but it is difficult to choose the ...
The Dude's user avatar
  • 111
4 votes
0 answers
805 views

Can clustering with Gaussian mixture models be done based on cosine similarity?

Apologies if this has already been answered; I found some similar posts (here and here) but don't feel they answered the specific question I have. Please feel free to correct any misunderstandings in ...
phamilton's user avatar

15 30 50 per page
1
2 3 4 5