Artykuł w czasopiśmie
Brak miniatury
Licencja

ClosedAccessDostęp zamknięty
 

Analysis of the Maximal a Posteriori Partition in the Gaussian Dirichlet Process Mixture Model

Uproszczony widok
cris.lastimport.scopus2024-02-12T20:46:26Z
dc.abstract.enMixture models are a natural choice in many applications, but it can be difficult to place an a priori upper bound on the number of components. To circumvent this, investigators are turning increasingly to Dirichlet process mixture models (DPMMs). It is therefore important to develop an understanding of the strengths and weaknesses of this approach. This work considers the MAP (maximum a posteriori) clustering for the Gaussian DPMM (where the cluster means have Gaussian distribution and, for each cluster, the observations within the cluster have Gaussian distribution). Some desirable properties of the MAP partition are proved: ‘almost disjointness’ of the convex hulls of clusters (they may have at most one point in common) and (with natural assumptions) the comparability of sizes of those clusters that intersect any fixed ball with the number of observations (as the latter goes to infinity). Consequently, the number of such clusters remains bounded. Furthermore, if the data arises from independent identically distributed sampling from a given distribution with bounded support then the asymptotic MAP partition of the observation space maximises a function which has a straightforward expression, which depends only on the within-group covariance parameter. As the operator norm of this covariance parameter decreases, the number of clusters in the MAP partition becomes arbitrarily large, which may lead to the overestimation of the number of mixture components.
dc.affiliationUniwersytet Warszawski
dc.contributor.authorRajkowski, Łukasz
dc.date.accessioned2024-01-24T16:34:07Z
dc.date.available2024-01-24T16:34:07Z
dc.date.issued2019
dc.description.financeNie dotyczy
dc.description.number2
dc.description.volume14
dc.identifier.doi10.1214/18-BA1114
dc.identifier.issn1936-0975
dc.identifier.urihttps://repozytorium.uw.edu.pl//handle/item/100638
dc.languageeng
dc.relation.ispartofBayesian Analysis
dc.relation.pages477-494
dc.rightsClosedAccess
dc.sciencecloudnosend
dc.subject.enDirichlet process mixture models Chinese Restaurant Process
dc.titleAnalysis of the Maximal a Posteriori Partition in the Gaussian Dirichlet Process Mixture Model
dc.typeJournalArticle
dspace.entity.typePublication