Dirichlet Process, Infinite Mixture Models, and Clustering (with program examples). $$ The Dirichlet process provides a very interesting approach to understand group assignments and models for clustering effects. Often time we encounter the k-means approach. However, it is necessary to have a fixed number of clusters. Often we encounter situations where we don’t know how many fixed clusters we need. Suppose we’re trying to identify groups of voters. We could use political partisanship (e.g. low/medium/high Democratic vote) but that may not necessary describe the data appropriately. If this is the case then we can turn to Bayesian nonparametrics and the Dirichlet Process and use some approaches there to solve this problem. Three in particular are commonly used as examples: the Chinese Restaurant Model, Pólya’s Urn, and Stick Breaking. http://statistical-research.com/dirichlet-process-infinite-mixture-models-and-clustering