• Neural computation · Mar 2019

    The Information Bottleneck and Geometric Clustering.

    • D J Strouse and David J Schwab.
    • Department of Physics, Princeton University, Princeton, NJ 08544, U.S.A. danieljstrouse@gmail.com.
    • Neural Comput. 2019 Mar 1; 31 (3): 596-612.

    AbstractThe information bottleneck (IB) approach to clustering takes a joint distribution http://www.w3.org/1998/Math/MathML">P(X,Y) and maps the data http://www.w3.org/1998/Math/MathML">X to cluster labels http://www.w3.org/1998/Math/MathML">T , which retain maximal information about http://www.w3.org/1998/Math/MathML">Y (Tishby, Pereira, & Bialek, 1999 ). This objective results in an algorithm that clusters data points based on the similarity of their conditional distributions http://www.w3.org/1998/Math/MathML">P(YX) . This is in contrast to classic geometric clustering algorithms such as http://www.w3.org/1998/Math/MathML">k -means and gaussian mixture models (GMMs), which take a set of observed data points http://www.w3.org/1998/Math/MathML">{xi}i=1:N and cluster them based on their geometric (typically Euclidean) distance from one another. Here, we show how to use the deterministic information bottleneck (DIB) (Strouse & Schwab, 2017 ), a variant of IB, to perform geometric clustering by choosing cluster labels that preserve information about data point location on a smoothed data set. We also introduce a novel intuitive method to choose the number of clusters via kinks in the information curve. We apply this approach to a variety of simple clustering problems, showing that DIB with our model selection procedure recovers the generative cluster labels. We also show that, in particular limits of our model parameters, clustering with DIB and IB is equivalent to http://www.w3.org/1998/Math/MathML">k -means and EM fitting of a GMM with hard and soft assignments, respectively. Thus, clustering with (D)IB generalizes and provides an information-theoretic perspective on these classic algorithms.

      Pubmed     Full text   Copy Citation     Plaintext  

      Add institutional full text...

    Notes

     
    Knowledge, pearl, summary or comment to share?
    300 characters remaining
    help        
    You can also include formatting, links, images and footnotes in your notes
    • Simple formatting can be added to notes, such as *italics*, _underline_ or **bold**.
    • Superscript can be denoted by <sup>text</sup> and subscript <sub>text</sub>.
    • Numbered or bulleted lists can be created using either numbered lines 1. 2. 3., hyphens - or asterisks *.
    • Links can be included with: [my link to pubmed](http://pubmed.com)
    • Images can be included with: ![alt text](https://bestmedicaljournal.com/study_graph.jpg "Image Title Text")
    • For footnotes use [^1](This is a footnote.) inline.
    • Or use an inline reference [^1] to refer to a longer footnote elseweher in the document [^1]: This is a long footnote..

    hide…

Want more great medical articles?

Keep up to date with a free trial of metajournal, personalized for your practice.
1,694,794 articles already indexed!

We guarantee your privacy. Your email address will not be shared.