Goldszmidt, Moises and Sahami, Mehran (1998) A Probabilistic Approach to Full-Text Document Clustering. Technical Report. Stanford InfoLab.