Cluster-Similarity version 0.0.1

Computes the similarity of two word clusterings using several clustering similarity measures.

Consider for eg. the following groupings:

clustering_1: { {a, b, c}, {d, e, f} }
clustering_2: { {a, b}, {c, d, e}, {f} }

Cluster similarity measures provide a numerical value helping to assess the alikeness of these two groupings.

All cluster similarity measures implemented in this module are based on the so-called contingency table of the two classifications (clusterings). The contingency table is a matrix with a cell for each pair of classes (one from each classification), containing the number of objects present in both classes.

The similarity measures are taken from Chapter 4 of Susanne Schulte im Walde's Phd thesis:

Sabine Schulte im Walde. Experiments on the Automatic Induction of German Semantic Verb Classes. PhD thesis, Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, 2003. Published as AIMS Report 9(2).

INSTALLATION

To install this module, run the following commands:

        perl Makefile.PL
        make
        make test
        make install

SUPPORT AND DOCUMENTATION

After installing, you can find documentation for this module with the perldoc command.

perldoc Cluster::Similarity

You can also look for information at:

RT, CPAN's request tracker

http://rt.cpan.org/NoAuth/Bugs.html?Dist=Cluster-Similarity

AnnoCPAN, Annotated CPAN documentation

http://annocpan.org/dist/Cluster-Similarity

CPAN Ratings

http://cpanratings.perl.org/d/Cluster-Similarity

Search CPAN

http://search.cpan.org/dist/Cluster-Similarity/

COPYRIGHT AND LICENCE

Copyright (C) 2008 Ingrid Falk

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.