NAME

File::Find::Duplicates - Find duplicate files

SYNOPSIS

use File::Find::Duplicates;

my @dupes = find_duplicate_files('/basedir1', '/basedir2');

      foreach my $dupeset (@dupes) { 
        printf "Files %s (of size %d) hash to %s\n",
          join(", ", @{$dupeset->files}), $dupeset->size, $dupeset->md5;
      }

DESCRIPTION

This module provides a way of finding duplicate files on your system.

FUNCTIONS
find_duplicate_files

my %dupes = find_duplicate_files('/basedir1', '/basedir2');

When passed a base directory (or list of such directories) it returns a list of objects with the following methods:

files
A listref of the names of the duplicate files.

size
The size of the duplicate files.

md5
The md5 sum of the duplicate files.

TODO

Check the contents of tars, zipfiles etc to ensure none of these also exist elsewhere (if so requested).

SEE ALSO

File::Find.

AUTHOR

Tony Bowden

BUGS and QUERIES

Please direct all correspondence regarding this module to: bug-File-Find-Duplicates@rt.cpan.org

COPYRIGHT AND LICENSE

Copyright (C) 2001-2005 Tony Bowden.

      This program is free software; you can redistribute it and/or modify it under
      the terms of the GNU General Public License; either version 2 of the License,
      or (at your option) any later version.

      This program is distributed in the hope that it will be useful, but WITHOUT
      ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
      FOR A PARTICULAR PURPOSE.