Test::Spelling - check for spelling errors in POD files


Test-Spelling documentation Contained in the Test-Spelling distribution.

Index


Code Index:

NAME

Top

Test::Spelling - check for spelling errors in POD files

SYNOPSIS

Top

    use Test::More;
    BEGIN {
        plan skip_all => "Spelling tests only for authors"
            unless -d 'inc/.author';
    }

    use Test::Spelling;
    all_pod_files_spelling_ok();

DESCRIPTION

Top

Test::Spelling lets you check the spelling of a POD file, and report its results in standard Test::More fashion. This module requires a spellcheck program such as spell, aspell, ispell, or hunspell.

    use Test::Spelling;
    pod_file_spelling_ok('lib/Foo/Bar.pm', 'POD file spelling OK');

Note that it is a bad idea to run spelling tests during an ordinary CPAN distribution install, or in a package that will run in an uncontrolled environment. There is no way of predicting whether the word list or spellcheck program used will give the same results. You can include the test in your distribution, but be sure to run it only for authors of the module by guarding it in a skip_all unless -d 'inc/.author' clause, or by putting the test in your distribution's xt/ directory. Anyway, people installing your module really do not need to run such tests, as it is unlikely that the documentation will acquire typos while in transit. :-)

You can add your own stopwords, which are words that should be ignored by the spell check, like so:

    add_stopwords(qw(asdf thiswordiscorrect));

Adding stopwards in this fashion affects all files checked for the remainder of the test script. See Pod::Spell (which this module is built upon) for a variety of ways to add per-file stopwords to each .pm file.

If you have a lot of stopwords, it's useful to put them in your test file's DATA section like so:

    use Test::Spelling;
    add_stopwords(<DATA>);
    all_pod_files_spelling_ok();

    __END__
    folksonomy
    Jifty
    Zakirov

To maintain backwards compatibility, comment markers and some whitespace are ignored. In the near future, the preprocessing we do on the arguments to add_stopwords will be changed and documented properly.

FUNCTIONS

Top

all_pod_files_spelling_ok( [@files/@directories] )

Checks all the files for POD spelling. It gathers all_pod_files() on each file/directory, and declares a plan in Test::More for you (one test for each file), so you must not call plan yourself.

If @files is empty, the function finds all POD files in the blib directory if it exists, or the lib directory if it does not. A POD file is one that ends with .pod, .pl, .plx, or .pm; or any file where the first line looks like a perl shebang line.

If there is no working spellchecker (determined by has_working_spellchecker), this test will issue a "skip all" directive.

If you're testing a distribution, just create a t/pod-spell.t with the code in the SYNOPSIS.

Returns true if every POD file has correct spelling, or false if any of them fail. This function will show any spelling errors as diagnostics.

pod_file_spelling_ok( FILENAME[, TESTNAME ] )

pod_file_spelling_ok will test that the given POD file has no spelling errors.

When it fails, pod_file_spelling_ok will show any spelling errors as diagnostics.

The optional second argument TESTNAME is the name of the test. If it is omitted, pod_file_spelling_ok chooses a default test name "POD spelling for FILENAME".

all_pod_files( [@dirs] )

Returns a list of all the Perl files in each directory and its subdirectories, recursively. If no directories are passed, it defaults to blib if blib exists, or else lib if not. Skips any files in CVS or .svn directories.

A Perl file is:

* Any file that ends in .PL, .pl, .plx, .pm, .pod or .t.
* Any file that has a first line with a shebang and "perl" on it.

Furthermore, files for which the filter set by set_pod_file_filter return false are skipped. By default, this filter passes everything through.

The order of the files returned is machine-dependent. If you want them sorted, you'll have to sort them yourself.

add_stopwords(@words)

Add words that should be skipped by the spellcheck. Note that Pod::Spell already skips words believed to be code, such as everything in verbatim (indented) blocks and code marked up with ..., as well as some common Perl jargon.

has_working_spellchecker

has_working_spellchecker will return undef if there is no working spellchecker, or a true value (the spellchecker command itself) if there is. The module performs a dry-run to determine whether any of the spellcheckers it can will use work on the current system. You can use this to skip tests if there is no spellchecker. Note that all_pod_files_spelling_ok will do this for you.

set_spell_cmd($command)

If you want to force this module to use a particular spellchecker, then you can specify which one with set_spell_cmd. This is useful to ensure a more consistent lexicon between developers, or if you have an unusual environment. Any command that takes text from standard input and prints a list of misspelled words, one per line, to standard output will do.

set_pod_file_filter($code)

If your project has POD documents written in languages other than English, then obviously you don't want to be running a spellchecker on every Perl file. set_pod_file_filter lets you filter out files returned from all_pod_files (and hence, the documents tested by all_pod_files_spelling_ok).

    set_pod_file_filter(sub {
        my $filename = shift;
        return 0 if $filename =~ /_ja.pod$/; # skip Japanese translations
        return 1;
    });

SEE ALSO

Top

Pod::Spell

ORIGINAL AUTHOR

Top

Ivan Tubert-Brohman <itub@cpan.org>

Heavily based on Test::Pod by Andy Lester and brian d foy.

MAINTAINER

Top

Shawn M Moore <sartak@bestpractical.com>

COPYRIGHT

Top


Test-Spelling documentation Contained in the Test-Spelling distribution.

package Test::Spelling;
use 5.006;
use strict;
use warnings;

use base 'Exporter';
use Pod::Spell;
use Test::Builder;
use File::Spec;
use IPC::Open3;
use Symbol 'gensym';

our $VERSION = '0.14';

our @EXPORT = qw(
    pod_file_spelling_ok
    all_pod_files_spelling_ok
    add_stopwords
    set_spell_cmd
    all_pod_files
    set_pod_file_filter
    has_working_spellchecker
);

my $TEST = Test::Builder->new;

my $SPELLCHECKER;
my $FILE_FILTER = sub { 1 };

sub spellchecker_candidates {
    # if they've specified a spellchecker, use only that one
    return $SPELLCHECKER if $SPELLCHECKER;

    return (
        'spell', # for back-compat, this is the top candidate ...
        'aspell list -l en', # ... but this should become first soon
        'ispell -l',
        'hunspell -l',
    );
}

sub has_working_spellchecker {
    my $dryrun_results = _get_spellcheck_results("dry run", 1);

    if (ref $dryrun_results) {
        return;
    }

    return $SPELLCHECKER;
}

sub _get_spellcheck_results {
    my $document = shift;
    my $dryrun = shift;

    my @errors;

    for my $spellchecker (spellchecker_candidates()) {
        my @words;
        my $ok = eval {
            # IPC::Open3 says "If CHLD_ERR is false [...] then STDOUT and
            # STDERR of the child are on the same filehandle (this means that
            # an autovivified lexical cannot be used for the STDERR
            # filehandle [...])" - what a crummy API!
            my $child_error = gensym;

            my $pid = open3(my ($child_in, $spellcheck_results), $child_error, $spellchecker);

            print $child_in $document;

            # signal to spellchecker that we're done giving it words
            close $child_in or die $!;

            @words = <$spellcheck_results>;

            my $errors = do { local $/; <$child_error> };
            die "spellchecker had errors: $errors" if length $errors;

            # wait for spellchecker to clean up
            waitpid $pid, 0;

            1;
        };

        if ($ok) {
            # remember the one we used, so that it's consistent for all the files
            # this run, and we don't keep retrying the same spellcheckers that will
            # never work. also we need to expose the spellchecker we're using in
            # has_working_spellchecker
            set_spell_cmd($spellchecker)
                if !$SPELLCHECKER;
            return @words;
        }

        push @errors, "Unable to run '$spellchecker': $@";
    }

    # no working spellcheckers during a dry run
    return \"no spellchecker" if $dryrun;

    # no working spellcheckers; report all the errors
    require Carp;
    Carp::croak
        "Unable to find a working spellchecker:\n"
        . join("\n", map { "    $_\n" } @errors)
}

sub invalid_words_in {
    my $file = shift;

    my $document = '';
    open my $handle, '>', \$document;

    # save digested POD to the string $document
    my $checker = Pod::Spell->new;
    $checker->parse_from_file($file, $handle);

    my @words = _get_spellcheck_results($document);

    chomp for @words;
    return @words;
}

sub pod_file_spelling_ok {
    my $file = shift;
    my $name = shift || "POD spelling for $file";

    if (!-r $file) {
        $TEST->ok(0, $name);
        $TEST->diag("$file does not exist or is unreadable");
        return;
    }

    my @words = invalid_words_in($file);

    # remove stopwords, select unique errors
    @words = grep { !$Pod::Wordlist::Wordlist{$_} } @words;
    my %seen;
    @seen{@words} = ();
    @words = sort keys %seen;

    # emit output
    my $ok = @words == 0;
    $TEST->ok($ok, "$name");
    if (!$ok) {
        $TEST->diag("Errors:\n" . join '', map { "    $_\n" } @words);
    }

    return $ok;
}

sub all_pod_files_spelling_ok {
    my @files = all_pod_files(@_);

    if (!has_working_spellchecker()) {
        return $TEST->plan(skip_all => "no working spellchecker found");
    }

    $TEST->plan(tests => scalar @files);

    my $ok = 1;
    for my $file (@files) {
        pod_file_spelling_ok($file) or undef $ok;
    }
    return $ok;
}

sub all_pod_files {
    my @queue = @_ ? @_ : _starting_points();
    my @pod;

    while (@queue) {
        my $file = shift @queue;

        # recurse into subdirectories
        if (-d $file) {
            opendir(my $dirhandle, $file) or next;
            my @newfiles = readdir($dirhandle);
            closedir $dirhandle;

            @newfiles = File::Spec->no_upwards(@newfiles);
            @newfiles = grep { $_ ne "CVS" && $_ ne ".svn" } @newfiles;

            push @queue, map "$file/$_", @newfiles;
        }

        # add the file if it meets our criteria
        if (-f $file) {
            next unless _is_perl($file);
            next unless $FILE_FILTER->($file);
            push @pod, $file;
        }
    }

    return @pod;
}

sub _starting_points {
    return 'blib' if -d 'blib';
    return 'lib';
}

sub _is_perl {
    my $file = shift;

    return 1 if $file =~ /\.PL$/;
    return 1 if $file =~ /\.p(l|lx|m|od)$/;
    return 1 if $file =~ /\.t$/;

    open my $handle, '<', $file or return;
    my $first = <$handle>;

    return 1 if defined $first && ($first =~ /^#!.*perl/);

    return 0;
}

sub add_stopwords {
    for (@_) {
        # explicit copy so we don't modify constants as in add_stopwords("SQLite")
        my $word = $_;

        # XXX: the processing this performs is to support "perl t/spell.t 2>>
        # t/spell.t" which is bunk. in the near future the processing here will
        # become more modern
        $word =~ s/^#?\s*//;
        $word =~ s/\s+$//;
        next if $word =~ /\s/ or $word =~ /:/;
        $Pod::Wordlist::Wordlist{$word} = 1;
    }
}

sub set_spell_cmd {
    $SPELLCHECKER = shift;
}

sub set_pod_file_filter {
    $FILE_FILTER = shift;
}

1;

__END__