| Data-FormValidator-Constraints-Words documentation | Contained in the Data-FormValidator-Constraints-Words distribution. |
Data::FormValidator::Constraints::Words - Data constraints for word inputs.
use Data::FormValidator::Constraints::Words;
my $rv = Data::FormValidator->check(\%input, {
real_name => realname(),
basic_words => basicwords(),
simple_words => simplewords(),
print_safe => printsafe(),
paragraph => paragraph(),
},
# or, use the regular functions
my $rv = Data::FormValidator->check(\%input, {
comments => sub {
my($dfv, $value) = @_;
return $dfv->match_paragraph($value);
}
});
Data::FormValidator::Constraints::Words provides several methods that
can be used to generate constraint closures for use with Data::FormValidator
for the purpose of validating textual input.
In the methods below several character code ranges are specified, below is a quick guide to what those ranges represent:
Dec Oct Hex Description --------------------------------------------------------- 32-47 040-057 20-2F ASCII symbols 48-57 060-071 30-39 ASCII numerals 58-64 072-100 3A-40 ASCII symbols 65-90 101-132 41-5A ASCII uppercase alphabetics 91-96 133-140 5B-60 ASCII symbols 97-122 141-172 61-7A ASCII lowercase alphabetics 123-126 173-176 7B-7E ASCII symbols 128-159 200-237 80-9F Extended symbols (unsupported in HTML4 standard) 160-191 240-277 A0-BF Extended symbols 192-255 300-377 C0-FF Extended alphabetics
The above table is based on the ISO Latin 1 (ISO 8859-1) set of encodings. The character range of 128-159 has no corresponding HTML entity encodings, and are considered control characters in the ISO Latin 1 character set. If you wish to override these settings, subclass this module and set the appropriate values for the following regular expression settings:
$REALNAME = qr/\-\s\w.,\'\xC0-\xFF/;
$BASICWORDS = qr/\-\s\w.,\'\"&;:\?\#\xC0-\xFF/;
$SIMPLEWORDS = qr/\-\s\w.,\'\"&;:\?\#~\+=\(\)\[\]\{\}<>\/!\xC0-\xFF/;
$PRINTSAFE = qr/\s\x20-\x7E\xA0-\xFF/;
$PARAGRAPH = qr/\s\x20-\x7E\xA0-\xFF/;
Note that these are used within a character class, so characters such as '-' must be escaped.
Although here PRINTSAFE and PARAGRAPH are the same, they may not be when subclassed.
The realname methods allows commonly used characters within a person's name to be used. Also restricts the string length to 128 characters. Acceptable characters must match the $REALNAME regular expression.
The basicwords methods allow a restricted character set to match simple strings, such as reference codes. Acceptable characters must match the $BASICWORDS regular expression:
The simplewords methods allow commonly used characters within simple text box input, such as for titles. Acceptable characters must match the $SIMPLEWORDS regular expression.
The printsafe methods restrict characters to those non-control characters within the character set. Acceptable characters must match the $PRINTSAFE regular expression.
The paragraph methods allows for a larger range of characters that would be expected to appear in a textarea inpout, such as a news story or a review. Acceptable characters must match the $PARAGRAPH regular expression:
Although Data-FormValidator is not a dependency, it is expected that this module will be used as part of DFV's constraint framework.
This module was originally written as part of the Labyrinth website management tool.
Barbie, <barbie@missbarbell.co.uk> for Miss Barbell Productions, http://www.missbarbell.co.uk/
Copyright (C) 2002-2008 Barbie for Miss Barbell Productions All Rights Reserved. This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
| Data-FormValidator-Constraints-Words documentation | Contained in the Data-FormValidator-Constraints-Words distribution. |
package Data::FormValidator::Constraints::Words; use strict; use warnings; use vars qw($VERSION $AUTOLOAD $REALNAME $BASICWORDS $SIMPLEWORDS $PRINTSAFE $PARAGRAPH); $VERSION = '0.04';
#---------------------------------------------------------------------------- # Exporter Settings require Exporter; use vars qw($VERSION @ISA @EXPORT); @ISA = qw(Exporter); @EXPORT = qw( realname valid_realname match_realname basicwords valid_basicwords match_basicwords simplewords valid_simplewords match_simplewords printsafe valid_printsafe match_printsafe paragraph valid_paragraph match_paragraph ); #---------------------------------------------------------------------------- # Variables
$REALNAME = qr/\-\s\w.,\'\xC0-\xFF/; $BASICWORDS = qr/\-\s\w.,\'\"&;:\?\#\xC0-\xFF/; $SIMPLEWORDS = qr/\-\s\w.,\'\"&;:\?\#~\+=\(\)\[\]\{\}<>\/!\xC0-\xFF/; $PRINTSAFE = qr/\s\x20-\x7E\xA0-\xFF/; $PARAGRAPH = qr/\s\x20-\x7E\xA0-\xFF/; #---------------------------------------------------------------------------- # Subroutines
sub realname { return sub { my ($self,$word) = @_; $self->set_current_constraint_name('realname'); $self->valid_realname($word); } } sub valid_realname { my ($self,$word) = @_; $word =~ m< ^( [$REALNAME]+ )$ >x ? 1 : 0; } sub match_realname { my ($self,$word) = @_; return unless defined $word; $word =~ s/\s+/ /g; $word =~ s/[^$REALNAME]+//g; return substr $word, 0, 128; }
sub basicwords { return sub { my ($self,$word) = @_; $self->set_current_constraint_name('basicwords'); $self->valid_basicwords($word); } } sub match_basicwords { my ($self,$word) = @_; $word =~ m< ^( [$BASICWORDS]+ )$ >x ? $1 : undef; }
sub simplewords { return sub { my ($self,$word) = @_; $self->set_current_constraint_name('simplewords'); $self->valid_simplewords($word); } } sub match_simplewords { my ($self,$word) = @_; $word =~ m< ^( [$SIMPLEWORDS]+ )$ >x ? $1 : undef; }
sub printsafe { return sub { my ($self,$word) = @_; $self->set_current_constraint_name('printsafe'); $self->valid_printsafe($word); } } sub valid_printsafe { my ($self,$word) = @_; $word =~ m< ^( [$PRINTSAFE]+ )$ >x ? 1 : 0; } sub match_printsafe { my ($self,$word) = @_; return unless defined $word; $word =~ s/[^$PRINTSAFE]+//; return $word || undef; }
sub paragraph { return sub { my ($self,$word) = @_; $self->set_current_constraint_name('paragraph'); $self->valid_paragraph($word); } } sub match_paragraph { my ($self,$word) = @_; $word =~ m< ^( [$PARAGRAPH]+ )$ >x ? $1 : undef; } sub AUTOLOAD { my $name = $AUTOLOAD; no strict qw/refs/; $name =~ m/^(.*::)(valid_)(.*)/; my ($pkg,$prefix,$sub) = ($1,$2,$3); # All non-defined valid_* routines are essentially identical to their # match_* counterpart, we're going to generate them dynamically from # the appropriate match_* routine. if ((defined $prefix) and ($prefix eq 'valid_')) { return defined &{$pkg.'match_' . $sub}(@_); } } 1; __END__