CPAN
Home  Documentation  Recent  Preferences  Modules  Distributions    Authors   
Find    in      

Distributions     > >     H     > >     HTML     > >     HTML-Content-Extractor
Distribution HTML-Content-Extractor [Download]
Author JTAVERNI [ Jean Tavernier ]
Version 0.01
Abstract Perl module for extracting content from HTML documents.
Released 22 Aug 2005
Size 27.4 KB
MD5 Checksum 2226c2f443e96ed463f0dcc0f408fd7e
Additional Files README   |   Changes   |   Makefile.PL   |  
Links search.cpan.org   |   CPAN::Forum  |   AnnoCPAN  |   rt.cpan.org  |   Rating  | CPANTS  | CPAN testers | Dependencies | Testers matrix

Modules

HTML::Content::ContentExtractor  [source]   [v 0.01] Perl module for extracting content from HTML documents.
HTML::Content::HTMLTokenizer  [source] Perl module to tokenize HTML documents.
HTML::Content::TokeParserTokenizer [source]
HTML::WordTagRatio::ExponentialRatio  [source] Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.
HTML::WordTagRatio::NormalizedRatio  [source] Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.
HTML::WordTagRatio::Ratio  [source] Default module for determining the ratio of words to tags in a range of tokens in an HTML document.
HTML::WordTagRatio::RelativeRatio  [source] Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.
HTML::WordTagRatio::SmoothedRatio  [source] Default module for determining the ratio of words to tags in a range of tokens in an HTML document.
HTML::WordTagRatio::WeightedRatio  [source] Perl module for determining the ratio of words to tags in a range of tokens in an HTML document.

Categories

World Wide Web HTML HTTP CGI    >>     HTML

Win32 PPM packages for "HTML-Content-Extractor"

ActiveState default Perl 5.8 repository   [  v 0.01   ]

Problems, suggestions, or comments to Randy Kobes. Questions? Check the FAQ.
Enable installations using PAR::WebStart.