| Lingua-EN-Keywords documentation | view source | Contained in the Lingua-EN-Keywords distribution. |
Lingua::EN::Keywords - Automatically extracts keywords from text
use Lingua::EN::Keywords; my @keywords = keywords($text);
This is a very simple algorithm which removes stopwords from a
summarized version of a text (generated with Lingua::EN::Summarize)
and then counts up what it considers to be the most important
"keywords". The keywords subroutine returns a list of five keywords
in order of relevance.
This is pretty dumb. Don't expect any clever document categorization algorithms here, because you won't find them. But it does a reasonable job.
keywords subroutine.
Simon Cozens, simon@cpan.org
perl(1).
| Lingua-EN-Keywords documentation | view source | Contained in the Lingua-EN-Keywords distribution. |