Plucene::SearchEngine::Index::HTML - Backend for simply parsing HTML


Plucene-SearchEngine documentation  | view source Contained in the Plucene-SearchEngine distribution.

Index


NAME

Top

Plucene::SearchEngine::Index::HTML - Backend for simply parsing HTML

DESCRIPTION

Top

This backend analysis a HTML file for the following Plucene fields:

text

The text part of the HTML

A list of links in the HTML

Additionally, any META tags are turned into Plucene fields.


Plucene-SearchEngine documentation  | view source Contained in the Plucene-SearchEngine distribution.