HTML::TableExtract is a module that simplifies the extraction of information contained in tables within HTML documents.
Tables of note may be specified using Headers, Depth, Count, Attributes, or some combination of the three. See the module documentation for details.
You install HTML-TableExtract, as you would install any perl module library, by running these commands:
See HTML/TableExtract.pm for the code. See Changes for recent changes. POD style documentation is included in the module. This is normally converted to a manual page and installed as part of the "make install" process. You should also be able to use the 'perldoc' utility to extract and read documentation from the module directly.
There is also an example page available:
There is a mailing list available. To register or examine archived messages, go here:
Questions and comments may also be directed to Matt Sisk <email@example.com>
The package is available from CPAN:
The latest version is also available at:
Thanks to the following people for their generous bug catching, fault analysis, and suggestions: Celeste Suliin Burris, Jeff Casey, David Finberg, Michael Fowler, Robert Goff, Klaus Gottschalk, Daniel Griscom, Jeremy Howard, Martin Joost, Jeff Lewwid, Nicholas R. Markham, Julian Mehnle, Michael S. Muegel, Patrick Naubert, Jani Ollikainen, Wilson Snyder, Volker Stuerzl, Steve Wong, and Matt Zip.
Copyright (c) 1999-2005 Matthew P. Sisk. All rights reserved. All wrongs revenged. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.