$Header: /cvsroot/unac/Text-Unaccent/README,v 1.9 2004/10/17 19:00:36 loic Exp $
Text::Unaccent is a module that provides functions to remove accents from a string. For instance the string été will become ete. The charset of the input string is specified as an argument. The input is converted to UTF-16 using iconv(3), accents are stripped and the result is converted back to the original charset. The iconv -l command on GNU/Linux will show all charset supported.
Text-Unaccent-1.08 has the same unac.[ch] files than unac-1.8.0
Try man Text::Unaccent
For OS that are not GNU/Linux we recommend to use the iconv library provided by Bruno Haible <haible@clisp.cons.org> at ftp://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.9.2.tar.gz
perl Makefile.PL
make all
make test
make install
use Text::Unaccent;
unac_string("ISO-8859-1", "été") => "ete"
The main distribution site is http://gna.org/projects/unac/
Isolate the problem, run a test case with unac_debug(UNAC_DEBUG_HIGH) and report the result to unac-devel@mail.freesoftware.fsf.org. If you feel like fixing the problem yourself, use the C library package located at http://www.senga.org/unac/. The unac.[ch] files are copied from the C library package but the tools used to generate part of their content (the translation tables) are missing in the Perl package.
Loic Dachary
loic@gnu.org