String::Downgrade::Amharic - Generate Acceptable Spellings from Canonical.


String-Downgrade documentation  | view source Contained in the String-Downgrade distribution.

Index


NAME

Top

String::Downgrade::Amharic - Generate Acceptable Spellings from Canonical.

SYNOPSIS

Top

  #
  #  OO Style:
  #
  use utf8;
  require String::Downgrade::Amharic;

  my $string = new String::Downgrade::Amharic;

  my @list = $string->downgrade ( "እግዚአብሔር" );

  my $count = 0;
  foreach (@list) {
      $count++;
      print "$count: $_\n";
  }




  #
  #  Functional Style:
  #
  use utf8;
  use String::Downgrade::Amharic;

  my @list = downgrade ( "እግዚአብሔር" );

  :
  :
  :

DESCRIPTION

Top

Under the "three levels of Amharic spelling" theory, the String::Downgrade::Amharic package will take a canonical word (level one) and generate level two words (the level of popular use). The first member of the returned array is the original string. The last member of the returned array is a regular expression that will match all renderings of the list.

The doc/index.html file presents a development of the downgrade rules applied.

The package is useful for some problems, it will produce orthographically "legal" simplification and avoids improbable naive simplifications. Text::Metaphone::Amharic of course over simplifies as it addresses a different problem. So while not to promote level 2 orthographies, in some instances it is useful to generate level 2 renderings given a canonical form.

You must start with the canonical spelling of a word as only downgrades can occur. Starting with a near canonical form and downgrading will generate a shorter word list than you would have starting from the top.

REQUIRES

Top

Regexp::Ethiopic (which rules btw).

COPYRIGHT

Top

BUGS

Top

None presently known.

AUTHOR

Top

Daniel Yacob, dyacob@cpan.org

SEE ALSO

Top

Text::Metaphone::Amharic


String-Downgrade documentation  | view source Contained in the String-Downgrade distribution.