| Lingua-ZH-ZhuYinWen documentation | view source | Contained in the Lingua-ZH-ZhuYinWen distribution. |
Lingua::ZH::ZhuYinWen - ZhuYinWen Bastardizer
use Lingua::ZH::ZhuYinWen qw(bastardize); bastardize($text);
In taiwan, traditionally Mandarin's phonetics is marked by a system called ZhuYinFuHao which is composed of 37 phonetic symbols. It is not used in formal language, and only helps children to learn Mandarin. Because of prevalence of internet and complex Chinese input methods which are frustrating to some people who are not handy at keyboard, more and more people are used to typing these phonetic symbols in place of standard Han characters. This phenomenon is called ZhuYinWen. It leads to obfuscation, great reading difficulty, or even a darned mess.
This module converts Mandarin to ZhuYinWen. However, there is no one-to-one correspondence between morphology and phonology in Chinese. Moreover, full pronunciation is replaced with a single phoneme. Thus perfect two-way conversion is impossible. This module only provides Mandarin-to-ZhuYinWen converting method.
Although full backward conversion is not possible, several AI or NLP methods still can be applied to reconstruct texts with the greatest possibility. Perchance they will be incorporated in the versions to come.
Text::Bastardize
xern <xern@cpan.org>
This module is free software; you can redistribute it or modify it under the same terms as Perl itself.
| Lingua-ZH-ZhuYinWen documentation | view source | Contained in the Lingua-ZH-ZhuYinWen distribution. |