# -*- coding:utf-8; mode:Text; fill-column:79 -*- # Time-stamp: "2014-06-17 17:23:26 MDT sburke@cpan.org" # (This page is in UTF-8!) |В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·В·| Module: Text::Unidecode-- make ASCII transliterations of Unicode text Unidecode makes ASCII transliterations of Unicode text. Sometimes it's dumb, but it's better than looking at "???" or "\15BA\15A0\1610...". If you have smarter text-handling subroutines, Unidecode might be useful as a fallthrough for them. Example: print unidecode( "еЊ—дє°\n" ); prints: Bei Jing See more examples below. For full documentation, run: perldoc Unidecode Or read:see: http://search.cpan.org/perldoc?Text::Unidecode An article about how Unidecode runs: http://interglacial.com/tpj/22/ REQUIREMENTS This module requires Perl 5.8.0 at the very least. That's probably not a problem for you, since that's from a decade ago! INSTALLATION * For using the "CPAN Plus" system, read: perldoc cpanp * For old-style "make" interface, read: perldoc perlmodinstall ~~~ EXAMPLE UNIDECODE INPUT AND OUTPUT ~~~ (Just two or three lines, from a few languages.) La dГ©cennie voit le dГ©but des biotechnologies avec le premier clonage, les organismes gГ©nГ©tiquement modifiГ©s, le dГ©but du sГ©quenГ§age du gГ©nome humain => La decennie voit le debut des biotechnologies avec le premier clonage, les organismes genetiquement modifies, le debut du sequencage du genome humain WЕ›rГіd nocnej ciszy gЕ‚os siД™ rozchodzi: WstaЕ„cie, pasterze, BГіg siД™ nam rodzi! => Wsrod nocnej ciszy glos sie rozchodzi: Wstancie, pasterze, Bog sie nam rodzi! ОљО±ОёО±ОЇПЃОїОЅП„О±О№ ОґбѕЅ бј„О»О»П‰П‚ О±бјµОјО±П„О№ ОјО№О±О№ОЅПЊОјОµОЅОїО№ Оїбј·ОїОЅ Оµбјґ П„О№П‚ Оµбј°П‚ ПЂО·О»бЅёОЅ бјђОјОІбЅ°П‚ ПЂО·О»бї· бјЂПЂОїОЅОЇО¶ОїОЅП„Ої. ОјО±ОЇОЅОµПѓОёО±О№ ОґбѕЅ бј‚ОЅ ОґОїОєОїОЇО·, Оµбјґ П„ОЇП‚ ОјО№ОЅ бјЂОЅОёПЃПЋПЂП‰ОЅ => Kathairontai d' allos aimati miainomenoi oion ei tis eis pelon embas pelo aponizonto. mainesthai d' an dokoie, ei tis min anthropon РќР° РґСЂСѓРіРѕР№ день Рє завтраку подавали очень вкусные пирожки, раков Рё бараньи котлеты; Рё РїРѕРєР° ели, РїСЂРёС…РѕРґРёР» наверх повар Никанор справиться, => Na drughoi dien' k zavtraku podavali ochien' vkusnyie pirozhki, rakov i baran'i kotliety; i poka ieli, prikhodil navierkh povar Nikanor spravit'sia, NЖ°б»›c trГ (hay nЖ°б»›c chГЁ) lГ Д‘б»“ uб»‘ng phб»• biбєїn thб»© hai trГЄn thбєї giб»›i (sau nЖ°б»›c uб»‘ng). NГі lГ m bбє±ng cГЎch ngГўm lГЎ, chб»“i, hay cГ nh của cГўy chГЁ => Nuoc tra (hay nuoc che) la do uong pho bien thu hai tren the gioi (sau nuoc uong). No lam bang cach ngam la, choi, hay canh cua cay che #### And Then Things Get A Bit Suboptimal # But remember the Unidecode motto: "It's better than nothing!" мњ мћђм°Ё(жџљеђиЊ¶)лЉ” мњ мћђмІмќ„ м°¬л¬јмќґл‚ лЌ”мљґ л¬јм—ђ нќ¬м„ќн•м—¬ 마시는 н•њкµмќ м „н†µ 차이다. мњ мћђмІмќЂ м–‡кІЊ 자른 мњ мћђлҐј кїЂмќґл‚ м„¤нѓ•кіј м„ћмќЂ л’¤ 3~4к°њм›” => yujaca(You Zi Cha )neun yujaceongeul canmulina deoun mule hyiseoghayeo masineun hangugyi jeontong caida. yujaceongeun yalbge jareun yujareul ggulina seoltanggwa seoggeun dwi 3~4gaeweol * The Gayatri Mantra- Sanskrit ॐ а¤аҐ‚र्а¤аҐЃа¤µа¤ѓаҐ’ स्वः । तत्स॑वितुर्वरे॑णियं । а¤аҐ’र्गो॑ दे॒वस्य॑ धीमहि। । धियो॒ यो नः॑ प्रचो॒दया॑त्॥ । => AUM bhuurbhuvH' svH / tts'viturvre'nniyN / bh'rgo' de'vsy' dhiimhi / / dhiyo' yo nH' prco'dyaa't // / 道可道,非常道。名可名,非常名。無名天地之始;有名萬物之母。故常無欲, 以觀其妙;常有欲,以觀其徼。ж¤е…©иЂ…,同出而異名,同謂之玄。玄之又玄,衆 => Dao Ke Dao ,Fei Chang Dao . Ming Ke Ming ,Fei Chang Ming . Wu Ming Tian Di Zhi Shi ;You Ming Wan Wu Zhi Mu . Gu Chang Wu Yu , Yi Guan Qi Miao ;Chang You Yu ,Yi Guan Qi Jiao . Ci Liang Zhe ,Tong Chu Er Yi Ming ,Tong Wei Zhi Xuan . Xuan Zhi You Xuan ,Zhong #Yiddish. Directionality and ligature might come out wrong in your browser/editor:] ‏ЧЧ™Ч™ ЧђЧ™Ч– Чђ Ч’ЧўЧЧЁЧђЧ Ч§ Ч•Ч•ЧђЧЎ ЧћЧўЧџ ЧЧЁЧ™Ч Ч§Ч ЧђЧ™Ч‘ЧўЧЁ Ч“ЧўЧЁ Ч’ЧђЧЁЧўЧЁ Ч•Ч•ЧўЧњЧ. ЧЧ™Ч™ Ч•Ч•ЧўЧЁЧ Ч’ЧўЧћЧђЧ›Ч Ч“Ч•ЧЁЧљ Ч•Ч•Ч™Ч™Ч§Чџ Ч“Ч™ Ч’ЧўЧЧЁЧ•Ч§Ч ЧЧў Ч‘ЧњЧўЧЧўЧЁ ЧђЧ“ЧўЧЁ Ч‘ЧњЧ•ЧћЧўЧџ Ч¤Ч•Чџ Ч“ЧўЧќ Ч¤ЧњЧђЧ ЧҐвЂЋ => tyy yz g`trnq vvs m`n trynqt yb`r d`r gr`r vv`lt. tyy vv`rt g`mkt dvrk vvyyqn dy g`trvqnt` bl`t`r d`r blvm`n pvn d`m plnts #Urdu. Directionality and ligature might come out wrong in your browser/editor:] ‏چائے ШЇЩ†ЫЊШ§ Ъ©ЫЊ ЩѕШіЩ†ШЇЫЊШЇЫЃ Щ…ШґШ±Щ€ШЁ ЫЃЫ’Ы” ЫЊЫЃ Ъ†Ш§Ы“ Ъ©Ы’ ЩѕЩ€ШЇЫ’ Ъ©ЫЊ ЩѕШЄЫЊЩ€Ъє Ъ©Щ€ Ъ†Щ†ШЇ Щ…Щ†Щ№ ЪЇШ±Щ… ЩѕШ§Щ†ЫЊ Щ…ЫЊЪє Ш§ШЁШ§Щ„Щ†Ы’ ШіЫ’ ШЄЫЊШ§Ш± ЫЃЩ€ШЄЫЊ ہے۔‎ => chy'y dny khy psndydh mshrwb hy. yh chy' khy pwdy khy ptywN khw chnd mntt grm pny myN blny sy tyr hwty hy.