Introduction¶
wikt2pron is a Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format. Support IPA and X-SAMPA format at present.
Features¶
- Extract pronunciation from Wiktionary XML dump.
- Lookup pronunciation for a word in Wiktionary.
- IPA -> X-SAMPA conversion.
Installation¶
# download the latest version
$ git clone https://github.com/abuccts/wikt2pron.git
$ cd enwiktionary
# install and run test
$ python setup.py install
$ python setup.py -q test
# make documents
$ make -C docs html