Home > pyKanjidic2Parser

pyKanjidic2Parser

PyKanjidic2Parser is a project mainly written in Python, it's free.

A python parser for Jim Breen's Kanjidic2 dictionary.

A SAX parser for Jim Breen's KANJIDIC2 Japanese dictionary.

The code is its current state can be used for simple processing tasks such as extracting JLPT lists from the Kanjidic2 file available at http://www.csse.monash.edu.au/~jwb/kanjidic2/ .

Kanji lists @KANJIDIC2

Kanjidic2SaxHandler & KanjiListSaxWriter extract and write lists of Kanjis from the dictionary

Vocabulary Lists

HtmlTabSaxHandler extracts Kanjis and attributes from an html table. N1-5 vocabulary tab list are taken from http://www.tanos.co.uk/jlpt/ .

Previous:grad-notes