KanjiDB
Dictionary Search

KanjiDB Dictionary Search Documentation

The KanjiDB Dictionary search system provides a search interface for the wide array of EDICT-format dictionaries presently available. EDICT-format dictionary entries consist of a kana field indicating pronuciation in Japanese, a kanji spelling of the entry if available, and one or more english translations. An entry may be further marked as pertaining to a specific part of speech, or may have other tags associated with it depending on the subject of the dictionary. (for example: ENAMDICT, a dictionary of Japanese names, may tag entries as being family names, male/female/unclassified given names, place names, or full names of a specific personage). Not all aspects of the dictionaries are presently searchable.

This system would be completely impossible without the remarkable work of Dr. Jim Breen and his contributors to the KANJIDIC and EDICT datasets which form the backbone of KanjiDB. It would also be impossible without the generous licence for the use of that data provided by the Electronic Dictionary Research Group of Monash University.

At present, no mechanism has been implemented for easy entry of kanji characters. Unless you have a Japanese IME running on your computer in some capacity, your best bet is to copy and paste characters from some source, possibly the kanji pages or kanji search.

Search Terms:

  1. Kanji - Searches for dictionary entries containing all of the kanji entered in this field.
  2. Reading - Searches against kana fields for dictionary entries. This field will accept either roman characters or kana (the two Japanese syllabic alphabets) characters. When entering roman text, be aware that the database uses a version of revised Hepburn romanization when searching readings although, due to the difficulty of typing macrons, extended vowels are written by doubling the letter. (e.g. きいて = kiite, the -te form of きく - to hear)

    You should also be aware that the database only searches for one romanization per kana. In the case of kana such as (ha, but pronounced wa when used to denote the topic marker particle), or (he, but pronouced e when used to denote the destination particle), the more general pronunciation is the one the romanization engine will render. For example, the result - konnichi wa (hello, good day) will not turn up on a search for the string "konnichi wa" or "konnichiwa" because the romanization engine does not presently recognize the wa syllable as corresponding to the kana (it would instead match to which is always pronounced wa). Searching for "konnichi ha" or "konnichiha" will turn up the correct result. Of course, searching for the string "こんにちは" takes the romanization engine out of the loop and will return the desired result without having to accomodate the limitations of the engine. Future upgrades to the engine may be better able to handle kana with somewhat ambiguous readings.

    There is, naturally, an exception. The kana is technically pronounced wo but this pronuciation is obsolete (except occasionaly when it is used in names, which often have archaic spelling rules). It is only ever used, ignoring the name exception, to denote the direct object particle (pronounced o) so the romanization of the kana is not ambigious and a search for "o" will turn it up (and a search for "wo" will not). For example, the result - en o gaku (to draw a circle) will turn up on a search for the string "en o gaku" or "enogaku".

  3. English Terms - Searches for entries containing english translations (or romanized versions of the entry, as in the case of ENAMDICT) entered in this field.
  4. Dictionaries - A number of separate EDICT-format dictionaries may be loaded into the database at any one time, most focusing on specific topics. To search a specific dictionary, select it from this menu. Otherwise, select the 'All Dictionaries' option. Dictionary selection can prove especially useful in the case of ENAMDICT, the dictionary of Japanese names, which is extremely large and can return a large number of hits for many readings and combinations of kanji. If you are not looking for a name selecting "All Dictionaries except ENAMDICT" can greatly improve your search.
  5. Parts of Speech - This field allows you to restrict your search to words that have specific grammatical uses. Be warned that there are a large collection of different types of verbs. Select all parts of speech that you wish to search for (Multiple items can be selected by holding Shift and clicking) or leave this field blank to not restrict your search by parts of speech.

Dictionary Search

KanjiDB makes use of the EDICT and KANJIDIC datasets, as well as a number of EDICT-format dictionaries. EDICT, KANJIDIC, and related files and terms are the property of The Electronic Dictionary Research and Development Group at Monash University and are used in conformance with the Group's license. All EDICT-format dictionary files are the property of their respective owners and are used in conformance with license restrictions where applicable. All other content © 2006-2008 Martin Thorne.