Discover and explore top open-source AI tools and projects—updated daily.
kujirahandComprehensive English-Japanese dictionary dataset
Top 99.1% on SourcePulse
Summary
EJDict-hand provides a comprehensive English-Japanese dictionary dataset under a Public Domain (CC0) license. It offers easily downloadable and testable data for developers and researchers working with bilingual lexicographical resources, eliminating copyright concerns and facilitating integration into various applications.
How It Works
The dataset comprises text files organized alphabetically, with a consolidated and sorted version available. Each entry follows a EnglishWord\tMeaning format, employing specific notations for synonyms, multiple meanings, grammatical forms (e.g., {形}, {動}), regional variations (e.g., 《米》, 《英》), and countability (e.g., 〈C〉, 〈U〉). PHP scripts are included for merging files and converting the data to SQLite format.
Quick Start & Requirements
http://kujirahand.com/web-tools/EJDictFreeDL.php.https://kujirahand.com/web-tools/EJDict.php.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The dataset explicitly warns of the inclusion of "discriminatory expressions" which users are advised to avoid. While AI corrections enhance data quality, potential for AI-introduced inaccuracies exists, though recent updates focused on minor typos.
5 months ago
Inactive
finic-ai