This repository was archived by the owner on Jul 15, 2020. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
This repository was archived by the owner on Jul 15, 2020. It is now read-only.
hunspell port to Ruby #2
Copy link
Copy link
Open
Labels
gemRuby gemRuby gemhas candidateSomebody tries to start the workSomebody tries to start the workideaIdea of the projectIdea of the project
Description
Project
Port Hunspell opensource spellchecker to pure Ruby.
Proposed code name: spelleology.
Plan
- Understand hunspell dictionaries format.
- Create Hunspell dictionary reader, using Hunspell's code & docs as a reference and its dictionary samples
- Create simplistic spell-checking solution (split text into words → remove punctuations → run against dictionary)
- Wrap into proper Ruby gem, with executable and library usage (ver. ~0.0.1)
- Further development directions:
- profiling and optimization
- CI-readiness (different output formats, Rake task)
- supplementary tools (dictionary downloader from OO repository)
- pluggable integration with Markdown parsers and other markups, for proper reporting of spelling problems positions in marked files.
Importance
Hunspell is currently the most popular open source spellchecking tool, having most of the actual dictionaries in its format. But the tool itself is pretty complicated C++ software, that is hard to integrate and use from Ruby.
Pure-Ruby Hunspell port can be easily integrated with other Ruby tools, like Markdown parsers (or even Ruby parser, imagine you can spellcheck your Rake task descriptions?), Jekyll, CI tools and so on.
Skills and domains
You'll need to be able to at least read C++ of hunspell's sources. And expect a lot of optimization practice.
Metadata
Metadata
Assignees
Labels
gemRuby gemRuby gemhas candidateSomebody tries to start the workSomebody tries to start the workideaIdea of the projectIdea of the project