Skip to content
This repository was archived by the owner on Jul 15, 2020. It is now read-only.
This repository was archived by the owner on Jul 15, 2020. It is now read-only.

hunspell port to Ruby #2

@zverok

Description

@zverok

Project

Port Hunspell opensource spellchecker to pure Ruby.

Proposed code name: spelleology.

Plan

  1. Understand hunspell dictionaries format.
  2. Create Hunspell dictionary reader, using Hunspell's code & docs as a reference and its dictionary samples
  3. Create simplistic spell-checking solution (split text into words → remove punctuations → run against dictionary)
  4. Wrap into proper Ruby gem, with executable and library usage (ver. ~0.0.1)
  5. Further development directions:
    • profiling and optimization
    • CI-readiness (different output formats, Rake task)
    • supplementary tools (dictionary downloader from OO repository)
    • pluggable integration with Markdown parsers and other markups, for proper reporting of spelling problems positions in marked files.

Importance

Hunspell is currently the most popular open source spellchecking tool, having most of the actual dictionaries in its format. But the tool itself is pretty complicated C++ software, that is hard to integrate and use from Ruby.

Pure-Ruby Hunspell port can be easily integrated with other Ruby tools, like Markdown parsers (or even Ruby parser, imagine you can spellcheck your Rake task descriptions?), Jekyll, CI tools and so on.

Skills and domains

You'll need to be able to at least read C++ of hunspell's sources. And expect a lot of optimization practice.

Metadata

Metadata

Assignees

No one assigned

    Labels

    gemRuby gemhas candidateSomebody tries to start the workideaIdea of the project

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions