picky

https://github.com/floere/picky

HTML

Picky is an easy to use and fast Ruby semantic search engine that helps your users find what they are looking for.

Picky::Tokenizer#preprocess

Default preprocessing hook.

Does:
1. Character substitution.
2. Remove illegal expressions.
3. Remove non-single stopwords. (Stopwords that occur with other words)

Source | Google | Stack overflow

Edit

git clone [email protected]:floere/picky.git

cd picky

open server/lib/picky/tokenizer.rb

Contribute

# Make a new branch

git checkout -b -your-name--update-docs-Picky--Tokenizer-preprocess-for-pr


# Commit to git

git add server/lib/picky/tokenizer.rbgit commit -m "better docs for Picky::Tokenizer#preprocess"


# Open pull request

gem install hub # on a mac you can `brew install hub`

hub fork

git push <your name> -your-name--update-docs-Picky--Tokenizer-preprocess-for-pr

hub pull-request


# Celebrate!