-
-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
N-grams in dictionary #37
Comments
@CarlaFernandez according to this comment (wolfgarbe/SymSpell#54 (comment)), the only way to add multi-word phrases is through
|
@mammothb
|
You need to use |
Thanks. |
Any update on being able to use the lookup_compound with n-grams? |
Hi @mammothb, great job with SymspellPy.
I recently saw Issue 15 at Symspell's Github (wolfgarbe/SymSpell#15), and the last comment caught my attention. Apparently Symspell suppports N-grams in the dictionary file, but I did a small test in SysmpellPy and I was not able to achieve the desired behavior. My approach was the following:
I added on top of a custom frequency dictionary the following sequence:
abc def ghi 116422658
(highest frequency in the dictionary)I obtained suggestions to the sentence:
abc dff ghi
, using both lookup and lookup_compoundThe returned corrections were based on single words (1-grams) I had previously defined un my dictionary and not on the newly inserted 3-gram:
abc off ghi
I would like to know if there is any way to reproduce the desired behavior in SymspellPy, that is, obtaining a prediction based on the N-gram counts, or if there are any plans to add it as a feature in the near future.
Thanks for your time!
The text was updated successfully, but these errors were encountered: