Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Persian support in Tesseract #3

Closed
ghost opened this issue Jul 3, 2015 · 7 comments
Closed

Persian support in Tesseract #3

ghost opened this issue Jul 3, 2015 · 7 comments

Comments

@ghost
Copy link

ghost commented Jul 3, 2015

Would you be willing to add support for Persian language?

There is an on-going project here https://github.com/reza1615/PersianOcr I tested with "per.traineddata" and it works well on my end.

Thank you

@ghost ghost changed the title Persian support tesseract Persian support in Tesseract Jul 3, 2015
@zdenop
Copy link
Contributor

zdenop commented Jul 3, 2015

tessdata files are generated by google and they provide support for including langdata.
IMO community trainings (e.g. not supported by google) should be separated from google data.

@jimregan
Copy link

jimregan commented Jul 3, 2015

Persian data may be coming in the future (it was added to langdata).

@ghost
Copy link
Author

ghost commented Jul 6, 2015

good news thank you @zdenop and @jimregan

@theraysmith
Copy link
Contributor

Persian traineddata and training data is included in tessdata and langdata respectively.
The language code is fas (ISO 639-2T), NOT per, which uses the disfavored ISO 639-2B. (See https://en.wikipedia.org/wiki/Talk%3AISO_639-2).

Having said all that, the accuracy is probably appalling, but I would be happy to know if it works at all.

@ghost
Copy link
Author

ghost commented Jul 14, 2015

Awesome!

@ghost ghost closed this as completed Jul 14, 2015
@mafshin
Copy link

mafshin commented Dec 4, 2015

Thank you @theraysmith , the key missing explanation .. fas not per :-)

@Shreeshrii
Copy link
Contributor

Shreeshrii commented Aug 5, 2017

@PepLamb @mafshin @reza1615

Please test with the latest BEST 4.0alpha traineddata and provide feedback at #70

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants