Skip to content

jf99/findSimilarLines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

findSimilarLines

Find and extract similar lines in text files. The purpose of this is to find similar sentences in the Common Voice dataset - such as a wrong sentence and its corrected version.

Make sure to turn on compiler optimizations, especially if you are dealing with text files of > 1000 lines.

About

find and extract similar lines in text files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published