Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Find and download the training sets for the encoder-decoder sentence simplification model #2

Open
wammar opened this issue May 3, 2024 · 1 comment

Comments

@wammar
Copy link

wammar commented May 3, 2024

Probably with ParaPhrase DataBase(http://paraphrase.org/#/download) and with Aligned pairs between the Simple English Wikipedia entries and their corresponding English Wikipedia entries

Kauchak, D.: Improving text simplification language modeling using unsimplified text data. In: 51st Annual Meeting of the Association for Computational Linguistics, Vol. 1: Long papers, pp. 1537–1546. ACl, Sofia, Bulgaria (2013)

Ganitkevitch, J., Van Durme, B., Callison-Burch, C.: PPDB: the paraphrase database. In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 758–764 (2013)

@wammar
Copy link
Author

wammar commented May 3, 2024

After discussing alternative approaches in the design doc, we decided to archive this task.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant