-
Update convert_to_conll with hi4nlp rules
-
Support validation tokens/tags
-
Support BERT ( maybe use SrlReader)
-
Use one_hot features
-
Use POS as features to model
-
Use F1 during training/validation
-
Evaluation Framework
-
Eu treino usando o verbo do spacy, mas o original considera a tag para o verb_indicator
Testar o dataset do https://www.kaggle.com/shankkumar/multilingualopenrelations15/
https://github.com/princeton-nlp/PURE Testar https://gaotianyu1350.github.io/assets/simcse/simcse.pdf
- Use GPT as a way to validate extractions