Learning to Extract Structured Entities Using Language Models

[🔥 Oral, top 7% of all accepted papers 🔥]

⚙️ This is the implementation of our collaboration between MSR and Mila, "Learning to Extract Structured Entities Using Language Models", accepted to EMNLP 2024 Main conference.

Abstract

Recent advances in machine learning have significantly impacted the field of information extraction, with Language Models (LMs) playing a pivotal role in extracting structured information from unstructured text. Prior works typically represent information extraction as triplet-centric and use classical metrics such as precision and recall for evaluation. We reformulate the task to be entity-centric, enabling the use of diverse metrics that can provide more insights from various perspectives. We contribute to the field by introducing Structured Entity Extraction and proposing the Approximate Entity Set OverlaP (AESOP) metric, designed to appropriately assess model performance. Later, we introduce a new Multi-stage Structured Entity Extraction (MuSEE) model that harnesses the power of LMs for enhanced effectiveness and efficiency by decomposing the extraction task into multiple stages. Quantitative and human side-by-side evaluations confirm that our model outperforms baselines, offering promising directions for future advancements in structured entity extraction.

Install Dependencies

conda create -n MuSEE python=3.8 --file requirements.txt
conda activate MuSEE

Directory structure

data/  # Dataset generation code will be released soon due to internal process to go through.
|-- GPT4-based/  # GPT4-based dataset
|-- Wikidata-based/  # Wikidata-based dataset
|-- nyt/  # New York Times Relation Extraction dataset
|-- conll04/  # CoNLL04 dataset
|-- REBEL/  # REBEL dataset
|-- TREX/  # T-REx dataset
|-- dataloader_musee.py  # Dataloader for MuSEE model
model/
|-- t5_with_t5decoder.py  # base model architecture for MuSEE
trainer/
|-- trainer_musee.py  # Trainer for MuSEE model
args.py  # Arguments for MuSEE model and running experiments
experiment_musee.py  # Main file to run experiments
metric.py  # Calculate different variants of the proposed AESOP metric
compute_metrics.py  # Calculate metrics for the entire dataset
requirements.txt  # Required packages
utils.py  # Utility functions

Run the code

python experiment_musee.py \
    --model_choice=musee \
    --dataset=gpt4 \
    --pretrained_model_name=t5-large \
    --batch_size=1 \
    --epochs=100 \
    --log_wandb=True \
    --use_lora=True \
    --lr=1e-4 \
    --weight_decay=1e-2 \
    --mode=train \
    --loss_mode=mean \
    --use_better_init=True

Citation and Contact

If you find this paper useful, please cite our work:

@inproceedings{wu2024structured,
    title={Structured Entity Extraction Using Large Language Models},
    author={Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, and Bhaskar Mitra},
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, USA",
    publisher = "Association for Computational Linguistics",
}

💬 If you have any questions, feel free to contact us through email ([email protected], [email protected]) or Github issues. Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github		.github
data		data
img		img
model		model
trainer		trainer
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
args.py		args.py
compute_metrics.py		compute_metrics.py
experiment_musee.py		experiment_musee.py
metric.py		metric.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Extract Structured Entities Using Language Models

Abstract

Install Dependencies

Directory structure

Run the code

Citation and Contact

About

Releases

Packages

Contributors 5

Languages

License

microsoft/Structured-Entity-Extraction

Folders and files

Latest commit

History

Repository files navigation

Learning to Extract Structured Entities Using Language Models

Abstract

Install Dependencies

Directory structure

Run the code

Citation and Contact

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages