Welcome to the repository for the News Event Detection (NED) Dataset. This dataset is a comprehensive collection designed for the advancement of news event detection algorithms within the field of computational journalism and social media analysis.
The NED dataset consists of 17,366 Twitter posts, each paired with corresponding images, meticulously annotated with 40 real-world events. NED encompasses a wide array of event themes such as political events (elections, referendums, political crises, protests), sports events (the Olympics, soccer matches), and natural disasters (hurricanes, floods), amongst others. With the inclusion of events with subtle differences (e.g., “2016 Summer Olympics” vs. “2018 Winter Olympics”), NED presents a challenging landscape for detection algorithms, simulating the complexity encountered in real-world scenarios.
For any inquiries regarding the dataset, please feel free to contact Mr. Zehang Lin at [email protected]. If you encounter any issues with the dataset links or have further questions about the data, do not hesitate to contact us.
Contributions to the dataset and the related benchmarks are welcome. If you have suggestions or updates, please contact the repository administrators.
We appreciate all contributors and collaborators who have made this dataset possible and look forward to seeing the innovations it will enable in the realm of news event detection.
Please note that the NED dataset is provided for research purposes only. Any commercial use is strictly prohibited without prior consent.