Your browser is not supported by DesignSafe. Please switch to Chrome or Firefox if you experience issues.
Opens in a new windowOpens an external siteOpens an external site in a new windowOpens an external site in a new window
PRJ-5748 | Enhancing the Fidelity of Social Media Image Datasets in Earthquake Disaster Assessment
PI
Project Type
Field research | Engineering
Natural Hazard Type(s)
Earthquake
Facilities
Keywords
Earthquake, Social Media, Deep Learning, Machine Learning, Disaster Reconnaissance
|
Description:
The development of artificial intelligence (AI) provides an opportunity for rapid and accurate assessment of earthquake-induced infrastructure damage using social media images. Nevertheless, data collection and labeling remain challenging due to limited expertise among annotators. This study introduces a novel 4-class Earthquake Infrastructure Damage (EID) assessment dataset compiled from a combination of images from several other social media image databases but with added emphasis on data quality. Unlike the previous datasets such as DAD and Crisis-Benchmark, the EID includes comprehensive labeling guidelines and a multi-class classification system aligned with established damage scales, such as HAZUS and EMS-98 to enhance the accuracy and utility of social media imagery for disaster response. By integrating detailed descriptions and clear labeling criteria, the labeling approach of EID reduces the subjective nature of image labeling and the inconsistencies found in existing datasets. The findings demonstrate a significant improvement in annotator agreement, reducing disagreement from 39.7% to 10.4%, thereby validating the efficacy of the refined labeling strategy. The EID, containing 13,513 high-quality images from five significant earthquakes, is designed to support community-level assessments and advanced computational research, paving the way for enhanced disaster response strategies through improved data utilization and analysis.
Document Collection | Earthquake Infrastructure Damage (EID) Assessment Dataset for Social Media Images
Cite This Data:
Huang, H., D. Zhang, A. Masalava, M. Roozbahani, N. Roy, D. Frost (2025). "Earthquake Infrastructure Damage (EID) Assessment Dataset for Social Media Images", in Enhancing the Fidelity of Social Media Image Datasets in Earthquake Disaster Assessment. DesignSafe-CI. https://doi.org/10.17603/ds2-yj8p-hs62
Hide Data
Author(s)
; ; ; ; ;
Facility
GEER - Geotechnical Extreme Event Reconnaissance
Date Published
2025-02-07
DOI
10.17603/ds2-yj8p-hs62
License
Open Data Commons Attribution
Description:
This project introduces a novel 4-class Earthquake Infrastructure Damage (EID) assessment dataset compiled from a combination of images from several other social media image databases but with added emphasis on data quality. Unlike the previous datasets such as DAD and Crisis-Benchmark, the EID includes comprehensive labeling guidelines and a multi-class classification system aligned with established damage scales, such as HAZUS and EMS-98 to enhance the accuracy and utility of social media imagery for disaster response. By integrating detailed descriptions and clear labeling criteria, the labeling approach of EID reduces the subjective nature of image labeling and the inconsistencies found in existing datasets. The findings demonstrate a significant improvement in annotator agreement, reducing disagreement from 39.7% to 10.4%, thereby validating the efficacy of the refined labeling strategy. The EID, containing 13,513 high-quality images from five significant earthquakes, is designed to support community-level assessments and advanced computational research, paving the way for enhanced disaster response strategies through improved data utilization and analysis.