Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification

Hongzhou Shen, Yue Ju, Zhijing Zhu

Research output: Journal PublicationArticlepeer-review

3 Citations (Scopus)

Abstract

User-generated contents (UGCs) on social media are a valuable source of emergency information (EI) that can facilitate emergency responses. However, the tremendous amount and heterogeneous quality of social media UGCs make it difficult to extract truly useful EI, especially using pure machine learning methods. Hence, this study proposes a machine learning and rule-based integration method (MRIM) and evaluates its EI classification performance and determinants. Through comparative experiments on microblog data about the “July 20 heavy rainstorm in Zhengzhou” posted on China’s largest social media platform, we find that the MRIM performs better than pure machine learning methods and pure rule-based methods, and that its performance is influenced by microblog characteristics such as the number of words, exact address and contact information, and users’ attention. This study demonstrates the feasibility of integrating machine learning and rule-based methods to mine the text of social media UGCs and provides actionable suggestions for emergency information management practitioners.

Original languageEnglish
Article number1862
JournalInternational Journal of Environmental Research and Public Health
Volume20
Issue number3
DOIs
Publication statusPublished - Feb 2023

Keywords

  • emergency information
  • machine learning
  • microblog
  • rule-based classification
  • social media

ASJC Scopus subject areas

  • Pollution
  • Public Health, Environmental and Occupational Health
  • Health, Toxicology and Mutagenesis

Fingerprint

Dive into the research topics of 'Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification'. Together they form a unique fingerprint.

Cite this