Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification

Hongzhou Shen; Yue Ju; Zhijing Zhu

doi:10.3390/ijerph20031862

Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification

Hongzhou Shen, Yue Ju, Zhijing Zhu

Department of International Business and Management

Research output: Journal Publication › Article › peer-review

3 Citations (Scopus)

Abstract

User-generated contents (UGCs) on social media are a valuable source of emergency information (EI) that can facilitate emergency responses. However, the tremendous amount and heterogeneous quality of social media UGCs make it difficult to extract truly useful EI, especially using pure machine learning methods. Hence, this study proposes a machine learning and rule-based integration method (MRIM) and evaluates its EI classification performance and determinants. Through comparative experiments on microblog data about the “July 20 heavy rainstorm in Zhengzhou” posted on China’s largest social media platform, we find that the MRIM performs better than pure machine learning methods and pure rule-based methods, and that its performance is influenced by microblog characteristics such as the number of words, exact address and contact information, and users’ attention. This study demonstrates the feasibility of integrating machine learning and rule-based methods to mine the text of social media UGCs and provides actionable suggestions for emergency information management practitioners.

Original language	English
Article number	1862
Journal	International Journal of Environmental Research and Public Health
Volume	20
Issue number	3
DOIs	https://doi.org/10.3390/ijerph20031862
Publication status	Published - Feb 2023

Keywords

emergency information
machine learning
microblog
rule-based classification
social media

ASJC Scopus subject areas

Pollution
Public Health, Environmental and Occupational Health
Health, Toxicology and Mutagenesis

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.3390/ijerph20031862

Cite this

@article{963b9ebc4246463b83a606414a988326,

title = "Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification",

abstract = "User-generated contents (UGCs) on social media are a valuable source of emergency information (EI) that can facilitate emergency responses. However, the tremendous amount and heterogeneous quality of social media UGCs make it difficult to extract truly useful EI, especially using pure machine learning methods. Hence, this study proposes a machine learning and rule-based integration method (MRIM) and evaluates its EI classification performance and determinants. Through comparative experiments on microblog data about the “July 20 heavy rainstorm in Zhengzhou” posted on China{\textquoteright}s largest social media platform, we find that the MRIM performs better than pure machine learning methods and pure rule-based methods, and that its performance is influenced by microblog characteristics such as the number of words, exact address and contact information, and users{\textquoteright} attention. This study demonstrates the feasibility of integrating machine learning and rule-based methods to mine the text of social media UGCs and provides actionable suggestions for emergency information management practitioners.",

keywords = "emergency information, machine learning, microblog, rule-based classification, social media",

author = "Hongzhou Shen and Yue Ju and Zhijing Zhu",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = feb,

doi = "10.3390/ijerph20031862",

language = "English",

volume = "20",

journal = "International Journal of Environmental Research and Public Health",

issn = "1661-7827",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "3",

}

TY - JOUR

T1 - Extracting Useful Emergency Information from Social Media

T2 - A Method Integrating Machine Learning and Rule-Based Classification

AU - Shen, Hongzhou

AU - Ju, Yue

AU - Zhu, Zhijing

PY - 2023/2

Y1 - 2023/2

N2 - User-generated contents (UGCs) on social media are a valuable source of emergency information (EI) that can facilitate emergency responses. However, the tremendous amount and heterogeneous quality of social media UGCs make it difficult to extract truly useful EI, especially using pure machine learning methods. Hence, this study proposes a machine learning and rule-based integration method (MRIM) and evaluates its EI classification performance and determinants. Through comparative experiments on microblog data about the “July 20 heavy rainstorm in Zhengzhou” posted on China’s largest social media platform, we find that the MRIM performs better than pure machine learning methods and pure rule-based methods, and that its performance is influenced by microblog characteristics such as the number of words, exact address and contact information, and users’ attention. This study demonstrates the feasibility of integrating machine learning and rule-based methods to mine the text of social media UGCs and provides actionable suggestions for emergency information management practitioners.

AB - User-generated contents (UGCs) on social media are a valuable source of emergency information (EI) that can facilitate emergency responses. However, the tremendous amount and heterogeneous quality of social media UGCs make it difficult to extract truly useful EI, especially using pure machine learning methods. Hence, this study proposes a machine learning and rule-based integration method (MRIM) and evaluates its EI classification performance and determinants. Through comparative experiments on microblog data about the “July 20 heavy rainstorm in Zhengzhou” posted on China’s largest social media platform, we find that the MRIM performs better than pure machine learning methods and pure rule-based methods, and that its performance is influenced by microblog characteristics such as the number of words, exact address and contact information, and users’ attention. This study demonstrates the feasibility of integrating machine learning and rule-based methods to mine the text of social media UGCs and provides actionable suggestions for emergency information management practitioners.

KW - emergency information

KW - machine learning

KW - microblog

KW - rule-based classification

KW - social media

UR - http://www.scopus.com/inward/record.url?scp=85147849334&partnerID=8YFLogxK

U2 - 10.3390/ijerph20031862

DO - 10.3390/ijerph20031862

M3 - Article

C2 - 36767235

AN - SCOPUS:85147849334

SN - 1661-7827

VL - 20

JO - International Journal of Environmental Research and Public Health

JF - International Journal of Environmental Research and Public Health

IS - 3

M1 - 1862

ER -

Extracting Useful Emergency Information from Social Media: A Method Integrating Machine Learning and Rule-Based Classification

Abstract

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this