Classifications and characterization of safety hazard texts
CSTR:
Author:
Affiliation:

1.School of Management Engineering, Capital University of Economics and Business, Beijing 100070, P. R. China;2.Editorial Department of Journal of Beijing University of Posts and Telecommunications (Nature Edition);3.Social Network Information Research Center, School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing 100876, P. R. China

Clc Number:

Fund Project:

Supported by the Special Fund Project of the Society of China University Journals (CUJS2024-GJ-A01).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    To improve the efficiency of organizing and retrieving safety hazard information and to support more complex information processing tasks, effective technical methods for automatic text classification and type analysis are required. Support Vector Machine (SVM) can automatically classify unstructured text. However, their underlying principle focuses on identifying optimal classification boundaries within the training set and does not facilitate the extraction of representative features for each text category. To address this limitation, a normalized entropy model is proposed to search for typical category features, thereby improving the traditional term frequency-inverse document frequency (TF-IDF) based feature recognition method. Using 2 534 law enforcement inspection records from a government emergency management bureau as a case study, SVM was used for automatic text classification and achieved an accuracy of up to 97%. Meanwhile, the normalized entropy model was used to extract representative features for each category, providing decision support for formulating targeted rectification strategies in hazard investigation. Experimental results show that the combined use of SVM and the normalized entropy model effectively addresses both text classification and category feature recognition tasks.

    Reference
    Related
    Cited by
Get Citation

乔剑锋,刘萱,艾莉莎,张丽玮,王汀.基于SVM和归一化熵模型的隐患文本分类与类型特征分析[J].重庆大学学报,2026,49(2):105~115

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 15,2024
  • Revised:
  • Adopted:
  • Online: February 03,2026
  • Published:
Article QR Code