[1]方晓东,刘昌辉*,王丽亚,等.基于BERT的复合网络模型的中文文本分类[J].武汉工程大学学报,2020,42(06):688-692.[doi:10.19843/j.cnki.CN42-1779/TQ. 202002009]
FANG Xiaodong,LIU Changhui*,WANG Liya,et al.Chinese Text Classification Based on BERT’s Composite Network Model[J].Journal of Wuhan Institute of Technology,2020,42(06):688-692.[doi:10.19843/j.cnki.CN42-1779/TQ. 202002009]
Chinese Text Classification Based on BERT’s Composite Network Model
1674 - 2869(2020)06 - 0688 - 05
方晓东; 刘昌辉*; 王丽亚; 殷 兴
武汉工程大学计算机科学与工程学院,湖北 武汉 430205
FANG Xiaodong; LIU Changhui*; WANG Liya; YIN Xing
School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China
BERT; BiGRU; 注意力机制; 中文文本分类; 新闻分类
BERT; BiGRU; attention mechanism; Chinese text classification; news classification
10.19843/j.cnki.CN42-1779/TQ. 202002009
- 摘要:
- Abstract:
Natural languages have strong dependence among words in sentence structure. This paper proposes a bidirectional encoder representation from transformer-based composite network model for Chinese news classification. First, the BERT’s attention mechanism-based multi-layer bidirectional transformer was used as the feature extractor to obtain a global expression of feature relationships between words and sentences. Then, the above results were input into the bidirectional gated loop neural network layer with a simple gate structure, which was able to enhance features, reduce the time cost, and improve the accuracy of data feature selection. Finally, the text feature information with different weights was input into the softmax layer for classification. Experiments were conducted on the Sina news data set cnews. An F1 value of 97.21% was obtained. The results show that the proposed feature fusion model has a better classification effect than other models.
收稿日期:2020-02-15基金项目:国家自然科学基金(61103136);武汉工程大学教育创新计划(CX2019238)作者简介:方晓东,硕士研究生。E-mail:[email protected]*通讯作者:刘昌辉,博士,副教授。E-mail:[email protected]引文格式:方晓东,刘昌辉,王丽亚,等. 基于BERT的复合网络模型的中文文本分类[J]. 武汉工程大学学报,2020,42(6):688-692.
