A Hybrid Algorithm for Text Classification Based on CNN-BLSTM with Attention

Lei Fu, Zhao Xia Yin, Xin Wang, Yi Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

We propose an effective text classification framework, which is the hybrid of different weights of character-level and word-level features through concatenation based on Convolutional Neural Network-bidirectional long short-term memory with attention (BACNN). The first step is word segmentation or character segmentation in the process of Chinese natural language processing. However, due to the different semantic relations in Chinese, Chinese sentences usually have several ways of word segmentation, which leads to the problem of word segmentation ambiguity. Although Chinese character segmentation is not ambiguity, its meaning is not accurate and rich enough. And in different situations, the character and word are different in importance. Therefore, to overcome the above problems, we propose the method of hybrid different weights of word-level and character-level features to let them make up the respective shortcomings. The experiment results indicate that our proposed method is better than the simple word or character level feature in classification performance.

Original languageEnglish
Title of host publicationProceedings of the 2018 International Conference on Asian Language Processing, IALP 2018
EditorsMinghui Dong, Moch. Bijaksana, Herry Sujaini, Arif Bijaksana Putra Negara, Ade Romadhony, Fariska Z. Ruskanda, Elvira Nurfadhilah, Lyla Ruslana Aini
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages31-34
Number of pages4
ISBN (Electronic)9781728111766
DOIs
StatePublished - 2 Jul 2018
Externally publishedYes
Event22nd International Conference on Asian Language Processing, IALP 2018 - Bandung, Indonesia
Duration: 15 Nov 201817 Nov 2018

Publication series

NameProceedings of the 2018 International Conference on Asian Language Processing, IALP 2018

Conference

Conference22nd International Conference on Asian Language Processing, IALP 2018
Country/TerritoryIndonesia
CityBandung
Period15/11/1817/11/18

Keywords

  • Attention mechanism
  • Bidirectional long short-term memory
  • Convolutional Neural Network
  • Text classification

Fingerprint

Dive into the research topics of 'A Hybrid Algorithm for Text Classification Based on CNN-BLSTM with Attention'. Together they form a unique fingerprint.

Cite this