TRENDS AND PATTERNS OF TEXT CLASSIFICATION TECHNIQUES: A SYSTEMATIC MAPPING STUDY

TRENDS AND PATTERNS OF TEXT CLASSIFICATION TECHNIQUES: A SYSTEMATIC MAPPING STUDY. The Malaysian Journal of Computer Science, 33 (2). pp. 102-117. ISSN 0127-9084 (2020)



Abstract

Due to the mass availability of textual data on Web, text classification (TC), classifying texts into predetermined sets becomes a spotlight for researchers. A number of TC applications have been proposed yet very few studies reported an overview of TC research area in a proper and systematic manner. This paper aims to provide an overview of TC research trends and gaps by structuring and analyzing research patterns, encountered problems and problem-solving methods in TC. In other words, this study highlights problem types, data sources, choice of language of text and types of applied techniques in TC. An intensive systematic study is conducted by applying guidelines proposed by Petersen and colleagues in 2007. In this paper, ninety-six literatures from five electronic databases from 2006 to 2017 were systematically reviewed and followed each and every step properly in accordance with systematic mapping study. Nine main problems in TC research area were identified and significant findings which highlighted the evolution of TC research within the past 12 years were investigated. Different from other review articles, this paper highlighted issues and technical gaps of TC area in a useful and effective manner.

Item Type: Article
Keywords: Machine learning techniques, Text classification, Text categorization, Natural language processing (NLP), Systematic mapping (SM)
Taxonomy: By Subject > Computer & Mathematical Sciences > Information Technology
Local Content Hub: Subjects > Computer and Mathematical Sciences
Depositing User: Eza Eliana Abdul Wahid
Date Deposited: 01 Aug 2021 13:33
Last Modified: 01 Aug 2021 13:33
Related URLs:

Actions (login required)

View Item View Item