Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Importance Weighted Feature Selection Strategy for Text Classification

Abstract: 

Feature selection, which aims at obtaining a compact and effective feature subset for better performance and higher efficiency, has been studied for decades. The traditional feature selection metrics, such as Chi-square and information gain, fail to consider how important a feature is in a document. Features, no matter how much effective semantic information they hold, are treated equally. Intuitively, thus calculated feature selection metrics are very likely to introduce much noise. We, therefore, in this study, extend the work of Li et al. [1] on document frequency metric, propose a general importance weighted feature selection strategy for text classification, in which the importance value of a feature in a document is derived from its relative frequency in that document. Extensive experiments with two state-of-the-art feature selection metrics (Chi-square and information gain) on three text classification datasets demonstrate the effectiveness of the proposed strategy.

up
0 users have voted:

Paper Details

Authors:
Baoli Li
Submitted On:
27 November 2016 - 10:44am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Baoli LI
Paper Code:
113
Document Year:
2016
Cite

Document Files

IALP2016-113-baoli-v0.2.pdf

(102 downloads)

Subscribe

[1] Baoli Li, "Importance Weighted Feature Selection Strategy for Text Classification", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1312. Accessed: Aug. 20, 2017.
@article{1312-16,
url = {http://sigport.org/1312},
author = {Baoli Li },
publisher = {IEEE SigPort},
title = {Importance Weighted Feature Selection Strategy for Text Classification},
year = {2016} }
TY - EJOUR
T1 - Importance Weighted Feature Selection Strategy for Text Classification
AU - Baoli Li
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1312
ER -
Baoli Li. (2016). Importance Weighted Feature Selection Strategy for Text Classification. IEEE SigPort. http://sigport.org/1312
Baoli Li, 2016. Importance Weighted Feature Selection Strategy for Text Classification. Available at: http://sigport.org/1312.
Baoli Li. (2016). "Importance Weighted Feature Selection Strategy for Text Classification." Web.
1. Baoli Li. Importance Weighted Feature Selection Strategy for Text Classification [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1312