Feature Selection through Visualisation for the Classification of Online Reviews
Abstract
The purpose of this work is to prove that the visualization is at least as powerful as the best automatic feature selection algorithms. This is achieved by applying our visualization technique to the online review classication into fake and genuine reviews. Our technique uses radial chart and the color overlaps to explore the best feature selection through visualization for classication. Every review is treated as a radial translucent red or blue membrane with its dimensions determining the shape of the membrane. This work also shows how the dimension ordering and combination is relevant in the feature selection process. In brief, the whole idea is about giving a structure to each text review based on certain attributes, comparing how different or how similar the structure of the different or same categories are and highlighting the key features that contribute to the classication the most. Colors and saturations aid in the feature selection process. Our visualization technique helps the user get insights into the high dimensional data by providing means to eliminate the worst features right away, pick some best features without statistical aids, understand the behavior of the dimensions in different combinations. This work outlines the different approaches explored, results and analysis.
Degree
M.S.
Advisors
Fang, Purdue University.
Subject Area
Information science|Computer science
Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server.