An Iterative Method of Sentiment Analysis for Reliable User Evaluation

Jingyi Hui, Purdue University

Abstract

Benefited from the booming social network, reading posts from other users over internet is becoming one of commonest ways for people to intake information. One may also have noticed that sometimes we tend to focus on users provide well-founded analysis, rather than those merely who vent their emotions. This thesis aims at finding a simple and efficient way to recognize reliable information sources among countless internet users by examining the sentiments from their past postsTo achieve this goal, the research utilized a dataset of tweets about Apples stock price retrieved from Twitter. Key features we studied include post-date, user name, number of followers of that user, and the sentiment of that tweet. Prior to making further use of the dataset, tweets from users who do not have sufficient posts are filtered out. To compare user sentiments and the derivative of Apples stock price, we use Pearson correlation between them for to describe how well each user performs. Then we iteratively increase the weight of reliable users and lower the weight of untrustworthy users, the correlation between overall sentiment and the derivative of stock price will finally converge. The final correlations for individual users are their performance scores. Due to the chaos of real world data, manual segmentation via data visualization is also proposed as a denoise method to improve performance. Besides our method, other metrics can also be considered as user trust index, such as numbers of followers of each user. Experiments are conducted to prove that our method out performs others. With simple input, this method can be applied on a wide range of topics including election, economy, and job market.

Degree

M.Sc.

Advisors

Fang, Purdue University.

Subject Area

Artificial intelligence|Computer science|Geographic information science|Information science|Information Technology|Web Studies

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server
.

Share

COinS