Type

Journal Article

Authors

Thomas Brendan Murphy
Michael Salter-Townshend

Subjects

Mathematics

Topics
sentiment analysis mixture model bias modelling crowdsourcing irish online news sources em algorithm mixtures

Mixtures of biased sentiment analysers (2013)

Abstract Modelling bias is an important consideration when dealing with inexpert annotations. We are concerned with training a classifier to perform sentiment analysis on news media articles, some of which have been manually annotated by volunteers. The classifier is trained on the words in the articles and then applied to non-annotated articles. In previous work we found that a joint estimation of the annotator biases and the classifier parameters performed better than estimation of the biases followed by training of the classifier. An important question follows from this result: can the annotators be usefully clustered into either predetermined or data-driven clusters, based on their biases? If so, such a clustering could be used to select, drop or otherwise categorise the annotators in a crowdsourcing task. This paper presents work on fitting a finite mixture model to the annotators’ bias. We develop a model and an algorithm and demonstrate its properties on simulated data. We then demonstrate the clustering that exists in our motivating dataset, namely the analysis of potentially economically relevant news articles from Irish online news sources.
Collections Ireland -> University College Dublin -> College of Science
Ireland -> University College Dublin -> Insight Research Collection
Ireland -> University College Dublin -> Mathematics and Statistics Research Collection
Ireland -> University College Dublin -> School of Mathematics and Statistics
Ireland -> University College Dublin -> Institutes and Centres
Ireland -> University College Dublin -> Insight Centre for Data Analytics

Full list of authors on original publication

Thomas Brendan Murphy, Michael Salter-Townshend

Experts in our system

1
Thomas Brendan Murphy
University College Dublin
Total Publications: 38
 
2
Michael Salter-Townshend
University College Dublin
Total Publications: 12