TY - GEN
T1 - A case-based approach to cross domain sentiment classification
AU - Ohana, Bruno
AU - Delany, Sarah Jane
AU - Tierney, Brendan
PY - 2012
Y1 - 2012
N2 - This paper considers the task of sentiment classification of subjective text across many domains, in particular on scenarios where no in-domain data is available. Motivated by the more general applicability of such methods, we propose an extensible approach to sentiment classification that leverages sentiment lexicons and out-of-domain data to build a case-based system where solutions to past cases are reused to predict the sentiment of new documents from an unknown domain. In our approach the case representation uses a set of features based on document statistics, while the case solution stores sentiment lexicons employed on past predictions allowing for later retrieval and reuse on similar documents. The case-based nature of our approach also allows for future improvements since new lexicons and classification methods can be added to the case base as they become available. On a cross domain experiment our method has shown robust results when compared to a baseline single-lexicon classifier where the lexicon has to be pre-selected for the domain in question.
AB - This paper considers the task of sentiment classification of subjective text across many domains, in particular on scenarios where no in-domain data is available. Motivated by the more general applicability of such methods, we propose an extensible approach to sentiment classification that leverages sentiment lexicons and out-of-domain data to build a case-based system where solutions to past cases are reused to predict the sentiment of new documents from an unknown domain. In our approach the case representation uses a set of features based on document statistics, while the case solution stores sentiment lexicons employed on past predictions allowing for later retrieval and reuse on similar documents. The case-based nature of our approach also allows for future improvements since new lexicons and classification methods can be added to the case base as they become available. On a cross domain experiment our method has shown robust results when compared to a baseline single-lexicon classifier where the lexicon has to be pre-selected for the domain in question.
KW - case-based reasoning
KW - sentiment classification
KW - sentiment lexicons
UR - http://www.scopus.com/inward/record.url?scp=84866645830&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-32986-9_22
DO - 10.1007/978-3-642-32986-9_22
M3 - Conference contribution
AN - SCOPUS:84866645830
SN - 9783642329852
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 284
EP - 296
BT - Case-Based Reasoning Research and Development - 20th International Conference, ICCBR 2012, Proceedings
T2 - 20th International Conference on Case-Based Reasoning, ICCBR 2012
Y2 - 3 September 2012 through 6 September 2012
ER -