Abstract
Social media repositories serve as a significant source of evidence when extracting information related to the reputation of a particular entity (e.g., a particular politician, singer or company). Reputation management experts manually mine the social media repositories (in particular Twitter) for monitoring the reputation of a particular entity. Recently, the online reputation management evaluation campaign known as RepLab at CLEF has turned attention to devising computational methods for facilitating reputation management experts. A quite significant research challenge related to the above issue is to classify the reputation dimension of tweets with respect to entity names. More specifically, finding various aspects of a brand's reputation is an important task which can help companies in monitoring areas of their strengths and weaknesses in an effective manner. To address this issue in this paper we use dominant Wikipedia categories related to a reputation dimension in a random forest classifier. Additionally we also use tweet-specific features, language-specific features and similarity-based features. The experimental evaluations show a significant improvement over the baseline accuracy.
| Original language | English |
|---|---|
| Pages (from-to) | 1512-1518 |
| Number of pages | 7 |
| Journal | CEUR Workshop Proceedings |
| Volume | 1180 |
| Publication status | Published - 2014 |
| Externally published | Yes |
| Event | 2014 Cross Language Evaluation Forum Conference, CLEF 2014 - Sheffield, United Kingdom Duration: 15 Sep 2014 → 18 Sep 2014 |