Abstract
The success of Youtube has attracted a lot of users, which results in an increase of the number of comments present on Youtube channels. By analyzing those comments we could provide insight to the Youtubers that would help them to deliver better quality. Youtube is very popular in India. A majority of the population in India speak and write a mixture of two languages known as Hinglish for casual communication on social media. Our study focuses on the sentiment analysis of Hinglish comments on cookery channels. The unsupervised learning technique DBSCAN was employed in our work to find the different patterns in the comments data. We have modelled and evaluated both parametric and non-parametric learning algorithms. Logistic regression with the term frequency vectorizer gave 74.01% accuracy in Nisha Madulika’s dataset and 75.37% accuracy in Kabita’s Kitchen dataset. Each classifier is statistically tested in our study.
Original language | English |
---|---|
Article number | 37 |
Pages (from-to) | 1-19 |
Number of pages | 19 |
Journal | Big Data and Cognitive Computing |
Volume | 3 |
Issue number | 3 |
DOIs | |
Publication status | Published - Sep 2019 |
Keywords
- Cookery channels
- Hinglish
- Machine learning
- Sentiment analysis