Detail publikace

Sentiment Analysis Based on Support Vector Machine and Big Data

Originální název

Sentiment Analysis Based on Support Vector Machine and Big Data

Anglický název

Sentiment Analysis Based on Support Vector Machine and Big Data

Jazyk

en

Originální abstrakt

This paper deals with sentiment analysis in text documents, especially text valence detection. The proposed solution is based on Support Vector Machines classifier. This classifier was trained with huge amount of data and complex word combinations were analysed. For this purpose distributed learning on 112 processors was used. Datasets used for training and testing were automatically obtained from real user feedback on products from different web pages (and different product segments). The proposed solution has been evaluated with different languages – English, German, Czech and Spanish. This paper improves accuracy achieved with the Big Data approach about 11%. The best accuracy achieved in this work was 95.31% for recognition of positive and negative text valence. The described learning is fully automatic, can be applied to any language and no complicated preprocessing is needed.

Anglický abstrakt

This paper deals with sentiment analysis in text documents, especially text valence detection. The proposed solution is based on Support Vector Machines classifier. This classifier was trained with huge amount of data and complex word combinations were analysed. For this purpose distributed learning on 112 processors was used. Datasets used for training and testing were automatically obtained from real user feedback on products from different web pages (and different product segments). The proposed solution has been evaluated with different languages – English, German, Czech and Spanish. This paper improves accuracy achieved with the Big Data approach about 11%. The best accuracy achieved in this work was 95.31% for recognition of positive and negative text valence. The described learning is fully automatic, can be applied to any language and no complicated preprocessing is needed.

Dokumenty

BibTex


@inproceedings{BUT127869,
  author="Lukáš {Povoda} and Radim {Burget} and Malay Kishore {Dutta}",
  title="Sentiment Analysis Based on Support Vector Machine and Big Data",
  annote="This paper deals with sentiment analysis in text documents, especially text valence detection. The proposed solution is based on Support Vector Machines classifier. This classifier was trained with huge amount of data and complex word combinations were analysed. For this purpose distributed learning on 112 processors was used. Datasets used for training and testing were automatically obtained from real user feedback on products from different web pages (and different product segments). The proposed solution has been evaluated with different languages – English, German, Czech and Spanish. This paper improves accuracy achieved with the Big Data approach about 11%. The best accuracy achieved in this work was 95.31% for recognition of positive and negative text valence. The described learning is fully automatic, can be applied to any language and no complicated preprocessing is needed.",
  booktitle="Proceedings of the 39th International Conference on Telecommunication and Signal Processing, TSP 2016",
  chapter="127869",
  doi="10.1109/TSP.2016.7760939",
  howpublished="online",
  year="2016",
  month="june",
  pages="543--545",
  type="conference paper"
}