Aspect Based Sentiment Analysis Dataset
ABSA systems are usually tested on the same dataset they’re developed on but how would a system trained on the Semeval2016 dataset perform on new data from different sources? To assess ABSA real-world performances, we manually annotated a completely new dataset from Foursquare comments of about 215K user reviews in English of restaurants all over the world. From these reviews, we randomly selected 585 samples, which contain 1006 sentences, and annotated them with the SemEval2016 annotation guidelines for the restaurant domain.
By accessing the Aspect Based Sentiment Analysis Dataset (“ABSA Dataset”), you agree to be bound by the Foursquare Terms of Use. In addition, you are granted a limited, non-exclusive, non-transferable, non-sublicensable, revocable license to access and use the ABSA Dataset for purposes of academic research. You may not resell, republish, or make any commercial use of the ABSA Dataset or its contents.
Please contact Gianluca Monaci for any legal question regarding the data and Caroline Brun and Vassilina Nikoulina for any technical question.
If you use the data please cite the following paper:
Aspect Based Sentiment Analysis into the Wild, Caroline Brun and Vassilina Nikoulina, 9th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis
WASSA 2018 will be held in conjunction with 2018 Conference on Empirical Methods in Natural Language Processing EMNLP 2018, Brussels, Belgium, 2nd November – 4th November 2018
NAVER LABS Europe is a platinum sponsor at EMNLP. Come and meet us at booth number 5! Booth Demos: Machine Reading; Sentiment Aware Maps Our researchers in Natural Language Processing and Machine Learning and Optimization will be there to discuss research, career opportunities and research collaborations.