Review data sets for "Latent Aspect Rating Analysis"

  1. TripAdvisor Data Set (JSON, Text, Processed, Readme)

  2. Amazon MP3 Data Set (Text, Readme)

  3. Six Categories of Amazon Product Reviews (JSON, Readme)

When you are using above data sets in your research, please consider to cite the following papers:
  • Hongning Wang, Yue Lu and ChengXiang Zhai. Latent Aspect Rating Analysis without Aspect Keyword Supervision. The 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'2011), P618-626, 2011.
  • Hongning Wang, Yue Lu and Chengxiang Zhai. Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach. The 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'2010), p783-792, 2010.

Forum data sets for "Online forum mining and opinion Networks analysis"

  1. Apple Discussion (Download, Readme)

  2. Google Earth (Download)

  3. CNET (Download)

When you are using above data sets in your research, please consider to cite the follow paper:
  • Hongning Wang, Chi Wang, ChengXiang Zhai and Jiawei Han. Learning Online Discussion Structures by Conditional Random Fields. The 34th Annual International ACM SIGIR Conference (SIGIR'2011), P435-444, 2011.
  1. Military.com (Download, Readme)

When you are using above data sets in your research, please consider to cite the follow paper:
  • Yue Lu, Hongning Wang, ChengXiang Zhai and Dan Roth. Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions. The 21st ACM International Conference on Information and Knowledge Management (CIKM'2012), p1642-1646, 2012.

Review data sets for "Hidden Topic Sentiment Model"

  1. NewEgg Data Set (JSON, Readme)

  2. Amazon Data Set (JSON, Readme)

  3. Prior and Seed Words (ZIP)

When you are using above data sets in your research, please consider to cite the following papers:
  • Md Mustafizur Rahman and Hongning Wang. Hidden Topic Sentiment Model. The 25th International World-Wide Web Conference (WWW'2016). (to appear)