This document discusses building large multi-domain resources for Arabic sentiment analysis. It describes the problem of existing resources being small, domain-specific and not publicly available. The authors built multi-domain datasets from online reviews totaling over 33,000 reviews across multiple domains. They also built multi-domain sentiment lexicons containing around 2,000 entries using machine learning techniques. Experiments were conducted on the datasets to evaluate their effectiveness for sentiment analysis and provide benchmarking. The results and resources are made publicly available.
Related topics: