Efficient Extraction of Domain Specific Sentiment Lexicon with Active Learning
- categorize
- Machine Learning
- Month
- Apr
- Journal Name
- Pattern Recognition Letters
- Volume
- 56
- Page
- 38 - 44
- File
- 1-s2.0-S0167865515000227-main 1.pdf (1.2M) 30회 다운로드 DATE : 2023-11-09 22:12:23
Park, S.R., Lee, W.S., & Moon, I.C. (2015). Efficient Extraction of Domain Specific Sentiment Lexicon with Active Learning. Pattern Recognition Letters, 56(4), 38–44
Abstract
Recent research indicates that a sentiment lexicon focusing on a specific domain leads to better sentiment analyses compared to a general-purpose sentiment lexicon, such as SentiWordNet. In spite of this potential improvement, the cost of building a domain-specific sentiment lexicon hinders its wider and more practical applications. To compensate for this difficulty, we propose extracting a sentiment lexicon from a domain-specific corpus by annotating an intelligently selected subset of documents in the corpus. Specifically, the subset is selected by an active learner with initializations from diverse text analytics, i.e. latent Dirichlet allocation and our proposed lexicon coverage algorithm. This active learning produces a better domain-specific sentiment lexicon which results in a higher accuracy of the sentiment classification. Subsequently, we evaluate extracted sentiment lexicons by observing (1) the increased F1 measure in sentiment classifications and (2) the increased similarity to the sentiment lexicon with the full annotation. We expect that this contribution will enable more accurate sentiment classification by domain-specific sentiment lexicons with less sentiment tagging efforts.
@article{Park201538,
title = {Efficient extraction of domain specific sentiment lexicon with active learning},
journal = {Pattern Recognition Letters},
volume = {56},
pages = {38 - 44},
year = {2015},
issn = {0167-8655},
doi = {http://dx.doi.org/10.1016/j.patrec.2015.01.004},
url = {http://www.sciencedirect.com/science/article/pii/S0167865515000227},
author = {Sungrae Park and Wonsung Lee and Il-Chul Moon},
keywords = {Sentiment analysis, Active learning, Sentiment lexicon}
}
Source Website:
http://www.sciencedirect.com/science/article/pii/S0167865515000227