Crowdsourcing a word-emotion association lexicon

Par Conseil national de recherches du Canada

Téléchargement	Voir le manuscrit accepté : Crowdsourcing a word-emotion association lexicon (PDF, 747 Kio)
DOI	Trouver le DOI : https://doi.org/10.1111/j.1467-8640.2012.00460.x
Auteur	Rechercher : Mohammad, Saif M.¹; Rechercher : Turney, Peter D.¹
Affiliation	Conseil national de recherches du Canada. Technologies de l'information et des communications
Format	Texte, Article
Sujet	affect; Crowdsourcing; emotion lexicon; emotions; Mechanical turks; polarity lexicon; Semantic orientation; Sentiment analysis; Artificial intelligence; Computational methods; Semantics
Résumé	Even though considerable attention has been given to the polarity of words (positive and negative) and the creation of large polarity lexicons, research in emotion analysis has had to rely on limited and small emotion lexicons. In this paper, we show how the combined strength and wisdom of the crowds can be used to generate a large, high-quality, word-emotion and word-polarity association lexicon quickly and inexpensively. We enumerate the challenges in emotion annotation in a crowdsourcing scenario and propose solutions to address them. Most notably, in addition to questions about emotions associated with terms, we show how the inclusion of a word choice question can discourage malicious data entry, help to identify instances where the annotator may not be familiar with the target term (allowing us to reject such annotations), and help to obtain annotations at sense level (rather than at word level). We conducted experiments on how to formulate the emotion-annotation questions, and show that asking if a term is associated with an emotion leads to markedly higher interannotator agreement than that obtained by asking if a term evokes an emotion.
Date de publication	2012-09-04
Dans	Computational Intelligence 29, nº 3 (4 septembre 2012) : 436–465.
Langue	anglais
Publications évaluées par des pairs	Oui
Numéro NPARC	21270400
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	4d626836-c350-421f-9775-f549024097fd
Enregistrement créé	2014-02-07
Enregistrement modifié	2020-06-04

Date de modification :: 2025-05-09