Ampliación de lexicones de opinión específicos de dominio usando representaciones continuas de palabras

Fermín Cruz Mata; Fernando Enríquez de Salamanca Ros; Tomás López Solaz

Ayuda

Ampliación de lexicones de opinión específicos de dominio usando representaciones continuas de palabras

Autores: Fermín Cruz Mata , Fernando Enríquez de Salamanca Ros , Tomás López Solaz
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 57, 2016, págs. 49-56
Idioma: español
Títulos paralelos:
- Expansion of domain-specific opinion lexicons using word embeddings
Enlaces
- Texto completo
Resumen
- español
  En este trabajo abordamos la ampliación de lexicones de opinión específicos de dominio a partir de textos del dominio elegido. El método se basa en la construcción de clasificadores que catalogan las palabras de entrada como positivas, negativas o neutras, y en un criterio estricto de selección de las palabras que pretende garantizar la precisión de las nuevas incorporaciones al lexicón. Se utilizan representaciones continuas de palabras (word embeddings) como espacio de características de los clasificadores. Los resultados confirman que dichas representaciones contienen información relativa a la polaridad de las palabras, obteniéndose una precisión en la selección de los candidatos y en la estimación de su polaridad de alrededor del 94% para los tres dominios analizados, con una cobertura en torno al 50% de las palabras de opinión contenidas en los textos de partida.
- English
  In this work we present a domain-specific opinion lexicon expansion method. The method is based on classifiers which categorize words as positive, negative or neutral, and a strict selection criteria of words intended to ensure the precision of the new additions to the lexicon. We use word embeddings as the feature space of the classifiers. The results confirm that these representations contain information on the polarity of the words, obtaining a precision in the selection of candidates and the estimation of its polarities of about 94% for the three domains analyzed, covering around 50% of the opinion words contained in the initial texts.
Referencias bibliográficas
- Baccianella, Stefano, Andrea Esuli, y Fabrizio Sebastiani. 2010. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and...
- Cardellino, Cristian. 2016. Spanish Billion Words Corpus and Embeddings, March.
- Cerini, S., V. Compagnoni, A. Demontis, M. Formentelli, y G. Gandini, 2007. Language resources and linguistic theory: Typology, second language...
- Cruz, Fermín L, José A Troyano, Fernando Enríquez, F Javier Ortega, y Carlos G Vallejo. 2013. Long autonomy or long delay? the importance...
- Cruz, Fermín L, José A Troyano, F Javier Ortega, y Fernando Enríquez. 2011. Automatic expansion of feature-level opinion lexicons. En Proceedings...
- Cruz, Fermín L, Carlos G Vallejo, Fernando Enrı, José A Troyano, y others. 2012. Polarityrank: Finding an equilibrium between followers and...
- Cruz, Fermín L., José A. Troyano, Beatriz Pontes, y F. Javier Ortega. 2014. Building layered, multilingual sentiment lexicons at synset and...
- Dumais, Susan T. 1995. Latent semantic indexing (lsi): Trec-3 report. Nist Special Publication SP, páginas 219–219.
- Dumais, Susan T. 2004. Latent semantic analysis. Annual review of information science and technology, 38(1):188–230.
- Esuli, Andrea y Fabrizio Sebastiani. 2006. Determining term subjectivity and term orientation for opinion mining. En Proceedings of the European...
- Hatzivassiloglou, Vasileios y Kathleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. En Proceedings of the eighth conference...
- Hu, Minqing y Bing Liu. 2004. Mining and summarizing customer reviews. En Proceedings of the ACM SIGKDD Conference on Knowledge Discovery...
- Huang, Eric H, Richard Socher, Christopher D Manning, y Andrew Y Ng. 2012. Improving word representations via global context and multiple...
- Kamps, Jaap, Maarten Marx, Robert J. Mokken, y Maarten De Rijke. 2004. Using wordnet to measure semantic orientation of adjectives. En National...
- Kanayama, Hiroshi y Tetsuya Nasukawa. 2006. Fully automatic lexicon expansion for domain-oriented sentiment analysis. En EMNLP, páginas 355–363,...
- Kim, Joo-Kyung y Marie-Catherine de Marneffe. 2013. Deriving adjectival scales from continuous space word representations. En EMNLP, páginas...
- Mikolov, Tomas, Kai Chen, Greg Corrado, y Jeffrey Dean. 2013a. Efficient estimation of word representations in vector space. arXiv preprint...
- Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S Corrado, y Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality....
- Molina-González, M Dolores, Eugenio Martínez-Cámara, M Teresa MartínValdivia, y L Alfonso Ure˜na-L´opez. 2015. A spanish semantic orientation...
- Molina-González, M Dolores, Eugenio Martínez-Cámara, María-Teresa Martín Valdivia, y José M Perea-Ortega. 2013. Semantic orientation for polarity...
- Pablos, Aitor García, Montse Cuadros, y German Rigau. 2015. Unsupervised word polarity tagging by exploiting continuous word representations....
- Pavlopoulos, John y Ion Androutsopoulos. 2014. Aspect term extraction for sentiment analysis: New datasets, new evaluation measures and an...
- Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas,...
- Qiu, Guang, Bing Liu, Jiajun Bu, y Chun Chen. 2011. Opinion word expansion and target extraction through double propagation. Computational...
- Socher, Richard, Alex Perelygin, Jean Y Wu, Jason Chuang, Christopher D Manning, Andrew Y Ng, y Christopher Potts. 2013. Recursive deep models...
- Stone, Philip J. 1966. The General Inquirer: A Computer Approach to Content Analysis. The MIT Press.
- Tang, Duyu, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, y Bing Qin. 2014. Learning sentiment-specific word embedding for twitter sentiment classification....
- Turian, Joseph, Lev Ratinov, y Yoshua Bengio. 2010. Word representations: a simple and general method for semi-supervised learning. En Proceedings...
- Turney, Peter D. y Michael L. Littman. 2003. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions...
- Yu, Hong y Vasileios Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity...