iulia nica
Universitatea Alexandru Ioan Cuza Iasi, Litere, Faculty Member
- University of Salamanca, Departamento de Lengua española, Faculty Memberadd
•
Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated... more
Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated into a syntagmatic pattern. The strategy supposes previously only a morphological analysis; it does not make use of statistical information and it has a wide disambiguating potential. We present a case study and the discussion of the obtained results. Key words: sense tagging, semantic annotated ...
Publication Date: 2003
•
Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automáticamore
by iulia nica and Andres Montoyo
Research Interests:
•
Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses. In the present work we propose a method for the enrichment of EuroWordNet with syntagmatic information, by means of the WSD process... more
Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses. In the present work we propose a method for the enrichment of EuroWordNet with syntagmatic information, by means of the WSD process itself. We consider that an ambiguous occurrence drastically reduces its ambiguity when considered together with the words it establishes syntactic relations in the sentence: the claim of “quasi one sense per syntactic relation”. On this hypothesis, we obtain sense-tagged syntactic ...
Publication Name: lrec-conf.org
•
Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses: the “gap” between lexicon (here EuroWordNet, EWN) and corpus. In the present work we propose to fill this gap by applying different... more
Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses: the “gap” between lexicon (here EuroWordNet, EWN) and corpus. In the present work we propose to fill this gap by applying different strategies: from one side, we extract paradigmatic information related to the ambiguous occurrence in a syntactic pattern from corpus and we incorporate it into the WSD process; from the other side, we derive discriminatory sets of senses from EWN for the ambiguous word and so ...
Publisher: hnk.ffzg.hr
Publication Date: Apr 20, 2004
Publication Name: Workshop Programme
•
Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated... more
Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated into a syntagmatic pattern. The strategy supposes previously only a morphological analysis; it does not make use of statistical information and it has a wide disambiguating potential. We present a case study and the discussion of the obtained results. Key words: sense tagging, semantic annotated ...
Publisher: nats-www.informatik.uni-hamburg.de
Publication Date: 2003
Publication Name: Proceedings of RANLP’03
•
Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automáticamore
by iulia nica and Andres Montoyo
Resumen. Proponemos un método alternativo para la desambiguación semántica automática, centrado en la interacción entre la información sintagmática y paradigmática. Se toma como unidad en el proceso de desambiguación una ocurrencia... more
Resumen. Proponemos un método alternativo para la desambiguación semántica automática, centrado en la interacción entre la información sintagmática y paradigmática. Se toma como unidad en el proceso de desambiguación una ocurrencia ambigua integrada en un patrón sintagmático. La estrategia no necesita corpus etiquetado al nivel de sentido, presupone tan sólo un análisis previo de tipo morfosintáctico y agrupación por chunks, no usa información estadística y su potencial desambiguador es amplio. Ilustramos las dos ...
Publisher: sepln.org
Publication Date: Sep 1, 2003
Publication Name: XIX Congreso de la SEPLN 2003
Research Interests:
•
In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by... more
In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work ...
Publication Date: 2004
Publication Name: Lecture Notes in Computer Science
Research Interests:
•
The DLSI-UA team is currently working on sev- eral word sense disambiguation approaches, both supervised and unsupervised. These approaches are based on different ways to use both annotated and unannotated data, and several resources... more
The DLSI-UA team is currently working on sev- eral word sense disambiguation approaches, both supervised and unsupervised. These approaches are based on different ways to use both annotated and unannotated data, and several resources generated from or exploiting WordNet (Miller et al., 1993), WordNet Domains, EuroWordNet (EWN) and addi- tional corpora. This paper presents a view of differ- ent system results for Word Sense Disambiguation in different tasks of SENSEVAL-3.
Research Interests:
•
Publisher: Berlin: Springer-Verlag, 1973-
Publication Date: 2004
Publication Name: Lecture Notes in Computer Science
•
Abstract: The paper addresses the issue of how to use linguistic information in Word Sense Disambiguation (WSD). We introduce a knowledge-driven and unsupervised WSD method that requires only a large corpus previously tagged with POS and... more
Abstract: The paper addresses the issue of how to use linguistic information in Word Sense Disambiguation (WSD). We introduce a knowledge-driven and unsupervised WSD method that requires only a large corpus previously tagged with POS and very little grammatical knowledge. The WSD process is performed taking into account the syntactic patterns in which the ambiguous occurrence appears, relaying in the hypothesis of “almost one sense per syntactic pattern”. This integration allows us to obtain, from corpora, paradigmatic and ...
Publisher: sepln.org
Publication Date: Sep 1, 2004
Publication Name: Congreso de la SEPLN
Research Interests:
•
The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In... more
The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In this paper we propose a completely automatic WSD method for Spanish – restricted to nouns – to be used as a module in a Natural Language Processing system for unlimited text. We call it the Commutative Test. This method exploits an adaptation of EuroWordNet, Sense Discriminators, that implicitly keeps all lexical-semantic relations of its nominal hierarchy. The only requirement is the availability of a large corpus and a part-of-speech tagger, without any need of previous sense-tagging. An evaluation of the method has been done on the Senseval test corpus. The method can be easily adapted to other languages that dispose of a corpus, a WordNet component and a part-of-speech tagger.
Publication Date: 2004
Research Interests:
•
Publication Name: Procesamiento Del Lenguaje Natural
Research Interests:
•
Research Interests:
•
In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by... more
In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work with the information, also paradigmatic, contained in EWN. We evaluate the method and interpret the results.