Skip to main content

    iulia nica

    Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated... more
    Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated into a syntagmatic pattern. The strategy supposes previously only a morphological analysis; it does not make use of statistical information and it has a wide disambiguating potential. We present a case study and the discussion of the obtained results. Key words: sense tagging, semantic annotated ...
    Research Interests:
    Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses. In the present work we propose a method for the enrichment of EuroWordNet with syntagmatic information, by means of the WSD process... more
    Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses. In the present work we propose a method for the enrichment of EuroWordNet with syntagmatic information, by means of the WSD process itself. We consider that an ambiguous occurrence drastically reduces its ambiguity when considered together with the words it establishes syntactic relations in the sentence: the claim of “quasi one sense per syntactic relation”. On this hypothesis, we obtain sense-tagged syntactic ...
    Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses: the “gap” between lexicon (here EuroWordNet, EWN) and corpus. In the present work we propose to fill this gap by applying different... more
    Word Sense Disambiguation confronts with the lack of syntagmatic information associated to word senses: the “gap” between lexicon (here EuroWordNet, EWN) and corpus. In the present work we propose to fill this gap by applying different strategies: from one side, we extract paradigmatic information related to the ambiguous occurrence in a syntactic pattern from corpus and we incorporate it into the WSD process; from the other side, we derive discriminatory sets of senses from EWN for the ambiguous word and so ...
    Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated... more
    Abstract. We propose an alternative method for automatic acquisition of sense tagged corpora, based on the interaction between syntagmatic and paradigmatic information. The labeling process starts from an ambiguous occurrence integrated into a syntagmatic pattern. The strategy supposes previously only a morphological analysis; it does not make use of statistical information and it has a wide disambiguating potential. We present a case study and the discussion of the obtained results. Key words: sense tagging, semantic annotated ...
    Resumen. Proponemos un método alternativo para la desambiguación semántica automática, centrado en la interacción entre la información sintagmática y paradigmática. Se toma como unidad en el proceso de desambiguación una ocurrencia... more
    Resumen. Proponemos un método alternativo para la desambiguación semántica automática, centrado en la interacción entre la información sintagmática y paradigmática. Se toma como unidad en el proceso de desambiguación una ocurrencia ambigua integrada en un patrón sintagmático. La estrategia no necesita corpus etiquetado al nivel de sentido, presupone tan sólo un análisis previo de tipo morfosintáctico y agrupación por chunks, no usa información estadística y su potencial desambiguador es amplio. Ilustramos las dos ...
    In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by... more
    In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work ...
    The DLSI-UA team is currently working on sev- eral word sense disambiguation approaches, both supervised and unsupervised. These approaches are based on different ways to use both annotated and unannotated data, and several resources... more
    The DLSI-UA team is currently working on sev- eral word sense disambiguation approaches, both supervised and unsupervised. These approaches are based on different ways to use both annotated and unannotated data, and several resources generated from or exploiting WordNet (Miller et al., 1993), WordNet Domains, EuroWordNet (EWN) and addi- tional corpora. This paper presents a view of differ- ent system results for Word Sense Disambiguation in different tasks of SENSEVAL-3.
    Research Interests:
    Abstract: The paper addresses the issue of how to use linguistic information in Word Sense Disambiguation (WSD). We introduce a knowledge-driven and unsupervised WSD method that requires only a large corpus previously tagged with POS and... more
    Abstract: The paper addresses the issue of how to use linguistic information in Word Sense Disambiguation (WSD). We introduce a knowledge-driven and unsupervised WSD method that requires only a large corpus previously tagged with POS and very little grammatical knowledge. The WSD process is performed taking into account the syntactic patterns in which the ambiguous occurrence appears, relaying in the hypothesis of “almost one sense per syntactic pattern”. This integration allows us to obtain, from corpora, paradigmatic and ...
    The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In... more
    The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In this paper we propose a completely automatic WSD method for Spanish – restricted to nouns – to be used as a module in a Natural Language Processing system for unlimited text. We call it the Commutative Test. This method exploits an adaptation of EuroWordNet, Sense Discriminators, that implicitly keeps all lexical-semantic relations of its nominal hierarchy. The only requirement is the availability of a large corpus and a part-of-speech tagger, without any need of previous sense-tagging. An evaluation of the method has been done on the Senseval test corpus. The method can be easily adapted to other languages that dispose of a corpus, a WordNet component and a part-of-speech tagger.
    Research Interests:
    In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by... more
    In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work with the information, also paradigmatic, contained in EWN. We evaluate the method and interpret the results.