Specification of a general linguistic annotation framework and its use in a real context

Artola Zubillaga, Xabier; Díaz de Ilarraza Sánchez, Arantza; Sologaistoa Fresno, Aitor; Soroa Etxabe, Aitor

Specification of a general linguistic annotation framework and its use in a real context

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/2996

Información del item - Informació de l'item - Item information
Título:	Specification of a general linguistic annotation framework and its use in a real context
Autor/es:	Artola Zubillaga, Xabier \| Díaz de Ilarraza Sánchez, Arantza \| Sologaistoa Fresno, Aitor \| Soroa Etxabe, Aitor
Palabras clave:	Modelo de anotación \| Arquitectura para la integración \| TEI-P4 \| Annotation model \| Integration architecture
Fecha de publicación:	sep-2007
Editor:	Sociedad Española para el Procesamiento del Lenguaje Natural
Cita bibliográfica:	ARTOLA ZUBILLAGA, Xabier, et al. "Specification of a general linguistic annotation framework and its use in a real context". Procesamiento del lenguaje natural. N. 39 (sept. 2007). ISSN 1135-5948, pp. 157-164
Resumen:	AWA es una arquitectura general para representar información lingüística producida por procesadores lingüísticos. Nuestro objetivo es definir un esquema de representación coherente y flexible que sea la base del intercambio de información entre herramientas lingüísticas de cualquier tipo. Los análisis lingüísticos se representan por medio de estructuras de rasgos según las directrices de TEI-P4. Estas estructuras y su relación con los demás elementos que componen el análisis forman parte de un modelo de datos diseñado bajo el paradigma de orientación a objetos. AWA se encarga de la representación de la información dentro de una arquitectura más amplia para gestionar todo el proceso de análisis de un corpus. Como ejemplo de la utilidad del modelo presentado explicaremos cómo se ha aplicado dicho modelo en el procesamiento de dos corpus. \| In this paper we present AWA, a general architecture for representing the linguistic information produced by diverse linguistic processors. Our aim is to establish a coherent and flexible representation scheme that will be the basis for the exchange of information. We use TEI-P4 conformant feature structures as a representation schema for linguistic analyses. A consistent underlying data model, which captures the structure and relations contained in the information to be manipulated, has been identified and implemented by a set of classes following the object-oriented paradigm. As an example of the usefulness of the model, we will show the usage of the framework in a real context: two corpora have been annotated by means of an application which aim is to exploit and manipulate the data created by the linguistic processors developed so far.
URI:	http://hdl.handle.net/10045/2996
ISSN:	1135-5948
Idioma:	eng
Tipo:	info:eu-repo/semantics/article
Aparece en las colecciones:	Procesamiento del Lenguaje Natural - Nº 39 (septiembre 2007)

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
PLN_39_19.pdf		183,42 kB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro completo