Describing biomedical document sets in terms of its most distinctive facts

Ramírez Cruz, Yunior; Berlanga Llavori, Rafael; Pons Porrata, Aurora

Describing biomedical document sets in terms of its most distinctive facts

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/11710

Información del item - Informació de l'item - Item information
Título:	Describing biomedical document sets in terms of its most distinctive facts
Título alternativo:	Descripción de conjuntos de documentos biomédicos a través de sus hechos más distintivos
Autor/es:	Ramírez Cruz, Yunior \| Berlanga Llavori, Rafael \| Pons Porrata, Aurora
Palabras clave:	Minería de textos \| Recuperación de información \| Aplicaciones biomédicas \| Text mining \| Information retrieval \| Biomedical applications
Área/s de conocimiento:	Lenguajes y Sistemas Informáticos
Fecha de publicación:	sep-2009
Editor:	Sociedad Española para el Procesamiento del Lenguaje Natural
Cita bibliográfica:	RAMÍREZ CRUZ, Yunior; BERLANGA LLAVORI, Rafael; PONS PORRATA, Aurora. “Describing biomedical document sets in terms of its most distinctive facts”. Procesamiento del lenguaje natural. N. 43 (sept. 2009). ISSN 1135-5948, pp. 159-167
Resumen:	En este artículo proponemos un método para describir un conjunto de documentos biomédicos, conceptualmente indexados, a través de sus hechos más distintivos. Estos documentos han sido recuperados como soporte de un concepto foco, el cual representa una necesidad de información. Los hechos utilizados para la descripción son unidades de información concisas, representadas mediante tripletas con la forma entidad-verbo-entidad. Estos se presentan ordenados por su relevancia con respecto al concepto foco, la cual se calcula usando modelos de lenguajes. Los resultados experimentales, obtenidos sobre tres conjuntos de documentos de una colección extraída de MEDLINE, son prometedores. \| In this paper, we propose a method to describe a set of conceptually indexed biomedical documents in terms of its most distinctive facts. These documents are retrieved to support the occurrence of a focus concept, which expresses an information need. The facts used for description are concise information units, represented as triples of the form entity-verb-entity. These are presented as a ranked list, ordered by their relevance with respect to the focus concept, which is determined using a language modeling approach. Experimental results, obtained on three document sets over a collection extracted from MEDLINE, are promising.
Patrocinador/es:	This work has been partially funded by the CICYT Project TIN2008-01825/TIN and the Research Promotion Program 2008 of Universitat Jaume I, Spain.
URI:	http://hdl.handle.net/10045/11710
ISSN:	1135-5948
Idioma:	eng
Tipo:	info:eu-repo/semantics/article
Revisión científica:	si
Aparece en las colecciones:	Procesamiento del Lenguaje Natural - Nº 43 (septiembre 2009)

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
PLN_43_18.pdf		219,2 kB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro completo