Research Interests:
For the participation of the University of Alicante in the first cross-language Geographic Information Retrieval, we have developed a system made up of three modules. One of them is an Information Retrieval module and the others are Named... more
For the participation of the University of Alicante in the first cross-language Geographic Information Retrieval, we have developed a system made up of three modules. One of them is an Information Retrieval module and the others are Named Entity Recognition modules based on machine learning and based on knowledge. We have carried out several runs with different combinations of these modules for resolving the monolingual and bilingual tasks. The system obtained better result in monolingual task achieving an improvement between 48 % and 69 % above the average. The results are shown and discussed in the paper.
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
The aim of GeoCLEF 2005 is to retrieve relevant documents by using geographic tags [2]. Nowadays, the fast development of Geographic Information Systems (GIS) involves the need of Geographic Information Retrieval Systems (GIR) that help... more
The aim of GeoCLEF 2005 is to retrieve relevant documents by using geographic tags [2]. Nowadays, the fast development of Geographic Information Systems (GIS) involves the need of Geographic Information Retrieval Systems (GIR) that help GIS systems to obtain ...
Research Interests:
Research Interests:
For our participation in the second edition of GeoCLEF, we have researched the incorporation of geographic knowledge into Geographic Information Retrieval (GIR). Our system is made up of an IR module (IR-n) and a Geographic Knowledge... more
For our participation in the second edition of GeoCLEF, we have researched the incorporation of geographic knowledge into Geographic Information Retrieval (GIR). Our system is made up of an IR module (IR-n) and a Geographic Knowledge module (Geonames). The results show that the addition of geographic knowledge has a negative impact on the precision. However, the fact that for some topics the obtained results are better, makes us conclude that the addition of this knowledge could be useful but further research is needed in order to determine how.
Research Interests:
De-identification allows faster and more collaborative clinical research while protecting patient confidentiality. Clinical narrative de-identification is a tedious process that can be alleviated by automated natural language processing... more
De-identification allows faster and more collaborative clinical research while protecting patient confidentiality. Clinical narrative de-identification is a tedious process that can be alleviated by automated natural language processing methods. The goal of this research is the development of an automated text de-identification system for Veterans Health Administration (VHA) clinical documents. We devised a novel stepwise hybrid approach designed to improve the current strategies used for text de-identification. The proposed system is based on a previous study on the best de-identification methods for VHA documents. This best-of-breed automated clinical text de-identification system (aka BoB) tackles the problem as two separate tasks: (1) maximize patient confidentiality by redacting as much protected health information (PHI) as possible; and (2) leave de-identified documents in a usable state preserving as much clinical information as possible. We evaluated BoB with a manually annotated corpus of a variety of VHA clinical notes, as well as with the 2006 i2b2 de-identification challenge corpus. We present evaluations at the instance- and token-level, with detailed results for BoB's main components. Moreover, an existing text de-identification system was also included in our evaluation. BoB's design efficiently takes advantage of the methods implemented in its pipeline, resulting in high sensitivity values (especially for sensitive PHI categories) and a limited number of false positives. Our system successfully addressed VHA clinical document de-identification, and its hybrid stepwise design demonstrates robustness and efficiency, prioritizing patient confidentiality while leaving most clinical information intact.