Ir al contenido

Documat


Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain

  • SANTOS, André [1] ; NOGUEIRA, Regina [1] ; LOURENÇO, Anália [1]
    1. [1] Centre of Biological Engineering
  • Localización: ADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal, ISSN-e 2255-2863, Vol. 1, Nº. 1, 2012, págs. 1-8
  • Idioma: inglés
  • Enlaces
  • Resumen
    • Scientific publications are the main vehicle to disseminate information in the field of biotechnology for wastewater treatment. Indeed, the new research paradigms and the application of high-throughput technologies have increased the rate of publication considerably. The problem is that manual curation becomes harder, prone-to-errors and time-consuming, leading to a probable loss of information and inefficient knowledge acquisition. As a result, research outputs are hardly reaching engineers, hampering the calibration of mathematical models used to optimize the stability and performance of biotechnological systems. In this context, we have developed a data curation workflow, based on text mining techniques, to extract numerical parameters from scientific literature, and applied it to the biotechnology domain. A workflow was built to process wastewater-related articles with the main goal of identifying physico-chemical parameters mentioned in the text. This work describes the implementation of the workflow, identifies achievements and current limitations in the overall process, and presents the results obtained for a corpus of 50 full-text documents.

  • Referencias bibliográficas
    • Ceccaroni, L., Cortés, U., and Sànchez-Marrè , M. OntoWEDSS: augmenting environmental decision-support systems with ontologies. Environmental...
    • Dionisi, H., Layton, A., Robinson, K., Brown, J., Gregory, I., Parker, J., and Sayler, G. Quantification of nitrosomonas oligotropha and nitrospira...
    • Gerner, M., Nenadic, G., and Bergman, C. Linnaeus: a species name identification system for biomedical literature. BMC Bioinformatics, 11(1)(2010)...
    • Hamouda, M., Anderson, W., Huck, P., et al. Decision support systems in water and wastewater treatment process selection and design: a review....
    • Koegst, T., Tränckner, J., Blumensaat, F., Eichhorn, J., and Mayer-Eichberger, V. On the use of an ontology for the identification of degrees...
    • Krallinger, M., Leitner, F., and Valencia, A. Analysis of biological processes and diseases using text mining approaches. Methods in Molecular...
    • Limpiyakorn, T., Kurisu, F., and Yagi, O. Development and application of real-time pcr for quantification of specific ammonia-oxidizing bacteria...
    • Nogueira, R. and Melo, L. Competition between nitrospira spp. and nitrobacter spp. in nitrite-oxidizing bioreactors. Biotechnology and bioengineering,...
    • Nogueira, R., Melo, L., Purkhold, U., Wuertz, S., and Wagner, M. Nitrifying and heterotrophic population dynamics in biofilm reactors: effects...
    • Tamames, J. and De Lorenzo, V. Envmine: A text-mining system for the automatic extraction of contextual information. BMC bioinformatics, 11(1)(2010)...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno