Ir al contenido

Documat


Automated Web Information Collection System for Analyzing ICT Innovations in Companies

  • Francisco Hermo [1] ; Ángel Gómez [1] ; Carlos Dafonte [1] Árbol académico
    1. [1] Universidade da Coruña

      Universidade da Coruña

      A Coruña, España

  • Localización: Proceedings XoveTIC 2024: Impulsando el talento científico / coord. por Manuel Lagos Rodríguez, Tirso Varela Rodeiro, Javier Pereira-Loureiro Árbol académico, Manuel Penedo Árbol académico, 2024, págs. 19-24
  • Idioma: inglés
  • Enlaces
  • Resumen
    • For companies, having ICT innovation capabilities on their websites is an essential factor for staying competitive in the current market. Additionally, collecting this type of public information can be very useful for analytical and statistical purposes. Our project focuses on developing a system that allows us to automatically collect information about the technological innovations companies add to their websites, search for it, extract it, and turn it into exploitable data. To achieve this, we are going to use the well-known technique of Web Scraping, which allows us to track and extract data from the various websites we are interested in. The extracted information will be processed and exported into JSON format files, ensuring its future exploitation.


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno