Ir al contenido

Documat


Enabling efficient distributed spatial join on large scale vector-raster data lakes

  • Sebastián Villarroya [1] ; José R. R. Viqueira [1] Árbol académico ; José Manuel Cotos [1] Árbol académico ; José Ángel Taboada [1] Árbol académico
    1. [1] Universidade de Santiago de Compostela

      Universidade de Santiago de Compostela

      Santiago de Compostela, España

  • Localización: Actas de las XXVII Jornadas de Ingeniería del Software y Bases de Datos (JISBD 2023) / coord. por Amador Durán Toro Árbol académico, 2023
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • Both the increasing number of GPS-enabled mobile devices and the geographic crowd-sourcing initiatives, such as Open Street Map, are determinants for the large amount of vector spatial data that is currently being produced. On the other hand, the automatic generation of raster data by remote sensing devices and environmental modeling processes was always leading to very large datasets. Currently, huge data generation rates are reached by improved sensor observation systems and data processing infrastructures. As an example, the Sentinel Data Access System of the Copernicus Program of the European Space Agency (ESA) was publishing 38.71 TB of data per day during 2020. This paper shows how the assumption of a new spatial data model that includes multi-resolution parametric spatial data types, enables achieving an efficient implementation of a large scale distributed spatial analysis system for integrated vector-raster data lakes. In particular, the proposed implementation outperforms the state-of-the-art Spark-based spatial analysis systems by more than one order of magnitude during vector raster spatial join evaluation.


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno