Ir al contenido

Documat


A quadtree approach based on European geographic grids: reconciling data privacy and accuracy

  • Raymond Lagonigro [1] ; Ramon Oller [1] ; Joan Carles Martori [1]
    1. [1] Departament d’Economia i Empresa, Universitat de Vic - Universitat Central de Catalunya
  • Localización: Sort: Statistics and Operations Research Transactions, ISSN 1696-2281, Vol. 41, Nº. 1, 2017, págs. 139-158
  • Idioma: inglés
  • Enlaces
  • Resumen
    • Methods to preserve confidentiality when publishing geographic information conflict with the need to publish accurate data. The goal of this paper is to create a European geographic grid frame- work to disseminate statistical data over maps. We propose a methodology based on quadtree hierarchical geographic data structures. We create a varying size grid adapted to local area densities. High populated zones are disaggregated in small squares to allow dissemination of accurate data. Alternatively, information on low populated zones is published in big squares to avoid identification of individual data. The methodology has been applied to the 2014 population register data in Catalonia.

  • Referencias bibliográficas
    • Aldeen, Y.A. A.S., Salleh, M. and Razzaque, M.A. (2015). A comprehensive review on privacy preserving data mining. SpringerPlus,4, 1–36.
    • Andersson, M., Klaesson, J. and Larsson, J. P. (2012). How local are spatial density externalities? Evidence from square grid data. Technical...
    • Annoni, A., Luzet, C., Gubler, E. and Ihde, J. (2001). Map projections for Europe. Report EUR 20120.
    • Ardagna, C. A., Cremonini, M., Damiani, E., Di Vimercati, S. D. C. and Samarati, P. (2007). Location privacy protection through obfuscation-based...
    • Armstrong, M. P. and Ruggles, A. J. (2005). Geographic information technologies and personal privacy. Cartographica: The International Journal...
    • Armstrong, M. P., Rushton, G. and Zimmerman, D. L. (1999). Geographically masking health data to preserve confidentiality. Statistics in medicine,...
    • Beresford, A. R. and Stajano, F. (2003). Location privacy in pervasive computing. IEEE Pervasive computing, 2, 46–55.
    • Boulos, M.N.K., Curtis, A. J. and AbdelMalik, P. (2009). Musings on privacy issues in health research involving disaggregate geographic data...
    • Briant, A., Combes, P.-P. and Lafourcade, M. (2010). Dots to boxes: do the size and shape of spatial units jeopardize economic geography estimations?...
    • Burden, S. and Steel, D. (2013). Characteristics of empirical zoning distributions for small area health data. Working Paper 15-13, University...
    • Cassa, C. A., Wieland, S. C. and Mandl, K. D. (2008). Re-identification of home addresses from spatial locations anonymized by Gaussian skew....
    • Cockings, S., Harfoot, A., Martin, D. and Hornby, D. (2011). Maintaining existing zoning systems using automated zone-design techniques: methods...
    • Cockings, S. and Martin, D. (2005). Zone design for environment and health studies using pre-aggregated data. Social Science & Medicine,...
    • Curtis, A., Mills, J. W., Agustin, L. and Cockburn, M. (2011). Confidentiality risks in fine scale aggregations of health data. Computers,...
    • Curtis, A. J., Mills, J. W. and Leitner, M. (2006). Spatial confidentiality and GIS: re-engineering mortality locations from published maps...
    • Defays, D. and Anwar, M. N. (1998). Masking microdata using micro-aggregation. Journal Of Official Statistics-Stockholm, 14, 449–462.
    • Domingo-Ferrer, J., Sánchez, D. and Soria-Comas, J. (2016). Database anonymization: privacy models, data utility, and microaggregation-based...
    • Duckham, M. and Kulik, L. (2005). A formal model of obfuscation and negotiation for location privacy. In International Conference on Pervasive...
    • Duncan, G. T., Keller-McNulty, S. A. and Stokes, S. L. (2001). Disclosure risk vs. data utility: the R-U confidentiality map. Technical report,...
    • Duque, J. C., Ramos, R. and Suriñach, J. (2007). Supervised regionalization methods: a survey. International Regional Science Review, 30,...
    • Exeter, D. J., Rodgers, S. and Sabel, C. E. (2013). “Whose data is it anyway?” The implications of putting small area-level health and social...
    • Fienberg, S. E. (1994). Conflicts between the needs for access to statistical information and demands for confidentiality. Journal of Official...
    • Flowerdew, R., Geddes, A. and Green, M. (2001). Behaviour of regression models under random aggregation. In P. M. A. Nicholas J. Tate (Ed.),...
    • Fotheringham, A. S. and Wong, D. W. S. (1991). The modifiable areal unit problem in multivariate statistical analysis. Environment and Planning...
    • GEOSTAT 1A (2011). ESSnet project GEOSTAT 1A-representing census data in a European population grid. Technical report, The European Forum...
    • GEOSTAT 1B (2013). ESSnet project GEOSTAT 1B-representing 2011 census data on grid. Technical report, The European Forum for GeoStatistics.
    • GEOSTAT 1B (2014). ESSnet project GEOSTAT 1B-representing census data in a European population grid. Technical report, The European Forum...
    • Giuliani, G., Ray, N. and Lehmann, A. (2011). Grid-enabled spatial data infrastructure for environmental sciences: challenges and opportunities....
    • Hampton, K. H., Fitch, M. K., Allshouse, W. B., Doherty, I.A., Gesink, D.C., Leone, P.A., Serre, M.L. and Miller, W. C. (2010). Mapping health...
    • Horner, J. (2014). Rook: Rook a web server interface for R. R package version 1.1-1.
    • Hunter, G. M. (1978). Efficient Computation and Data Structures for Graphics. Ph. D. thesis, Princeton, NJ, USA.
    • INSPIRE (2010). INSPIRE Specification on Geographical Grid Systems Guidelines (D2.8.I.2). Technical report, INSPIRE Infrastructure for Spatial...
    • Kalnis, P., Ghinita, G., Mouratidis, K. and Papadias, D. (2007). Preventing location-based identity inference in anonymous spatial queries....
    • Kilibarda, M. (2015). plotGoogleMaps: Plot Spatial or Spatio-Temporal Data Over Google Maps. R package version 2.2.
    • Kwan, M.-P., Casas, I. and Schmitz, B. C. (2004). Protection of geoprivacy and accuracy of spatial information: how effective are geographical...
    • Marceau, D. (2014). The scale issue in the social and natural sciences. Canadian Journal of Remote Sensing, 25, 347–356.
    • Martin, D. (2002). Geography for the 2001 Census in England and Wales: an overview of the geography system used in the 2001 census, primarily...
    • Martin, D. (2003). Extending the automated zoning procedure to reconcile incompatible zoning systems. International Journal of Geographical...
    • Martin, D., Dorling, D. and Mitchell, R. (2002). Linking censuses through time: problems and solutions. Area, 34, 82–91.
    • Mateo Sanz, J. M. and Domingo Ferrer, J. (1998). A comparative study of microaggregation methods. Qüestiiǿ, 22, 511–526.
    • Miller, C. C. (2006). A beast in the field: the Google Maps mashup as GIS/2. Cartographica: The International Journal for Geographic Information...
    • Openshaw, S. (1977). A geographical solution to scale and aggregation problems in region-building, partitioning and spatial modelling. Transactions...
    • Openshaw, S. (1984). The modifiable area unit problem. Concepts and Techniques in Modern Geography, 38, 1–41.
    • Openshaw, S. (1995). Algorithms for reengineering 1991 census geography. Environment and Planning A, 27, 425–446.
    • R Core Team (2015). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
    • Ralphs, M. and Ang, L. (2009). Optimised Geographies for Data Reporting: Zone Design Tools for Census Output Geographies. Number 09 in Statistics...
    • Reiter, J. P. (2012). Statistical approaches to protecting confidentiality for microdata and their effects on the quality of statistical inferences....
    • Samet, H. (1984). The quadtree and related hierarchical data structures. ACM Computing Surveys, 16, 187–260.
    • Samet, H. (1988). An Overview of quadtrees, octrees, and related hierarchical data structures. In R. Earnshaw (Ed.), Theoretical Foundations...
    • Steinnocher, K. and Kaminger, I. (2010). Gridded population-new data sets for an improved disaggregation approach. European Forum for Geostatistics...
    • Sweeney, L. (2002a). Achieving k-anonymity privacy protection using generalization and suppression. International Journal of Uncertainty,...
    • Sweeney, L. (2002b). k-anonymity: a model for protecting privacy. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems,...
    • Tammilehto-Luode, M. (2011). Opportunities and challenges of grid-based statistics. World Statistics Congress of the International Statistical...
    • Tammilehto-Luode, M., Ralphs, M. and Backer, L. (2003). Tandem II: towards a common geographical base for statistics across Europe. The final...
    • VanWey, L. K., Rindfuss, R.R., Gutmann, M.P., Entwisle, B. and Balk, D. L. (2005). Confidentiality and spatially explicit data: concerns and...
    • Vilhuber, L. (2013). Methods for Protecting the Confidentiality of FirmLevel Data: Issues and Solutions.
    • Vu, K., Zheng, R. and Gao, J. (2012). Efficient algorithms for k-anonymous location privacy in participatory sensing. In INFOCOM, 2012 Proceedings...
    • Walford, N. (2013). Development and design of a web-based interface to address geographical incompatibility in spatial units. Environment...
    • Xu, T. and Cai, Y. (2009). Feeling-based location privacy protection for location-based services. In Proceedings of the 16th ACM Conference...
    • Young, C., Martin, D. and Skinner, C. (2009). Geographically intelligent disclosure control for flexible aggregation of census data. International...
    • Zandbergen, P. A. (2014). Ensuring confidentiality of Geocoded health data: assessing geographic masking strategies for individual-level data....
    • Zimmerman, D. L., Armstrong, M.P. and Rushton, G. (2007). Alternative techniques for masking geographic detail to protect privacy. In Geocoding...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno