Ir al contenido

Documat


A Study of Convex Hulls Intersection in Clustering with Cardinality Constraints

  • Autores: Agusti Solanas Árbol académico
  • Localización: XXX Congreso Nacional de Estadística e Investigación Operativa y de las IV Jornadas de Estadística Pública: actas, 2007, ISBN 978-84-690-7249-3
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • Statistical disclosure control (SDC) seeks to transform data in such a way that they can be publicly released whilst preserving data utility and statistical confidentiality. Controlling statistical disclosure is specially important when the protected data belong to individual respondents or entities whose privacy could be put in jeopardy. This kind of data is known as micro-data and one of the most popular techniques for protecting micro-data is micro-aggregation.

      Given a data set D, the micro-aggregation problem consists of two steps: (i) generate subsets of D such as the homogeneity of the subsets is maximised and the cardinality of each subset is at least k, (ii) compute the centroid of each subset and replace the original elements in D by the centroid of the subset to which they belong. Multivariate micro-aggregation can be seen as a clustering problem with constraints in the size of the clusters. Each cluster (i.e. a set of points in Rn) can be wrapped by a convex hull.

      In this article we study the intersection of the convex hulls that wrap the elements of D. We observe that the existence of such intersections leads to a poor within-group homogeneity in terms of sum of square errors (SSE).


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno