Search
 
Home| Contact Us| Join Our Mailing List| New Journals| Browse Journals| Journal Prices| For Authors| Advanced Search
HOME > JOURNALS BY SUBJECT > COMPUTER SCIENCE > IJUFKS
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems (IJUFKS)
Current Issue | 2009 | 2008 | 2007 | All Volumes (1993-2009)

Volume: 16, Supplementary Issue 1(2008) pp. 121-138     DOI: 10.1142/S0218488508005285
Abstract | Full Text (PDF, 243KB) | References
Title: HOW TO GROUP ATTRIBUTES IN MULTIVARIATE MICROAGGREGATION
Author(s):
JORDI NIN
IIIA, Artificial Intelligence Research Institute, CSIC, Spanish National Research Council, Campus UAB s/n, 08193 Bellaterra, Catalonia, Spain

JAVIER HERRANZ
IIIA, Artificial Intelligence Research Institute, CSIC, Spanish National Research Council, Campus UAB s/n, 08193 Bellaterra, Catalonia, Spain

VICENÇ TORRA
IIIA, Artificial Intelligence Research Institute, CSIC, Spanish National Research Council, Campus UAB s/n, 08193 Bellaterra, Catalonia, Spain
History:
Received 12 October 2007
Revised 31 January 2008
Accepted 15 February 2008
Abstract:
Microaggregation is one of the most employed microdata protection methods. It builds clusters of at least k original records, and then replaces these records with the centroid of the cluster. When the number of attributes of the dataset is large, one usually splits the dataset into smaller blocks of attributes, and then applies microaggregation to each block, successively and independently. In this way, the effect of the noise introduced by microaggregation is reduced, at the cost of losing the k-anonymity property.

In this work we show that, besides the specific microaggregation method, the value of the parameter k and the number of blocks in which the dataset is split, there exists another factor which influences the quality of the microaggregation: the way in which the attributes are grouped to form the blocks. When correlated attributes are grouped in the same block, the statistical utility of the protected dataset is higher. In contrast, when correlated attributes are dispersed into different blocks, the achieved anonymity is higher, and so, the disclosure risk is lower. We present quantitative evaluations of such statements based on different experiments on real datasets.
Keywords:
Microaggregation; attribute selection; statistical disclosure control

Imperial College Press  |  Global Publishing  |  Asia-Pacific Biotech News  |  Innovation Magazine
Labcreations Co  |  Meeting Matters  |  National Academies Press

World Scientific is a Member of CrossRef

Copyright © 2010 World Scientific Publishing Co. All rights reserved.