Publication Type:
Conference Proceedings
Source:
6th International Summer School on Aggregation Operators-AGOP2011, Lulu.com, Univ. of Sannio, Benevento, Italy, p.223--228 (2011)
ISBN:
978-1-4477-7019-0
URL:
http://agop2011.ciselab.org/proceedings
Keywords:
data privacy;
record linkage;
disclosure risk;
Mahalanobis distance;
fuzzy measure;
Choquet integral
Abstract:
In data privacy, record linkage is a well known technique used to evaluate the disclosure risk of protected data. Mainly, the idea is the linkage between records of different databases, which make reference to the same individuals. In this paper we introduce a new parametrized variation of record linkage relying on the Mahalanobis distance, and a supervised learning method to determine the optimum simulated covariance matrix for the linkage process. We evaluate and compare our proposal with other studied parametrized and not parametrized variations of record linkage, such as weighted mean or the Choquet integral, which determines the optimal fuzzy measure.