Published on IIIA (http://www.iiia.csic.es)

Home > Publications > Content

Improving record linkage with supervised learning for disclosure risk assessment

  • data privacy
  • record linkage

Publication Type:

Journal Article

Authors:

Daniel Abril [1]; Guillermo Navarro-Arribas [2]; Vicenç Torra [3]

Source:

Information Fusion, Volume 13, Issue 4, p.274-284 (2012)

URL:

http://www.sciencedirect.com/science/article/pii/S1566253511000352 [4]

Abstract:

In data privacy, record linkage can be used as an estimator of the disclosure risk of protected data. To model the worst case scenario one normally attempts to link records from the original data to the protected data. In this paper we introduce a parametrization of record linkage in terms of a weighted mean and its weights, and provide a supervised learning method to determine the optimum weights for the linkage process. That is, the parameters yielding a maximal record linkage between the protected and original data. We compare our method to standard record linkage with data from several protection methods widely used in statistical disclosure control, and study the results taking into account the performance in the linkage process, and its computational effort.

  • Tagged [5]
  • XML [6]
  • BibTex [7]
Projects: 
ARES [8]
IIIA-CSIC
Campus de la UAB, E-08193 Bellaterra, Catalonia (Spain)
Tel: (+34) 93 580 9570 - Fax: (+34) 93 580 9661

Source URL: http://www.iiia.csic.es/en/publications/improving-record-linkage-supervised-learning-disclosure-risk-assessment

Links:
[1] http://www.iiia.csic.es/en/individual/daniel-abril
[2] http://www.iiia.csic.es/en/individual/guillermo-navarro-arribas
[3] http://www.iiia.csic.es/en/individual/vicenc-torra
[4] http://www.sciencedirect.com/science/article/pii/S1566253511000352
[5] http://www.iiia.csic.es/en/publications/export/tagged/4759
[6] http://www.iiia.csic.es/en/publications/export/xml/4759
[7] http://www.iiia.csic.es/en/publications/export/bib/4759
[8] http://www.iiia.csic.es/en/project/ares