Remember Me
Or use your Academic/Social account:


Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Schnell, R.; Borgs, C. (2015)
Languages: English
Types: Unknown
Subjects: RG, QA75
To assess the quality of hospital care, national databases of standard medical procedures are common. A widely known example are national databases of births. If unique personal identification numbers are available (as in Scandinavian countries), the construction of such databases is trivial from a computational point of view. However, due to privacy legislation, such identifiers are not available in all countries. Given such constraints, the construction of a national perinatal database has to rely on other patient identifiers, such as names and dates of birth. These kind of identifiers are prone to errors. Furthermore, some jurisdictions require the encryption of personal identifiers. The resulting problem is therefore an example of Privacy Preserving Record Linkage (PPRL). This contribution describes the design considerations for a national perinatal database using data of about 600,000 births in about 1,000 hospitals. Based on simulations, recommendations for parameter settings of Bloom filter based PPRL are given for this real world application.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • T. Bachteler and J. Reiher, “Verkn u¨pfung der Geburtshilfe- und Neonatalogiedaten des Jahres 2011 im Auftrag des AQUA-Instituts,” German Record Linkage Center, Tech. Rep., 2012.
    • X.-H. Zhou, C. Zhou, D. Liu, and X. Ding, Applied Missing Data Analysis in the Health Sciences. Hoboken: Wiley, 2014.
    • [3] J. K. Leiss, D. Giles, K. M. Sullivan, R. Mathews, G. Sentelle, and K. M. Tomashek, “U.S. maternally linked birth records may be biased for Hispanics and other population groups,” Annals of Epidemiology, vol. 20, no. 1, pp. 23-31, Jan. 2010.
    • [4] J. T. Lariscy, “Differential record linkage by Hispanic ethnicity and age in linked mortality studies: implications for the epidemiologic paradox,” Journal of Aging and Health, vol. 23, no. 8, pp. 1263-1284, Dec. 2011.
    • [5] I. Baldi, A. Ponti, R. Zanetti, G. Ciccone, F. Merletti, and D. Gregori, “The impact of record-linkage bias in the Cox model,” Journal of Evaluation in Clinical Practice, vol. 16, no. 1, pp. 92-96, 2010.
    • K. Harron, A. Wade, R. Gilbert, B. Muller-Pebody, and H. Goldstein, “Evaluating bias due to data linkage error in electronic healthcare records,” BMC Medical Research Methodology, vol. 14, no. 1, p. 36, 2014.
    • [7] J. B. Ford, C. L. Roberts, and L. K. Taylor, “Characteristics of unmatched maternal and baby records in linked birth records and hospital discharge data,” Paediatric and Perinatal Epidemiology, vol. 20, no. 4, pp. 329-337, 2006.
    • [8] J. P. Bentley, J. B. Ford, L. K. Taylor, K. A. Irvine, and C. L. Roberts, “Investigating linkage rates among probabilistically linked birth and hospitalization records,” BMC Medical Research Methodology, vol. 12, p. 149, 2012.
    • [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] R. Schnell, C. Borgs, G. Heller, and F. Niedermeyer, “Pseudonymisierte Verknu¨ pfung der Perinatal- und Neonatalerhebung mit Bloom-Filtern,” German Record Linkage Center, Duisburg, unpublished Technical Report, 2015.
    • C. Quantin, C. Binquet, K. Bourquard, R. Pattisina, B. Gouyon-Cornet, C. Ferdynus, J.-B. Gouyon, and F.-A. Allaert, “Which are the best identifiers for record linkage?” Medical Informatics and the Internet in Medicine, vol. 29, no. 3-4, pp. 221-227, 2004.
    • W. E. Winkler, “Record linkage,” in Handbook of Statistics 29A, Sample Surveys: Design, Methods and Applications, D. Pfeffermann and C. Rao, Eds. Amsterdam: Elsevier, North-Holland, 2009, pp.
    • M. A. Hernandez and S. S. Stolfo, “Real-world data is dirty: data cleansing and the merge/purge problem,” Data Mining and Knowledge Discovery, vol. 2, no. 1, pp. 9-37, 1998.
    • W. Stallings, Cryptography and Network Security: Principles and Practice, 6th ed. New Jersey: Pearson, 2014.
    • D. Vatsalan, P. Christen, and V. S. Verykios, “A taxonomy of privacypreserving record linkage techniques,” Information Systems, vol. 38, no. 6, pp. 946-969, 2013.
    • T. N. Herzog, F. J. Scheuren, and W. E. Winkler, Data Quality and Record Linkage Techniques. New York: Springer, 2007.
    • F. Borst, F.-A. Allaert, and C. Quantin, “The Swiss solution for anonymous chaining patient files,” in Proceedings of the 10th World Congress on Medical Informatics: 2-5 September 2001; London, V. Patel, R. Rogers, and R. Haux, Eds. Amsterdam: IOS Press, 2001, pp.
    • R. Karmel, P. Anderson, D. Gibson, A. Peut, S. Duckett, and Y. Wells, “Empirical aspects of record linkage across multiple data sets using statistical linkage keys: the experience of the PIAC cohort study,” BMC Health Services Research, vol. 10, no. 41, 2010.
    • R. Schnell, A. Richter, and C. Borgs, “Performance of different methods for privacy preserving record linkage with large scale medical data sets,” University of Duisburg-Essen, 2014.
    • B. H. Bloom, “Space/time trade-offs in hash coding with allowable errors,” Communications of the ACM, vol. 13, no. 7, pp. 422-426, 1970.
    • R. Schnell, T. Bachteler, and J. Reiher, “Privacy-preserving record [40] J. M. Hancock and M. J. Zvelebil, Eds., Concise Encyclopaedia of Bioinformatics and Computational Biology, 2nd ed. Chichester: Wiley, 2014.
    • R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2015. [Online]. Available: http://www.R-project.org/ M. Kuzu, M. Kantarcioglu, E. Durham, and B. Malin, “A constraint satisfaction cryptanalysis of Bloom filters in private record linkage,” in Privacy Enhancing Technologies, 11th International Symposium, PETS 2011. Waterloo, Canada: Springer Berlin Heidelberg, 2011, pp. 226- 245.
    • M. Kuzu, M. Kantarcioglu, E. A. Durham, C. Toth, and B. Malin, “A practical approach to achieve private medical record linkage in light of public resources,” Journal of the American Medical Informatics Association, vol. 20, no. 2, pp. 285-292, 2013.
    • F. Niedermeyer, S. Steinmetzer, M. Kroll, and R. Schnell, “Cryptanalysis of basic bloom filters used for privacy preserving record linkage,” Journal of Privacy and Confidentiality, vol. 6, no. 2, pp. 59-79, 2014.
    • A. Kirsch and M. Mitzenmacher, “Less hashing same performance: building a better Bloom filter,” in Algorithms-ESA 2006. Proceedings of the 14th Annual European Symposium: 11-13 September 2006; Z u¨rich, Switzerland, Y. Azar and T. Erlebach, Eds. Berlin: Springer, 2006, pp.
    • M. Kroll and S. Steinmetzer, “Automated cryptanalysis of bloom filter encryptions of health records,” in 8th International Conference on Health Informatics, 2015.
  • No related research data.
  • No similar publications.
  • BioEntity Site Name

Share - Bookmark

Download from

Cite this article