Privacy Preserving Data Mining

Home  |  People  |  Research Projects  |  Publications  |  Tutorials  |  Workshops  |   Contact


  Publications (year-based list)

 

2004

J. Vaidya and C. Clifton. Privacy-Preserving Outlier Detection. In Proceedings of the Fourth IEEE International Conference on Data Mining (ICDM 2004), Brighton, UK, November 2004.

K. Wang, P. Yu, and S. Chakraborty. Botton-Up Generalization: A Data Mining Solution to Privacy Protection. In Proceedings of the Fourth IEEE International Conference on Data Mining (ICDM 2004), Brighton, UK, November 2004.

D. Meng and K. Sivakumar. Privacy Sensitive Bayesian Network Parameter Learning. In Proceedings of the Fourth IEEE International Conference on Data Mining (ICDM 2004), Brighton, UK, November 2004.

N. Zang, S. Wang, and W. Zhao. A New Scheme on Privacy Preserving Association Rule Mining. In Proceedings of the 15th European Conference on Machine Learning (ECML) and the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), Pisa, Italy, September 2004.

S. R. M. Oliveira and O. R. Zaïane. Achieving Privacy Preservation When Sharing Data For Clustering. In Proceedings of the International Workshop on Secure Data Management in a Connected World (SDM'04) in conjunction with VLDB 2004, Toronto, Canada, August, 2004.

A. Sanil, A. Karr, X. Lin, and J. Reiter. Privacy Preserving Regression Modelling Via Distributed Computation. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, August 2004.

Y. Zhu and L. Liu. Optimal Randomazation for Privacy Preserving Data Mining. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, August 2004.

R. Wright and Z. Yang. Privacy-Preserving Bayesian Network Structure Computation on Distributed Heterogeneous Data. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, August 2004.

B. Gilburd, A. Schuster, and R. Wolff. A New Privacy Model and Association-Rule MIning Algorithm for Large-Scale Distributed Environments. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, August 2004.

M. Kantarcioglu, J. Jin, and C. Clifton. When Do Data MIning Results Violate Privacy? In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, August 2004.

S. R. M. Oliveira and O. R. Zaïane. Toward Standardization in Privacy-Preserving Data Mining. In Proceeding of the 3rd. Workshop on Data Mining Standards (DM-SSP 2004), in conjunction with KDD 2004, Seattle, WA, USA, August, 2004.

W. Du, Y. S. Han, and S. Chen. Privacy-Preserving Multivariate Statistical Analysis: Linear Regression and Classification. In Proceedings of the 2004 SIAM Conference on Data Mining, Lake Buena Vista, Floria, USA, April 2004.

J. Vaidya and C. Clifton. Privacy Preserving Naïve Bayes Classifier for Vertically Partitioned Data. In Proceedings of the 2004 SIAM Conference on Data Mining, Lake Buena Vista, Floria, USA, April 2004.

V. S. Verykios, E. Bertino, I. N. Fovino, L. P. Provenza, Y. Saygin, Y. Theodoridis. State-of-the-art in Privacy Preserving Data Mining. In SIGMOD Record, 33(1): 50-57, March 2004.

S. Agrawal, V. Krishnan and  J. R. Haritsa. On Addressing Efficiency Concerns in Privacy-Preserving Mining. In Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA-2004), Jeju Island, Korea, March 2004.

Md. Z. Islan and L. Brankovic. A Framewor for Privacy Preserving Data Mining. In Proceedings of the Australasian Workshop on Data Mining and Web Intelligence (DMWI 2004), Dunedin, New Zealand, January 2004, pp. 163-168.

2003

Md. Z. Islan, P. M. Barnaghi, and L. Brankovic. Measuring Data Quality: Predictive Accuracy vs. Similarity of Decision Trees. In Proceedings of the 6th International Conference on Computer and Information Technology (ICCIT 2003), Dhaka, Bangladesh, December 2003.

Md. Z. Islan and L. Brankovic. Noise Addition for Protecting Privacy in Data Mining. In Proceedings of the 6th Engineering Mathematics and Applications Conference (EMAC 2003), Sydney, Australia, 2003.

B. Brumen, I. Golob, T. Welzer, I. Rozman, M. Druzovec, and H. Jaakkola. An Algorithm for Protecting Knowledge Discovery Data. In INFORMATICA, 14(3): 277-288, December 2003.

H. Kargupta, S. Datta, Q. Wang,and K. Sivakumar. On the Privacy Preserving Properties of Random Data Perturbation Techniques. In Proceedings of the Third IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 2003, pp. 99-106.

S. R. M. Oliveira and O. R. Zaïane. Protecting Sensitive Knowledge By Data Sanitization. In Proceedings of the Third IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 2003, pp. 613-616.

S. Merugu and J. Ghosh. Privacy-Preserving Distributed Clustering Using Generative Models. In Proceedings of the Third IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 2003, pp. 211-218.

H. Polat and W. Du. Privacy-Preserving Collaborative Filtering Using Randomized Perturbation Techniques. In Proceedings of the Third IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 2003, pp. 625-628.

M. Kantarcoglu and J. Vaidya. Privacy Preserving Naive Bayes Classifier for Horizontally Pertitioned Data. In IEEE ICDM Workshop on Privacy Preserving Data Mining, Melbourne, Florida, USA, November 2003, pp. 3-9.

C. W. Wu. Privacy Preserving Data Mining: A Signal Processing Perspective And A Simple Data Perturbation Protocol. In IEEE ICDM Workshop on Privacy Preserving Data Mining, Melbourne, Florida, USA, November 2003, pp. 10-17.

T. Mielikainen. On inverse Frequent Set Mining. In IEEE ICDM Workshop on Privacy Preserving Data Mining, Melbourne, Florida, USA, November 2003, pp. 18-23.

Y. Di, H. Liu, A. Ramineni, and A. Sen. Detecting Hidden Information in Images: A Comparative Study. In IEEE ICDM Workshop on Privacy Preserving Data Mining, Melbourne, Florida, USA, November 2003, pp. 24-30.

A. A. Veloso, Wagner Meira Jr., S. Parthasarathy and M. B. Carvalho. Efficient, Accurate and Privacy-Preserving Data Mining for Frequent Itemsets in  Distributed Databases. In Proceedings of the 18th Brazilian Symposium on Databases, Manaus, Amazonas, Brazil, October 2003, pp.281-292.

S. R. M. Oliveira and O. R. Zaïane. Privacy Preserving Clustering By Data Transformation. In Proceedings of the 18th Brazilian Symposium on Databases, Manaus, Amazonas, Brazil, October 2003, pp.304-318.

J. Vaidya and C. Clifton. Privacy-Preserving K-Means Clustering over Vertically Partitioned Data. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 2003, pp.206-215.

W. Du and Z. Zhan. Using Randomized Response Techniques for Privacy-Preserving Data Mining. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 2003, pp.505-510.

S. R. M. Oliveira and O. R. Zaïane. Algorithms for Balancing Privacy and Knowledge Discovery in Association Rule Mining. In Proceedings of the 7th International Database Engineering and Applications Symposium (IDEAS 2003), Hong Kong, China, July  2003, pp.54-63.

A. Evfimievski, J. E. Gehrke, and R. Srikant. Limiting Privacy Breaches in Privacy Preserving Data Mining. In Proceedings of the 22nd ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2003).  San Diego, CA, June 2003.

2002

B. Thuraisingham. Data Mining, National Security, Privacy and Civil Liberties. In SIGKDD Explorations, 4(2): 1-5, December 2002.

C. Farkas and S. Jajodia.  The Inference Problem: A Survey. In SIGKDD Explorations, 4(2): 6-11, December 2002.

B. Pinkas. Cryptographic Techniques for Privacy-Preserving Data Mining. In SIGKDD Explorations, 4(2): 12-19, December 2002.

M. Olivier. Database Privacy. In SIGKDD Explorations, 4(2): 20-27, December 2002.

C. Clifton, M. Kantarcioglu, J. Vaidya, X. Lin and M. Y. Zhu.  Tools for Privacy Preserving Distributed Data Mining. In SIGKDD Explorations, 4(2): 28-34 December 2002.

W. Lee.  Applying Data Mining to Intrusion Detection: The Quest for Automation, Efficiency, and Credibility. In SIGKDD Explorations, 4(2): 35-42, December 2002.

A. Evfimievski. Randomization in Privacy-Preserving Data Mining. In SIGKDD Explorations, 4(2): 43-48, December 2002.

T. Johnsten and V. V. Raghavan.  A Methodology for Hiding Knowledge in Databases. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.9-17.

C. Boyens, O. Günther and M.Teltzrow. Privacy Conflicts in CRM Services for Online Shops: A Case Study. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.27-35.

M. Kantarcioglu and J. Vaidya. An Architecture for Privacy-preserving Mining of Client Information. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.37-42.

G. Schadow, S. J. Grannis and C. J. McDonald. Privacy-Preserving Distributed Queries for a Clinical Case Research Network. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.55-65.

W. Du and Z. Zhan. Building Decision Tree Classifier on Private Data. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.1-8.

S. R. M. Oliveira and O. R. Zaïane. Foundations for an Access Control Model for Privacy Preservation in Multi-Relational Association Rule Mining. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.19-26.

S. R. M. Oliveira and O. R. Zaïane.  Privacy Preserving Frequent Itemset Mining. In Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining, Maebashi City, Japan, December 2002, pp.43-54.

S. J. Rizvi and J. R. Haritsa. Privacy-Preserving Association Rule Mining. In Proceedings of 28th International Conference on Very Large Data Bases. VLDB, Hong Kong, China, August 2002.

V. S. Iyengar. Transforming Data to Satisfy Privacy Constraints. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, AB, Canada, July 2002, pp.279-288.

J. Vaidya and C. Clifton. Privacy Preserving Association Rule Mining in Vertically Partitioned Data. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, AB, Canada, July 2002, pp639-644.

A. Evfimievski, R. Srikant, R. Agrawal, and J. Gehrke. Privacy Preserving Mining of Association Rules. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, AB, Canada, July 2002, pp217-228.

M. Kantarcioglu and C.Clifton. Privacy-preserving Distributed Mining of Association Rules on Horizontally Partitioned Data. In Proceedings of the ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD'02), June 2002.

Y. Saygin, V. S. Verykios, and A. K. Elmagarmid.  Privacy Preserving Association Rule Mining. In Proceedings of the 12th International Workshop on Research Issues in Data Engineering: Engineering E-Commerce/E-Business Systems (RIDE'02),  San Jose, CA, USA, February  2002.

2001

O. De Vel, A. Anderson, M. Corney, and G. Mohay. Mining Email Content for Author Identification Forensics. In SIGMOD Record, v.30, n.4, December 2001.

Y. Saygin, V.S. Verykios and C. Clifton. Using Unknowns to Prevent Discovery of Association Rules. In SIGMOD Record, v.30, n.4, December 2001.

S. J. Stolfo, W. Lee, P. K. Chan, W. Fan, and E. Eskin. Data Mining-based Intrusion Detectors: An Overview of the Columbia IDS Project. In SIGMOD Record, v.30, n.4, December 2001, pp.45-54.

D. Barbara, J. Couto, S. Jajodia, and N. Wu. ADAM: A Testbed for Exploring the Use of Data Mining in Intrusion Detection. In SIGMOD Record, v.30, n.4, December 2001.

J. B. D. Cabrera, L. Lewis, and R. K. Mehra. Detection and Classification of Intrusions and Faults using Sequences of System Calls. In SIGMOD Record, v.30, n.4, December 2001.

W. Lee and W. Fan. Mining System Audit Data: Opportunities and Challenges. In SIGMOD Record, v.30, n.4, December 2001.

L. Mé and C. Michel. Intrusion Detection: A Bibliography. In Technical Report SSIR-2001-01, Sup'elec, Rennes, France, September 2001.

T. Johnsten, andV. V. Raghavan. Security Procedures for Classification Mining Algorithms. In Proceedings of the 15th Annual IFIP WG 11.3 Working Conference on Database and Applications Security, Niagara on the Lake, ON, Canada, July 2001, pp.293-309.

W. Du and M. J. Atallah.  Privacy-Preserving Cooperative Scientific Computations. In Proceedings of the 14th IEEE Computer Security Foundations Workshop (CSFW'01), Cape Breton, Novia Scotia, Canada, June 2001, pp.273-.285.

W. Lee, S. J. Stolfo, P. K. Chan, E. Eskin, W. Fan, M. Miller, S. Hershkop, and J. Zhang. Real Time Data Mining-based Intrusion Detection. In Proceedings of DARPA Information Survivability Conference and Exposition (DISCEX-II 2001), Anaheim, CA, USA, June 2001, pp.85-100.

W. Lee and D. Xiang. Information-Theoretic Measures for Anomaly Detection. In Proceedings of the IEEE Symposium on Security and Privacy, Oakland, CA, USA, May 2001, pp.130-143.

W. Lee, S. J. Stolfo, and K. Mok. Adaptive Intrusion Detection: A data Mining Approach. Articial Intelligence Review, v.14, 2001, pp.533-567.

D. Agrawal and C. C. Aggarwal.  On the Design and Quantification of Privacy Preserving Data Mining Algorithms. In Proceedings of the  20th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Santa Barbara, California, USA, May 2001, pp.247-255.

E. Dasseni,  V. S. Verykios,  A. K. Elmagarmid, and Elisa Bertino. Hiding Association Rules by Using Confidence and Support. In Proceedings of the 4th International Information Hiding Workshop (IHW), Pittsburg, PA, April 2001, pp.369-383.

2000

C. Clifton. Using Sample Size to Limit Exposure to Data Mining. In Journal of Computer Security, v.8, n.4, IOS Press, November 2000, pp.281-307 (Invited paper).

W. Lee and S. J. Stolfo. A Framework for Constructing Features and Models for Intrusion Detection Systems. In ACM Transations on Information and System Security, v.3, n.4, November 2000, pp.227-261.

C. Clifton and G. Gengo. Developing Custom Intrusion Detection Filters Using Data Mining. In 2000 Military Communications International Symposium (MILCOM2000), Los Angeles, California, October 2000.

Y. Lindell and B. Pinkas. Privacy Preserving Data Mining. In Proceedings of  CRYPTO 2000, LNCS 1880, Springer-Verlag, Santa Barbara, CA, August 2000, pp.36-54.

R. Agrawal and R. Srikant. Privacy-Preserving Data Mining. In Proceedings of the ACM SIGMOD Conference on Management of Data, Dallas, Texas, May 2000, pp.439-450.

W. Fan, W. Lee, S. J. Stolfo, and M. Miller. A Multiple Model Cost-Sensitive Approach for Intrusion Detection. In Proceedings of the 11th European Conference on Machine Learning (ECML00), Barcelona Spain, May 2000, pp.148-156.

W. Lee, W. Fan, M. Miller, S. J. Stolfo, and E. Zadok. Toward Cost-Sensitive Modeling for Intrusion Detection and Response. In Proceedings of the 1st ACM Workshop on Intrusion Detection Systems, 2000. Also available as Technical Report CUCS-002-00, Computer Science, Columbia University, 2000.

R. Mukkamala, J. Gagnon, and S. Jajodia. Integrating Data Mining Techniques with Intrusion Detection. In V. Atluri and J. Hale, editors, Research Advances in Database and Information Systems Security, Kluwer Publishers, 2000, pp.33-46.

1999

M. E. Meaney. Data Mining, Dataveillance, and Medical Information Privacy. In Biomedical Ethics Reviews, November 1999, pp.145-164.

M. Atallah, E. Bertino, A. Elmagarmid, M. Ibrahim and V. Verykios. Disclosure Limitation of Sensitive Rules. In Proceedings of the IEEE Knowledge and Data Engineering Exchange Workshop (KDEX'99),  November 1999, Chicago, IL, pp. 45-52.

V. Estivill-Castro and L. Brankovic. Data Swapping: Balancing Privacy against Precision in Mining for Logic Rules. In Proceedings of the First International Data Warehousing and Knowledge Discovery (DaWaK'99:),  Florence, Italy, August 30 - September 1999, pp.389-398.

V. Estivill-Castro, L. Brankovic and D. L. Dowe. Privacy in Data Mining.  In Privacy Law and Policy Reporter, v.6, n.3, September 1999, pp.33-35.

A. J. Broder. Data Mining, the Internet, and Privacy. In B. M. Masand and M. Spiliopoulou (Eds.): Web Usage Analysis and User Profiling, International WEBKDD'99 Workshop, San Diego, California, USA, August 1999, pp.56-73.

T. Johnsten, andV. V. Raghavan. Impact of Decision-Region Based Classification Mining Algorithms on Database Security. In Proceedings of the 13th Annual IFIP WG 11.3 Working Conference on Database Security,. Seattle, WA, USA, July 1999, pp.177-191.

C. Clifton. Protecting Against Data Mining Through Samples. In Proceedings of the 13th Annual IFIP WG 11.3 Working Conference on Database Security,. Seattle, WA, USA, July 1999.

L. Brankovic and V. Estivill-Castro.  Privacy Issues  in Knowledge Discovery and Data Mining. In Proceedings of the Australian Institute of Computer Ethics Conference (AICEC99), Melbourne, Australia, July 1999, pp.89-99.

G.  H. John. Behind-the-Scenes Data Mining. Newsletter of ACM Special Interest Group on Knowledge Discovery & Data Mining, v.1., n.1, June 1999, pp.9-11.

W. Lee. A Data Mining Framework for Constructing Features and Models for Intrusion Detection Systems. PhD thesis, Computer Science Department, Columbia University, June 1999.

W. Lee, S. J. Stolfo, and K. W. Mok. A Data Mining Framework for Building Intrusion Detection Models. In Proceedings of the 1999 IEEE Symposium on Security and Privacy, May 1999, pp. 120-132.

W. Lee, S. J. Stolfo, and K. W. Mok. Algorithms For Mining System Audit Data. In Lin, T. Y. and Cercone, N., editors, Data Retrieval and Data Mining. Kluwer Academic Publishers, 1999.

1998

W. Lee, S. J. Stolfo and K. W. Mok. Mining Audit Data to Build Intrusion Detection Models. In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD'98), New York, NY, August 1998, pp66-72.

W. Lee and S. Stolfo. Data Mining Approaches for Intrusion Detection. In Proceedings of the 7th USENIX Security Symposium, January 1998, pp 79-93.

Office of the Information and Privacy Commissioner. Data Mining: Staking a Claim on Your Privacy, Toronto, Ontario, January 1998.

1996

K. C. Laudon. Markets and privacy, Communications of the ACM, v.39 n.9, September 1996, pp.92-104.

C. Clifton and D. Marks. Security and Privacy Implications of Data Mining. In Proceedings of the 1996 ACM SIGMOD Workshop on Data Mining and Knowledge Discovery, Montreal, Canada, June 1996, pp.15-19.

1995

Willi Klösgen. Anonymization Techniques for Knowledge Discovery in Databases. In  Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD-95), Montreal, Canada, August 1995. AAAI Press, ISBN 0-929280-82-2, pp.186-191.

G. Piatetsky-Shapiro. Knowledge Discovery in Personal Data vs. Privacy: A mini-symposium. In IEEE Expert,  v.10, n.2,  pp.46-47, April 1995,.

D. E. O'Leary. Some Privacy Issues in Knowledge Discovery: The OECD Personal Privacy Guidelines. In IEEE Expert,  v.10, n.2,  pp.48-52, April 1995.

Willi Klösgen. KDD: Public and Private Concerns . In IEEE Expert,  v.10, n.2,  pp.55-57, April 1995.

1991

D. E. O'Leary. Knowledge Discovery as a Threat to Database Security. In G. Piatetsky-Shapiro and W. J. Frawley (eds.): Knowledge Discovery in Databases. AAAI/MIT Press, pp.507-516, Menlo Park, CA, 1991.