Anonymizing k-NN classification on mapreduce

Sibghat Ullah Bazai, Julian Jang-Jaccard, Ruili Wang

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Data analytics scenario such as a classification algorithm plays an important role in data mining to identify a category of a new observation and is often used to drive new knowledge. However, classification algorithm on a big data analytics platform such as MapReduce and Spark, often runs on plain text without an appropriate privacy protection mechanism. This leaves user’s data to be vulnerable from unauthorized access and puts the data at a great privacy risk. To address such concern, we propose a new novel k-NN classifier which can run on an anonymized dataset on MapReduce platform. We describe new Map and Reduce algorithms to produce different anonymized datasets for k-NN classifier. We also illustrate the details of experiments we performed on the multiple anonymized data sets to understand the effects between the level of privacy protection (data privacy) and the high-value insights (data utility) trade-off before and after data anonymization.

Original languageEnglish
Title of host publicationMobile Networks and Management - 9th International Conference, MONAMI 2017, Proceedings
EditorsSheng Wen, Jiankun Hu, Ibrahim Khalil, Zahir Tari
PublisherSpringer Verlag
Pages364-377
Number of pages14
ISBN (Print)9783319907741
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event9th International Conference on Mobile Networks and Management, MONAMI 2017 - Melbourne, Australia
Duration: 13 Dec 201715 Dec 2017

Publication series

NameLecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
Volume235
ISSN (Print)1867-8211

Conference

Conference9th International Conference on Mobile Networks and Management, MONAMI 2017
Country/TerritoryAustralia
CityMelbourne
Period13/12/1715/12/17

Keywords

  • Data anonymization
  • K-anonymity
  • k-NN classification
  • MapReduce

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Anonymizing k-NN classification on mapreduce'. Together they form a unique fingerprint.

Cite this