A Bottom-up k-anonymization approach for big data publishing

Abderrahmane Saidi; Salheddine Kabou; Imad Eddine Kimmi; Laid Gasmi

doi:10.5935/jetia.v12i58.3239

Abderrahmane Saidi Higher Normal School of Bechar, Algeria https://orcid.org/0009-0007-0611-0445
Salheddine Kabou Higher Normal School of Bechar, Algeria http://orcid.org/0000-0002-1423-7215
Imad Eddine Kimmi Higher Normal School of Bechar, Algeria http://orcid.org/0009-0005-6997-9194
Laid Gasmi Ahmed Draia University -Adrar, Algeria https://orcid.org/0000-0001-8925-0089

DOI: https://doi.org/10.5935/jetia.v12i58.3239

Abstract

As governments and other organizations share larger datasets, keeping individual information private has become increasingly difficult to solve. When publishing the data, data anonymization models like k-anonymity and l-diversity are employed to ensure the trade-off between privacy and data utility. This paper presents a method called Bottom-Up k-anonymization (BU-K), implemented on Apache Spark. It improves efficiency by applying the Bottom-Up Generalization (BUG) approach. BU-KC performs better than Top-Down Specialization (TDS) in terms of scalability, and data privacy, while still keeping the data useful. Moreover, using Apache Spark’s distributed computing architecture significantly improves processing time compared to traditional MapReduce approaches. This work fills a gap in distributed anonymization on Spark by offering a new, efficient, and scalable solution

Downloads

Download data is not yet available.

JETIA Journal Data
Available:	2015 - 2026
Volumes:	12
Issues:	58
Articles:	1.110
Article Processing Charges (APC):	PAID