Question
How to remove personal identifiers from datasets to protect privacy, while keeping the datasets still valuable for processing?
Context
Cloud services are often used to process large datasets, potentially containing primary or secondary private data, or such data can be inferred by correlating different datasets.
Solution
The identity of data owner needs to be stripped off from the records in such a way so that the data owner cannot be identified directly or indirectly from that anonymised data.
References
Data anonymization and integrity checking in cloud computing - ieeexplore.ieee.org
Anonymization: A Method To Protect Sensitive Data In Cloud - www.ijser.org
Enhancing Cloud Security Using Data Anonymization - www.scribd.com
How to Enhance the Security of Sensitive Customer Data by Using Amazon CloudFront Field-Level Encryption - aws.amazon.com