Skip to main navigation Skip to search Skip to main content

Towards an Efficient Log Data Protection in Software Systems through Data Minimization and Anonymization

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

IT infrastructures of companies generate large amounts of log data every day. These logs are typically analyzed by software engineers to gain insights about activities occurring within a company (e.g., to debug issues exhibited by the production systems). To facilitate this process, log data management is often outsourced to cloud providers. However, logs may contain information that is sensitive by nature and considered personal identifiable under most of the new privacy protection laws, such as the European General Data Protection Regulation (GDPR). To ensure that companies do not violate regulatory compliance, they must adopt, in their software systems, appropriate data protection measures. Such privacy protection laws also promote the use of anonymization techniques as possible mechanisms to operationalize data protection. However, companies struggle to put anonymization in practice due to the lack of integrated, intuitive, and easy-to-use tools that accommodate effectively with their log management systems. In this paper, we propose an automatic approach (SafeLog) to filter out information and anonymize log streams to safeguard the confidentiality of sensitive data and prevent its exposure and misuse from third parties. Our results show that atomic anonymization operations can be effectively applied to log streams to preserve the confidentiality of information, while still allowing to conduct different types of analysis tasks such as users behavior, and anomaly detection. Our approach also reduces the amount of data sent to cloud vendors, hence decreasing the financial costs and the risk of overexposing information.

Original languageEnglish
Title of host publicationProceedings - 2019 7th International Conference in Software Engineering Research and Innovation, CONISOFT 2019
EditorsReyes Juarez-Ramirez, Carlos Alberto Fernandez y Fernandez, Samantha Paulina Jimenez Calleros, Alan David Ramirez-Noriega, Hector G. Perez-Gonzalez, Guillermo Licea Sandoval, Cesar Arturo Guerra-Garcia
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages107-115
Number of pages9
ISBN (Electronic)9781728125244
DOIs
Publication statusPublished - Oct 2019
Externally publishedYes
Event7th International Conference in Software Engineering Research and Innovation, CONISOFT 2019 - Mexico City, Mexico
Duration: 23 Oct 201925 Oct 2019

Publication series

NameProceedings - 2019 7th International Conference in Software Engineering Research and Innovation, CONISOFT 2019

Conference

Conference7th International Conference in Software Engineering Research and Innovation, CONISOFT 2019
Country/TerritoryMexico
CityMexico City
Period23/10/1925/10/19

Keywords

  • Anonymization
  • Privacy
  • Security
  • Software Engineering

Fingerprint

Dive into the research topics of 'Towards an Efficient Log Data Protection in Software Systems through Data Minimization and Anonymization'. Together they form a unique fingerprint.

Cite this