Repository logo
 
Loading...
Thumbnail Image
Publication

Errors of identifiers in anonymous databases: impact on data quality

Use this identifier to reference this record.
Name:Description:Size:Format: 
paper SOCO.pdf409.78 KBAdobe PDF Download

Advisor(s)

Abstract(s)

Data quality is essential for a correct understanding of the concepts they represent. Data mining is especially relevant when data with inferior quality is used in algorithms that depend on correct data to create accurate models and predictions. In this work, we introduce the issue of errors of identifiers in an anonymous database. The work proposes a quality evaluation approach that considers individual attributes and a contextual analysis that allows additional quality evaluations. The proposed quality analysis model is a robust means of minimizing anonymization costs.

Description

Keywords

Data pre-processing Anonymized data Data quality

Citation

Research Projects

Organizational Units

Journal Issue