Publication
Errors of identifiers in anonymous databases: impact on data quality
dc.contributor.author | Pombinho, Paulo | |
dc.contributor.author | Cavique, Luís | |
dc.contributor.author | Correia, Luís | |
dc.date.accessioned | 2023-01-03T10:03:19Z | |
dc.date.available | 2023-01-03T10:03:19Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Data quality is essential for a correct understanding of the concepts they represent. Data mining is especially relevant when data with inferior quality is used in algorithms that depend on correct data to create accurate models and predictions. In this work, we introduce the issue of errors of identifiers in an anonymous database. The work proposes a quality evaluation approach that considers individual attributes and a contextual analysis that allows additional quality evaluations. The proposed quality analysis model is a robust means of minimizing anonymization costs. | pt_PT |
dc.description.sponsorship | The authors would like to thank the FCT Projetct of Scientific Research and Technological Development in Data Science and Artificial Intelligence in Public Administration, 2018–2022 (DSAIPA/DS/0039/2018), for its support, and also acknowledge support by BioISI (UID/MULTI/04046/2103) and LASIGE Research Unit (UIDB/00408/2020, UIDP/00408/2020) center grants. | pt_PT |
dc.description.sponsorship | The authors would like to thank the FCT Projetct of Scientific Research and Technological Development in Data Science and Artificial Intelligence in Public Administration, 2018–2022 (DSAIPA/DS/0039/2018), for its support, and also acknowledge support by BioISI (UID/MULTI/04046/2103) and LASIGE Research Unit (UIDB/00408/2020, UIDP/00408/2020) center grants. | |
dc.description.version | info:eu-repo/semantics/publishedVersion | pt_PT |
dc.identifier.doi | 10.1007/978-3-031-18050-7_53 | pt_PT |
dc.identifier.uri | http://hdl.handle.net/10400.2/12904 | |
dc.language.iso | eng | pt_PT |
dc.peerreviewed | yes | pt_PT |
dc.relation | LASIGE - Extreme Computing | |
dc.relation | LASIGE - Extreme Computing | |
dc.subject | Data pre-processing | pt_PT |
dc.subject | Anonymized data | pt_PT |
dc.subject | Data quality | pt_PT |
dc.title | Errors of identifiers in anonymous databases: impact on data quality | pt_PT |
dc.type | conference object | |
dspace.entity.type | Publication | |
oaire.awardTitle | LASIGE - Extreme Computing | |
oaire.awardTitle | LASIGE - Extreme Computing | |
oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00408%2F2020/PT | |
oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F00408%2F2020/PT | |
oaire.citation.endPage | 556 | pt_PT |
oaire.citation.startPage | 547 | pt_PT |
oaire.citation.title | SOCO 2020. 17th International Conference on Soft Computing Models in Industrial and Environmental Applications) | pt_PT |
oaire.citation.volume | 531 | pt_PT |
oaire.fundingStream | 6817 - DCRRNI ID | |
oaire.fundingStream | 6817 - DCRRNI ID | |
person.familyName | Pombalinho | |
person.familyName | Cavique | |
person.familyName | Correia | |
person.givenName | Paulo | |
person.givenName | Luís | |
person.givenName | Luís | |
person.identifier | F-3440-2016 | |
person.identifier.ciencia-id | AF18-066F-60F6 | |
person.identifier.ciencia-id | 911E-84AC-3956 | |
person.identifier.ciencia-id | CC18-5389-6CBA | |
person.identifier.orcid | 0000-0001-7583-6791 | |
person.identifier.orcid | 0000-0002-5590-1493 | |
person.identifier.orcid | 0000-0003-2439-1168 | |
person.identifier.rid | M-3656-2013 | |
person.identifier.scopus-author-id | 56865595100 | |
project.funder.identifier | http://doi.org/10.13039/501100001871 | |
project.funder.identifier | http://doi.org/10.13039/501100001871 | |
project.funder.name | Fundação para a Ciência e a Tecnologia | |
project.funder.name | Fundação para a Ciência e a Tecnologia | |
rcaap.rights | openAccess | pt_PT |
rcaap.type | conferenceObject | pt_PT |
relation.isAuthorOfPublication | d7cf5c07-3cfa-4ff3-9e05-7bbcdccf0e1b | |
relation.isAuthorOfPublication | 40906a16-46a2-42f1-b26d-7db7012294ee | |
relation.isAuthorOfPublication | 527c9b62-536d-45b2-bce6-ac856844f41e | |
relation.isAuthorOfPublication.latestForDiscovery | 40906a16-46a2-42f1-b26d-7db7012294ee | |
relation.isProjectOfPublication | 01a99baa-a025-45a3-bc2c-f4528aeae605 | |
relation.isProjectOfPublication | 2109bfea-cc21-4c47-9950-99898d92a041 | |
relation.isProjectOfPublication.latestForDiscovery | 01a99baa-a025-45a3-bc2c-f4528aeae605 |