Abstract
The effective functioning of data-intensive applications usually requires that the dataset should be of high quality. The quality depends on the task they will be used for. However, it is possible to identify task-independent data quality dimensions which are solely related to data themselves and can be extracted with the help of rule mining/pattern mining. In order to assess and improve data quality, we propose an ontological approach to report data quality violated triples. Our goal is to provide data stakeholders with a set of methods and techniques to guide them in assessing and improving data quality.
Original language | English |
---|---|
Journal | CEUR Workshop Proceedings |
Volume | 2956 |
DOIs | |
Publication status | Published - 2021 |
Event | 15th International Rule Challenge, 7th Industry Track, and 5th Doctoral Consortium @ RuleML+RR 2021, RuleML+RR-Companion 2021 - Virtual, Leuven, Belgium Duration: 8 Sep 2021 → 15 Sep 2021 |
Keywords
- Data quality assessment
- Data quality improvement
- Linked data
- Root cause analysis