Abstract
The effective functioning of data-intensive applications usually requires that the dataset should be of high quality. The quality depends on the task they will be used for. However, it is possible to identify task-independent data quality dimensions which are solely related to data themselves and can be extracted with the help of rule mining/pattern mining. In order to assess and improve data quality, we propose an ontological approach to report data quality violated triples. Our goal is to provide data stakeholders with a set of methods and techniques to guide them in assessing and improving data quality.
| Original language | English |
|---|---|
| Number of pages | 8 |
| Journal | CEUR Workshop Proceedings |
| Volume | 2956 |
| DOIs | |
| Publication status | Published - 2021 |
| Event | 15th International Rule Challenge, 7th Industry Track, and 5th Doctoral Consortium @ RuleML+RR 2021, RuleML+RR-Companion 2021 - Virtual, Leuven, Belgium Duration: 8 Sep 2021 → 15 Sep 2021 |
Keywords
- Data quality assessment
- Data quality improvement
- Linked data
- Root cause analysis