Project

Constraint-based data cleaning

Code
bof/baf/4y/2024/01/019
Duration
01 January 2024 → 31 December 2025
Funding
Regional and community funding: Special Research Fund
Research disciplines
  • Natural sciences
    • Database theory
Keywords
data cleaning relational databases data quality
 
Project description

In this project, we investigate how integrity constraints can be used in data cleaning. In particular, we investigate:

- a balance between expressivity of constraints and complexity of the fundamental algorithms (e.g., implication)

- the repair problem: how to turn inconsistent (dirty) data into consistent (clean) data in a cost-optimal manner

- the role of cost models in the generation of explainable consistent data