This review paper examines the implications of uploading unverified datasets on data banking sites, specifically using Kaggle as a case study. It highlights that poor data quality arises from issues like incompleteness and inconsistency, leading to erroneous outcomes and poor decision-making. The authors recommend implementing strict data quality measures before uploading datasets to ensure they are high-quality and fit for use.
Related topics: