Tips for Eliminating Poor Data
The Best Approach To Handling Poor Data
There are many ways to evaluate poor data, but the following approach has proved to be the most effective and universal in practice.
To weed out poor data, you need to:
- Clearly define criteria for poor data
- Perform data analysis against these criteria
- Find out the sources of this poor data
- Fix poor data
- Fix poor data sources
Criteria for poor data can be matching the data to a certain type or format, to a range, its completeness, the absence of duplicates, and others.
Next, you need to check all the data or some of them for compliance with these criteria.
At the same time, if the amount of data being checked is large, it makes sense to check only part of the data at the initial stages since most sources of errors can be identified and corrected even on a small sample.