Reporting and Validating Clean Data
Using Spreadsheet
COUNTA: A spreadsheet function that counts the total number of values within a specified range.
Find and replace: A tool that finds a specified search term and replaces it with something else.
Using SQL
Use the CASE
function
CASE: A SQL statement that returns record that meet conditions by including an if/then statement in a query.
SELECT
variable_name1
CASE
WHEN variable_name2 = 'wrong_value' THEN 'correct_value'
ELSE variable_name2
END AS new_variable_name
FROM
table_name
WHERE
The Change Log
Change log: A file containing a chronologically ordered list of modifications made to a project.
Verification: A process to confirm that a data-cleaning effort was well executed and the resulting data is accurate and reliable.