Clean Data Using R
Anscombe’s quartet: Four datasets that have nearly identical summary statistics but contain different plotted values.
Log file: A computer-generated file that records events from operating systems and other software programs.
FWF(fixed-width file): A text file with a specific format, which enables the saving of textual data in an organized fashion.
TSV(Tab-separated-values file): A text file that stores a data table by separating columns of data with tabs.
Tidy data: A way of standardizing the organization of data within R.
Tibble: A streamlined variation of data frames.
head(): An R function that returns a preview of the column names and the first few rows of a dataset.
mutate(): An R function that makes changes to a data frame, separating and merging columns or creating new variables.