This course will guide you through the basics of using ChatGPT for automating the tedious tasks of data cleaning and formatting.
First, what is data cleaning? Data cleaning involves detecting and correcting (or removing) corrupt or inaccurate records from a dataset. For instance, it sometimes happen that a field is missing (empty cell on excel) or has an error (email address ending without any domain extension e.g .com or .net)
In this tutorial, you will learn how to:
- Upload your data files
- Remove duplicate data
- Remove blanks
- Insert placeholder values
- Standardize formatting
- Label data based on specific rules
