# your solution
Data Exploration and Cleaning Exercise
- Load demo.xlsx dataset
Rename the columns as suggested below
Old name New name Age age Gender gender Marital Status marital_status Address address Income income Income Category income_category Job Category job_category
# your solution
- Display all the columns in the dataset
# your solution
- Display some basic statistics about the numeric variables in the dataset
# your solution
- Display some basic statistics about the categorical variables in the dataset
# your solution
- What are the unique observations under gender?
# your solution
- Can you fix any problems observed under the gender, give brief explanations why and how
# your solution
- How many observations have ‘no answer’ for marital status?
# your solution
- Write some piece of code to return only numeric variables from the dataset
# your solution
- Are there any missing values in the dataset?
# your solution
- Are there any outliers in the income variable?
# your solution
- Investigate the relationship between age and income
# your solution
- How many people earn more than 300 units?
# your solution
- What data type is the marital status?
# your solution
- Create dummy variables for gender
# your solution
END