Complete the Developing Intimacy with your Data Exercise located at the following link:
Working With Data (Click chapter 4 and then exercises)
Submit a brief paper discussing:
- Why you selected your data set?
- What are the physical properties of the data set?
- What could you do/would you need to do to clean or modify the existing data to create new values to work with?
- What other data could you imagine would be valuable to consolidate the existing data?
Include a screenshot showing your using R, SQL, or Python to perform a manipulation of your data.
Exercises
DEVELOPING INTIMACY WITH YOUR DATA
This exercise involves you working with a dataset of your choosing. Visit the Kaggle website, browse through the options and find a dataset of interest, then follow the simple instructions to download it. With acquisition completed, work through the remaining key steps of examining, transforming and exploring your data to develop a robust familiarisation with its potential offering:
Examination: Thoroughly examine the physical properties (type, size, condition) of your dataset, noting down useful observations or descriptions where relevant.
Transformation: What could you do/would you need to do to clean or modify the existing data to create new values to work with? What other data could you imagine would be valuable to consolidate the existing data?
Exploration: Using a tool of your choice (such as Excel, Tableau, R) to visually explore the dataset in order to deepen your appreciation of the physical properties and their discoverable qualities (insights) to help you cement your understanding of their respective value. If you don’t have scope or time to use a tool, use your imagination to consider what angles of analysis you might explore if you had the opportunity? What piques your interest about this subject?
(You can, of course, repeat this exercise on any subject and any dataset of your choice, not just those on Kaggle.)