Unit 5 data literacy: data collection of data analysis
Data:-data can be defined as a representation of facts or instruction about some entity
(students, school, sports etc) that can be processed by humans or machines.
Data literacy:-data literacy means being able to find and use data effectively this include skills
like collecting data, organising it, checking its quality, analysing it, understanding the results and
using it ethically.
There are three types of data:
structured, semi structured, unstructured
Data collection
It allows you to capture a record of past events so that we can use data analysis to find
recurring patterns. For those pattern you built predictive models using machine learning
algorithm that look for trends and product future changes.
It means pulling data by scrapping, capturing and loading it from multiple sources, including
offline and online sources.
There are mainly two sources of data collection primary and secondary.
Primary sources are sources which are created to collect the data for analysis sum of the
examples are survey interview observation experiment, marketing campaign and questionnaire.
Secondary data sources are where data is already stored and ready for use data given in
books generals, newspaper internal transaction database can be reused for data analysis.
Some methods of collecting secondary data are social media data tracking, web scraping,
satellite data tracking and online data platforms.
Exploring data:-Exploring data means to get an opportunity to identify and correct any problem
in the explore data that would affect the conclusions drawn in anyway during analysis.