Assignment 2
Unit 2
1. Discuss nominal, ordinal and categorical data with suitable examples.
2. What is Structured, Semi structured and Unstructured data. Explain with suitable example.
3. Explain Source of data: Time Series, Transactional Data, Biological Data, Spatial Data,
Social Network Data. Give example
4, Observe the following Table.
Loan_ID Gender Marital D Educatio Sel Applic Coapp Loan Loan C Property_ Lo
status e n f_E antInc licantI Amo _Am re Area an_
p mp ome in ncome unt ount di Sta
e loy $ thou _Ter t_ tus
n ed sand m H
d s, in In is
e $ mont to
nt hs ry
s
LP001002 Male Single 0 Graduate No 5849 0 360 1 Urban Y
LP001003 Male Divorced 1 Graduate No 4583 1508 128 360 1 Rural N
LP001005 Male Married 0 Graduate Yes 3000 0 66 240 1 Urban Y
LP001006 Male Single 0 Not No 2583 2358 120 360 1 Urban Y
Graduate
LP001008 Male Married 0 Graduate No 6000 0 141 240 1 Urban Y
LP001011 Female Married 2 Graduate Yes 5417 4196 267 360 1 Urban Y
LP001013 Male Married 0 Not No 2333 1516 95 360 1 Urban Y
Graduate
LP001014 Male Widow 3 Graduate No 3036 2504 158 360 0 Semiurban N
LP001018 Male Divorced 2 Graduate No 4006 1526 168 360 1 Urban Y
LP001020 Male Married 1 Graduate No 12841 10968 349 240 1 Semiurban N
LP001024 Male Married 2 Graduate No 3200 700 70 360 1 Urban Y
LP001027 Female Single 2 Graduate 2500 1840 109 360 1 Urban Y
LP001028 Male Widow 2 Graduate No 3073 8106 200 360 1 Urban Y
LP001029 Male Single 0 Graduate No 1853 2840 114 360 1 Rural N
LP001030 Female Married 2 Graduate No 1299 1086 17 120 1 Urban Y
Recognize each fields type .
a) Qualitative Quantitative (Numerical)
b) Nominal or Ordinal or
Categorical or binomial continuous or discrete
c) Whether given data is structured, Semi structured or unstructured
d) Also mention which scale can be used for each field –(Nominal, Ordinal, Interval, Raio)
(all above can put in tabular form )
e) Is it a high dimensional data. Justify
5) Mention the sources of data of the following as transactional, time series, social network ,
Spatial or Biological with 1 sentence reasoning.
1. Library Book issue/return data
2. Structure of proteins and nucleic acids data
3. Twitter data
4. Genome sequence of Viruses
5. Sales data of a grocery store on day today basis.
6. Terrain info of a planet with longitude and latitude
7. Stock information of a Grocery store
8. Sun position observed over a period of time.
9. Friends and friends of friends information over facebook
10. Information of a place over a map
6) Write a short note on Data Evolution.