0% found this document useful (0 votes)
42 views2 pages

Data Analysis Q&A

The document discusses various concepts related to data analysis including data modeling, manipulation, management, cleansing, mining, profiling, visualization, validation, outliers, normal distribution, and tools used for analysis such as Tableau, Power BI, Google Fusion Table, Node XL, and Python libraries like NumPy, Matplotlib, Pandas, and scikit-learn. It also covers responsibilities and skills of data analysts, the data analysis process, challenges faced, and concepts like hash tables, collisions, pivot tables, logistic regression, and data lakes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views2 pages

Data Analysis Q&A

The document discusses various concepts related to data analysis including data modeling, manipulation, management, cleansing, mining, profiling, visualization, validation, outliers, normal distribution, and tools used for analysis such as Tableau, Power BI, Google Fusion Table, Node XL, and Python libraries like NumPy, Matplotlib, Pandas, and scikit-learn. It also covers responsibilities and skills of data analysts, the data analysis process, challenges faced, and concepts like hash tables, collisions, pivot tables, logistic regression, and data lakes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

1. What is Data Analysis?

Data analysis is a process of analysing, modelling & interpreting the data to draw meaningful insights.
2. What is Data Modelling?
The process of creating conceptual representation of data & their relationship is known as Data Modelling.
3. What is Data Manipulation?
The process of organizing data in a specific order to make it easier for interpretation is known as data
manipulation.
4. What is Data Management?
The process of collecting, keeping and use of data in a secure and efficient way is known as data
management.
5. What is Data cleansing?
A process of identifying & modifying the incorrect, incomplete & missing data is known as data cleansing.
6. What is Data Mining?
The Process of exploring rule, patterns and undiscovered relation between the data is known as data
mining.
7. What is Data Profiling?
The process of analysing individual attributes of the data is known as data profiling.
8. What is Data Visualization?
The representation of information in the data with visual elements like graphs, Charts and Maps is known
as Data visualization.
9. What are the Responsibilities of a Data Analyst?
• Collect and analyse the data using statistical techniques and represent the report.
• Establishing the needs with the teams.
• Find opportunities for improvement in the existing area.
10. What are the key skills required a data analyst?
• Knowledge of reporting packages, Coding Language, database & BI Tools.
• Ability to collect, analyse & organize the data.
• Efficient in writing queries, Make reports and presentation.
11. What is the Process of analysing the data?
There are 3 stages of data analysis. 1stly collect the data from various source of information. 2ndly analyse
the data for improvement as per the requirement. Lastly present the analysed data to the end user in the
form of report or dashboard.
12. What are the Challenges faced by a data analyst?
Duplicate entries, incomplete data, insufficient architecture, represent the data from multiple source & un-
realistic timeline are some challenges.
13. What are the tools used for data analysis?
Tableau, Power BI, Google Fusion table, Node Xl are some uses during analysis.
14. What is Data Validation?
The process of check the accuracy and quality of data before implementing it, is known as data validation.
15. What is Outlier?
Outlier is the values that differ from the characteristics of the datasets.
16. What is Normal Distribution?
Normal distribution defines and measure how their values differs mean and standard deviation.
17. Which python libraries are used in data analysis?
Numpy, Matplotlib, Pandas and sklearn are used in data analysis.
18. What is Hash Table?
Hash table is a data structure that store the data in associative manner. It store the data generally in array.
19. What is Collision?
When 2 keys having same index then the situation called collision.
20. What is Pivot Table?
Pivot table is a tool used for summarize the large data set.
21. What is Logistic Regression?
Logistic method is a mathematical model used to study the data with one or more independent variables.
22. What is Data Lake?
Data Lakes are the largest storage devices which can store the raw data in original format.

You might also like