0% found this document useful (0 votes)

48 views34 pages

Python and Libraries For AI

The document provides a comprehensive overview of Python programming, focusing on its application in AI, machine learning, and data science. It covers Python basics, control structures, data structures, object-oriented programming, and essential libraries like NumPy, pandas, and Matplotlib. Additionally, it discusses machine learning concepts, data manipulation techniques, and practical coding questions, making it a valuable resource for learners and practitioners in the field.

Uploaded by

dnaresh2323

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views34 pages

Python and Libraries For AI

Uploaded by

dnaresh2323

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Python and Libraries for AI,ML & Data Science

1. Python Basics

Q1. What is Python and why is it popular in AI and Data Science?

A:
Python is a high-level, versatile programming language known for its simplicity and
readability. It's popular in AI and Data Science because of its extensive libraries (like
NumPy, pandas, scikit-learn), strong community support, and ease of integrating with
other tools, making data analysis and machine learning tasks more efficient.

Q2. What are the key data types in Python?

 Integers (int): Whole numbers (e.g., 5, -3).

 Floating-point numbers (float): Decimal numbers (e.g., 3.14).

 Strings (str): Text enclosed in quotes (e.g., "Hello").

 Booleans (bool): True or False values.

 Lists (list): Ordered, mutable collections (e.g., [1, 2, 3]).

 Tuples (tuple): Ordered, immutable collections (e.g., (1, 2, 3)).

 Dictionaries (dict): Key-value pairs (e.g., {'a':1, 'b':2}).

 Sets (set): Unordered collections of unique elements (e.g., {1, 2, 3}).

Q3. How do you write a function in Python?

A:
Use the def keyword followed by the function name and parameters. For example:

def greet(name):

return f"Hello, {name}!"

# Usage

print(greet("Alice")) # Output: Hello, Alice!

Q4. What is a Python list comprehension?

A:
List comprehension is a concise way to create lists. It combines loops and conditional
statements in a single line. For example, to create a list of squares:

squares = [x**2 for x in range(5)]

print(squares) # Output: [0, 1, 4, 9, 16]

Q5. Explain the difference between append() and extend() methods in lists.

 append(element): Adds a single element to the end of the list.

lst = [1, 2]

lst.append(3) # lst becomes [1, 2, 3]

 extend(iterable): Adds each element from an iterable (like another list) to the
end.

lst = [1, 2]

lst.extend([3, 4]) # lst becomes [1, 2, 3, 4]

2. Control Structures

Q6. How do you write an if-else statement in Python?

A:
Use indentation to define blocks. For example:

x = 10

if x > 5:

print("x is greater than 5")

else:

print("x is 5 or less")

Q7. What is a for loop in Python? Provide an example.

A:
A for loop iterates over elements of a sequence (like a list).

fruits = ['apple', 'banana', 'cherry']

for fruit in fruits:

print(fruit)

Output:

apple

banana

cherry
Q8. How do you handle exceptions in Python?

A:
Use try and except blocks to catch and handle errors.

try:

result = 10 / 0

except ZeroDivisionError:

print("Cannot divide by zero.")

Output:

(Csharp-code)

Cannot divide by zero.

3. Data Structures

Q9. What is a dictionary in Python? How is it different from a list?

A:
A dictionary is a collection of key-value pairs, allowing fast access to values via keys.
Unlike lists, which are ordered and accessed by index, dictionaries are unordered
(prior to Python 3.7) and accessed by unique keys.

# Dictionary

student = {'name': 'Alice', 'age': 25}

# List

student_list = ['Alice', 25]

Q10. How do you iterate over key-value pairs in a dictionary?

A:
Use the .items() method.

student = {'name': 'Alice', 'age': 25}

for key, value in student.items():

print(f"{key}: {value}")

Output:

(Makefile-code)

name: Alice

age: 25
Q11. Explain the difference between a tuple and a list.

 List:

o Mutable (can be changed).

o Defined with square brackets [].

o Example: [1, 2, 3]

 Tuple:

o Immutable (cannot be changed).

o Defined with parentheses ().

o Example: (1, 2, 3)

4. Object-Oriented Programming (OOP)

Q12. What is a class in Python?

A:
A class is a blueprint for creating objects. It defines attributes (data) and methods
(functions) that the objects created from the class can have.

class Dog:

def init(self, name):

self.name = name

def bark(self):

return f"{self.name} says woof!"

# Creating an object

my_dog = Dog("Buddy")

print(my_dog.bark()) # Output: Buddy says woof!

Q13. What is inheritance in Python?

A:
Inheritance allows a class (child) to inherit attributes and methods from another class
(parent), promoting code reuse.

class Animal:

def speak(self):
return "Some sound"

class Dog(Animal):

def speak(self):

return "Woof!"

my_dog = Dog()

print(my_dog.speak()) # Output: Woof!

Q14. What is the init method in Python classes?

A:
The __init__ method is a constructor that initializes an object's attributes when the
object is created.

class Person:

def init(self, name, age):

self.name = name

self.age = age

# Creating an object

person = Person("Alice", 30)

print(person.name) # Output: Alice

print(person.age) # Output: 30

5. Python Libraries for AI, ML, and Data Science

Q15. What is NumPy and why is it important?

A:
NumPy is a library for numerical computing in Python. It provides support for large,
multi-dimensional arrays and matrices, along with a collection of mathematical
functions to operate on these arrays efficiently. It's fundamental for data manipulation
and is widely used in AI and ML projects.

Q16. What is pandas in Python?

A:
Pandas is a powerful library for data manipulation and analysis. It introduces two main
data structures: Series (1D) and DataFrame (2D), which make it easy to handle
structured data like CSV files, SQL tables, and Excel spreadsheets. Pandas is essential
for data cleaning, transformation, and exploratory data analysis.

Q17. How do you install a Python library, for example, pandas?

A:
Use the pip package manager in the terminal or command prompt.

(Bash-code)

pip install pandas

Q18. What is Matplotlib?

A:
Matplotlib is a plotting library for creating static, interactive, and animated
visualizations in Python. It's widely used for generating graphs, charts, and plots to
visualize data, which is crucial for data analysis and reporting.

Q19. What is scikit-learn?

A:
Scikit-learn is a library for machine learning in Python. It provides simple and efficient
tools for data mining and data analysis, including various algorithms for classification,
regression, clustering, and dimensionality reduction, as well as tools for model
selection and evaluation.

Q20. Explain the difference between NumPy arrays and pandas DataFrames.

 NumPy Arrays (ndarray):

o Structure: Multi-dimensional, homogeneous data (all elements must be

the same type).

o Usage: Efficient numerical computations, mathematical operations.

 Pandas DataFrames:

o Structure: 2D, heterogeneous data (different data types in each column).

o Usage: Data manipulation, cleaning, analysis, and handling structured

data like tables.

Example:

import numpy as np

import pandas as pd
# NumPy array

np_array = np.array([[1, 2], [3, 4]])

print("NumPy Array:\n", np_array)

# Pandas DataFrame

df = pd.DataFrame({'A': [1, 3], 'B': [2, 4]})

print("\nPandas DataFrame:\n", df)

Output:

(Lua-code)

NumPy Array:

[[1 2]

[3 4]]

Pandas DataFrame:

A B

0 1 2

1 3 4

6. Data Manipulation and Cleaning

Q21. How do you read a CSV file using pandas?

A:
Use the read_csv() function.

import pandas as pd

# Read CSV file

df = pd.read_csv('data.csv')

print(df.head()) # Display first 5 rows

Q22. How do you handle missing values in pandas?

A:
Common methods include:
 Removing Missing Values:

df.dropna(inplace=True)

 Filling Missing Values:

df.fillna(value=0, inplace=True)

 Forward Fill:

df.fillna(method='ffill', inplace=True)

Q23. How can you filter rows in a pandas DataFrame based on a condition?

A:
Use boolean indexing.

import pandas as pd

# Sample DataFrame

data = {'Name': ['Alice', 'Bob', 'Charlie'],

'Age': [25, 30, 35]}

df = pd.DataFrame(data)

# Filter rows where Age > 28

filtered_df = df[df['Age'] > 28]

print(filtered_df)

Output:

(Markdown-code)

Name Age

1 Bob 30

2 Charlie 35

Q24. How do you merge two pandas DataFrames?

A:
Use the merge() function.

import pandas as pd

# Sample DataFrames

df1 = pd.DataFrame({'ID': [1, 2, 3],

'Name': ['Alice', 'Bob', 'Charlie']})

df2 = pd.DataFrame({'ID': [1, 2, 4],

'Age': [25, 30, 40]})

# Merge on 'ID'

merged_df = pd.merge(df1, df2, on='ID', how='inner')

print(merged_df)

Output:

ID Name Age

0 1 Alice 25

1 2 Bob 30

7. Data Visualization

Q25. How do you create a simple line plot using Matplotlib?

A:
Use the plot() function.

import matplotlib.pyplot as plt

# Sample data

x = [1, 2, 3, 4, 5]

y = [2, 3, 5, 7, 11]

# Create line plot

plt.plot(x, y)

plt.xlabel('X-axis')

plt.ylabel('Y-axis')

plt.title('Simple Line Plot')

plt.show()

Q26. How do you create a bar chart using Matplotlib?

A:
Use the bar() function.

import matplotlib.pyplot as plt

# Sample data

categories = ['A', 'B', 'C']

values = [10, 20, 15]

# Create bar chart

plt.bar(categories, values)

plt.xlabel('Categories')

plt.ylabel('Values')

plt.title('Bar Chart Example')

plt.show()

Q27. How can you visualize a histogram using Matplotlib?

A:
Use the hist() function.

import matplotlib.pyplot as plt

import numpy as np

# Sample data

data = np.random.randn(1000)

# Create histogram

plt.hist(data, bins=30, edgecolor='black')

plt.xlabel('Value')

plt.ylabel('Frequency')

plt.title('Histogram Example')

plt.show()

8. Machine Learning Basics

Q28. What is machine learning?

A:
Machine Learning is a subset of artificial intelligence that enables computers to learn
from data and make predictions or decisions without being explicitly programmed for
specific tasks. It involves algorithms that improve their performance as they are
exposed to more data.

Q29. What is the difference between supervised and unsupervised learning?

 Supervised Learning:

o Definition: Learns from labeled data (input-output pairs).

o Examples: Classification, Regression.

o Use Cases: Spam detection, house price prediction.

 Unsupervised Learning:

o Definition: Learns from unlabeled data to find hidden patterns.

o Examples: Clustering, Dimensionality Reduction.

o Use Cases: Customer segmentation, anomaly detection.

Q30. What is overfitting in machine learning?

A:
Overfitting occurs when a model learns the training data too well, including its noise
and outliers, leading to poor performance on new, unseen data. It means the model is
too complex and doesn't generalize well.

Prevention Techniques:

 Use simpler models.

 Gather more training data.

 Apply regularization.

 Use cross-validation.

Q31. What is a confusion matrix?

A:
A confusion matrix is a table used to evaluate the performance of a classification
model. It shows the number of correct and incorrect predictions broken down by each
class.

Components:

 True Positives (TP): Correctly predicted positive class.

 True Negatives (TN): Correctly predicted negative class.

 False Positives (FP): Incorrectly predicted positive class.

 False Negatives (FN): Incorrectly predicted negative class.

Example:

(Yaml-code)

Predicted

Yes No

Actual Yes TP FN

No FP TN

Q32. What is cross-validation?

A:
Cross-validation is a technique to assess how well a machine learning model
generalizes to an independent dataset. It involves splitting the data into multiple
subsets, training the model on some subsets, and validating it on others.

Common Methods:

 k-Fold Cross-Validation: Splits data into k equal parts and iterates training
and validation k times.

 Leave-One-Out Cross-Validation (LOOCV): Each sample is used once as a

validation set while the rest form the training set.

Q33. What is the purpose of the train_test_split function in scikit-learn?

A:
The train_test_split function splits a dataset into training and testing subsets. This
allows you to train a model on one set of data and evaluate its performance on
another, ensuring that the model generalizes well to new data.

from sklearn.model_selection import train_test_split

# X: Features, y: Labels

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

Q34. What is a feature in machine learning?

A:
A feature is an individual measurable property or characteristic of the data used as
input for a machine learning model. Features are used by algorithms to make
predictions or classifications.

Example: In a dataset predicting house prices:

 Features: Number of bedrooms, size in square feet, location.

 Label: House price.

Q35. Explain the concept of regularization in machine learning.

A:
Regularization is a technique used to prevent overfitting by adding a penalty to the
model's complexity. It discourages the model from fitting the noise in the training
data.

Common Types:

 L1 Regularization (Lasso): Adds absolute value of coefficients.

 L2 Regularization (Ridge): Adds squared value of coefficients.

Example in scikit-learn:

from sklearn.linear_model import Ridge

model = Ridge(alpha=1.0)

model.fit(X_train, y_train)

9. Practical Coding Questions

Q36. How do you import a library in Python?

A:
Use the import statement.

import numpy as np

import pandas as pd

Q37. Write a Python function to calculate the factorial of a number.

A:
Using a loop:

def factorial(n):

result = 1

for i in range(1, n + 1):

result *= i

return result

print(factorial(5)) # Output: 120

Using recursion:
def factorial(n):

if n == 0:

return 1

else:

return n * factorial(n - 1)

print(factorial(5)) # Output: 120

Q38. How do you handle missing values in a pandas DataFrame?

A:
You can remove or fill missing values using dropna() or fillna().

import pandas as pd

# Sample DataFrame with missing values

data = {'A': [1, 2, None], 'B': [4, None, 6]}

df = pd.DataFrame(data)

# Remove rows with missing values

df_cleaned = df.dropna()

# Fill missing values with a specific value

df_filled = df.fillna(0)

print("Cleaned DataFrame:\n", df_cleaned)

print("\nFilled DataFrame:\n", df_filled)

Q39. How do you calculate the mean of a NumPy array?

A:
Use the numpy.mean() function.

import numpy as np

arr = np.array([1, 2, 3, 4, 5])

mean = np.mean(arr)
print("Mean:", mean) # Output: 3.0

Q40. Write a Python program to check if a number is prime.

def is_prime(n):

if n <= 1:

return False

for i in range(2, int(n**0.5) + 1):

if n % i == 0:

return False

return True

# Test the function

print(is_prime(7)) # Output: True

print(is_prime(10)) # Output: False

10. Advanced Topics

Q41. What is list slicing in Python?

A:
List slicing allows you to access a subset of a list by specifying a start and end index.

my_list = [0, 1, 2, 3, 4, 5]

subset = my_list[2:5] # [2, 3, 4]

Q42. How do you concatenate two lists in Python?

A:
Use the + operator or the extend() method.

# Using +

list1 = [1, 2]

list2 = [3, 4]

concatenated = list1 + list2

print(concatenated) # Output: [1, 2, 3, 4]

# Using extend()
list1 = [1, 2]

list1.extend([3, 4])

print(list1) # Output: [1, 2, 3, 4]

Q43. What is a lambda function in Python?

A:
A lambda function is an anonymous, small function defined using the lambda
keyword. It's useful for short, simple functions.

# Lambda function to add two numbers

add = lambda x, y: x + y

print(add(3, 5)) # Output: 8

Q44. How do you handle exceptions in Python?

A:
Use try, except, else, and finally blocks to catch and handle errors.

try:

result = 10 / 0

except ZeroDivisionError:

print("Cannot divide by zero.")

else:

print("Division successful.")

finally:

print("Execution completed.")

Output:

(Csharp-code)

Cannot divide by zero.

Execution completed.

Q45. What is the purpose of the self keyword in Python classes?

A:
self refers to the instance of the class. It's used to access attributes and methods
within the class.

class Car:

def init(self, model):

self.model = model

def display_model(self):

print(f"Model: {self.model}")

my_car = Car("Tesla")

my_car.display_model() # Output: Model: Tesla

Q46. Explain the difference between deepcopy and shallow copy.

 Shallow Copy (copy.copy()):

o Creates a new object but inserts references into it.

o Changes in nested objects affect both copies.

 Deep Copy (copy.deepcopy()):

o Creates a new object and recursively copies all nested objects.

o Changes in nested objects do not affect the original copy.

Example:

import copy

original = [[1, 2], [3, 4]]

# Shallow copy

shallow = copy.copy(original)

shallow[0][0] = 'a'

print("Original after shallow copy modification:", original) # [['a', 2], [3, 4]]

# Deep copy

original = [[1, 2], [3, 4]]

deep = copy.deepcopy(original)

deep[0][0] = 'a'

print("Original after deep copy modification:", original) # [[1, 2], [3, 4]]
Q47. What is the Global Interpreter Lock (GIL) in Python?

A:
The GIL is a mutex that protects access to Python objects, preventing multiple native
threads from executing Python bytecodes simultaneously. It simplifies memory
management but can limit the performance of CPU-bound multi-threaded programs.

Q48. How do you optimize Python code for better performance?

 Use Built-in Functions and Libraries: They are optimized and faster.

 Avoid Using Loops When Possible: Utilize vectorized operations with NumPy
or pandas.

 Use List Comprehensions: They are faster than traditional loops.

 Profile Your Code: Identify bottlenecks using profiling tools like cProfile.

 Leverage Multi-processing: For CPU-bound tasks, use the multiprocessing

module.

 Use Just-In-Time Compilers: Tools like Numba can speed up numerical

computations.

Q49. What is the purpose of the __str__ and __repr__ methods in Python?

 str: Defines the human-readable string representation of an object, used by

the print() function.

 repr: Defines the official string representation of an object, used in

debugging and by the repr() function. It's meant to be unambiguous.

Example:

class Point:

def init(self, x, y):

self.x = x

self.y = y

def __str__(self):

return f"Point({self.x}, {self.y})"

def __repr__(self):

return f"Point(x={self.x}, y={self.y})"

p = Point(1, 2)

print(p) # Uses str: Point(1, 2)

print(repr(p)) # Uses repr: Point(x=1, y=2)

Q50. How do you create a virtual environment in Python?

A:
Use the venv module to create an isolated Python environment.

# Create a virtual environment named 'env'

python -m venv env

# Activate the virtual environment

# On Windows:

env\Scripts\activate

# On macOS/Linux:

source env/bin/activate

11. Working with Data

Q51. How do you drop a column from a pandas DataFrame?

A:
Use the drop() method with axis=1.

import pandas as pd

# Sample DataFrame

data = {'Name': ['Alice', 'Bob'], 'Age': [25, 30], 'City': ['NY', 'LA']}

df = pd.DataFrame(data)

# Drop the 'City' column

df = df.drop('City', axis=1)

print(df)

Output:

(Markdown-code)
Name Age

0 Alice 25

1 Bob 30

Q52. How do you handle categorical variables in machine learning?

A:
Convert categorical variables into numerical formats using techniques like:

 Label Encoding: Assigns a unique integer to each category.

 One-Hot Encoding: Creates binary columns for each category.

Example using pandas:

import pandas as pd

# Sample DataFrame

data = {'Color': ['Red', 'Blue', 'Green', 'Blue']}

df = pd.DataFrame(data)

# One-Hot Encoding

df_encoded = pd.get_dummies(df, columns=['Color'])

print(df_encoded)

Output:

Copy code

Color_Blue Color_Green Color_Red

0 0 0 1

1 1 0 0

2 0 1 0

3 1 0 0

Q53. What is feature scaling and why is it important?

A:
Feature scaling normalizes the range of independent variables (features) to ensure
that each feature contributes equally to the result. It's important because many
machine learning algorithms perform better or converge faster when features are on a
similar scale.

Common Techniques:
 Min-Max Scaling: Scales features to a range of [0, 1].

 Standardization (Z-score): Centers features around the mean with a standard

deviation of 1.

Q54. How do you handle imbalanced datasets in machine learning?

A:
Techniques to handle imbalanced datasets include:

 Resampling:

o Oversampling: Increase the number of minority class samples (e.g.,

SMOTE).

o Undersampling: Decrease the number of majority class samples.

 Using Different Algorithms:

o Algorithms like Random Forest, Gradient Boosting can handle imbalance

better.

 Changing Evaluation Metrics:

o Use metrics like Precision, Recall, F1-Score instead of Accuracy.

 Cost-Sensitive Learning:

o Assign higher costs to misclassifying the minority class.

Q55. What is Principal Component Analysis (PCA)?

A:
PCA is a dimensionality reduction technique that transforms high-dimensional data
into a lower-dimensional form while preserving as much variance as possible. It
identifies the principal components (directions of maximum variance) in the data,
which can help in reducing noise and improving model performance.

Usage in scikit-learn:

from sklearn.decomposition import PCA

import numpy as np

# Sample data

X = np.random.rand(100, 5)

# Apply PCA to reduce to 2 dimensions

pca = PCA(n_components=2)
X_reduced = pca.fit_transform(X)

print("Reduced Data Shape:", X_reduced.shape) # Output: (100, 2)

Q56. What is the difference between fit and transform methods in scikit-
learn?

 fit(): Learns the parameters from the data (e.g., mean and variance for scaling).

 transform(): Applies the learned parameters to transform the data.

 fit_transform(): Combines both steps for convenience.

Example:

from sklearn.preprocessing import StandardScaler

import numpy as np

scaler = StandardScaler()

# Sample data

X = np.array([[1, 2], [3, 4], [5, 6]])

# Fit the scaler to the data and transform it

X_scaled = scaler.fit_transform(X)

print("Scaled Data:\n", X_scaled)

Q57. How do you evaluate a classification model's performance?

A:
Common evaluation metrics for classification models include:

 Accuracy: Proportion of correct predictions.

 Precision: Proportion of positive identifications that were actually correct.

 Recall (Sensitivity): Proportion of actual positives correctly identified.

 F1-Score: Harmonic mean of Precision and Recall.

 Confusion Matrix: Table showing correct and incorrect predictions.

Example using scikit-learn:

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,
confusion_matrix

# True labels and predicted labels

y_true = [1, 0, 1, 1, 0]

y_pred = [1, 0, 1, 0, 0]

# Calculate metrics

accuracy = accuracy_score(y_true, y_pred)

precision = precision_score(y_true, y_pred)

recall = recall_score(y_true, y_pred)

f1 = f1_score(y_true, y_pred)

cm = confusion_matrix(y_true, y_pred)

print("Accuracy:", accuracy)

print("Precision:", precision)

print("Recall:", recall)

print("F1-Score:", f1)

print("Confusion Matrix:\n", cm)

Q58. What is cross-validation and why is it used?

A:
Cross-validation is a technique to assess how a machine learning model will generalize
to an independent dataset. It involves partitioning the data into subsets, training the
model on some subsets, and validating it on others. It helps in:

 Preventing Overfitting: Ensures the model performs well on unseen data.

 Reliable Performance Estimates: Provides a more accurate measure of

model performance.

Common Method:

 k-Fold Cross-Validation: Divides data into k equal parts and iterates training
and validation k times.

Q59. How do you handle categorical data in machine learning?

A:
Convert categorical data into numerical format using encoding techniques:

 Label Encoding: Assigns a unique integer to each category.

 One-Hot Encoding: Creates binary columns for each category.

Example using pandas:

import pandas as pd

# Sample DataFrame

data = {'Color': ['Red', 'Blue', 'Green']}

df = pd.DataFrame(data)

# One-Hot Encoding

df_encoded = pd.get_dummies(df, columns=['Color'])

print(df_encoded)

Output:

Color_Blue Color_Green Color_Red

0 0 0 1

1 1 0 0

2 0 1 0

Q60. What is the purpose of the random_state parameter in scikit-learn

functions?

A:
The random_state parameter ensures reproducibility by controlling the randomness of
processes like data splitting or algorithm initialization. Setting a specific random_state
value allows you to get the same results every time you run the code.

Example:

from sklearn.model_selection import train_test_split

# Split data with a fixed random state

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

12. Practical Tips for Interviews

 Understand the Basics: Make sure you have a solid grasp of Python
fundamentals.

 Practice Coding: Solve coding problems on platforms like LeetCode or

HackerRank.

 Know Your Libraries: Familiarize yourself with essential libraries like NumPy,
pandas, Matplotlib, and scikit-learn.

 Work on Projects: Practical experience through projects can help you

understand real-world applications.

 Prepare for Behavioral Questions: Be ready to discuss your projects,

challenges faced, and how you overcame them.

 Stay Updated: Keep up with the latest trends and updates in AI, ML, and Data
Science.

13. Additional Common Interview Questions

Q61. What is the difference between a list and a tuple in Python?

 List:

o Mutable: Can be changed after creation.

o Syntax: Defined with square brackets [].

o Example: [1, 2, 3]

 Tuple:

o Immutable: Cannot be changed after creation.

o Syntax: Defined with parentheses ().

o Example: (1, 2, 3)

Use Cases:
Use tuples for fixed collections of items and lists for collections that may change.

Q62. Why is NumPy faster than Python lists?

 Uniform Data Types: NumPy arrays store elements of the same type, enabling
optimized memory usage and faster computations.

 Optimized C Implementation: NumPy operations are executed in compiled C

code, which is faster than Python's interpreted loops.

 Vectorized Operations: Perform operations on entire arrays without explicit

Python loops, enhancing speed.
 Memory Efficiency: Uses contiguous memory blocks, improving cache locality
and access speed.

Example:

import numpy as np

import time

# NumPy array

np_array = np.arange(1000000)

# Python list

py_list = list(range(1000000))

# NumPy addition

start_time = time.time()

np_result = np_array + 1

print("NumPy Time:", time.time() - start_time, "seconds")

# Python list addition using list comprehension

start_time = time.time()

py_result = [x + 1 for x in py_list]

print("Python List Time:", time.time() - start_time, "seconds")

Output:

(Less-code)

NumPy Time: 0.025 seconds

Python List Time: 0.25 seconds

Q63. How do you check for an empty (zero-element) array in Python?

A:
Use the .size attribute. If size is 0, the array is empty.

import numpy as np

# Create an empty array

empty_array = np.zeros((1, 0))

print("Empty Array:", empty_array)

print("Size:", empty_array.size) # Output: 0

# Check if the array is empty

if empty_array.size == 0:

print("The array is empty.")

else:

print("The array is not empty.")

Output:

(Sql-code)

Empty Array: []

Size: 0

The array is empty.

Q64. How do you count the number of times a given value appears in an
array of integers in NumPy?

A:
Use the numpy.bincount() function for non-negative integers.

import numpy as np

# Create an array of integers

arr = np.array([0, 5, 4, 0, 4, 4, 3, 0, 0, 5, 2, 1, 1, 9])

# Count the occurrences

counts = np.bincount(arr)

print("Counts of each integer:", counts)

Output:

(Sql-code)

Counts of each integer: [4 2 1 1 3 2 0 0 0 1]

Explanation:

 0 appears 4 times.

 1 appears 2 times.
 2 appears 1 time.

 3 appears 1 time.

 4 appears 3 times.

 5 appears 2 times.

 9 appears 1 time.

Q65. How can you sort an array in NumPy?

A:
Use the .sort() method for in-place sorting or numpy.sort() for returning a sorted copy.

In-place Sorting:

import numpy as np

# Create an unsorted array

arr = np.array([3, 2, 1])

# Sort the array in ascending order

arr.sort()

print(arr) # Output: [1 2 3]

Creating a Sorted Copy:

import numpy as np

# Create an unsorted array

original = np.array([10, 7, 8, 9, 1])

# Sort the array and create a new sorted array

sorted_copy = np.sort(original)

print("Original Array:", original)

print("Sorted Copy:", sorted_copy)

Output:

(Less-code)

Original Array: [10 7 8 9 1]

Sorted Copy: [ 1 7 8 9 10]

Sorting in Descending Order:

import numpy as np

# Create an array

arr = np.array([3, 1, 4, 2, 5])

# Sort in ascending order and then reverse

sorted_desc = np.sort(arr)[::-1]

print("Sorted in Descending Order:", sorted_desc) # Output: [5 4 3 2 1]

Q66. How can you find the maximum or minimum value of an array in
NumPy?

A:
Use numpy.max() and numpy.min() functions.

import numpy as np

# Create an array

arr = np.array([3, 2, 1])

# Find the maximum value

max_value = np.max(arr)

print("Maximum Value:", max_value) # Output: 3

# Find the minimum value

min_value = np.min(arr)

print("Minimum Value:", min_value) # Output: 1

For Multi-dimensional Arrays:

import numpy as np

# Create a 2D array

matrix = np.array([[3, 2, 1],

[5, 4, 6]])
# Find the maximum value in each column

max_cols = np.max(matrix, axis=0)

print("Max of each column:", max_cols) # Output: [5 4 6]

# Find the minimum value in each row

min_rows = np.min(matrix, axis=1)

print("Min of each row:", min_rows) # Output: [1 4]

Q67. How can slicing and indexing be used for data cleaning in NumPy?

A:
Indexing and slicing allow you to access and modify specific parts of an array based
on conditions, which is useful for data cleaning.

Example:

import numpy as np

# Sample NumPy array with negative values

data = np.array([1, 2, -1, 4, 5, -2, 7])

# Indexing: Replace negative values with zeros

data[data < 0] = 0

print("Data after replacing negatives with zeros:", data) # Output: [1 2 0 4 5 0 7]

# Slicing: Extract elements greater than 2

subset = data[data > 2]

print("Elements greater than 2:", subset) # Output: [4 5 7]

Explanation:

 Indexing: Applies a condition to replace specific elements.

 Slicing: Extracts a subset of the array based on a condition.

Q68. What is the difference between using the shape and size attributes of a
NumPy array?

 shape:
o Definition: A tuple that describes the dimensions of the array.

o Example: For a 3x4 array, shape is (3, 4).

o Usage: Helps understand the structure of the array (number of rows and
columns).

 size:

o Definition: An integer representing the total number of elements in the

array.

o Example: For a 3x4 array, size is 12.

o Usage: Useful for knowing how much data is stored, regardless of its
shape.

Example:

import numpy as np

# Create a 2D NumPy array

arr = np.array([[1, 2, 3, 4],

[5, 6, 7, 8],

[9, 10, 11, 12]])

# Get the shape of the array

shape = arr.shape

print("Shape:", shape) # Output: (3, 4)

# Get the size of the array

size = arr.size

print("Size:", size) # Output: 12

Q69. What is a NumPy array and how is it different from a NumPy matrix?

 NumPy Array (ndarray):

o Definition: A versatile N-dimensional array object used for storing and

manipulating numerical data.

o Features:

 Multidimensional: Supports 1D, 2D, 3D, etc.

 Element-wise Operations: Operations are performed element by
element.

 Flexible: Can handle various data types.

 NumPy Matrix:

o Definition: A specialized 2-dimensional array subclass for linear algebra.

o Features:

 Always 2D: Strictly two-dimensional.

 Matrix Multiplication: The * operator performs matrix

multiplication instead of element-wise.

 Built-in Linear Algebra Methods: Provides methods like .I for

inverse and .T for transpose.

Example:

import numpy as np

# NumPy array

array = np.array([[1, 2, 3],

[4, 5, 6]])

print("NumPy Array:\n", array)

# NumPy matrix

matrix = np.matrix([[1, 2],

[3, 4]])

print("\nNumPy Matrix:\n", matrix)

# Matrix multiplication

result = matrix * matrix

print("\nMatrix Multiplication:\n", result)

Output:

(Lua-code)

NumPy Array:

[[1 2 3]

[4 5 6]]
NumPy Matrix:

[[1 2]

[3 4]]

Matrix Multiplication:

[[ 7 10]

[15 22]]

Note:
While matrices can be useful for linear algebra, ndarray is more flexible and widely
used in the NumPy ecosystem. Many developers prefer using ndarray with functions
from numpy.linalg for linear algebra operations.

Q70. How can you find the unique elements in an array in NumPy?

A:
Use the numpy.unique() function to identify unique elements in an array. It can also
return the counts of each unique element.

Example:

import numpy as np

# Create an array with duplicate elements

array = np.array([1, 2, 3, 1, 2, 3, 3, 4, 5, 6, 7, 5])

# Find unique elements

unique_elements = np.unique(array)

print("Unique Elements:", unique_elements) # Output: [1 2 3 4 5 6 7]

# Find unique elements and their counts

unique, counts = np.unique(array, return_counts=True)

print("Unique Elements:", unique) # Output: [1 2 3 4 5 6 7]

print("Counts:", counts) # Output: [2 2 3 1 2 1 1]

Explanation:

 unique_elements: Contains all unique values in the array, sorted.

 counts: Shows how many times each unique element appears.

Finding Unique Rows in a 2D Array:

import numpy as np

# Create a 2D array with duplicate rows

array_2d = np.array([[1, 2],

[3, 4],

[1, 2],

[5, 6]])

# Find unique rows

unique_rows = np.unique(array_2d, axis=0)

print("Unique Rows:\n", unique_rows)

Output:

(Lua-code)

Unique Rows:

[[1 2]

[3 4]

[5 6]]

14. Conclusion

Preparing for Python interviews in AI, Machine Learning, and Data Science involves
understanding both Python programming concepts and how they apply to data-related
tasks. Focus on practicing coding problems, understanding library functionalities, and
applying concepts to real-world scenarios. Remember to work on projects and build a
portfolio to showcase your skills to potential employers.

Good luck with your interview preparations!

Python 1
No ratings yet
Python 1
14 pages
Top 50 Python Interview Questions
No ratings yet
Top 50 Python Interview Questions
8 pages
Ct3 QB Answers
No ratings yet
Ct3 QB Answers
8 pages
100 Python Interview Questions
100% (1)
100 Python Interview Questions
68 pages
Assignment
No ratings yet
Assignment
6 pages
Python Important
No ratings yet
Python Important
35 pages
Pyhton Potential Interview Questions
No ratings yet
Pyhton Potential Interview Questions
34 pages
Python Imp 001
No ratings yet
Python Imp 001
16 pages
Python Exam Paper Solve
No ratings yet
Python Exam Paper Solve
7 pages
? Python Interview Questions With Detailed Answers
No ratings yet
? Python Interview Questions With Detailed Answers
6 pages
Interview Questions
No ratings yet
Interview Questions
99 pages
Top 100 Python Interview Questions For Data Analyst
No ratings yet
Top 100 Python Interview Questions For Data Analyst
10 pages
Python Unit 1 To 5
No ratings yet
Python Unit 1 To 5
10 pages
2024 Summer Model Answer Paper
No ratings yet
2024 Summer Model Answer Paper
28 pages
Python Developer Interview
No ratings yet
Python Developer Interview
9 pages
Ip File Library Stock
No ratings yet
Ip File Library Stock
36 pages
Python Interview QA DataScience GenAI
No ratings yet
Python Interview QA DataScience GenAI
4 pages
4 Weeks Session 2 DA Fundamentals
No ratings yet
4 Weeks Session 2 DA Fundamentals
36 pages
Python Interviews Question
No ratings yet
Python Interviews Question
47 pages
Python Programming Essentials Guide
No ratings yet
Python Programming Essentials Guide
11 pages
Unit 3 Python Class 11 Ai Subject
No ratings yet
Unit 3 Python Class 11 Ai Subject
9 pages
Iii Sem Python Solved QP 2023
No ratings yet
Iii Sem Python Solved QP 2023
10 pages
Python Question Bank
No ratings yet
Python Question Bank
12 pages
Elaborate All Questions From First To Last On Firs...
No ratings yet
Elaborate All Questions From First To Last On Firs...
4 pages
Data Science Using Python
No ratings yet
Data Science Using Python
7 pages
Python Assignment
No ratings yet
Python Assignment
10 pages
Real Python Interview Questions American Express
No ratings yet
Real Python Interview Questions American Express
7 pages
Python Concepts Explained with Examples
No ratings yet
Python Concepts Explained with Examples
11 pages
Class X Practical Notes
No ratings yet
Class X Practical Notes
17 pages
Final Ga
No ratings yet
Final Ga
152 pages
9th - PartB - UNIT 7 - QA
No ratings yet
9th - PartB - UNIT 7 - QA
6 pages
PFP Theory Ques (2) .3736
No ratings yet
PFP Theory Ques (2) .3736
18 pages
Python Basics & Data Structures
No ratings yet
Python Basics & Data Structures
47 pages
Complete Python Interview QA
No ratings yet
Complete Python Interview QA
6 pages
Week 1
No ratings yet
Week 1
7 pages
Python2 Materials
No ratings yet
Python2 Materials
27 pages
BASIC - FUNCTIONALITIES - OF - PYTHON (1) Vikas
No ratings yet
BASIC - FUNCTIONALITIES - OF - PYTHON (1) Vikas
52 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
78 pages
Python Interview Questions & Answers
No ratings yet
Python Interview Questions & Answers
62 pages
Python Graphics Assignment Set 2 Q1-10 Full
No ratings yet
Python Graphics Assignment Set 2 Q1-10 Full
1 page
Python QB Ans
No ratings yet
Python QB Ans
3 pages
Python - Data Science Lecture 1
No ratings yet
Python - Data Science Lecture 1
55 pages
Python Numpy Pandas Interview Questions
100% (1)
Python Numpy Pandas Interview Questions
8 pages
1.1 (Co1, Co2)
No ratings yet
1.1 (Co1, Co2)
25 pages
Python Lab Viva Que
No ratings yet
Python Lab Viva Que
15 pages
CDFH
No ratings yet
CDFH
18 pages
Machine Larning
No ratings yet
Machine Larning
4 pages
PWP QB1 Answers
No ratings yet
PWP QB1 Answers
9 pages
1.4 PWP 12345
No ratings yet
1.4 PWP 12345
28 pages
Python Programming Question Bank
No ratings yet
Python Programming Question Bank
50 pages
Bcse206l Fds Module-5 Smsatapathy
No ratings yet
Bcse206l Fds Module-5 Smsatapathy
74 pages
m6 Answer 2025
No ratings yet
m6 Answer 2025
10 pages
Python Papper
No ratings yet
Python Papper
43 pages
Imp Python
No ratings yet
Imp Python
4 pages
60 Python Interview Qs Every Data Analyst Must Know
No ratings yet
60 Python Interview Qs Every Data Analyst Must Know
11 pages
Python Interview Questions for Freshers
No ratings yet
Python Interview Questions for Freshers
6 pages
Python & Libraries Internship Report
No ratings yet
Python & Libraries Internship Report
9 pages
Python Theory Question and Answer 1
No ratings yet
Python Theory Question and Answer 1
15 pages
Automated Auth Quranic Verset Using Bert
No ratings yet
Automated Auth Quranic Verset Using Bert
8 pages
Qi Li Dissertation
No ratings yet
Qi Li Dissertation
116 pages
02jul2024 StaticMedia AI UNIT 2-CAPSTONE PROJECT NOTES 6759955093464609405
No ratings yet
02jul2024 StaticMedia AI UNIT 2-CAPSTONE PROJECT NOTES 6759955093464609405
6 pages
Forecasting: Categories of Forecasting Methods
No ratings yet
Forecasting: Categories of Forecasting Methods
1 page
CS8075 DWDM Unit 5
No ratings yet
CS8075 DWDM Unit 5
24 pages
DWDM Lab Manual Using Weka-For MIC
No ratings yet
DWDM Lab Manual Using Weka-For MIC
42 pages
Identification and Prediction of Chronic Diseases Using Machine
No ratings yet
Identification and Prediction of Chronic Diseases Using Machine
9 pages
Deep Learning for ADR Prediction
No ratings yet
Deep Learning for ADR Prediction
36 pages
Machine Learning (Important QS) - Young Researchers
No ratings yet
Machine Learning (Important QS) - Young Researchers
81 pages
Trend Surface Analysis
No ratings yet
Trend Surface Analysis
42 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
Theses 16
No ratings yet
Theses 16
78 pages
Traffic Management System
100% (2)
Traffic Management System
42 pages
Tsumoto 1998
No ratings yet
Tsumoto 1998
18 pages
HW4 Solution
No ratings yet
HW4 Solution
13 pages
8759 23290 1 PB
No ratings yet
8759 23290 1 PB
8 pages
Uber Data Analysis
100% (4)
Uber Data Analysis
37 pages
Class Notes ML 1
No ratings yet
Class Notes ML 1
108 pages
Predicting Future TV Ratings with ML
No ratings yet
Predicting Future TV Ratings with ML
13 pages
An Effective Hybrid Learning System For
No ratings yet
An Effective Hybrid Learning System For
13 pages
Effective Model Validation Using Machine Learning
No ratings yet
Effective Model Validation Using Machine Learning
4 pages
MAE for Predicting Article Views
No ratings yet
MAE for Predicting Article Views
141 pages
Detecting Political Ideology in Youtube Comments Using Machine Learning
No ratings yet
Detecting Political Ideology in Youtube Comments Using Machine Learning
11 pages
Paper Dynamic Pricing KNN - Aaaa
No ratings yet
Paper Dynamic Pricing KNN - Aaaa
15 pages
Jntuk R20 ML MANUAL
100% (1)
Jntuk R20 ML MANUAL
53 pages
AI for Short-Term Water Demand Forecasting
No ratings yet
AI for Short-Term Water Demand Forecasting
22 pages
Big Data: New Tricks For Econometrics: Hal R. Varian
No ratings yet
Big Data: New Tricks For Econometrics: Hal R. Varian
55 pages
Classifying The Supervised Machine Learning and Comparing The Performances of The Algorithms
No ratings yet
Classifying The Supervised Machine Learning and Comparing The Performances of The Algorithms
17 pages
Revised PROOFREAD Thesis Document
No ratings yet
Revised PROOFREAD Thesis Document
74 pages
Approximate Statistical Tests For Comparing Supervised Classification Learning Algorithms
No ratings yet
Approximate Statistical Tests For Comparing Supervised Classification Learning Algorithms
30 pages

Python and Libraries For AI

Uploaded by

Python and Libraries For AI

Uploaded by

Python and Libraries for AI,ML & Data Science

Q1. What is Python and why is it popular in AI and Data Science?

Q2. What are the key data types in Python?

 Integers (int): Whole numbers (e.g., 5, -3).

 Floating-point numbers (float): Decimal numbers (e.g., 3.14).

 Strings (str): Text enclosed in quotes (e.g., "Hello").

 Booleans (bool): True or False values.

 Lists (list): Ordered, mutable collections (e.g., [1, 2, 3]).

 Tuples (tuple): Ordered, immutable collections (e.g., (1, 2, 3)).

 Dictionaries (dict): Key-value pairs (e.g., {'a':1, 'b':2}).

 Sets (set): Unordered collections of unique elements (e.g., {1, 2, 3}).

Q3. How do you write a function in Python?

return f"Hello, {name}!"

print(greet("Alice")) # Output: Hello, Alice!

Q4. What is a Python list comprehension?

squares = [x**2 for x in range(5)]

 append(element): Adds a single element to the end of the list.

lst.append(3) # lst becomes [1, 2, 3]

lst.extend([3, 4]) # lst becomes [1, 2, 3, 4]

Q6. How do you write an if-else statement in Python?

print("x is greater than 5")

Q7. What is a for loop in Python? Provide an example.

fruits = ['apple', 'banana', 'cherry']

for fruit in fruits:

print("Cannot divide by zero.")

Cannot divide by zero.

Q9. What is a dictionary in Python? How is it different from a list?

student = {'name': 'Alice', 'age': 25}

student_list = ['Alice', 25]

Q10. How do you iterate over key-value pairs in a dictionary?

student = {'name': 'Alice', 'age': 25}

for key, value in student.items():

o Mutable (can be changed).

o Defined with square brackets [].

o Immutable (cannot be changed).

o Defined with parentheses ().

4. Object-Oriented Programming (OOP)

Q12. What is a class in Python?

def __init__(self, name):

return f"{self.name} says woof!"

print(my_dog.bark()) # Output: Buddy says woof!

Q13. What is inheritance in Python?

print(my_dog.speak()) # Output: Woof!

Q14. What is the __init__ method in Python classes?

def __init__(self, name, age):

person = Person("Alice", 30)

print(person.name) # Output: Alice

5. Python Libraries for AI, ML, and Data Science

Q15. What is NumPy and why is it important?

Q16. What is pandas in Python?

Q17. How do you install a Python library, for example, pandas?

pip install pandas

Q18. What is Matplotlib?

Q19. What is scikit-learn?

 NumPy Arrays (ndarray):

o Structure: Multi-dimensional, homogeneous data (all elements must be

o Usage: Efficient numerical computations, mathematical operations.

o Structure: 2D, heterogeneous data (different data types in each column).

o Usage: Data manipulation, cleaning, analysis, and handling structured

np_array = np.array([[1, 2], [3, 4]])

print("NumPy Array:\n", np_array)

df = pd.DataFrame({'A': [1, 3], 'B': [2, 4]})

print("\nPandas DataFrame:\n", df)

6. Data Manipulation and Cleaning

Q21. How do you read a CSV file using pandas?

# Read CSV file

print(df.head()) # Display first 5 rows

Q22. How do you handle missing values in pandas?

 Filling Missing Values:

data = {'Name': ['Alice', 'Bob', 'Charlie'],

'Age': [25, 30, 35]}

# Filter rows where Age > 28

filtered_df = df[df['Age'] > 28]

Q24. How do you merge two pandas DataFrames?

df1 = pd.DataFrame({'ID': [1, 2, 3],

df2 = pd.DataFrame({'ID': [1, 2, 4],

def init(self, name):

Q14. What is the init method in Python classes?

def init(self, name, age):