0% found this document useful (0 votes)

26 views12 pages

SQL Basics and Indexing

Uploaded by

nthumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views12 pages

SQL Basics and Indexing

Uploaded by

nthumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

📍 Section 1: The Structured Query Language (SQL)

1.1 Introduction to SQL

● SQL (Structured Query Language) is used to communicate with relational databases.

● Declarative language: you state what data you want, not how to get it.
● Example: SELECT name FROM students;

1.2 Basic SQL Operations

● SELECT, FROM, WHERE – retrieving data

● INSERT INTO students VALUES (1, 'John');
● UPDATE students SET name='Mike' WHERE id=1;
● DELETE FROM students WHERE id=1;

1.3 Filtering and Sorting

● WHERE age > 18

● ORDER BY name DESC
● LIMIT 5 OFFSET 10

1.4 Aggregation & Grouping

● SELECT COUNT(*) FROM students;

● GROUP BY class HAVING AVG(score) > 80

1.5 Joins in SQL

● INNER JOIN: only matching rows

● LEFT JOIN: all left + matched right
● Example:

SELECT s.name, m.marks FROM students s

JOIN marks m ON s.id = m.student_id;

1.6 Subqueries and Nested Queries

● Scalar:

SELECT name FROM students WHERE age = (SELECT MAX(age) FROM students);

● Correlated: executed per row

1.7 Set Operations

● UNION, INTERSECT, EXCEPT – combine query results

1.8 Views and Indexes

● CREATE VIEW top_students AS SELECT * FROM students WHERE marks >
90;
● CREATE INDEX idx_name ON students(name);

1.9 Transactions in SQL

BEGIN;
UPDATE account SET balance = balance - 100 WHERE id = 1;
UPDATE account SET balance = balance + 100 WHERE id = 2;
COMMIT;

📍 Section 2: Storing and Indexing Data

2.1 Storage Basics

● Data in databases is stored in blocks called pages.

● Each page contains multiple rows (tuples).

● Pages are transferred between disk and main memory using a buffer manager.

Example:
Imagine a page as a folder with 100 paper forms (tuples). When the folder is full, a new one is
created.

2.2 Indexing Techniques

● Index = data structure for fast retrieval

● B+ Tree: Balanced tree used in most RDBMS

○ Internal nodes store keys

○ Leaf nodes store actual data pointers

● Example B+ Tree for student IDs:

[20 | 40]
/ | \
[5 10] [21 35] [42 50]

● Hash Index: Uses hash function on key; fast for equality lookups

○ Student ID 105 hashed to bucket 2

2.3 Clustered vs. Non-clustered Index

● Clustered: Index decides the physical order of records

● Non-clustered: Stores pointers to actual data

Example:

● Clustered index on student_id: rows are sorted on student_id

● Non-clustered index on name: index holds name and points to rows

2.4 Index Maintenance

● Insert/Delete operations in B+ Trees may require rebalancing

● Hash collisions are handled using chaining or open addressing

2.5 Bitmap Indexes

● For categorical fields with few values

● Example: Gender Index

M: 1 0 1 0
F: 0 1 0 1

2.6 Compression and Storage Formats

● Row-based: All attributes of one record stored together

● Column-based: All values of one attribute stored together

● RLE (Run-Length Encoding):

○ Data: AAAAABBB → A5B3

○ Useful when same value repeats consecutively

2.7 File Organization

● Heap File: Unordered, best for inserts

● Sorted File: Sorted by key, good for range queries

● Hashed File: Good for equality searches

📍 Section 3: Relational Data Processing
3.1 Relational Algebra and Query Execution

● σ (selection): Filters rows — σ(age > 25)(Students)

● π (projection): Chooses columns — π(name, dept)(Students)

● ⨝ (join): Combines tables

Execution Plan: Logical query → Relational algebra → Physical plan → Execution

3.2 Query Optimization

● Goal: minimize I/O and computation cost

● Use indexes, reduce intermediate data

● Join order matters: smaller tables joined first reduce size early
3.3 Join Algorithms

● Nested Loop Join:

for each row in R:

for each row in S:
if match: output

● Hash Join:

○ Build hash table on smaller input

○ Probe for matches in second input

● Sort-Merge Join:

○ Sort both inputs, then merge linearly

3.4 Buffer Management

● Manages which pages are kept in memory

● Replacement Policy: LRU (Least Recently Used)

● Pinning: lock critical pages in buffer

3.5 Parallel and Distributed Processing

● Partitioned Parallelism: Divide input data

● Pipelined Parallelism: Output of one operation fed into next

● Example: Big SQL query run across 4 servers, each handling a data slice
📍 Section 4: Transaction Processing
4.1 ACID Properties in Detail

● Atomicity: transaction is all-or-nothing

● Consistency: DB always moves from one valid state to another

● Isolation: concurrent transactions don’t interfere

● Durability: changes persist despite crashes

Example: Transferring ₹100 between two accounts — either both debit and credit happen or
neither

4.2 Concurrency Control Techniques

● Two-Phase Locking (2PL):

○ Growing phase: locks acquired

○ Shrinking phase: locks released

● Prevents non-serializable execution

4.3 Isolation Levels (with anomalies)

Isolation Level Dirty Read Non-repeatable Read Phantom Read

Read Uncommitted ✅ ✅ ✅

Read Committed ❌ ✅ ✅

Repeatable Read ❌ ❌ ✅

Serializable ❌ ❌ ❌

4.4 Write-Ahead Logging (WAL)

● Log changes before applying them

● REDO: for committed transactions

● UNDO: for uncommitted ones

4.5 Recovery Techniques

● ARIES (Analysis, Redo, Undo)

● Checkpoints reduce log scanning during recovery

4.6 Serializability

● Conflict Serializability: schedule can be converted to serial form by reordering

● Precedence Graph: Nodes = transactions; edge = dependency

○ Cycle ⇒ not serializable

📍 Section 5: Database Design
5.1 ER Modeling Example

● Entities: Student, Course

● Attributes: name, id, title

● Relationships: takes (Student ↔ Course)

5.2 Mapping to Relational Schema

● Each entity → table

● Each relationship → table with foreign keys

5.3 Normalization

● 1NF: atomic values

● 2NF: no partial dependency

● 3NF: no transitive dependency

● BCNF: stronger than 3NF

Example:

● Unnormalized:

Student(id, name, courses)

● 1NF:
Student(id, name), Enroll(student_id, course_id)

5.4 Denormalization

● Speeds up reads at cost of redundancy

● Example: merge student and enrollment tables

5.5 Schema Design Tips

● Use natural primary keys where possible

● Enforce constraints using CHECK, NOT NULL, UNIQUE

● Example: CHECK (salary > 0)

📍 Section 6: Beyond Relational Data
6.1 NoSQL Models

● Key-Value: simple, fast (e.g., Redis)

● Document: nested data (e.g., MongoDB)

● Column-Family: large-scale writes (e.g., Cassandra)

● Graph: relationships (e.g., Neo4j)

6.2 CAP Theorem

● Can only guarantee 2 of:

○ Consistency: all nodes same data

○ Availability: always responds

○ Partition Tolerance: survives network splits

6.3 Use Case Mapping

Model Best Use Case

Key-Value Caching sessions

Document Flexible schemas

Columnar Analytics on time-series

Graph Social networks, fraud

6.4 NoSQL Query Examples

// MongoDB
{ "age": { "$gt": 25 } }

// Neo4j
MATCH (a)-[:FRIEND]->(b) RETURN b.name;

6.5 Scaling

● Horizontal: add servers (scale out)

● Vertical: upgrade server (scale up)

● Sharding: split by key ranges (user_id < 1000 → node1)

6.6 Consistency Models

● Strong Consistency: read = latest write (RDBMS)

● Eventual Consistency: updates converge over time (Amazon S3)

● Causal Consistency: preserve cause-effect order (some NoSQLs)

Databases Note
No ratings yet
Databases Note
6 pages
Database System Concepts Full Summary
No ratings yet
Database System Concepts Full Summary
12 pages
Database Management Systems Skills
No ratings yet
Database Management Systems Skills
8 pages
Data Models: Preface XV
No ratings yet
Data Models: Preface XV
8 pages
? DBMS - Class Notes
No ratings yet
? DBMS - Class Notes
3 pages
Dbms
No ratings yet
Dbms
55 pages
Databaser
No ratings yet
Databaser
137 pages
Untitled Document
No ratings yet
Untitled Document
39 pages
RDBMS - In-Depth Class Notes
No ratings yet
RDBMS - In-Depth Class Notes
3 pages
Database Management
No ratings yet
Database Management
7 pages
Advance Dbms
No ratings yet
Advance Dbms
7 pages
Mad 1 Week 4
No ratings yet
Mad 1 Week 4
7 pages
T 8 TVIV3 SFX
No ratings yet
T 8 TVIV3 SFX
2 pages
BTech Database Detailed Reference Guide
No ratings yet
BTech Database Detailed Reference Guide
4 pages
Database Management Concepts
No ratings yet
Database Management Concepts
21 pages
Acmp 351
No ratings yet
Acmp 351
33 pages
Handouts PDF
No ratings yet
Handouts PDF
293 pages
Database Management Systems Overview
No ratings yet
Database Management Systems Overview
4 pages
Database and Design
No ratings yet
Database and Design
19 pages
Adbms Notes
No ratings yet
Adbms Notes
17 pages
SQL Complete Notes Guide by BhagyalakshmiC PDF
No ratings yet
SQL Complete Notes Guide by BhagyalakshmiC PDF
19 pages
Day 2 - SQL & Database For Data Science
No ratings yet
Day 2 - SQL & Database For Data Science
22 pages
Recherche
No ratings yet
Recherche
18 pages
DBMS Notes
No ratings yet
DBMS Notes
24 pages
Topic 1 - Introduction To Database
No ratings yet
Topic 1 - Introduction To Database
14 pages
SQL Notes PDF
No ratings yet
SQL Notes PDF
42 pages
Introduction To Databases
No ratings yet
Introduction To Databases
5 pages
ACMP 351Nf
No ratings yet
ACMP 351Nf
59 pages
ADBMS Detailed Notes
No ratings yet
ADBMS Detailed Notes
3 pages
DBMS Previous PPR
No ratings yet
DBMS Previous PPR
9 pages
SQL - Chapters - 10 - To - 13 - Notes Final
No ratings yet
SQL - Chapters - 10 - To - 13 - Notes Final
15 pages
Relational Model Basics
No ratings yet
Relational Model Basics
6 pages
Database SMY PQ
No ratings yet
Database SMY PQ
15 pages
Intro To Data Sys Desi
No ratings yet
Intro To Data Sys Desi
2 pages
Dbms
No ratings yet
Dbms
28 pages
Unit 1
No ratings yet
Unit 1
25 pages
Slides
No ratings yet
Slides
5 pages
SQL Basics for Beginners
No ratings yet
SQL Basics for Beginners
16 pages
RDBMS Syllabus
100% (1)
RDBMS Syllabus
1 page
DDMS1
No ratings yet
DDMS1
3 pages
Department of Computer Science and Engineering: Certification Course
No ratings yet
Department of Computer Science and Engineering: Certification Course
36 pages
DBMS
No ratings yet
DBMS
3 pages
Database Basics for Beginners
No ratings yet
Database Basics for Beginners
3 pages
XII IP Notes Database
No ratings yet
XII IP Notes Database
3 pages
SQL - Chapters - 10 - To - 13 - Notes Final
No ratings yet
SQL - Chapters - 10 - To - 13 - Notes Final
19 pages
IT 3306 Syllabus
No ratings yet
IT 3306 Syllabus
5 pages
Itec 2600 Notes
No ratings yet
Itec 2600 Notes
2 pages
DBMS Units 1 To 3 Notes
No ratings yet
DBMS Units 1 To 3 Notes
5 pages
Database Concepts for Students
No ratings yet
Database Concepts for Students
3 pages
Database Systems: Advantages & Disadvantages
No ratings yet
Database Systems: Advantages & Disadvantages
87 pages
Database Modeling and Database Systems - DLBCSDMDS01 - Course - Book
No ratings yet
Database Modeling and Database Systems - DLBCSDMDS01 - Course - Book
148 pages
P24CDMCA4 Unit3
No ratings yet
P24CDMCA4 Unit3
12 pages
Database
No ratings yet
Database
11 pages
SQL Basics - Simple Notes
No ratings yet
SQL Basics - Simple Notes
4 pages
Dbms External Exam Notes
No ratings yet
Dbms External Exam Notes
9 pages
Slides1 Introduction-Merged
No ratings yet
Slides1 Introduction-Merged
96 pages
Project Management Tool Documentation
No ratings yet
Project Management Tool Documentation
3 pages
Project MGMT Tool
No ratings yet
Project MGMT Tool
4 pages
Project Management Tool Detailed Documentation
No ratings yet
Project Management Tool Detailed Documentation
4 pages
Advanced Python Exercise Set
No ratings yet
Advanced Python Exercise Set
4 pages
KPC of ALL 045 Database
No ratings yet
KPC of ALL 045 Database
88 pages
F2 Database Questions and Answers
100% (2)
F2 Database Questions and Answers
9 pages
Lab Assessment Report Template
No ratings yet
Lab Assessment Report Template
24 pages
SQLBase 12 Command Center User's Guide
No ratings yet
SQLBase 12 Command Center User's Guide
46 pages
Amazing MySQL Interview Preparation
No ratings yet
Amazing MySQL Interview Preparation
15 pages
Systems Analysis and Design
No ratings yet
Systems Analysis and Design
37 pages
Oracle 3
No ratings yet
Oracle 3
19 pages
Create Aria App in Oracle APEX Guide
100% (4)
Create Aria App in Oracle APEX Guide
354 pages
EY Technical Interview Questions Guide
No ratings yet
EY Technical Interview Questions Guide
10 pages
SAP.C ABAPD 2309.v2024-10-07.q40
No ratings yet
SAP.C ABAPD 2309.v2024-10-07.q40
34 pages
Mahmud RAHMAN - 2.2 Workbook
No ratings yet
Mahmud RAHMAN - 2.2 Workbook
20 pages
Quiz 8 - Kỹ thuật và công nghệ dữ liệu lớn (2425I - INT3229 - 37)
No ratings yet
Quiz 8 - Kỹ thuật và công nghệ dữ liệu lớn (2425I - INT3229 - 37)
7 pages
Pandas PDF
No ratings yet
Pandas PDF
2,573 pages
TM02 Determine Suitability of Database Functionality and Scalability
No ratings yet
TM02 Determine Suitability of Database Functionality and Scalability
74 pages
Health and Fitness Tracker
No ratings yet
Health and Fitness Tracker
55 pages
Oracle SQL Commands and Concepts Guide
No ratings yet
Oracle SQL Commands and Concepts Guide
182 pages
DB2 Relational Database Guide
No ratings yet
DB2 Relational Database Guide
87 pages
Dynamic Named Range with INDEX Formula
No ratings yet
Dynamic Named Range with INDEX Formula
12 pages
Superbase Odbc
No ratings yet
Superbase Odbc
21 pages
In Memory DB
No ratings yet
In Memory DB
9 pages
SAP Table Data Upload/Download Tool
No ratings yet
SAP Table Data Upload/Download Tool
31 pages
Viva Questions
No ratings yet
Viva Questions
7 pages
Database Admin Fundamentals Quiz Key
100% (1)
Database Admin Fundamentals Quiz Key
7 pages
Инструкция терминала IND310
No ratings yet
Инструкция терминала IND310
52 pages
EIM Question
No ratings yet
EIM Question
4 pages
Database Management Systems - Study Guide
No ratings yet
Database Management Systems - Study Guide
13 pages
Topic 3 Relational Data Model Ict200
No ratings yet
Topic 3 Relational Data Model Ict200
56 pages
On Scale Independence For Querying Big Data: Wenfei Fan Floris Geerts Leonid Libkin
No ratings yet
On Scale Independence For Querying Big Data: Wenfei Fan Floris Geerts Leonid Libkin
12 pages
5 Superhero Queries SQL Server 2008
No ratings yet
5 Superhero Queries SQL Server 2008
4 pages
Sorting Algorithms in Database Systems
No ratings yet
Sorting Algorithms in Database Systems
57 pages