0% found this document useful (0 votes)

1K views74 pages

2025 EDITION: The Illustrated Guidebook

The document introduces the Model Context Protocol (MCP), a standardized framework that facilitates seamless interaction between AI models and external tools, resources, and environments. It outlines the architecture of MCP, including the roles of Host, Client, and Server, and discusses the benefits of using MCP to simplify integrations and enhance AI capabilities. Additionally, the document provides an overview of various projects and tools that leverage MCP for improved functionality in data science applications.

Uploaded by

khansamaira395

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views74 pages

2025 EDITION: The Illustrated Guidebook

Uploaded by

khansamaira395

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 74

FR

2025 EDITION EE

MCP THE ILLUSTRATED

GUIDEBOOK

Daily Dose of Avi Chawla & Akshay Pachaar

Data Science DailyDoseofDS.com
DailyDoseofDS.com

How to make the most out of

this book and your time?
The reading time of this book is about 3 hours. But not all chapters will be of
relevance to you. This 2-minute assessment will test your current expertise and
recommend chapters that will be most useful to you.

Scan the QR code below or open this link to start the assessment. It will only take
2 minutes to complete.

https://bit.ly/mcp-assessment

1
DailyDoseofDS.com

Table of contents

Section #1) Model Context Protocol…………….3

1.1) What is MCP?..................................................................................4-5
Introduction……………………..……………………………………………………………………………4-5

1.2) Why was MCP created?...............................................................6-8

The problem…………………………………………………………...………………………………………6-7
The solution……………………………………………………………………………………………………7-8
1.3) MCP Architecture Overview........................................................9-11
Host…………………………………………………………………………………………………………………….9
Client…………………………………………………………………………………………………………………10
Server…………………………………………………………………………………………………………………11
1.4) Tools, Resources and Prompts.................................................12-18
Tools………………………………………………………………………………………………………………....12
Resources………………………………………………………………………………………………………….14
Prompts……………………………………………………………………………………………………………..15

Section #2) MCP Projects……..……………………..19

2.1) 100% local MCP client………………………………………………………………………………20
2.2) MCP-powered Agentic RAG…………………………………………………………………….25
2.3) MCP-powered Financial Analyst…………………………………………………………….29
2.4) MCP-powered Voice Agent………………………………………………………………………34
2.5) A uniﬁed MCP server……………………………………………………………………...………..39
2.6) MCP-powered shared memory for Claude Desktop and Cursor……………43
2.7) MCP-powered RAG over complex docs…………………………………………………..47
2.8) MCP-powered Synthetic Data Generator…………………………………..…………..51
2.9) MCP-powered Deep Researcher…………………...………………………………………….57
2.10) MCP RAG over videos………………………...…………………………………………………..63
2.11) MCP-powered Audio Analysis Toolkit…………………………………………………….69

2
DailyDoseofDS.com

Model Context
Protocol
(MCP)

3
DailyDoseofDS.com

What is MCP?
Imagine you only know English. To get info from a person who only knows:

● French, you must learn French.

● German, you must learn German.
● And so on.

In this setup, learning even 5 languages will be a nightmare for you.

But what if you add a translator that understands all languages?

4
DailyDoseofDS.com

This is simple, isn't it?

The translator is like an MCP!

It lets you (Agents) talk to other people (tools or other capabilities) through a
single interface.

To formalize, while LLMs possess impressive knowledge and reasoning skills,

which allow them to perform many complex tasks, their knowledge is limited to
their initial training data.

If they need to access real-time information, they must use external tools and
resources on their own.

Model context protocol (MCP) is a standardized interface and framework that

allows AI models to seamlessly interact with external tools, resources, and
environments.

MCP acts as a universal connector for AI systems to capabilities (tools, etc.),

similar to how USB-C standardizes connections between electronic devices.

5
DailyDoseofDS.com

Why was MCP created?

Without MCP, adding a new tool or integrating a new model was a headache.

If you had three AI applications and three external tools, you might end up
writing nine diﬀerent integration modules (each AI x each tool) because there
was no common standard. This doesn’t scale.

Developers of AI apps were essentially reinventing the wheel each time, and tool
providers had to support multiple incompatible APIs to reach diﬀerent AI
platforms.

Let’s understand this in detail.

6
DailyDoseofDS.com

The problem
Before MCP, the landscape of connecting AI to external data and actions looked
like a patchwork of one-oﬀ solutions.

Either you hard-coded logic for each tool, managed prompt chains that were not
robust, or you used vendor-speciﬁc plugin frameworks.

This led to the infamous M×N integration problem.

Essentially, if you have M diﬀerent AI applications and N diﬀerent tools/data

sources, you could end up needing M × N custom integrations.

The diagram below illustrates this complexity: each AI (each “Model”) might
require unique code to connect to each external service (database, ﬁlesystem,
calculator, etc.), leading to spaghetti-like interconnections.

The solution
MCP tackles this by introducing a standard interface in the middle. Instead of M
× N direct integrations, we get M + N implementations: each of the M AI

7
DailyDoseofDS.com

applications implements the MCP client side once, and each of the N tools
implements an MCP server once.

Now everyone speaks the same “language”, so to speak, and a new pairing doesn’t
require custom code since they already understand each other via MCP.

The following diagram illustrates this shift.

● On the left (pre-MCP), every model had to wire into every tool.
● On the right (with MCP), each model and tool connects to the MCP layer,
drastically simplifying connections. You can also relate this to the
translator example we discussed earlier.

8
DailyDoseofDS.com

MCP Architecture Overview

At its heart, MCP follows a client-server architecture (much like the web or other
network protocols).

However, the terminology is tailored to the AI context. There are three main
roles to understand: the Host, the Client, and the Server.

Host
The Host is the user-facing AI application, the environment where the AI model
lives and interacts with the user.

This could be a chat application (like OpenAI’s ChatGPT interface or Anthropic’s

Claude desktop app), an AI-enhanced IDE (like Cursor), or any custom app that
embeds an AI assistant like Chainlit.

Host is the one that initiates connections to the available MCP servers when the
system needs them. It captures the user's input, keeps the conversation history,
and displays the model’s replies.

9
DailyDoseofDS.com

Client
The MCP Client is a component within the Host that handles the low-level
communication with an MCP Server.

Think of the Client as the adapter or messenger. While the Host decides what to
do, the Client knows how to speak MCP to actually carry out those instructions
with the server.

10
DailyDoseofDS.com

Server
The MCP Server is the external program or service that actually provides the
capabilities (tools, data, etc.) to the application.

An MCP Server can be thought of as a wrapper around some functionality, which

exposes a set of actions or resources in a standardized way so that any MCP
Client can invoke them.

Servers can run locally on the same machine as the Host or remotely on some
cloud service since MCP is designed to support both scenarios seamlessly. The
key is that the Server advertises what it can do in a standard format (so the client
can query and understand available tools) and will execute requests coming from
the client, then return results.

11
DailyDoseofDS.com

Tools, Resources and Prompts

Tools, prompts and resources form the three core capabilities of the MCP
framework. Capabilities are essentially the features or functions that the server
makes available.
● Tools: Executable actions or functions that the AI (host/client) can invoke
(often with side effects or external API calls).
● Resources: Read-only data sources that the AI (host/client) can query for
information (no side effects, just retrieval).
● Prompts: Predefined prompt templates or workflows that the server can
supply.

Tools
Tools are what they sound like: functions that do something on behalf of the AI
model. These are typically operations that can have eﬀects or require
computation beyond the AI’s own capabilities.

Importantly, Tools are usually triggered by the AI model’s choice, which means
the LLM (via the host) decides to call a tool when it determines it needs that
functionality.

Suppose we have a simple tool for weather. In an MCP server’s code, it might
look like:

12
DailyDoseofDS.com

This Python function, registered with @mcp.tool(), can be invoked by the AI via
MCP.

When the AI calls tools/call with name "get_weather" and {"location": "San
Francisco"} as arguments, the server will execute get_weather("San Francisco")
and return the dictionary result.

The client will get that JSON result and make it available to the AI. Notice the
tool returns structured data (temperature, conditions), and the AI can then use or
verbalize (generate a response) that info.

Since tools can do things like ﬁle I/O or network calls, an MCP implementation
often requires that the user permit a tool call.

13
DailyDoseofDS.com

For example, Claude’s client might pop up “The AI wants to use the ‘get_weather’
tool, allow yes/no?” the ﬁrst time, to avoid abuse. This ensures the human stays in
control of powerful actions.

Tools are analogous to “functions” in classic function calling, but under MCP,
they are used in a more ﬂexible, dynamic context. They are model-controlled but
developer/governance-approved in execution.

Resources
Resources provide read-only data to the AI model.

These are like databases or knowledge bases that the AI can query to get
information, but not modify.

Unlike tools, resources typically do not involve heavy computation or side eﬀects,
since they are often just information lookup.

Another key diﬀerence is that resources are usually accessed under the host
application’s control (not spontaneously by the model). In practice, this might
mean the Host knows when to fetch a certain context for the model.

14
DailyDoseofDS.com

For instance, if a user says, “Use the company handbook to answer my question,”
the Host might call a resource that retrieves relevant handbook sections and
feeds them to the model.

Resources could include a local ﬁle’s contents, a snippet from a knowledge base
or documentation, a database query result (read-only), or any static data like
conﬁguration info.

Essentially anything the AI might need to know as context. An AI research

assistant could have resources like “ArXiv papers database,” where it can retrieve
an abstract or reference when asked.

A simple resource could be a function to read a ﬁle:

Here we use a decorator @mcp.resource("ﬁle://{path}") which might indicate a

template for resource URIs.

The AI (or Host) could ask the server for resources.get with a URI like
ﬁle://home/user/notes.txt, and the server would
callread_ﬁle("/home/user/notes.txt") and return the text.

Notice that resources are usually identiﬁed by some identiﬁer (like a URI or
name) rather than being free-form functions.

15
DailyDoseofDS.com

They are also often application-controlled, meaning the app decides when to
retrieve them (to avoid the model just reading everything arbitrarily).

From a safety standpoint, since resources are read-only, they are less dangerous,
but still, one must consider privacy and permissions (the AI shouldn’t read ﬁles
it’s not supposed to).

The Host can regulate which resource URIs it allows the AI to access, or the
server might restrict access to certain data.

In summary, Resources give the AI knowledge without handing over the keys to
change anything.

They’re the MCP equivalent of giving the model reference material when needed,
which acts like a smarter, on-demand retrieval system integrated through the
protocol.

Prompts
Prompts in the MCP context are a special concept: they are predeﬁned prompt
templates or conversation ﬂows that can be injected to guide the AI’s behavior.

Essentially, a Prompt capability provides a canned set of instructions or an

example dialogue that can help steer the model for certain tasks.

But why have prompts as a capability?

Think of recurring patterns: e.g., a prompt that sets up the system role as “You
are a code reviewer,” and the user’s code is inserted for analysis.

Rather than hardcoding that in the host application, the MCP server can supply
it.

Prompts can also represent multi-turn workﬂows.

For instance, a prompt might deﬁne how to conduct a step-by-step diagnostic

interview with a user. By exposing this via MCP, any client can retrieve and use

16
DailyDoseofDS.com

these sophisticated prompts on demand.

As far as control is concerned, Prompts are usually user-controlled or

developer-controlled.

The user might pick a prompt/template from a UI (e.g., “Summarize this

document” template), which the host then fetches from the server.

The model doesn’t spontaneously decide to use prompts the way it does tools.

Rather, the prompt sets the stage before the model starts generating. In that
sense, prompts are often fetched at the beginning of an interaction or when the
user chooses a speciﬁc “mode”.

Suppose we have a prompt template for code review. The MCP server might have:

This prompt function returns a list of message objects (in OpenAI format) that
set up a code review scenario.

When the host invokes this prompt, it gets those messages and can insert the
actual code to be reviewed into the user content.

Then it provides these messages to the model before the model’s own answer.
Essentially, the server is helping to structure the conversation.

While we have personally not seen much applicability of this yet, common use
cases for prompt capabilities include things like “brainstorming guide,”
“step-by-step problem solver template,” or domain-speciﬁc system roles.

17
DailyDoseofDS.com

By having them on the server, they can be updated or improved without changing
the client app, and different servers can offer different specialized prompts.

An important point to note here is that prompts, as a capability, blur the line
between data and instructions.

They represent best practices or predeﬁned strategies for the AI to use.

In a way, MCP prompts are similar to how ChatGPT plugins can suggest how to
format a query, but here it’s standardized and discoverable via the protocol.

18
DailyDoseofDS.com

MCP Projects

19
DailyDoseofDS.com

#1) 100% local MCP Client

An MCP client is a component in an AI app (like Cursor) that establishes
connections to external tools. Learn how to build it 100% locally.

Tech stack:

● Llamaindex to build the MCP-powered Agent

● Ollama to locally serve Deepseek-R1.
● LightningAI for development and hosting

Workﬂow:

● User submits a query.

● Agent connects to the MCP server to discover tools.
● Based on the query, agent invokes the right tool and get context
● Agent returns a context-aware response.

20
DailyDoseofDS.com

Let’s implement this!

#1) Build an SQLite MCP Server

For this demo, we've built a simple SQLite server with two tools:

● add data
● fetch data

This is done to keep things simple, but the client we're building can connect to
any MCP server out there.

#2) Set Up LLM

We'll use a locally served Deepseek-R1 via Ollama as the LLM for our
MCP-powered agent.

21
DailyDoseofDS.com

#3) Deﬁne system prompt

We deﬁne our agent’s guiding instructions to use tools before answering user
queries.

Feel free to tweak this on a need basis.

#4) Deﬁne the Agent

We deﬁne a function that builds a typical LlamaIndex agent with its appropriate
arguments.

The tools passed to the agent are MCP tools, which llama_index wraps as native
tools that can be easily used by our FunctionAgent.

22
DailyDoseofDS.com

#5) Deﬁne Agent Interaction

We pass user messages to our FunctionAgent with a shared Context for memory,
stream tool calls and return its reply. We manage all the chat history and tool
calls here.

#6) Initialize MCP Client and the Agent

Launch the MCP client, load its tools, and wrap them as native tools for
function-calling agents in LlamaIndex. Then, pass these tools to the agents and
add the context manager.

23
DailyDoseofDS.com

#7) Run the Agent:

Finally, we start interacting with our agent and get access to the tools from our
SQLite MCP server.

The code is available here:

https://www.dailydoseofds.com/p/bui
lding-a-100-local-mcp-client/

24
DailyDoseofDS.com

#2) MCP-powered Agentic RAG

Learn how to create an MCP-powered Agentic RAG that searches a vector
database and falls back to web search if needed.

Tech stack:

● Bright Data to scrape the web at scale.

● Qdrant as the vector DB.
● Cursor as the MCP client.

Workﬂow:

● The user inputs a query through the MCP client (Cursor).

● The client contacts the MCP server to select a relevant tool.
● The tool output is returned to the client to generate a response.

Let’s implement this!

#1) Launch an MCP server

25
DailyDoseofDS.com

First, we deﬁne an MCP server with the host URL and port.

#2) Vector DB MCP tool

A tool exposed through an MCP server has two requirements:

● It must be decorated with the "tool" decorator.

● It must have a clear docstring.

Below, we have an MCP tool to query a vector DB. It stores ML-related FAQs.

26
DailyDoseofDS.com

#3) Web search MCP tool

If query is unrelated to ML, we resort to web search using Bright Data's SERP
API to scrape data at scale across several sources to get relevant context.

#4) Integrate MCP server with Cursor

Go to Settings → MCP → Add new global MCP server. In the JSON ﬁle, add
what's shown below

27
DailyDoseofDS.com

Done!

Your local MCP server is live and connected to Cursor. It has two MCP tools:

● Bright Data web search tool to scrape data at scale.

● Vector DB search tool to query the relevant documents.

Next, we interact with the MCP server.

● When we ask an ML-related query, it invokes the vector DB tool.

● But when we ask a general query, it invokes the Bright Data web search
tool to gather web data at scale from various sources.

That's Agentic behavior!

The code is available here:

https://www.dailydoseofds.com/p/mc
p-powered-agentic-rag/

28
DailyDoseofDS.com

#3) MCP-powered Financial Analyst

Build an MCP-powered AI agent that fetches, analyzes & generates insights on
stock market trends, right from Cursor or Claude Desktop.

Tech stack:

● CrewAI for multi-agent orchestration

● Ollama to locally serve DeepSeek-R1 LLM
● Cursor as the MCP host

Workﬂow:

● User submits a query.

● The MCP agent kicks oﬀ the ﬁnancial analyst crew.
● The crew conducts research and creates an executable script.
● The agent runs the script to generate an analysis plot.

29
DailyDoseofDS.com

#1) Setup LLM

We will use Deepseek-R1 as the LLM, served locally using Ollama.

Let's setup the Crew now

#2) Query Parser Agent

This agent accepts a natural language query and extracts structured output using
Pydantic. This guarantees clean and structured inputs for further processing!

30
DailyDoseofDS.com

#3) Code Writer Agent

This agent writes Python code to visualize stock data using Pandas, Matplotlib,
and Yahoo Finance libraries.

#4) Code Executor Agent

This agent reviews and executes the generated Python code for stock data
visualization.

It uses the code interpreter tool by CrewAI to execute the code in a secure
sandbox environment.

31
DailyDoseofDS.com

#5) Setup Crew and Kickoﬀ

We set up and kick oﬀ our ﬁnancial analysis crew to get the result shown below!

#6) Create MCP Server

Now, we encapsulate our ﬁnancial analyst within an MCP tool and add two more
tools to enhance the user experience.

● save_code -> Saves generated code to local directory

● run_code_and_show_plot -> Executes the code and generates a plot

32
DailyDoseofDS.com

#7) Integrate MCP server with Cursor

Go to: File → Preferences → Cursor Settings → MCP → Add new global MCP
server. In the JSON ﬁle, add what's shown below

Done! Our ﬁnancial analyst MCP server is live and connected to Cursor.

The code is available here:

https://www.dailydoseofds.com/p/hands-o
n-building-an-mcp-powered-ﬁnancial-an
alyst/

33
DailyDoseofDS.com

#4) MCP-powered Voice Agent

This project teaches you how to build an MCP-driven voice Agent that queries a
database and falls back to web search if needed.

Tech Stack

● AssemblyAI for Speech‐to‐Text.

● Firecrawl for web search.
● Supabase for a database.
● Livekit for orchestration.
● Qwen3 as the LLM.

Workﬂow:

● User's speech query is transcribed to text with AssemblyAI.

● Agent discovers DB & web tools.
● LLM invokes the right tool, fetches data & generates a response.
● The app delivers the response via text-to-speech.

34
DailyDoseofDS.com

Let’s implement this!

#1) Initialize Firecrawl & Supabase

We instantiate Firecrawl to enable web searches and start our MCP server to
expose Supabase tools to our Agent.

#2) Deﬁne web search tool

We fetch live web search results using Firecrawl search endpoint. This gives our
agent up-to-date online information.

35
DailyDoseofDS.com

#3) Get Supabase MCP Tools

We list our Supabase tools via the MCP server and wrap each of them as LiveKit
tools for our Agent.

36
DailyDoseofDS.com

#4) Build the Agent

We set up our Agent with instructions on how to handle user queries. We also
give it access to the Firecrawl web search and Supabase tools deﬁned earlier.

#5) Conﬁgure Speech-to-Response ﬂow

● We transcribe user speech with AssemblyAI Speech-to-Text.

● Qwen 3 LLM, served locally with Ollama, invokes the right tool.
● A voice output is generated via TTS.

37
DailyDoseofDS.com

#6) Launch the Agent

We connect to LiveKit and start our session with a greeting. Then continuously
listen and respond until the user stops.

Done!

Our MCP-powered Voice Agent is ready.

● If the query is related to a database, it queries Supabase via MCP tools.

● Otherwise, it performs a web search via Firecrawl.

The code is available here:

https://www.dailydoseofds.com/p/an
-mcp-powered-voice-agent/

38
DailyDoseofDS.com

39
DailyDoseofDS.com

#5) A Uniﬁed MCP server

This project builds an MCP server to query and chat with over 200+ data sources
using natural language through a uniﬁed interface powered by MindsDB and
Cursor IDE.

Tech stack

● MindsDB to power our uniﬁed MCP server

● Cursor as the MCP host
● Docker to self-host the server

Workﬂow

● User submits a query

● Agent connects to the MindsDB MCP server to ﬁnd tools
● Selects the appropriate tool based on the user query and calls it
● Finally, returns a contextually relevant response

Let’s implement this!

40
DailyDoseofDS.com

#1) Docker Setup

MindsDB provides Docker images that can be run in Docker containers.

Install MindsDB locally using the Docker image by running the command in your
terminal.

#2) Start MindsDB GUI

After installing the Docker image, go to 127.0.0.1:47334 in your browser to access

the MindsDB editor.

Through this interface, you can connect to over 200 data sources and run SQL
queries against them.

#3) Integrate Data Sources

Let's start building our federated query engine by connecting our data sources to
MindsDB.

We use Slack, Gmail, GitHub and Hacker News as our federated data sources.

41
DailyDoseofDS.com

#4) Integrate MCP Server with Cursor

After building the federated query engine, let's unify our data sources by
connecting them to MindsDB's MCP server.

Go to: File → Preferences → Cursor Settings → MCP → Add new global MCP
server. In the JSON ﬁle, add the following

42
DailyDoseofDS.com

Done! Our MindsDB MCP server is live and connected to Cursor!

The MCP server oﬀers two tools:

● list_databases: Lists all data sources connected to MindsDB.

● query: Answers user queries on the federated data.

Apart from Claude and Cursor, MindsDB MCP server also works with the new
OpenAI MCP integration.

The code is available here:

https://www.dailydoseofds.com/p/buil
d-an-mcp-server-to-connect-to-200-d
ata-sources/

43
DailyDoseofDS.com

#6) MCP-powered shared memory for Claude

Desktop and Cursor
Devs use Claude Desktop and Cursor independently with no context sharing.
Learn how to add a common memory layer to cross-operate without losing
context.

Tech Stack

● Zep’s Graphiti MCP as a memory layer for AI Agents.

● Cursor and Claude as the MCP hosts.

Workﬂow

● User submits a query to Cursor & Claude.

● Facts/Info are stored in a common memory layer using Graphiti MCP.
● Memory is queried if context is required in any interaction.
● Graphiti shares memory across multiple hosts.

44
DailyDoseofDS.com

#1) Docker Setup

Deploy the Graphiti MCP server using Docker Compose. This setup starts the
MCP server with Server-Sent Events (SSE) transport.

The Docker setup above includes a Neo4j container, which launches the database
as a local instance.

This conﬁguration lets you query and visualize the knowledge graph using the
Neo4j browser preview.

45
DailyDoseofDS.com

#2) Connect MCP server to Cursor

With tools and our server ready, let's integrate it with our Cursor IDE!

Go to: File → Preferences → Cursor Settings → MCP → Add new global MCP
server. In the JSON ﬁle, add what's shown below

#3) Connect MCP server with Claude

Go to File → Settings → Developer → Edit Conﬁg, add what's shown below

Done!

46
DailyDoseofDS.com

Our Graphiti MCP server is live and connected to Cursor & Claude!

Now you can chat with Claude Desktop, share facts/info, store the response in
memory, and retrieve them from Cursor, and vice versa.

This way, you can pipe Claude’s insights straight into Cursor, all via a single
MCP.

The code is available here:

https://www.dailydoseofds.com/p/build-a
-shared-memory-for-claude-desktop-and
-cursor/

47
DailyDoseofDS.com

#7) MCP-powered RAG over complex docs

Learn how to use MCP to power an RAG app over complex documents with
tables, charts, images, complex layouts, and whatnot.

Tech Stack

● Cursor as the MCP client

● EyelevelAI's GroundX to build an MCP server that can process complex
docs

Workﬂow

● User interacts with the MCP client (Cursor IDE)

● Client connects to the MCP server and selects a tool.
● Tools leverage GroundX to do an advanced search over docs
● Search results are used by Client to generate response

48
DailyDoseofDS.com

Let’s implement this!

#1) Setup server

First we setup a local MCP server, using FastMCP and provide it a name

#2) Create GroundX Client

GroundX oﬀers capabilities document search and retrieval capabilities for

complex real-world documents.

Here's how to set up a client:

49
DailyDoseofDS.com

#3) Create Ingestion tool

This tool is used to ingest new documents into the knowledge base. User just
needs to provide a path to the document to be ingested:

#4) Create Search tool

This tool leverages GroundX's advanced capabilities to do search and retrieval

from complex real world documents. Here's how to implement it:

50
DailyDoseofDS.com

#5) Start the server

Starts an MCP server using stdio as the transport mechanism:

#6) Connect to Cursor

Inside you Cursor IDE follow this: Cursor → Settings → Cursor Settings → MCP
Then add and start your server like this:

The code is available here:

https://www.dailydoseofds.com/p/mcp-
powered-rag-over-complex-docs/

51
DailyDoseofDS.com

#8) MCP-powered synthetic data generator

Learn how to build an MCP server that can generate any type of synthetic
dataset. It uses Cursor as the MCP host and SDV to generate realistic tabular
synthetic data.

Tech Stack

● Cursor as the MCP host

● Datacebo's SDV to generate realistic tabular synthetic data

Workﬂow

● User submits a query

● Agent connects to MCP server to ﬁnd tools
● Agent uses appropriate tool based on query
● Returns response on synthetic data creation, eval, or visualization

52
DailyDoseofDS.com

Here’s an overview of our MCP server, which includes three tools:

● SDV Generate
● SDV Evaluate
● SDV Visualise

We have kept the actual implementation of these tools using the SDV SDK in a
separate ﬁle, tools[.]py, that is imported here.

Now let's look at each tool in more details.

53
DailyDoseofDS.com

#1) SDV Generate Tool

This tool creates synthetic data from real data using the SDV Synthesizer.

SDV oﬀers a variety of synthesizers, each utilizing diﬀerent algorithms to

produce synthetic data.

#2) SDV Evaluate Tool

This tool evaluates the quality of synthetic data in comparison to real data.

We will assess statistical similarity to determine which real data patterns are
captured by the synthetic data.

54
DailyDoseofDS.com

#3) SDV Visualize Tool

This tool generates a visualization to compare real and synthetic data for a
speciﬁc column.

Use this function to visualize a real column alongside its corresponding synthetic
column.

55
DailyDoseofDS.com

With tools and server ready, lets integrate it with our Cursor IDE! Go to: File →
Preferences → Cursor Settings → MCP → Add new global MCP server. In the
JSON ﬁle, add what's shown below

Done! Your synthetic data generator MCP server is live and connected to Cursor.

56
DailyDoseofDS.com

The code is available here:

https://www.dailydoseofds.com/p/hands-on
-mcp-powered-synthetic-data-generator/

57
DailyDoseofDS.com

#9) MCP-powered deep researcher

ChatGPT has a deep research feature. It helps you get detailed insights on any
topic. Learn how you can build a 100% local alternative to it.

Tech Stack

● Linkup platform for deep web research

● CrewAI for multi-agent orchestration
● Ollama to locally serve DeepSeek
● Cursor as MCP host

Workﬂow

● User submits a query

● Web search agent runs deep web search via Linkup
● Research analyst veriﬁes and deduplicates results
● Technical writer crafts a coherent response with citations

58
DailyDoseofDS.com

#1) Setup LLM

We'll use a locally served DeepSeek-R1 using Ollama.

#2) Deﬁne Web Search Tool

We'll use Linkup platform's powerful search capabilities, which rival Perplexity
and OpenAI, to power our web search agent. This is done by deﬁning a custom
tool that our agent can use.

59
DailyDoseofDS.com

#3) Deﬁne Web Search Agent

The web search agent gathers up-to-date information from the internet based on
user query. The linkup tool we deﬁned earlier is used by this agent.

#4) Deﬁne Research Analyst Agent

This agent transforms raw web search results into structured insights, with
source URLs. It can also delegate tasks back to the web search agent for
veriﬁcation and fact-checking.

60
DailyDoseofDS.com

#5) Deﬁne Technical Writer Agent

It takes the analyzed and veriﬁed results from the analyst agent and drafts a
coherent response with citations for the end user.

#6) Setup Crew

Finally, once we have all the agents and tools deﬁned we set up and kickoﬀ our
deep researcher crew.

61
DailyDoseofDS.com

#7) Create MCP Server

Now, we'll encapsulate our deep research team within an MCP tool. With just a
few lines of code, our MCP server will be ready.

Let's see how to connect it with Cursor.

#8) Integrate MCP server with Cursor

Go to: File → Preferences → Cursor Settings → MCP → Add new global MCP
server

In the JSON ﬁle, add what's shown below

62
DailyDoseofDS.com

Done! Your deep research MCP server is live and connected to Cursor.

The code is available here:

https://www.dailydoseofds.com/p/hands-
on-mcp-powered-deep-researcher/

63
DailyDoseofDS.com

#10) MCP-powered RAG over videos

We have an MCP-driven video RAG that ingests a video and lets you chat with it.
It also fetches the exact video chunk where an event occurred.

Tech Stack

● RagieAI for video ingestion and retrieval.

● Cursor as the MCP host.

Workﬂow

● User speciﬁes video ﬁles and a query.

● An Ingestion tool indexes the videos in Ragie.
● A Query tool retrieves info from Ragie Index with citations.
● Show-video tool returns the video chunk that answers the query

64
DailyDoseofDS.com

Let’s implement this!

#1) Ingest data

We implement a method to ingest video ﬁles into the Ragie index.

We also specify the audio-video mode to load both audio and video channels
during ingestion.

#2) Retrieve data

We retrieve the relevant chunks from the video based on the user query.

65
DailyDoseofDS.com

Each chunk has a start time, an end time, and a few more details that correspond
to the video segment.

#3) Create MCP Server

We integrate our RAG pipeline into an MCP server with 3 tools:

● ingest_data_tool: Ingests data into Ragie index

● retrieve_data_tool: Retrieves data based on the user query
● show_video_tool: Creates video chunks from the original video

66
DailyDoseofDS.com

#4) Integrate MCP server with Cursor

To integrate the MCP server with Cursor, go to Settings → MCP → Add new
global MCP server.

67
DailyDoseofDS.com

Done!

Your local Ragie MCP server is live and connected to Cursor!

68
DailyDoseofDS.com

Next, we interact with the MCP server through Cursor.

Based on the query, it can:

● Ingest a new video into the Ragie Index.

● Fetch detailed information about an existing video.
● Retrieve the video segment where a speciﬁc event occurred.

And that was your MCP-powered video RAG.

The code is available here:

https://www.dailydoseofds.com/p/build-
an-mcp-powered-rag-over-videos/

69
DailyDoseofDS.com

#11) MCP-powered Audio Analysis Toolkit

We have an MCP-driven audio analysis toolkit that accepts an audio ﬁle and lets
you transcribe it and extract insights such as sentiment analysis, speaker labels,
summary and topic detection. It also lets you chat with audio.

Tech stack

● AssemblyAI for transcription and audio analysis.

● Claude Desktop as the MCP host.
● Streamlit for the UI

Workﬂow

● User's audio input is sent to AssemblyAI via a local MCP server.

● AssemblyAI transcribes it while providing the summary, speaker labels,
sentiment, and topics.
● Post-transcription, the user can also chat with audio.

70
DailyDoseofDS.com

#1) Transcription MCP tool

This tool accepts an audio input from the user and transcribes it using
AssemblyAI. We also store the full transcript to use in the next tool.

#2) Audio analysis tool

Next, we have a tool that returns speciﬁc insights from the transcript, like
speaker labels, sentiment, topics, and summary.

71
DailyDoseofDS.com

#3) Create MCP Server

Now, we’ll set up an MCP server to use the tools we created above.

#4) Integrate MCP server with Claude Desktop

Go to File → Settings → Developer → Edit Conﬁg and add the following code.

72
DailyDoseofDS.com

Once the server is conﬁgured, Claude Desktop will show the two tools we built
above in the tools menu:

● transcribe_audio
● get_audio_data

And that was our MCP-powered audio analysis toolkit!

For accessibility, we have created a Streamlit UI for the audio analysis app.

You can upload the audio, extract insights, and chat with it using AssemblyAI’s
LeMUR. Find the code below.

The code is available here:

https://www.dailydoseofds.com/p/hands-o
n-build-an-mcp-powered-audio-analysis-
toolkit/

Simplified: The USB-C For AI Integrations
No ratings yet
Simplified: The USB-C For AI Integrations
15 pages
Anthropic MCP Server
100% (2)
Anthropic MCP Server
10 pages
Lang Graph
100% (2)
Lang Graph
113 pages
LangChain in Action v5 MEAP
100% (1)
LangChain in Action v5 MEAP
372 pages
Build LLM Apps with LangChain Guide
100% (5)
Build LLM Apps with LangChain Guide
12 pages
Dokumen - Pub Building Agentic Ai Systems Create Intelligent Autonomous Ai Agents That Can Reason Plan and Adapt 9781803238753
100% (4)
Dokumen - Pub Building Agentic Ai Systems Create Intelligent Autonomous Ai Agents That Can Reason Plan and Adapt 9781803238753
288 pages
Patterns For Building LLM-based Systems & Products
50% (2)
Patterns For Building LLM-based Systems & Products
31 pages
100 Generative AI Use Cases Examples For Industries
100% (10)
100 Generative AI Use Cases Examples For Industries
63 pages
Building LLM Powered Applications With Langchain
100% (1)
Building LLM Powered Applications With Langchain
11 pages
AI Agent Engineering Syllabus
100% (1)
AI Agent Engineering Syllabus
9 pages
How To Build AI Agent Cheat Sheet by Dr. Maryam Miradi
100% (2)
How To Build AI Agent Cheat Sheet by Dr. Maryam Miradi
2 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
100% (2)
Introduction To LLMS: Transformers Types of Llms Configuration Settings
7 pages
Build vs. Buy: ML Observability Guide
No ratings yet
Build vs. Buy: ML Observability Guide
31 pages
Whitepaper Emebddings Vectorstores v2
100% (1)
Whitepaper Emebddings Vectorstores v2
64 pages
LLM Applications in Production Guide
100% (12)
LLM Applications in Production Guide
254 pages
Zero To Production AI Agent Guide
100% (1)
Zero To Production AI Agent Guide
30 pages
AI Integration in React Apps Explained
No ratings yet
AI Integration in React Apps Explained
9 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
AI Agents by Google
100% (11)
AI Agents by Google
42 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
100% (3)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
48 pages
Nerative AI Agents B0F9KK7N2H
100% (5)
Nerative AI Agents B0F9KK7N2H
254 pages
Agentic AI Red Teaming Guide JUN 2025
No ratings yet
Agentic AI Red Teaming Guide JUN 2025
62 pages
Software Architecture in An AI World
100% (2)
Software Architecture in An AI World
25 pages
Generative AI Usecases - A Comprehensive Guide - Dummies
100% (1)
Generative AI Usecases - A Comprehensive Guide - Dummies
19 pages
GenerativeAI Projects
100% (4)
GenerativeAI Projects
46 pages
Large Language Models (LLM)
100% (3)
Large Language Models (LLM)
139 pages
Top Agentic AI Architecture Design Patterns
100% (6)
Top Agentic AI Architecture Design Patterns
8 pages
ML Deployment & MLOps Guide
No ratings yet
ML Deployment & MLOps Guide
56 pages
Guide To Building AI Agents From Scratch
100% (9)
Guide To Building AI Agents From Scratch
17 pages
Building AI Agents With LLMS, RAG, and Knowledge Graphs
100% (8)
Building AI Agents With LLMS, RAG, and Knowledge Graphs
560 pages
An Illustrated Guide To AI Agents
100% (11)
An Illustrated Guide To AI Agents
117 pages
Agentic Design Patterns Clearly Explained 1737225219
No ratings yet
Agentic Design Patterns Clearly Explained 1737225219
7 pages
EY Generative AI Use Cases Repository
100% (4)
EY Generative AI Use Cases Repository
267 pages
Generative AI On AWS
100% (11)
Generative AI On AWS
208 pages
Context Engineering Guide
No ratings yet
Context Engineering Guide
12 pages
AI Engineer Resume
No ratings yet
AI Engineer Resume
2 pages
1GitHub - Modelcontextprotocol - Python-Sdk - The Official Python SDK For Model Context Protocol Servers and Clients
No ratings yet
1GitHub - Modelcontextprotocol - Python-Sdk - The Official Python SDK For Model Context Protocol Servers and Clients
9 pages
Agents Companion v2
100% (3)
Agents Companion v2
76 pages
Practical Guide To Using LLMs by Andrej Karpathy Feb 29 2025
No ratings yet
Practical Guide To Using LLMs by Andrej Karpathy Feb 29 2025
8 pages
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
50% (2)
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
21 pages
Agentic AI Projects
50% (4)
Agentic AI Projects
9 pages
Agents White Paper
100% (2)
Agents White Paper
21 pages
Diffusion
100% (6)
Diffusion
62 pages
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
100% (3)
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
134 pages
LLMs and Generative AI For (Z-Library)
100% (5)
LLMs and Generative AI For (Z-Library)
58 pages
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
100% (3)
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
27 pages
Retrieval-Augmented LMs Overview
No ratings yet
Retrieval-Augmented LMs Overview
120 pages
How Anthropic Teams Use Claude Code v2
No ratings yet
How Anthropic Teams Use Claude Code v2
23 pages
GitHub Trainings
No ratings yet
GitHub Trainings
5 pages
Build An LLM Application From Scratch MEAP 2 - Hamza Farooq
No ratings yet
Build An LLM Application From Scratch MEAP 2 - Hamza Farooq
161 pages
Fine-Tuning AI Models for Developers
100% (2)
Fine-Tuning AI Models for Developers
19 pages
Guide To Planning AI Agents
100% (1)
Guide To Planning AI Agents
12 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
300 LangChain Projects
100% (2)
300 LangChain Projects
17 pages
A Taxonomy of Retrieval Augmented Generation
100% (5)
A Taxonomy of Retrieval Augmented Generation
56 pages
Advanced Retrieval-Augmented Generation (RAG) With LangChain, LangGraph, and AI Agents - by Manoj Mukherjee - Oct, 2024 - Medium
No ratings yet
Advanced Retrieval-Augmented Generation (RAG) With LangChain, LangGraph, and AI Agents - by Manoj Mukherjee - Oct, 2024 - Medium
15 pages
MCP 1752911458
No ratings yet
MCP 1752911458
32 pages
MCP Concept
No ratings yet
MCP Concept
6 pages
MCP Beginners Guide v2
No ratings yet
MCP Beginners Guide v2
62 pages
Equity Reasrch
No ratings yet
Equity Reasrch
2 pages
Sahara Newsletter Sudoku
No ratings yet
Sahara Newsletter Sudoku
1 page
Daily Challenge Curriculum - 100 Days of Machine Learning
No ratings yet
Daily Challenge Curriculum - 100 Days of Machine Learning
15 pages
Open CV Notes
No ratings yet
Open CV Notes
30 pages
Material Approval Document Checklist-230619
No ratings yet
Material Approval Document Checklist-230619
3 pages
Cyb 102
No ratings yet
Cyb 102
44 pages
Mitel 5320 Quick Reference Guide
No ratings yet
Mitel 5320 Quick Reference Guide
1 page
Appdomain: Identifer Name Example Class Pascal
No ratings yet
Appdomain: Identifer Name Example Class Pascal
14 pages
F01U342426
No ratings yet
F01U342426
20 pages
Esther Virtual Assistant Resume 04 04 2025
No ratings yet
Esther Virtual Assistant Resume 04 04 2025
2 pages
Kitec Price List
100% (1)
Kitec Price List
10 pages
Businessware Technologies - Adempiere Presentation
No ratings yet
Businessware Technologies - Adempiere Presentation
32 pages
MSc in Electrical Engineering Aspirations
No ratings yet
MSc in Electrical Engineering Aspirations
2 pages
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
No ratings yet
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
5 pages
SDGDSGDSG
No ratings yet
SDGDSGDSG
31 pages
Wireless World 1984 12
0% (1)
Wireless World 1984 12
108 pages
DivyaVenkataramu Resume
No ratings yet
DivyaVenkataramu Resume
1 page
Mini Project Final
No ratings yet
Mini Project Final
18 pages
DP-300 Exam: Administering Azure Databases
No ratings yet
DP-300 Exam: Administering Azure Databases
6 pages
NV3029datasheet V1.9
No ratings yet
NV3029datasheet V1.9
99 pages
David Tastekin Harry Krötz Clemens Gerlach Jörg Roth-Stielow
No ratings yet
David Tastekin Harry Krötz Clemens Gerlach Jörg Roth-Stielow
7 pages
Hacking The Xbox An Introduction To Reverse Engineering Andrew Huang 2025 Easy Download
No ratings yet
Hacking The Xbox An Introduction To Reverse Engineering Andrew Huang 2025 Easy Download
171 pages
Cloudera Hive
No ratings yet
Cloudera Hive
132 pages
Library Manual Final
No ratings yet
Library Manual Final
13 pages
Mixmax - Backend Engineer
No ratings yet
Mixmax - Backend Engineer
3 pages
Resound Nexia Connectivity Brochure
No ratings yet
Resound Nexia Connectivity Brochure
4 pages
SharePoint 2013 - Showing List Data in Jquery Datatable With Advanced Feature PDF
No ratings yet
SharePoint 2013 - Showing List Data in Jquery Datatable With Advanced Feature PDF
12 pages
RTW Ref
No ratings yet
RTW Ref
786 pages
Hrusikesh Bisoyi
No ratings yet
Hrusikesh Bisoyi
3 pages
Installation & Cabling Manual: Osdr / PTP Node / PTMP Terminal / PTMP Hub
No ratings yet
Installation & Cabling Manual: Osdr / PTP Node / PTMP Terminal / PTMP Hub
75 pages
Bhasker Sony: Strabag Oman LLC Post Box No-444 Postal Code-100-Muscat Sultanate of Oman
No ratings yet
Bhasker Sony: Strabag Oman LLC Post Box No-444 Postal Code-100-Muscat Sultanate of Oman
2 pages
Orbit hf680 Datasheet en
No ratings yet
Orbit hf680 Datasheet en
2 pages
MCQ Exam Sys and Source Code
No ratings yet
MCQ Exam Sys and Source Code
48 pages
SCC Manual
No ratings yet
SCC Manual
564 pages