0% found this document useful (0 votes)

20 views8 pages

Fletcher (Optimized, Modular) - Setup + Code

Uploaded by

skibidiotto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views8 pages

Fletcher (Optimized, Modular) - Setup + Code

Uploaded by

skibidiotto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Fletcher (Optimized, Modular)

A faster, cleaner split-version of your Fletcher assistant with streaming TTS while thinking (near-zero
delay), microphone input, screen analysis, Spotify controls (Windows), app launch/close, and optional
macOS Calendar reading.

1) Folder structure
Create this layout (names matter):

fletcher/
├─ __init__.py
├─ [Link]
├─ fletcher/
│ ├─ __init__.py
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ ├─ [Link]
│ └─ [Link]
└─ [Link]

You can place everything inside a single top-level folder (named fletcher ) as shown.

2) Install prerequisites
Use a new venv if possible to avoid version conflicts.

# Windows (PowerShell)
python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install --upgrade pip
pip install pyqt5 openai pygame SpeechRecognition psutil pillow

# PyAudio (for mic) – Windows tips:

pip install pipwin
pipwin install pyaudio

# macOS/Linux (zsh/bash)
python3 -m venv .venv

1
source .venv/bin/activate
pip install --upgrade pip
pip install pyqt5 openai pygame SpeechRecognition psutil pillow pyaudio

If PyAudio gives trouble on macOS: brew install portaudio then pip install pyaudio .

3) Set your API keys (no hardcoding)

Set environment variables before running:

# Windows (PowerShell)
$env:OPENAI_API_KEY = "sk-..."
$env:ELEVENLABS_API_KEY = "eleven-..."
$env:ELEVENLABS_VOICE_ID = "NOpBlnGInO9m6vDvFkFC" # or your voice id

# macOS/Linux (zsh/bash)
export OPENAI_API_KEY="sk-..."
export ELEVENLABS_API_KEY="eleven-..."
export ELEVENLABS_VOICE_ID="NOpBlnGInO9m6vDvFkFC"

4) Run it

python -m [Link]

On first run macOS may ask to grant Calendar and Microphone access. For Calendar, allow Terminal (or
your IDE) under System Settings → Privacy & Security.

5) Usage (commands)
Type in the chat:

• open notepad , open chrome , close notepad (Windows examples)

• next song , previous song , pause , play (Windows Spotify)
• what is on my screen (takes a screenshot and describes it)
• any plans for this week? (macOS Calendar)
• fletcher exit to quit

Click the 🎤 button to speak.

2
6) Code — files

fletcher/__init__.py

# Empty on purpose; makes this a package.

fletcher/[Link]

import os
import platform

def is_windows() -> bool:

return [Link]() == "Windows"

def is_macos() -> bool:

return [Link]() == "Darwin"

def desktop_path() -> str:

return [Link]([Link]("~"), "Desktop")

fletcher/[Link]

import os
from .utils import is_windows

# Models
OPENAI_MODEL_TEXT = [Link]("OPENAI_MODEL_TEXT", "gpt-4o-mini")
OPENAI_MODEL_VISION = [Link]("OPENAI_MODEL_VISION", "gpt-4o")

# Keys (required)
OPENAI_API_KEY = [Link]("OPENAI_API_KEY", "")
ELEVEN_API_KEY = [Link]("ELEVENLABS_API_KEY", "")
ELEVEN_VOICE_ID = [Link]("ELEVENLABS_VOICE_ID", "NOpBlnGInO9m6vDvFkFC")
ELEVEN_MODEL_ID = [Link]("ELEVENLABS_MODEL_ID", "eleven_monolingual_v1")

# TTS
TTS_STABILITY = float([Link]("TTS_STABILITY", 0.4))
TTS_SIMILARITY = float([Link]("TTS_SIMILARITY", 0.92))
TTS_MAX_CHARS = int([Link]("TTS_MAX_CHARS", 220))

# App registry for quick open

APP_REGISTRY = {
"notepad": "[Link]",
"calc": "[Link]",
"calculator": "[Link]",

3
"firefox": "[Link]",
"chrome": "[Link]",
"word": "[Link]",
"excel": "[Link]",
"paint": "[Link]",
}

# Windows media keys

MEDIA_KEYS = {"next": 0xB0, "prev": 0xB1, "playpause": 0xB3} if is_windows()
else {}

fletcher/[Link]

import threading
from typing import Callable, List, Dict
from openai import OpenAI
from . import config

# One client per process

_client = OpenAI(api_key=config.OPENAI_API_KEY) if config.OPENAI_API_KEY else
None

def stream_chat(
history: List[Dict[str, str]],
user_text: str,
on_chunk: Callable[[str], None],
on_done: Callable[[str], None],
on_error: Callable[[str], None],
tts_streamer=None,
):
"""Stream a reply and forward chunks to UI and TTS streamer."""

def _job():
if _client is None:
on_error("OpenAI key missing. Set OPENAI_API_KEY.")
return
[Link]({"role": "user", "content": user_text})
reply = ""
try:
stream = _client.[Link](
model=config.OPENAI_MODEL_TEXT,
messages=history,
stream=True,
)
for chunk in stream:
delta = [Link][0].delta
piece = getattr(delta, "content", None)
if not piece:
continue

4
reply += piece
on_chunk(piece) # UI
if tts_streamer:
tts_streamer.feed(piece) # speak as it arrives
[Link]({"role": "assistant", "content": reply})
on_done(reply)
except Exception as e:
on_error(f"AI error: {e}")

[Link](target=_job, daemon=True).start()

fletcher/[Link]

import os
import re
import time
import queue
import tempfile
import threading
from typing import Optional

import pygame

from . import config

from .utils import is_windows, is_macos

# Try ElevenLabs lazily so app can still run without it

try:
from elevenlabs import generate, save, set_api_key, VoiceSettings
_HAVE_ELEVEN = True
except Exception:
_HAVE_ELEVEN = False

class TTSStreamer:
"""Aggregates tiny stream chunks into sentence-sized audio with minimal
delay."""
def __init__(self):
self._q: "[Link][str]" = [Link]()
self._buf = ""
self._stop = [Link]()
self._lock = [Link]()

if _HAVE_ELEVEN and config.ELEVEN_API_KEY:

set_api_key(config.ELEVEN_API_KEY)
else:
# No TTS; the streamer becomes a no-op
pass

# Configure pygame driver per OS (helps startup reliability)

5
if is_windows():
[Link]("SDL_AUDIODRIVER", "directsound")
elif is_macos():
[Link]("SDL_AUDIODRIVER", "coreaudio")
else:
[Link]("SDL_AUDIODRIVER", "alsa")

self._th = [Link](target=self._loop, daemon=True)

self._th.start()

def feed(self, text: str):

if text:
self._q.put(text)

def say(self, text: str):

"""Queue a full sentence immediately (bypasses aggregation)."""
for part in self._split_sentences(text):
self._q.put(part)

def shutdown(self):
self._stop.set()
self._q.put("")

# ---- internals ----

def _loop(self):
while not self._stop.is_set():
try:
piece = self._q.get(timeout=0.2)
except [Link]:
# If buffer has a pending sentence, flush after short idle
if self._buf:
self._flush_if_ready(force=True)
continue
self._buf += piece
self._flush_if_ready()

def _flush_if_ready(self, force: bool = False):

if not self._buf:
return
# Heuristic: speak when we hit sentence end or max size or short idle
sentence_done = bool([Link](r"[.!?…](\s|$)", self._buf))
too_long = len(self._buf) >= config.TTS_MAX_CHARS
if force or sentence_done or too_long:
chunk = self._take_speakable_chunk(self._buf)
self._buf = self._buf[len(chunk):]
if [Link]():
self._speak([Link]())

@staticmethod
def _split_sentences(text: str):
return [Link](r"(?<=[.!?…])\s+", text)

6
@staticmethod
def _take_speakable_chunk(text: str) -> str:
# Grab up to a sentence or MAX_CHARS
sentences = [Link](r"(?<=[.!?…])\s+", text)
out = ""
for s in sentences:
if len(out) + len(s) + 1 > config.TTS_MAX_CHARS:
break
out = (out + " " + s).strip()
if [Link](r"[.!?…]$", s):
break
return out or text[: config.TTS_MAX_CHARS]

def _speak(self, text: str):

if not _HAVE_ELEVEN or not config.ELEVEN_API_KEY:
return # no-op if TTS not configured
tmp_path: Optional[str] = None
try:
audio = generate(
text=text,
voice=config.ELEVEN_VOICE_ID,
model=config.ELEVEN_MODEL_ID,
voice_settings=VoiceSettings(
stability=config.TTS_STABILITY,
similarity_boost=config.TTS_SIMILARITY,
),
)
with [Link](delete=False, suffix=".mp3") as
tmp:
save(audio, [Link])
tmp_path = [Link]

with self._lock:
if not [Link].get_init():
[Link]()
[Link](tmp_path)
[Link]()
while [Link].get_busy():
[Link](0.05)
except Exception:
# Keep silent on TTS failures to avoid spamming the UI
pass
finally:
if tmp_path and [Link](tmp_path):
try:
[Link](tmp_path)
except Exception:
pass

7
fletcher/[Link]

import os
import base64
import tempfile
from typing import Optional
from PIL import ImageGrab
from openai import OpenAI
from . import config

_client = OpenAI(api_key=config.OPENAI_API_KEY) if config.OPENAI_API_KEY else

None

def analyze_screen() -> str:

if _client is None:
return "OpenAI key missing. Set OPENAI_API_KEY."
try:
img = [Link]()
except Exception as e:
return f"Couldn't capture screen: {e}"

Voice Assistant Python Script
No ratings yet
Voice Assistant Python Script
6 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
Jarvis: Your Mobile Voice Assistant
No ratings yet
Jarvis: Your Mobile Voice Assistant
8 pages
Jarvis Main
No ratings yet
Jarvis Main
6 pages
Import Pyttsx3
No ratings yet
Import Pyttsx3
5 pages
OpenAI Text Generation API Guide
No ratings yet
OpenAI Text Generation API Guide
12 pages
Voice Assistant Script Guide
No ratings yet
Voice Assistant Script Guide
3 pages
Complete Installation and Running Guide - Whisper.c
No ratings yet
Complete Installation and Running Guide - Whisper.c
10 pages
Voice-Controlled Assistant in Python
No ratings yet
Voice-Controlled Assistant in Python
19 pages
8.text To Speech Generation With LLM With Hugging Face - Ipynb
No ratings yet
8.text To Speech Generation With LLM With Hugging Face - Ipynb
100 pages
Jarvis Py
No ratings yet
Jarvis Py
7 pages
Voice Assistant Code
No ratings yet
Voice Assistant Code
4 pages
V Assist
No ratings yet
V Assist
3 pages
College Project
No ratings yet
College Project
39 pages
Jarvis
No ratings yet
Jarvis
5 pages
Python QR Code and GIF Generator Guide
No ratings yet
Python QR Code and GIF Generator Guide
2 pages
JARVIS
No ratings yet
JARVIS
5 pages
Jarvis AI Assistant Python Code
No ratings yet
Jarvis AI Assistant Python Code
4 pages
Overview
No ratings yet
Overview
2 pages
Python AI Assistant Features
No ratings yet
Python AI Assistant Features
14 pages
Voice Assistant Project in Python
No ratings yet
Voice Assistant Project in Python
48 pages
Jarvis
No ratings yet
Jarvis
7 pages
Using Python To Read and Save Your Outlook Emails! - by Alex Thines - Python in Plain English
100% (1)
Using Python To Read and Save Your Outlook Emails! - by Alex Thines - Python in Plain English
25 pages
Import Subprocess
No ratings yet
Import Subprocess
8 pages
Setting Up Packages For Speech Recognition
No ratings yet
Setting Up Packages For Speech Recognition
3 pages
Python Virtual Assistant Guide
No ratings yet
Python Virtual Assistant Guide
8 pages
Real Time Transcription Service For Online Meetings Using Whisper Api
No ratings yet
Real Time Transcription Service For Online Meetings Using Whisper Api
16 pages
Ut It Lites All 2 Continue
No ratings yet
Ut It Lites All 2 Continue
7 pages
Make The Future Ai and Provide Code Also
No ratings yet
Make The Future Ai and Provide Code Also
21 pages
RVC V2 Colab: Voice Conversion Guide
No ratings yet
RVC V2 Colab: Voice Conversion Guide
17 pages
Building A Windows Desktop AI Assistant (Python, Voice I - O, 3D Avatar)
No ratings yet
Building A Windows Desktop AI Assistant (Python, Voice I - O, 3D Avatar)
5 pages
Python Voice Assistant Code
No ratings yet
Python Voice Assistant Code
8 pages
Ai
No ratings yet
Ai
2 pages
Import Subprocess
No ratings yet
Import Subprocess
11 pages
System Overview
No ratings yet
System Overview
6 pages
API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
116 pages
Open AI Python
No ratings yet
Open AI Python
1 page
Parts of Code
No ratings yet
Parts of Code
13 pages
Saathi Project Plan & Structure
No ratings yet
Saathi Project Plan & Structure
5 pages
Dhara NLP Practical
No ratings yet
Dhara NLP Practical
67 pages
Documentation
No ratings yet
Documentation
5 pages
Code
No ratings yet
Code
4 pages
AI Voice Assistant
No ratings yet
AI Voice Assistant
51 pages
Main Py
No ratings yet
Main Py
2 pages
Technical Guide - Python Desktop AI Assistant On Windows
No ratings yet
Technical Guide - Python Desktop AI Assistant On Windows
8 pages
Saathi Voice Ai Features
No ratings yet
Saathi Voice Ai Features
3 pages
Project Documentation: Muhammad Munib Muhammad Afaaf
No ratings yet
Project Documentation: Muhammad Munib Muhammad Afaaf
11 pages
Using pyttsx3 for Text-to-Speech
No ratings yet
Using pyttsx3 for Text-to-Speech
5 pages
Madlib Code
No ratings yet
Madlib Code
11 pages
A Simple Guide To OpenAI API With Python
No ratings yet
A Simple Guide To OpenAI API With Python
9 pages
EchoMimicV2 Setup Guide
No ratings yet
EchoMimicV2 Setup Guide
6 pages
200835.113 - Cheat Sheet
No ratings yet
200835.113 - Cheat Sheet
29 pages
Python Codes To Deign A Chatbot
No ratings yet
Python Codes To Deign A Chatbot
3 pages
Voice Assistant Setup Guide
No ratings yet
Voice Assistant Setup Guide
5 pages
Text-to-Speech App with pyttsx3
No ratings yet
Text-to-Speech App with pyttsx3
15 pages
Major Project
No ratings yet
Major Project
144 pages
PyAudio Installation Error Guide
No ratings yet
PyAudio Installation Error Guide
5 pages
Class 2
No ratings yet
Class 2
4 pages
ICT QUESTIONS Cambridge
No ratings yet
ICT QUESTIONS Cambridge
15 pages
Protecting Vms With Rackware DR
No ratings yet
Protecting Vms With Rackware DR
17 pages
Associative Memory
No ratings yet
Associative Memory
25 pages
Accenture First Round Questions Set
No ratings yet
Accenture First Round Questions Set
4 pages
1587050846content PDF
No ratings yet
1587050846content PDF
104 pages
HANA Redhat-Clusters
No ratings yet
HANA Redhat-Clusters
36 pages
Citra Log - Txt.old
No ratings yet
Citra Log - Txt.old
8 pages
Tecra: Experience A New Level of Mobile Productivity
No ratings yet
Tecra: Experience A New Level of Mobile Productivity
2 pages
Huawei EC121 Installation Guide
No ratings yet
Huawei EC121 Installation Guide
14 pages
FortiSandbox-4 2 3-Administration - Guide1
No ratings yet
FortiSandbox-4 2 3-Administration - Guide1
246 pages
Preguntas Modelo C - HANATEC151
No ratings yet
Preguntas Modelo C - HANATEC151
25 pages
Asa 96 General Config PDF
No ratings yet
Asa 96 General Config PDF
1,260 pages
T5L_DGUSII Development Guide 2.2
No ratings yet
T5L_DGUSII Development Guide 2.2
199 pages
USB Disk Security Update Log
No ratings yet
USB Disk Security Update Log
4 pages
TLE-TE 9 - Q2 - W6 - Mod6 - ICT CSS - Removed
No ratings yet
TLE-TE 9 - Q2 - W6 - Mod6 - ICT CSS - Removed
17 pages
Addressing Modes
No ratings yet
Addressing Modes
35 pages
2165 Acs Product Catalogue 2020 en v1.2b
No ratings yet
2165 Acs Product Catalogue 2020 en v1.2b
10 pages
Advanced Linux Pentesting Guide
No ratings yet
Advanced Linux Pentesting Guide
7 pages
Wa0017.
No ratings yet
Wa0017.
10 pages
PHP Setup Guide for Win7 Users
No ratings yet
PHP Setup Guide for Win7 Users
14 pages
ASA 5506 10-3-1-2 Lab D - Configure AnyConnect Remote Access SSL VPN Using ASDM
100% (1)
ASA 5506 10-3-1-2 Lab D - Configure AnyConnect Remote Access SSL VPN Using ASDM
31 pages
Apex Easy Compliance Software Help
No ratings yet
Apex Easy Compliance Software Help
36 pages
cd800 Card Printer Quick Install Ug
No ratings yet
cd800 Card Printer Quick Install Ug
13 pages
Data Representation Boolean Algebra Solved
No ratings yet
Data Representation Boolean Algebra Solved
5 pages
Day 45 - Deploy WordPress Website On AWS
No ratings yet
Day 45 - Deploy WordPress Website On AWS
14 pages
Introduction to Operating Systems Notes
No ratings yet
Introduction to Operating Systems Notes
14 pages
3 IoT-Cloud Convergence - The Convergence of Internet of Things and Cloud For Smart Computing
No ratings yet
3 IoT-Cloud Convergence - The Convergence of Internet of Things and Cloud For Smart Computing
49 pages
Xduoo X3
0% (1)
Xduoo X3
2 pages
Flipper Zero GPIO Pinout Overview
No ratings yet
Flipper Zero GPIO Pinout Overview
1 page
TP - ms338.PB819 EMMC Pinouts View Online and Download Soft4led
No ratings yet
TP - ms338.PB819 EMMC Pinouts View Online and Download Soft4led
1 page

Fletcher (Optimized, Modular) - Setup + Code

Uploaded by

Fletcher (Optimized, Modular) - Setup + Code

Uploaded by

Fletcher (Optimized, Modular)

# PyAudio (for mic) – Windows tips:

3) Set your API keys (no hardcoding)

• open notepad , open chrome , close notepad (Windows examples)

Click the 🎤 button to speak.

# Empty on purpose; makes this a package.

def is_windows() -> bool:

def is_macos() -> bool:

def desktop_path() -> str:

# App registry for quick open

# Windows media keys

# One client per process

from . import config

# Try ElevenLabs lazily so app can still run without it

if _HAVE_ELEVEN and config.ELEVEN_API_KEY:

# Configure pygame driver per OS (helps startup reliability)

self._th = [Link](target=self._loop, daemon=True)

def feed(self, text: str):

def say(self, text: str):

# ---- internals ----

def _flush_if_ready(self, force: bool = False):

def _speak(self, text: str):

_client = OpenAI(api_key=config.OPENAI_API_KEY) if config.OPENAI_API_KEY else

def analyze_screen() -> str:

tmp = [Link]([Link](), "fletcher_screen.png")

You might also like