Context Cruncher 🎙️

Transform casual voice recordings into clean, structured context data for AI applications.

What is Context Cruncher?

Context Cruncher extracts structured context data from voice recordings using Gemini AI's multimodal capabilities. It processes audio directly, cleaning up natural speech patterns and organizing information into useful context data that AI systems can use for personalization.

Context data refers to specific information about users that grounds AI inference for more personalized results. This tool achieves that by:

Removing irrelevant information and tangents
Eliminating duplicates and redundancy
Reformatting from first person to third person
Organizing information hierarchically
Outputting both Markdown and JSON formats

See it in Action

Check out the demo page to see real results from processing example audio about movie preferences.

Features

🎤 Flexible Audio Input: Record directly in your browser or upload audio files (MP3, WAV, OPUS)
🤖 AI-Powered Extraction: Uses Gemini 2.0 Flash for intelligent audio understanding and context extraction
📝 Dual Output Formats: Get both human-readable Markdown and machine-readable JSON
👤 Customizable Identification: Choose how you're referred to in the context data (by name or as "the user")
📋 Easy Export: Download files or copy directly to clipboard

Quick Start

Prerequisites

Python 3.12+
A Gemini API key

Installation

Clone the repository:

git clone https://github.com/danielrosehill/Context-Cruncher.git
cd Context-Cruncher

Create a virtual environment and install dependencies:

# Using uv (recommended)
uv venv
source .venv/bin/activate
uv pip install -r requirements.txt

# Or using standard venv
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

Create a .env file with your Gemini API key:

cp .env.example .env
# Edit .env and add your API key:
# GEMINI_API="your_api_key_here"

Run the application:

Option A: Using the launch script (easiest)

./run.sh

Option B: Manual launch

source .venv/bin/activate
python app.py

The app will launch in your browser at http://localhost:7860

Usage

Configure: Enter your Gemini API key (or load from .env)
Choose Identification: Select whether to be referred to by name or as "the user"
Provide Audio: Either:
- Record directly in the browser using your microphone
- Upload an audio file (MP3, WAV, or OPUS)
Extract: Click "Extract Context" to process your audio
Download: Get your structured context data as Markdown or JSON

Example Transformation

Raw Audio Input:

"Okay so... let's document my health problems and the meds I take for this AI project... ehm.. where do I start... well, I've had asthma since I was a kid. I take a daily inhaler called Relvar for that. I also take Vyvanse for ADHD which is a stimulant medication. Oh.. hey Jay! What's up, man! Yeah see you at the gym. Okay, where was I. Note to self, pick up the laundry later. Oh yeah.. I've been on Vyvanse for three years and think it's great. I get bloods every 3 months."

Structured Output:

## Medical Conditions

- the user has had asthma since childhood
- the user has adult ADHD

## Medication List

- the user takes Relvar, daily, for asthma
- the user takes Vyvanse 70mg, daily, for ADHD

Generating Demo Results

To regenerate the demo results with the example audio:

python generate_demo.py

This will process the example-data/movie-prefs.opus file and save results to demo-results/.

Privacy Note

Your audio is processed using the Gemini API. Review Google's privacy policies before using this tool with sensitive information.

Use Cases

AI Assistant Personalization: Provide context to chatbots and AI assistants
Knowledge Management: Convert verbal notes into structured information
Preference Mapping: Document likes, dislikes, and preferences
Medical History: Organize health information (note privacy considerations)
Project Context: Capture project requirements and preferences

Technical Details

Frontend: Gradio web interface
AI Model: Gemini 2.0 Flash (with multimodal audio understanding)
Audio Processing: Direct audio file upload to Gemini API
Output Formats: Markdown and JSON

Repository Structure

Context-Cruncher/
├── app.py                  # Main Gradio application
├── gemini_processor.py     # Gemini API integration
├── generate_demo.py        # Demo generation script
├── run.sh                  # Launch script
├── requirements.txt        # Python dependencies
├── .env.example           # Environment variable template
├── demo.html              # Demo results page
├── example-data/          # Example audio files
└── demo-results/          # Generated demo outputs

Contributing

Contributions welcome! Please feel free to submit issues or pull requests.

License

MIT License - See LICENSE file for details

Author

Daniel Rosehill

Website: danielrosehill.com
GitHub: @danielrosehill

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Context Cruncher 🎙️

What is Context Cruncher?

See it in Action

Features

Quick Start

Prerequisites

Installation

Usage

Example Transformation

Generating Demo Results

Privacy Note

Use Cases

Technical Details

Repository Structure

Contributing

License

Author

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
demo-results		demo-results
example-data		example-data
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
app.py		app.py
demo.html		demo.html
gemini_processor.py		gemini_processor.py
generate_demo.py		generate_demo.py
requirements.txt		requirements.txt
run.sh		run.sh

danielrosehill/Context-Cruncher

Folders and files

Latest commit

History

Repository files navigation

Context Cruncher 🎙️

What is Context Cruncher?

See it in Action

Features

Quick Start

Prerequisites

Installation

Usage

Example Transformation

Generating Demo Results

Privacy Note

Use Cases

Technical Details

Repository Structure

Contributing

License

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages