GraspIt is a revolutionary tool that turns any educational concept into a complete explainer video — all in under 2 minutes. Just enter a topic like "IP Address", and watch as GraspIt generates a 5-scene educational video with narration, visuals, and smooth transitions, fully automated using cutting-edge AI.
-
💬 GPT-4 Script Generation via OpenRouter
Converts any concept into a 5-scene educational script, including narration and visual cues. -
🔊 Realistic Narration using Google Cloud Text-to-Speech
Produces natural, human-like voiceovers for each scene. -
🖼️ AI Image Generation with DeepAI
Creates custom illustrations for each scene based on the script. -
🎬 Video Assembly with MoviePy
Combines images and audio into a polished, high-quality MP4 video.
Install all required packages using pip:
pip install google-cloud-texttospeech moviepy pillow requests python-dotenvYou’ll need API keys and credentials from the following platforms:
- Create a project at: https://console.cloud.google.com/
- Enable Text-to-Speech API
- Create a Service Account → Download JSON key
- Save your key as
service-account.json
- Visit: https://openrouter.ai
- Generate an API key
- Visit: https://deepai.org
- Create an account → Get API key
Create a .env file in your project root:
GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json
OPENROUTER_API_KEY=your_openrouter_key
DEEPAI_API_KEY=your_deepai_key
VIDEO_WIDTH=900
VIDEO_HEIGHT=600
VIDEO_FPS=24
DEFAULT_SCENE_DURATION=4.0
---
From your terminal or Google Colab, run:
python main.pyWhen prompted, type your concept (e.g., IP Address, Photosynthesis, Blockchain).
GraspIt will:
- Generate a 5-scene script
- Convert narration to audio
- Create relevant visuals
- Assemble the video
- Output:
final_explainer_video.mp4
Concept: "Photosynthesis"
🎞️ Output:
- 5 structured scenes
- Voice-over narration
- AI-generated visuals
- Smooth scene transitions
📁 Final File: final_explainer_video.mp4
-
Script Creation Sends your topic to GPT-4 via OpenRouter → Receives 5 scenes with narration and visuals.
-
Text-to-Speech Google Cloud TTS converts narration into professional-quality voiceovers.
-
Image Generation Visual descriptions are sent to DeepAI to create illustrations for each scene.
-
Video Editing Uses MoviePy to combine images and audio into a cohesive video.
- Built an end-to-end AI orchestration system
- Integrated multiple APIs and handled fallbacks
- Generated educational videos in < 2 minutes
- Seamless visual and audio sync
- Handling asynchronous API responses
- Fallback strategies for failed API calls
- Automating educational content workflows
- Audio-video alignment and compression
- 🌐 Web-based interface for drag-and-drop concepts
- 🎙️ Support for multilingual narration (Kannada, Hindi, etc.)
- 📚 Batch generation for syllabus-based content
- 📺 YouTube / Google Classroom integration
- 📝 Subtitles and closed captions
“From concept to classroom — in under 2 minutes.”
Amrutha Kanakatte Ravishankar([email protected])
Sneha Venkatesh([email protected]
Version: 1.0
License: MIT License
graspit/
├── main.py
├── .env
├── final_explainer_video.mp4
├── scene_*.mp3 / .png (temporary files)
└── README.md
This project is licensed under the MIT License. See the LICENSE file for details.