Skip to content

Web app that generates image from text/image/audio prompt

Notifications You must be signed in to change notification settings

bryanmylee/audio-diffusion

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 

Repository files navigation

Audio Diffuser

An audio-thumbnail generator.

Currently, the application is able to classify these 10 sounds:

  • Air Conditioner
  • Car Horn
  • Children Playing
  • Dog bark
  • Drilling
  • Engine Idling
  • Gun Shot
  • Jackhammer
  • Siren
  • Street Music

Server

Requirements

A machine that is capable of running TensorFlow and PyTorch.

Installation

Navigate to /server.

Install all dependencies within requirements.txt. This step depends on the system architecture you are currently using.

If using Anaconda / Minicoda, create a new environment specifically for this project:

$ conda env create -f environment.yml

Activate the audio-diffusion environment.

$ conda activate audio-diffusion

Download the pre-trained models.

$ python3 download.py

Lastly, set the environment variables required by copying .env.sample into .env.

Run

$ python3 app.py

The server will be accessible at http://localhost:5001.

Web app

Requirements

  • npm or yarn
  • Node

Installation

Navigate to /webapp.

$ yarn install

Configure .env.local to set whether the application should use a server mock.

Run

$ yarn dev

The web application will be accessible at http://localhost:3000.

About

Web app that generates image from text/image/audio prompt

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 55.8%
  • TypeScript 37.0%
  • CSS 4.0%
  • JavaScript 2.8%
  • Dockerfile 0.4%