Topics tagged audio

Topic	Replies	Views	Activity
Introducing GPT-4o Transcribe Diarize: Now Available in the Audio API Community announcement , api , audio , transcribe , diarize	7	3177	November 14, 2025
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps API ios-app , agents , audio , api-realtime-speech	2	121	November 10, 2025
Best Practices for Maintaining Speaker Identity Across Chunks with gpt-4o-transcribe-diarize? API gpt-4 , audio , diarize	3	358	November 4, 2025
GPT-Audio Not working - Error 500 API audio	14	548	October 15, 2025
OpenAI Text To Speech - Speaking with emotion/effets / price API tts , audio	0	62	September 16, 2025
New in Evals: Full Audio Support Community api , audio , evals	0	88	September 12, 2025
:speaking_head: Want to Listen to ChatGPT Voices Reading Your Text? Now You Can! Introducing GPT Reader: A Free ChatGPT Powered Text to Speech Extension! Community chatgpt , plugin-development , audio , text2audio , browser-extensions	2	1923	September 6, 2025
AI Audio Gone Wild gallery 2025 - Your Audio Clips from gpt-4o-audio and tts Community tts , audio , gpt-4o-audio-preview	5	407	August 13, 2025
librosa + numpy compatibility bug broke all .wav and .flac audio analysis in GPT-4o Bugs audio	4	410	July 7, 2025
Realtime "modalities" session config not disabling local->model audio channel API audio , realtime , api-realtime	3	254	June 6, 2025
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings Community embeddings , project , whisper , audio	10	4557	May 11, 2025
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording? API api , whisper , audio , realtime , api-realtime	2	551	April 1, 2025
Create transcription with gpt-4o-transcribe – max audio file length/size? API audio	0	210	March 24, 2025
Audio Models in the API March 20, 2025 API audio	0	222	March 20, 2025
Speech to Text (ASR) Strategy Community whisper , audio , gpt-4o-audio-preview	8	606	March 10, 2025
Gpt-4o-audio-preview responds in text, not audio Bugs audio , gpt-4o	6	1690	January 25, 2025
Realtime API re-consuming it's own output audio as input audio API audio , realtime , api-realtime , api-realtime-speech	10	1248	January 10, 2025
Is there a way to prevent gpt-4o-audio-preview from returning audio? API audio	8	699	December 17, 2024
Can`t get the right audio format for recording in web application with whisper on IOS API gpt-4 , chatgpt , api , whisper , audio	0	107	November 20, 2024
Logit_bias for gpt-4o-audio-preview API gpt-4 , api , audio	1	55	November 13, 2024
CoT with 4o Audio or Real Time Prompting audio , realtime , api-realtime-speech	4	420	November 12, 2024
500 error in request to gpt-4o-audio-* model API api , audio , gpt-4o	13	479	November 12, 2024
Multiturn conversation format using gpt-4o-audio-preview with audio input API audio	1	528	November 12, 2024
Issues with gpt-4o-audio-preview when using tools/functions API tools , audio	1	433	November 12, 2024
Cached input audio_tokens is always 0 API audio , realtime	3	506	November 8, 2024
How to replace my GPT TTS call for better performance? API tts , audio	1	318	November 5, 2024
Waiting for gpt-4o-audio-preview API audio	11	3954	November 4, 2024
TranscriptionVerbose.duration is a number, not a string Documentation whisper , audio	0	85	October 25, 2024
Translation api returns incorect api key while the same key works for chat Bugs whisper , audio	2	103	October 11, 2024
'Transcription Outsourcing, LLC' repeated throughout whisper transcript API api , whisper , hallucinations , audio	18	1506	October 5, 2024