I need a plan of action to support audio models a part from chat and embeddings - transcription - text to speech https://platform.openai.com/docs/guides/audio