So I have the api working as in I can send audio files and get text back but what I am looking for is a robust way to have streaming functionality. For example, if there is a small duration of silence it should stop recording and send the audio to api etc.
Is there any such library in python?
I found this so far: https://github.com/KoljaB/RealtimeSTT
Maybe I can modify it to use whisper api.