live speech and tts
planned
Sven Kernke
Does that mean it involves two parts? One “input” via voice and an output as language, or just one of the two? If yes which? :) endu
endu
Sven Kernke Voice input to text to fill in your input prompt box instead of typing it out! you will have soon, we are working on it .. Read out loud also we can do but we not doing rn ..
endu
planned
endu
Hey! Totally hear you on this—live speech is a bit tricky right now since it depends on OAI expanding their limits, but we’ve got TTS on the way! It’s actually part of our roadmap, so stay tuned—it’s coming soon! 😊
R
Radosław Domański
endu hey. It's actually super simple to implement using a local (browser) engine on the web and the system engine on the native app
If you'd like to - I can help you out on this, I have example code for both of those cases. It's literally a few hours of work to make it happen
that being said - for myself I just use this: https://dictanote.co/voicein/
works flawlessly
k nabu
endu also oai is not the endall! look into hume octave and/or sesame!
endu
Radosław Domański: Hey, thanks for sharing! 😊 Yeah, we've tried similar approaches, and while it's definitely doable, we found the voices to still sound a bit robotic, don’t you think? Would love your thoughts if you've found a way around that!"
endu
k nabu: Alright, I’ll check them out and let you know how it goes. Thanks for the suggestions!
R
Radosław Domański
endu I was talking about the speech to tech part. as far as I'm concerned - that's a separate topic than TTS. And a much more powerful one. TTS is nice, but not required. So my previous answer was geared towards that.
but also - it's better to have a slightly robotic voice for free than none ;)
endu
Radosław Domański: Ahh, got it !