Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

As an aside, it seems you're interested in speech recognition, or speech to text, not voice recognition. Voice recognition is a different problem, where the particular speaker needs to be recognized from voice.


I used to work in the speech reco space for many years. Fighting this battle was lost long long long ago. Virtually no one outside of the space really cares, and people use speech and voice recognition interchangeably. It’s really more the case that voice recognition is just an ambiguous term.

It’s pretty hard to blame lay people when speech reco products like Dragon are widely marketed as “voice recognition”.

If you want to be clear and not just pedantic just call it speaker recognition.


I'd primarily like speech-to-text and the ability to know who is speaking. I have low expectations of the identification of speaker however.


If you're wanting a lot of people to use your solution as you described, recognition of who is speaking could add a lot of extra possibilities.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: