As an aside, it seems you're interested in speech recognition, or speech to text...

catblast · on March 14, 2020

I used to work in the speech reco space for many years. Fighting this battle was lost long long long ago. Virtually no one outside of the space really cares, and people use speech and voice recognition interchangeably. It’s really more the case that voice recognition is just an ambiguous term.

It’s pretty hard to blame lay people when speech reco products like Dragon are widely marketed as “voice recognition”.

If you want to be clear and not just pedantic just call it speaker recognition.

rs23296008n1 · on March 14, 2020

I'd primarily like speech-to-text and the ability to know who is speaking. I have low expectations of the identification of speaker however.

peglasaurus · on March 16, 2020

If you're wanting a lot of people to use your solution as you described, recognition of who is speaking could add a lot of extra possibilities.