API support to know who spoke what
I am trying to build a system that has should be able to recognize all the speakers and the speeches each speaker has spoken.
I was trying to build the solution using “Speaker Recognition API”. I am passing the voice/audio stream to identification API and able to know who are the speakers there; but didn’t find a way to know who spoke what.
Is there any way to know who spoke what using “Speaker Recognition API” as it is required for my solution?
Reference to any other APIs Microsoft is building will be really helpful