Can I Fine-Tune the Diarization Model to Recognize a Specific Individual's Voice? #234
Unanswered
shivamtawari
asked this question in
Q&A
Replies: 1 comment 1 reply
-
It's doable but not through finetuning, you will use the intermediate embeddings generated from MSDD model and compare them to reference embeddings that you generated to identify which speaker is XYZ |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @MahmoudAshraf97
I'm curious to know if it's possible to customize the diarization output. Specifically, can we assign a custom name, such as 'Mr. XYZ', to dialogues spoken by a particular person, while the rest are labeled as 'Person 0', 'Person 1', etc.?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions