Describe the question A clear and concise deion of what the

Understanding diarization labels about uis-rnn HOT 6 CLOSED

google commented on July 28, 2024

Understanding diarization labels

from uis-rnn.

Comments (6)

wq2012 commented on July 28, 2024

I really don't understand your questions. Please clarify.

We can't understand the relationship between real tags and predictive tags.

Which part you don't understand?

it makes it impossible to find out who the speaker is.

What do you mean?

from uis-rnn.

zyc1310517843 commented on July 28, 2024

For example, I used 46 people to train the model, where train_cluster_id is [0,0,0............... 45,45,45], and then I used Forty-sixth people to predict, where test_cluster_id is [0,0,0,0,0...]. The predicted result is [0, 0, 0, 0, 0...]. My question is, shouldn't the predicted label be [45, 45, 45...]? I hope you can understand what I said.

from uis-rnn.

wq2012 commented on July 28, 2024

In diarization, the labels are not absolute labels, but relative labels. It is identity-agnostic.

Labels are meaningless across utterances.

For example, in an utterance, the labels are [0, 0, 1], it means first two segments are from one speaker, while the last segment is from a different speaker. It does NOT refer to any specific speaker.

if another utterance has labels [0, 1, 1], the two speakers in this utterance has no connection with the speakers in the previous utterance.

from uis-rnn.

zyc1310517843 commented on July 28, 2024

I understand exactly what you said. Can I get the absolute label? Because I want to know who the speaker is.Thank you。

from uis-rnn.

zyc1310517843 commented on July 28, 2024

I understand exactly what you said. Can I get the absolute label? Because I want to know who the speaker is.Thank you。

…

---Original--- From: "Quan Wang"<[email protected]> Date: Mon, Jun 10, 2019 11:39 AM To: "google/uis-rnn"<[email protected]>; Cc: "Author"<[email protected]>;"zyc1310517843"<[email protected]>; Subject: Re: [google/uis-rnn] Understanding diarization labels (#51) In diarization, the labels are not absolute labels, but relative labels. It is identity-agnostic. Labels are meaningless across utterances. For example, in an utterance, the labels are [0, 0, 1], it means first two segments are from one speaker, while the last segment is from a different speaker. It's does refer to any specific speaker. if another utterance has labels [0, 1, 1], the two speakers in this utterance has no connection with the speakers in the previous utterance. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

from uis-rnn.

wq2012 commented on July 28, 2024

If you want the absolute labels, you are looking at the wrong technique and the wrong repo. It's not the problem diarization is trying to solve. It's speaker recognition, which is much easier than diarization. You can simply compute cosine similarity with different embeddings.

from uis-rnn.

Understanding diarization labels about uis-rnn HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent