Inquiry about Embedding Extraction #1663

PhilipAmadasun · 2024-03-01T23:50:16Z

PhilipAmadasun
Mar 1, 2024

I have two questions about extracting embeddings

First:
from scipy.spatial.distance import cdist distance = cdist(embedding1, embedding2, metric="cosine")[0,0]
Gives an error, I was following the tutorial here
I have to do this instead:
distance = cdist(np.expand_dims(embedding1,axis=0), np.expand_dims(embedding2, axis=0), metric="cosine")[0,0]
To get results. And now I'm confused if I'm even supposed to do this.
Second:
Do you get the most accurate "Essence" of someone's voice via embedding by:
- Extracting embedding from longer audio clips of the persons talking?
- Gathering (let's say) 30 millisecond chunks of audio, extracting the embeddings from each chunk, then getting an average embedding from them?
Which one of these is the way to go?

fungus75 · 2024-03-15T10:46:21Z

fungus75
Mar 15, 2024

First:
I guess they use nowadays speechbrain for identification of speakers: Link

Therefore the similarity-Identification might be done somehow like that/that.

I tried out the following:

similarity = torch.nn.CosineSimilarity(dim=-1, eps=1e-6)
sim = similarity(torch.from_numpy(embedding1), torch.from_numpy(embedding2))

But for me the question is, how big this "sim" must be to be identical.
According their code 0.25 is a good value? For me it is way too low.

2 replies

PhilipAmadasun Mar 20, 2024
Author

@fungus75 The way I previously described still works when it's people whose voice embeddings I've already saved. I don't have a good threshold to pick put unknown voices. I'll try this way you've mentioned.

PhilipAmadasun Mar 21, 2024
Author

@fungus75 Yeah after some brief testing I'm not sure their way can work for me. Have you tried the way I mentioned previously and compared the two methods? I'm curious. I'm kinda working in briefs waves due to illness currently, would appreciate more input on this as I can't carry out this task currently. Maybe it it will also help you as well hopefully.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry about Embedding Extraction #1663

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Inquiry about Embedding Extraction #1663

PhilipAmadasun Mar 1, 2024

Replies: 1 comment · 2 replies

fungus75 Mar 15, 2024

PhilipAmadasun Mar 20, 2024 Author

PhilipAmadasun Mar 21, 2024 Author

PhilipAmadasun
Mar 1, 2024

Replies: 1 comment 2 replies

fungus75
Mar 15, 2024

PhilipAmadasun Mar 20, 2024
Author

PhilipAmadasun Mar 21, 2024
Author