When we are building a speaker verification model, we have to build a test set to evaluate the performance of our model. For example, you will use this test to compute EER or minDCF.
The Channel-wise squeeze-excitation module (SE module) has achieved a great success in both computer vision and speech processing fields. In this tutorial, we will introduce it for beginners.
In this tutorial, we will use some examples to show you how to use torch.nn.AdaptiveAvgPool1d() in pytorch, which is very useful when you are building a CNN networks.
In voiceprint and face recognition, one of the important things is to determine similarity threshold. In this tutorial, we will introduce you how to get this threshold value.