ロード中...

Meta-learning with Latent Space Clustering in Generative Adversarial Network for Speaker Diarization

The performance of most speaker diarization systems with x-vector embeddings is both vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker diarization using generative adversarial network (GAN) with an encoder network (ClusterGAN) to project input x-vectors into a lat...

詳細記述

保存先:
書誌詳細
出版年:IEEE/ACM Trans Audio Speech Lang Process
主要な著者: Pal, Monisankha, Kumar, Manoj, Peri, Raghuveer, Park, Tae Jin, Kim, So Hyun, Lord, Catherine, Bishop, Somer, Narayanan, Shrikanth
フォーマット: Artigo
言語:Inglês
出版事項: 2021
主題:
オンライン・アクセス:https://ncbi.nlm.nih.gov/pmc/articles/PMC8118028/
https://ncbi.nlm.nih.gov/pubmed/33997106
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1109/taslp.2021.3061885
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!