Triplet Entropy Loss: Improving The Generalisation of Short Speech Language Iden

上传者：frantically_17442 2021-01-24 06:23:36上传 .PDF文件 2.54 MB 热度 32次

Triplet Entropy Loss: Improving The Generalisation of Short Speech Language Identification Systems

We present several methods to improve the generalisation of language identification (LID) systems to new speakers and to new domains. These methods involve Spectral augmentation, where spectrograms are masked in the frequency or time bands during training and CNN architectures that are pre-trained on the Imagenet dataset.The paper also introduces the novel Triplet Entropy Loss training method, which involves training a network simultaneously using Cross Entropy and Triplet loss. It was found that all three methods improved the generalisation of the models, though not significantly. Even though the models trained using Triplet Entropy Loss showed a better understanding of the languages and higher accuracies, it appears as though the models still memorise word patterns present in the spectrograms rather than learning the finer nuances of a language. The research shows that Triplet Entropy Loss has great potential and should be investigated further, not only in language identification tasks but any classification task.

三重熵损失：提高了短语音语言识别系统的通用性

我们提出了几种方法来提高语言识别（LID）系统对新说话者和新领域的通用性。这些方法涉及频谱增强，其中频谱图在训练和Imagenet数据集上预先训练的CNN架构期间的频率或时间段中被屏蔽。.. 本文还介绍了新颖的三重熵损失训练方法，该方法包括使用交叉熵和三重损失同时训练网络。发现这三种方法都改进了模型的泛化，尽管效果不显着。即使使用三重熵损失训练的模型显示出对语言的更好理解和更高的准确性，似乎这些模型仍然记住了频谱图中存在的单词模式，而不是学习语言的细微差别。研究表明，三重态熵损失具有巨大的潜力，不仅在语言识别任务中，而且在任何分类任务中，都应进一步研究。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

Triplet Entropy Loss Improving The Generalisation of Short Speech Language Iden

我们提出了几种方法来提高语言识别（LID）系统对新说话者和新领域的通用性。这些方法涉及频谱增强，其中...

大小：2.54 MB | 2021-01-24 06:23:36

triplet loss.docx

triplet loss.docx

大小：13KB | 2020-11-26 10:30:54

triplet loss anomaly detection源码

基于三重态损耗的异常检测架构介绍该项目旨在为目标图像开发基于深度学习的异常检测系统。该系统可以...

大小：634.6MB | 2021-05-03 10:22:11

tensorflow triplet loss master.zip

手动实现的triplet loss,包含numpy和TensorFlow两个版本,非常适合度量学习研...

大小：3.16MB | 2021-05-03 10:22:15

SPEECH and LANGUAGE PROCESSING

SPEECHandLANGUAGEPROCESSINGAnIntroductiontoNatural...

大小：0B | 2019-07-23 14:41:30

Speech and Language Processing

Anintroductiontonaturallanguageprocessing,computat...

大小：0B | 2019-07-23 14:41:29

Improving Language Understanding by Generative PreTraining

人工智能深度学习机器学习自然语言处理论文，ImprovingLanguageUnderstandin...

大小：0B | 2019-07-19 03:31:11

EndtoEnd Speech and Language Processing

端到端的语音处理系统，Recently,encoder-decoderneuralnetworksh...

大小：0B | 2020-03-02 23:58:18

speech and language processing简介

speech and language processing 简介

大小：0B | 2019-06-22 10:58:51

Speech-and-Language-Processin

大小：0B | 2019-03-06 07:29:54

Speech and Language Processing An introduction to natural language processing

AnexplosionofWeb-basedlanguagetechniques,mergingof...

大小：0B | 2020-04-18 18:15:31

Speech and Language Processing - An Introduction to Natural Language Processing,

大小：0B | 2018-12-07 22:25:24

A Maximum Entropy Approach to Natural Language Processing

关于“最大熵模型”的论文。大家有需要，可以的、下载来研究

大小：1.79MB | 2020-07-28 18:04:30

Speech_signal_short_time_analysis.rar

线性预测编码(LPC)是主要用于音频信号处理与语音处理中根据线性预测模型的信息用压缩形式表示数字语音...

大小：6KB | 2020-09-14 10:20:49

Distribution entropy for short term QT interval variability analysis a comparis

Distribution entropy for short-term QT interval va...

大小：667KB | 2021-02-09 09:30:37

Speech.and.Language.Processing.pdf

大小：0B | 2019-04-10 08:32:21