基于端到端的文本无关声纹识别研究任务书-任务书网

全文总字数：2733字

1. 毕业设计（论文）主要内容：

我们正在快速进入“识别技术”时代。

继“刷脸”之后，声纹识别开始进入人们的生活。

目前声纹识别技术已经在部分银行投入使用，用户登录手机银行APP，进行转账、支付等交易。

剩余内容已隐藏，您需要先支付后才能查看该篇文章全部内容！

2. 毕业设计（论文）主要任务及要求

完成深度学习络结构搭建使python语完成声纹识别的代码编写，可以使现有机器学习类库（Pytorch\Tensorflow）进编写。
查询声纹识别的相关国内外文献，声纹识别技术包括前端的语音处理部分和后端的网络结构，前端主要是语音特征提取，提取用于辨别说话人的特征，后端核心为训练说话人识别模型。
整个研究包括信号处理、声学研究和深度学习算法、框架。
剩余内容已隐藏，您需要先支付后才能查看该篇文章全部内容！
3. 毕业设计（论文）完成任务的计划与安排
1. 2020年1月11日-2020年1月31日查看相关国内外文献。
2. 2020年2月1日-2020年2月28日针对性的学习Python编程语言；完成Tensorflow框架搭建；熟悉声纹识别流程；尝试实现已阅读文献中的方法；完成开题报告。
3. 2020年3月1日-2020年3月31日处理原始数据集，数据集分为训练集和测试集，提取声纹特征(特征工程)。
  剩余内容已隐藏，您需要先支付后才能查看该篇文章全部内容！
  4. 主要参考文献
  1. Chao Li et al. “Deep Speaker: an End-to-End Neural Speaker Embedding System”. In: arXiv e-prints, arXiv:1705.02304 (2017), arXiv:1705.02304. 2. Xingang Pan et al. “Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net”. In:ECCV2018.3. Jie Hu et al. “Squeeze-and Excitation Network”. In:CVPR2018.David Snyder et al. “X-vectors: Robust dnn embeddings for speaker recognition”. In: Proc. of ICASSP. IEEE. 2018, pp. 5329–5333.4. G. Bhattacharya, J. Alam, and P. Kenny. “Deep Speaker Embeddings for Short-Duration Speaker Verification”. In: Proc. Interspeech. 2017, pp. 1517–1521.5. Koji Okabe, Takafumi Koshinaka, and Koichi Shinoda. “Attentive Statistics Pooling for Deep Speaker Embedding”. In: Proc. Interspeech. 2018, pp. 2252–2256.Chunlei Zhang and Kazuhito Koishida. “End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances”. In: Proc. Interspeech. 2017, pp. 1487–1491.Li Wan et al. “Generalized end-to-end loss for speaker verification”. In: Proc. of ICASSP. 2018, pp. 4879–4883.Weicheng Cai, Jinkun Chen, and Ming Li. “Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System”. In: Proc. Speaker Odyssey. 2018, pp. 74–81.Zili Huang, Shuai Wang, and Kai Yu. “Angular Softmax for Short-Duration Text-independent Speaker Verification.” In: Proc. of Interspeech. 2018, pp. 3623–3627.Joon Son Chung, Arsha Nagrani, and Andrew Zisserman. “VoxCeleb2: Deep Speaker Recognition”. In: Proc. INTERSPEECH. 2018, pp. 1086–1090.
  剩余内容已隐藏，您需要先支付 10元 才能查看该篇文章全部内容！立即支付

免费ai写开题、写任务书：免费Ai开题 | 免费Ai任务书 | 免费降AI率 | 免费降重复率 | 论文免费排版

注册

找回密码

基于端到端的文本无关声纹识别研究任务书

1. 毕业设计（论文）主要内容：

2. 毕业设计（论文）主要任务及要求

3. 毕业设计（论文）完成任务的计划与安排

4. 主要参考文献

您可能感兴趣的文章