基于卷積神經網絡的連續語音識別

<th id="5nh9l"></th><strike id="5nh9l"></strike>

<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>

<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">

<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>

<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>

<strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>

<span id="5nh9l"><noframes id="5nh9l">

<span id="5nh9l"><noframes id="5nh9l">

<span id="5nh9l"></span>

<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>

<progress id="5nh9l"><noframes id="5nh9l">

留言板

尊敬的讀者、作者、審稿人, 關于本刊的投稿、審稿、編輯和出版的任何問題, 您可以本頁添加留言。我們將盡快給您答復。謝謝您的支持!

姓名
郵箱
手機號碼
標題
留言內容
驗證碼

基于卷積神經網絡的連續語音識別

張晴晴, 劉勇, 潘接林, 顏永紅

文章導航 > 工程科學學報 > 2015 > 37(9): 1212-1217

張晴晴, 劉勇, 潘接林, 顏永紅. 基于卷積神經網絡的連續語音識別[J]. 工程科學學報, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

引用本文:

張晴晴, 劉勇, 潘接林, 顏永紅. 基于卷積神經網絡的連續語音識別[J]. 工程科學學報, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

Citation:

ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

張晴晴, 劉勇, 潘接林, 顏永紅. 基于卷積神經網絡的連續語音識別[J]. 工程科學學報, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

引用本文:

張晴晴, 劉勇, 潘接林, 顏永紅. 基于卷積神經網絡的連續語音識別[J]. 工程科學學報, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

Citation:

ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. doi: 10.13374/j.issn2095-9389.2015.09.015

基于卷積神經網絡的連續語音識別

doi: 10.13374/j.issn2095-9389.2015.09.015

中國科學院語言聲學與內容理解重點實驗室, 北京 100190

基金項目:

國家高技術研究發展計劃資助項目(2012AA012503)

國家自然科學基金資助項目(11161140319，91120001，61271426)

中國科學院戰略性先導科技專項(XDA06030100，XDA06030500)

中國科學院重點部署項目(KGZD-EW-103-2)

通訊作者:
張晴晴,E-mail:zhangqingqing@hccl.ioa.ac.cn

中圖分類號: TN912.34
計量
- 文章訪問數: 276
- HTML全文瀏覽量: 40
- PDF下載量: 24
- 被引次數: 0
出版歷程
- 收稿日期: 2014-05-08
- 網絡出版日期: 2021-07-10

Continuous speech recognition by convolutional neural networks

Key Laboratory of Speech Acoustics and Content Understanding,Chinese Academy of Sciences, Beijing 100190, China

摘要: 在語音識別中,卷積神經網絡(convolutional neural networks,CNNs)相比于目前廣泛使用的深層神經網絡(deep neural network,DNNs),能在保證性能的同時,大大壓縮模型的尺寸.本文深入分析了卷積神經網絡中卷積層和聚合層的不同結構對識別性能的影響情況,并與目前廣泛使用的深層神經網絡模型進行了對比.在標準語音識別庫TIMIT以及大詞表非特定人電話自然口語對話數據庫上的實驗結果證明,相比傳統深層神經網絡模型,卷積神經網絡明顯降低模型規模的同時,識別性能更好,且泛化能力更強.
- 卷積神經網絡 /
- 連續語音識別 /
- 權值共享 /
- 聚合 /
- 泛化性
Abstract: Convolutional neural networks (CNNs), which show success in achieving translation invariance for many image processing tasks, were investigated for continuous speech recognition. Compared to deep neural networks (DNNs), which are proven to be successful in many speech recognition tasks nowadays, CNNs can reduce the neural network model sizes significantly, and at the same time achieve even a better recognition accuracy. Experiments on standard speech corpus TIMIT and conversational speech corpus show that CNNs outperform DNNs in terms of the accuracy and the generalization ability.
- convolutional neural networks /
- continuous speech recognition /
- weight sharing /
- pooling /
- generalization

參考文獻(0)

資源附件(0)

WeChat

點擊查看大圖

計量

文章訪問數: 276
HTML全文瀏覽量: 40
PDF下載量: 24
被引次數: 0

/

下載: 全尺寸圖片幻燈片

分享

用微信掃碼二維碼

分享至好友和朋友圈

返回

<th id="5nh9l"></th><strike id="5nh9l"></strike>

<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>

<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">

<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>

<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>

<strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>

<span id="5nh9l"><noframes id="5nh9l">

<span id="5nh9l"><noframes id="5nh9l">

<span id="5nh9l"></span>

<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>

<progress id="5nh9l"><noframes id="5nh9l">