首页 | 本学科首页   官方微博 | 高级检索  
     

基于语料库的“普通话”地域性言语识别技术
引用本文:王虹. 基于语料库的“普通话”地域性言语识别技术[J]. 中国司法鉴定, 2014, 0(1): 75-79
作者姓名:王虹
作者单位:[1]中国刑事警察学院文件检验技术系文件检验鉴定公安部重点实验室,辽宁沈阳110035; [2]东北大学自然语言处理实验室,辽宁沈阳110004
基金项目:公安部科技强警基础专项项目(2011GABJC026);公安部重点实验室开放基金资助项目(2011KFKT09)
摘    要:目的探究汉语"普通话"中的地域性差别,发现言语特征,找到从有限的以普通话发音的语音材料中提取出更多地域性特征的途径和方法,提高地域性言语识别技术水平,更有效地为案件分析定向服务。方法在建立较大规模的《面向案件言语识别应用的汉语"普通话"语料库》及查询检索系统的基础上,进行统计分析、归纳总结。结果人们说普通话时会在声母、韵母、声调、重音、儿化、轻声等语音方面和词汇、语法等方面不同程度地出现其母语方言的固有特点。我们可以采用"以调值特征为中心,声韵特征相结合"、"利用各类特征的总和进行综合评断"等方法进行识别。结论利用"普通话"语声进行言语人地域性识别是一种切实可行的技术方法。

关 键 词:方言普通话  语料库  地域性言语识别

Corpus-based Regional Mandarin Recognition
WANG Hong. Corpus-based Regional Mandarin Recognition[J]. Chinese Journal of Forensic Sciences, 2014, 0(1): 75-79
Authors:WANG Hong
Affiliation:WANG Hong (1. Key Laboratory of Questioned Document Examination of Ministry of Public Security, Department of Questioned Document Examination, Police University of China, Shenyang 110035, China; 2. Laborotary of Natural Language Processing Laboratory at Northeastern University, Shenyang 110004, China)
Abstract:Objective To study the speech characteristics in different regional Mandarin and establish a method for extracting regional speech characteristics from speech materials in Mandarin pronunciation. Method A large-scale Chinese Mandarin Cor- pus serving for speech recognition was established, as well as a query system. The speech data in the corpus were analyzed sta- tistically and summarized. Results speakers demonstrated their native tongues in voice characteristics of initials, finals, tones, stress, retroflex suffixation, and neutral tones, as well as vocabulary and grammar characteristics. Therefore, the regional speech could be recognized by a comprehensive analysis of various characteristics, with emphasis on the feature of tone values and combination of initials and finals. Conclusion It is practicable to recognize the region of the speaker by analyzing his or her Mandarin speech.
Keywords:dialect Mandarin  corpus  regional speech recognition
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号