离线环境的中文语音识别 paddlepaddle 与 Speech_Recognition(Sphinx)实践

文章大纲

; 简介

本文主要针对中文语音识别问题,选用常用的模型进行 离线 demo 搭建及实践说明。

语音识别基础

语音识别技术是允许机器通过识别和理解的过程将语音信号转换成相应的文本或命令的技术。

[En]

Speech recognition technology is the technology that allows the machine to transform the speech signal into the corresponding text or command through the process of recognition and understanding.

  • 语音识别本质上是一个模式识别的过程。将未知语音的模式与已知语音的参考模式逐一进行比较,并将最佳匹配的参考模式作为识别结果。
    [En]

    speech recognition is essentially a process of pattern recognition. The pattern of unknown speech is compared with the reference pattern of known speech one by one, and the best matching reference pattern is taken as the recognition result.*

  • 语音识别的目的是让机器给人听觉特征,听懂人说的话,并相应地采取行动。目前,从语音生成机制的角度来看,大多数语音识别技术都是基于统计模式的。
    [En]

    the purpose of speech recognition is to enable the machine to give people the auditory characteristics, understand what people say, and act accordingly. At present, most speech recognition technologies are based on statistical mode, from the point of view of speech generation mechanism.*

Original: https://blog.csdn.net/wangyaninglm/article/details/118416865
Author: shiter
Title: 离线环境的中文语音识别 paddlepaddle 与 Speech_Recognition(Sphinx)实践

原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/512785/

转载文章受原作者版权保护。转载请注明原作者出处!

(0)

大家都在看

亲爱的 Coder【最近整理,可免费获取】👉 最新必读书单  | 👏 面试题下载  | 🌎 免费的AI知识星球