声明:参考B站视频,自学成长记录
并参考博客:https://blog.csdn.net/Datapad/article/details/82970253
安装SpeechRecognition
C:\Users\Administrator>pip3 install SpeechRecognition
......
Installing collected packages: SpeechRecognition
Successfully installed SpeechRecognition-3.8.1
安装成功入下图
安装pocketsphinx
C:\Users\Administrator>pip install pocketsphinx
......
Installing collected packages: pocketsphinx
Successfully installed pocketsphinx-0.1.15
安装成功入下图
代码示例
实现将wav格式的语音文件读取并进行整体 / 部分识别
import speech_recognition as sr
r = sr.Recognizer()
harvard = sr.AudioFile('E:\speek\harvard.wav')
with harvard as source:
all_audio = r.record(source)
print(type(all_audio))
all_text = r.recognize_sphinx(all_audio)
print(all_text)
with harvard as source:
audio = r.record(source, offset=4, duration=3)
text = r.recognize_sphinx(audio)
print(text)
注意事项
1、预先知道音频文件中语音的结构,那么offset和duration关键字参数对于分割音频文件非常有用。然而,匆忙使用它们会导致转录不良
2、音频文件类型以PCM WAV、AIFF/AIFF- c或本机FLAC读取音频文件,不然会报错
3、声音文件链接:https://pan.baidu.com/s/10oClt_NWgjOsDmIPuqQGzg 提取码:0wv4Original: https://blog.csdn.net/BuLingLings/article/details/109510391
Author: BuLingLings
Title: Python语音识别基础实践-SpeechRecognition
原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/526168/
转载文章受原作者版权保护。转载请注明原作者出处!