声纹技术(二):音频信号处理基础【模拟信号(连续)–采样–>数字信号(离散)–量化–>振幅简化为整数–编码–>二进制序列】【WAV音频格式】【SoX】【分帧-加窗-】

2.1 欲懂声纹,先学音频

在学科分类方面,声纹技术是语音信号处理的一个分支,而语音信号处理属于音频信号处理的范畴。

[En]

In terms of discipline classification, voiceprint technology is a branch of speech signal processing, while speech signal processing belongs to the category of audio signal processing.

语音信号和音频信号,两者的区别是:

[En]

Voice signal and audio signal, the difference between the two is:

  • 语音信号是指人类说话时具有社会意义的声音。
    [En]

    speech signals refer specifically to sounds of social significance when human beings speak.*

  • 音频信号一般是指人类可以听到的所有声音。比如,乐器、动物、汽车发动机、人打呼噜、打喷嚏、咳嗽发出的声音,广义上都属于音频信号,但不属于语音信号。因此,它通常不在声纹技术的范围内。
    [En]

    Audio signals generally refer to all sounds that human beings can hear. For example, the sounds made by musical instruments, animals, car engines, and people snoring, sneezing and coughing all belong to audio signals in a broad sense, but they do not belong to voice signals. so it’s not usually within the scope of voiceprint technology.*

音频信号处理中的许多基本概念和知识对于学习声纹技术是非常重要的。

[En]

Many basic concepts and knowledge in audio signal processing are very important for learning voiceprint technology.

任何声纹系统,无论模型多么先进,算法多么复杂,都离不开对声音的处理。只有连接正确的音频信号,并从中提取有意义的特征表示,后一种模型才能最大限度地发挥其作用。

[En]

Any voiceprint system, no matter how advanced the model and how sophisticated the algorithm, can not do without dealing with sound. Only when the correct audio signal is connected and the meaningful feature representation is extracted from it, the later model can play its role to the maximum extent.

因此,在本章中,我们具体而系统地学习这些与声音相关的概念和知识。本章涵盖了人类听觉感知、音频接口、编码技术、离散信号处理等广泛的子领域。乍一看,这些子领域似乎彼此没有太大关系。然而,当我们真正开始声纹领域的研究或工程项目时,我们会发现,所有这些子领域的知识都将不可避免地被使用。在企业或研究机构

[En]

Therefore, in this chapter, we specifically and systematically learn these sound-related concepts and knowledge. This chapter covers a wide range of sub-fields, such as human auditory perception, audio interface, coding technology, discrete signal processing and so on. At first glance, these subareas do not seem to have much to do with each other. However, when we really start research or engineering projects in the field of voiceprint, we will find that all of these sub-areas of knowledge will inevitably be used. In an enterprise or research institution

Original: https://blog.csdn.net/u013250861/article/details/124523119
Author: u013250861
Title: 声纹技术(二):音频信号处理基础【模拟信号(连续)–采样–>数字信号(离散)–量化–>振幅简化为整数–编码–>二进制序列】【WAV音频格式】【SoX】【分帧-加窗-】

原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/513170/

转载文章受原作者版权保护。转载请注明原作者出处!

(0)

大家都在看

亲爱的 Coder【最近整理,可免费获取】👉 最新必读书单  | 👏 面试题下载  | 🌎 免费的AI知识星球