语音标注的具体应用场景

2023年5月25日上午7:04 • 人工智能 • 阅读 83

语音标注是我们的注释员不断重写语言信息并让手动系统学习的过程。目前，语音识别技术在我们日常生活的很多方面都很流行，比如我们的微信翻译、语音助手、智能音频、智能客服等。随着人工智能的逐步发展，人机语音交互场景将向更多方向延伸，这对识别精度、场景优化、语音识别技术等提出了更高的要求。

[En]

Voice tagging is a process in which our annotators constantly rewrite the language information and let the manual system learn. At present, speech recognition technology is popular in many aspects of our daily life, such as our Wechat translation, voice assistant, intelligent audio, intelligent customer service and so on. With the gradual development of artificial intelligence, the human-computer voice interaction scene will be extended to more directions, which puts forward higher requirements for recognition accuracy, scene optimization, speech recognition technology and so on.

语音标注的应用场景

1、语音输入

语音输入是语音识别中常见的一种，它可以识别我们所说的话，并将语音转换为文本输入，这大大提高了效率。语音输入可以摆脱生僻字和拼音障碍，使用语音即时输入。可以有效识别带有轻微口音的普通话、广东话、四川话、英语和法语，还可以根据句子的意思自动更正，并可以自动添加标点符号，使输入更快，沟通更顺畅。

[En]

Speech input is common in speech recognition, which can recognize what we say and convert speech into text input, which greatly improves the efficiency. Voice input can get rid of rare words and pinyin barriers and use voice instant input. Mandarin, Cantonese, Sichuan dialect, English and French with slight accent can be effectively identified, and can also be automatically corrected according to the meaning of the sentence and punctuation can be added automatically to make the input faster and the communication more smooth.

文本的实时语音识别可以应用于语音聊天、语音输入、语音搜索、语音点餐、语音指令、语音问答等场景，在日常生活中，例如客服电话的语音转录、会议转录、通讯产品的语音输入和转录、语音病历、电影字幕、电视机等智能家居命令的自动生成，这些都使用了这项技术。在医疗领域，声音也经常被用来生成和编辑专业的医疗报告。

[En]

Real-time speech recognition of text can be applied to voice chat, voice input, voice search, voice orders, voice instructions, voice question and answer and other scenarios, in daily life, for example, voice transcription of customer service calls, conference transcription, voice input and transcription of communication products, voice medical records, automatic generation of movie subtitles, TV sets and other smart home commands, these all use this technology. In the medical field, sound is also often used to generate and edit professional medical reports.

2、语音合成

语音合成能将任意文字信息实时转化为标准流畅的语音朗读出来，相当于给机器装上了人工的嘴巴。例如app中的实时播报、合成特定人的声音、验证码内容语音合成、客服、导航软件，大厅，售货机等各场景的语音提示、语音早教机的语言发音学习、便携等场景。

3、声纹识别

声纹识别是一种生物识别技术，也称为说话人识别，包括说话人识别和说话人验证。声纹识别是将声音信号转换为电信号，然后由计算机进行识别。例如，使用声纹密码进行身份认证、登录、授权、打卡、公安身份存储、语音唤醒等。

[En]

Voiceprint recognition is a kind of biometric technology, also known as speaker recognition, including speaker identification and speaker verification. Voiceprint recognition is to convert sound signals into electrical signals and then recognize them by computer. For example, the use of voiceprint password for identity authentication, login, authorization, clocking in, public security identity storage, voice awakening and so on.

语音标注的应用前景

语音识别正逐渐成为信息技术中人机界面的关键技术。语音识别技术和语音合成技术的结合，使人们摆脱了键盘，通过语音命令进行操作。语音技术的应用已成为日常生活中竞争激烈的高新技术产业。

[En]

Speech recognition is gradually becoming the key technology of man-machine interface in information technology. The combination of speech recognition technology and speech synthesis technology enables people to get rid of the keyboard and operate through voice commands. The application of voice technology has become a competitive new high-tech industry in daily life.

今天，随着语音识别技术的发展，特别是中小词汇量独立语音识别系统的识别准确率已经超过98%，特定人语音识别系统的识别准确率更高。这些技术已经能够满足常见应用的要求。

[En]

Today, with the development of speech recognition technology, especially the recognition accuracy of small and medium vocabulary independent speech recognition system has been more than 98%, the recognition accuracy of person-specific speech recognition system is even higher. These technologies have been able to meet the requirements of common applications.

如今，很多用户都可以享受到语音识别技术带来的便利，比如智能手机的语音操作等。然而，这与真正的人机交流的实现还有一段距离。目前用户的计算机语音识别程度不高，在人机交互方面还存在一些问题，必须有所突破，这也是未来语音识别技术的发展方向。

[En]

Nowadays, many users can enjoy the convenience brought by speech recognition technology, such as the voice operation of smart phones and so on. However, there is still some distance between this and the realization of real man-machine communication. At present, the degree of computer speech recognition of users is not high, there are still some problems in human-computer interaction, we must make a breakthrough, which is also the development direction of speech recognition technology in the future.

景联文科技为 语音识别 技术提供一站式 数据 解决方案

作为人工智能的”养料”，机器想要实现智能化就必须有海量的有效数据来做支撑，而这些数据就需要我们的标注员进行数据标注分析与处理才能得出来。

景联文科技作为一家专业的数据采集标注公司，采集了《50800段车内录音采集数据集》、《60000段中文语音数据集》、《100个id12000段中国人读英语唤醒词数据集》等可用于研究语音识别技术的算法的数据集，可有效的提升企业的测试效率，减少研发时间。还可以针对特定人群、特定场景、特定语种提供个性化的数据定制服务。

为提高数据标注员的标注效率，景联文科技还自建数据标注平台与成熟的标注、审核、质检机制，支持语音工程（语音切割、ASR语音转写、语音情绪判定、声纹识别标注等）、计算机视觉（拉框标注、语义分割、3D点云标注、关键点标注、线标注、2D/3D融合标注、目标跟踪、图片分类等）、自然语言处理（OCR转写、文本信息抽取、NLU语句泛化）多类型数据标注。

此外，景联文科技自研专业的数据采集标注平台，已实现标审分离，完善平台风险管控机制，且设置了严格的数据隐私安全保障措施，全面保障数据安全。平台已实现Al数据的全流程线上标注和质量管理，全面支持音频、图像等数据标注需求、支持多类型标注模板、标注结果支持多种格式在线导出等。

随着语音识别技术的不断进步，人与机器的交流会越来越顺畅，人与机器的关系会越来越紧密，人们的生活也会越来越便利。未来，京联科技将继续为语音标注提供更精准的数据。

[En]

With the continuous progress of speech recognition technology, the communication between people and machines will be more and more smooth, the relationship between people and machines will be closer and closer, and people’s life will become more and more convenient. In the future, Jinglian Technology will continue to provide more accurate data for voice tagging.

语音标注的具体应用场景

Original: https://blog.csdn.net/weixin_55551028/article/details/122135144
Author: 景联文科技
Title: 语音标注的具体应用场景

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/512814/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

基于 PyTorch 的 cifar-10 图像分类

文章目录前言一、cifar-10 数据集介绍二、环境配置三、实验代码 * 1.简单网络的代码 2.VGG加深网络的代码四、运行结果五、遇到的问题总结前言本文的主要…

人工智能 2023年6月13日
0057
盘点10个冷门Python库，原来Python还能实现这些功能？

目录 👉 1 PrettyErrors 👉 2 Rich 👉 3 Dear PyGui 👉 4 HummingBird 👉 5 HiPlot 👉 6 Norfair 👉 7 Geo…

人工智能 2023年7月25日
0084
手把手教你使用LabVIEW人工智能视觉工具包快速实现图像读取与采集

文章目录前言一、工具包位置二、图像采集与色彩空间转换 * 1.文件读写 2.实现图片读取 3.使用算子cvtColor实现颜色空间转换三、从摄像头采集图像 * 1.Came…

人工智能 2023年6月15日
0085
【一起啃西瓜书】机器学习-期末复习（不挂科）

【一起啃西瓜书】机器学习-期末复习（不挂科）前言试卷题型第一章：绪论 * 一般过程任务数据训练集验证集监督学习无监督学习半监督学习第二章：模型评估与选择 * …

人工智能 2023年6月15日
0071
阿里P8架构师Spring源码阅读心得，都记录在这份PDF文档里面了

为什么学习读源码我们每天都和代码打交道。经过数年的基础教育和职业培训，大部分程序员都会「写」代码，或者至少会抄代码和改代码。但是，会读代码的并不在多数，会读代码又真正读懂一些…

人工智能 2023年6月28日
0072
Uniapp——使用安卓原生插件

Uniapp——使用安卓原生插件 1. 开发环境 2. 解压SDK压缩包 3. 导入UniPlugin-Hello-AS项目、并切换为project显示 4. 可删除提供的demo…

人工智能 2023年6月27日
00100
mmdetection训练自己的COCO数据集及常见问题

训练自己的VOC数据集及常见问题见下文： mmdetection训练自己的VOC数据集及常见问题_不瘦8斤的妥球球饼的博客-CSDN博客_mmdetection训练voc 目录一…

人工智能 2023年6月16日
0085
RobotStudio的基本布局方法，模型加载，工件坐标系的创建，手动操作机器人示教，以及模拟仿真机器人运动轨迹。

2、在【基本】功能选项卡中，打开【ABB模型库】，选择【IRB2600】。3、设定好数值，然后单击【确认】。4、在【基本】功能选项里，打开【导入模型库】—【设备】，选择【myToo…

人工智能 2023年6月1日
00322
清风数学建模学习笔记——系统(层次)聚类原理详解及案例分析

系统聚类系统聚类的合并算法通过计算两类数据点间的距离，对最为接近的两类数据点进行组合，并反复迭代这一过程，直到将所有数据点合成一类，并生成聚类谱系图。此外，系统聚类可以解决簇数 …

人工智能 2023年5月31日
0065
学习记录：正负样本分配策略之YoloX | SimOTA-简单易懂版

学习记录：正负样本分配策略之YoloX | SimOTA-简单易懂版文献阅读和分享目标检测领域趋势正负样本分配策略——SimOTA * 网络训练（恋爱历程） SimOTA具体…

人工智能 2023年6月17日
0048
Normalization）是什么

详细解决问题：关于Normalization是什么这个问题 1. 介绍在数据处理和机器学习任务中，Normalization（归一化）是一种常用的数据预处理技术。它通过对原始数据…

人工智能 2024年1月2日
0019
机器学习：使用matlab实现曲线线性回归拟合并绘制学习曲线

文章目录数据集划分数据可视化代价-梯度函数求解线性拟合绘制学习曲线多项式拟合再次求解选择合适的正则参数数据集划分先将数据集划分为训练集、验证集和测试集，标记为…

人工智能 2023年6月15日
00121
统计学三种相关系数【pearson、spearman、kendall】

1. pearson pearson系数的取值范围为[ − 1.0 , 1.0 ] [-1.0,1.0][−1 .0 ,1 .0 ]之间，接近0表示无相关性，接近1或-1表示强相关…

人工智能 2023年7月15日
0082
多模态预训练中的Prompt（ALPRO，Frozen）

以往的文章更新过，背景就不再赘述： Cross-modal Pretraining in BERT（多模态预训练） CLIP，DALL-E 预训练新范式（Prompt-tuning…

人工智能 2023年5月28日
0070
微信小程序组件化

组件定义 1、创建组件构造器使用的时Component 配置文件中设置component:true 2、引入组件首先声明这个组件，在配置文件声明 "usingComp…

人工智能 2023年7月30日
0045
win10+python3.6+tensorflow-cpu+Pycharm环境下的tensorflow配置方法

参考博客win10+python3.6+tensorflow-cpu+keras+Pycharm环境下的tensorflow配置方法_未来可期-CSDN博客成功了！ [En] S…

人工智能 2023年5月25日
0079

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

语音标注的具体应用场景

大家都在看