面向自然语言处理的对抗攻防与鲁棒性分析综述 Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Lang

6.面向自然语言处理的对抗攻防与鲁棒性分析综述
Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Language Processing

摘要:随着人工智能技术的飞速发展,深度神经网络在计算机视觉、信号分析和自然语言处理等领域中都得到了广泛应用.自然语言处理通过语法分析、语义分析、篇章理解等功能帮助机器处理、理解及运用人类语言.但是,已有研究表明深度神经网络容易受到对抗文本的攻击,通过产生不可察觉的扰动添加到正常文本中,就能使自然语言处理模型预测错误.为了提高模型的鲁棒安全性,近年来也出现了防御相关的研究工作.针对已有的研究,全面地介绍自然语言处理攻防领域的相关工作,具体而言,首先介绍了自然语言处理的主要任务与相关方法;其次,根据攻击和防御机制对自然语言处理的攻击方法和防御方法进行分类介绍;然后,进一步分析自然语言处理模型的可验证鲁棒性和评估基准数据集,并提供自然语言处理应用平台和工具包的详细介绍;最后总结面向自然语言处理的攻防安全领域在未来的研究发展方向.

[En]

Abstract: with the rapid development of artificial intelligence technology, deep neural network has been widely used in the fields of computer vision, signal analysis and natural language processing. Natural language processing helps machines process, understand and use human language through grammatical analysis, semantic analysis, text understanding and other functions. However, previous studies have shown that deep neural networks are vulnerable to attacks against texts, and the prediction error of natural language processing models can be made by generating imperceptible disturbances to normal texts. In order to improve the robust security of the model, defense-related research work has appeared in recent years. In view of the existing research, this paper comprehensively introduces the relevant work in the field of attack and defense of natural language processing, specifically, firstly, it introduces the main tasks and related methods of natural language processing; secondly, the attack and defense methods of natural language processing are classified according to the attack and defense mechanism. Then, the verifiable robustness of the natural language processing model and the evaluation benchmark data set are further analyzed, and the detailed introduction of the natural language processing application platform and toolkit is provided. Finally, the future research and development direction of attack and defense security for natural language processing is summarized.

关键词: 深度神经网络, 自然语言处理, 对抗攻击, 防御, 鲁棒性

Abstract:With the rapid development of artificial intelligence, deep neural networks have been widely applied in the fields of computer vision, signal analysis, and natural language processing. It helps machines process understand and use human language through functions such as syntax analysis, semantic analysis, and text comprehension. However, existing studies have shown that deep models are vulnerable to the attacks from adversarial texts. Adding imperceptible adversarial perturbations to normal texts, natural language processing models can make wrong predictions. To improve the robustness of the natural language processing model, defense-related researches have also developed in recent years. Based on the existing researches, we comprehensively detail related works in the field of adversarial attacks, defenses, and robustness analysis in natural language processing tasks. Specifically, we first introduce the research tasks and related natural language processing models. Then, attack and defense approaches are stated separately. The certified robustness analysis and benchmark datasets of natural language processing models are further investigated and a detailed introduction of natural language processing application platforms and toolkits is provided. Finally, we summarize the development direction of research on attacks and defenses in the future.

Key words: deep neural network, natural language processing, adversarial attack, defense, robustness

syntax:n.语法
semantic:n.语意
comprehension:n.理解
vulnerable:adj.易受攻击的
adversarial:adj.对抗的
imperceptible:adj.难以察觉的
perturbations:n.扰动
prediction:n.预测
robustness:n.鲁棒性=稳健性
defense-related:n.防卫性事务
comprehensively:adv.完全地
defenses:n.防御
Specifically:adv.具体来说
state:v.陈述
separately:adv.单独地
certify:v.认证
benchmark:n.基准
dataset:n.数据集
investigate:v.调查
toolkit:工具集
direction:n.方向

Original: https://blog.csdn.net/daisyxyr/article/details/124370699
Author: daisyxyr
Title: 面向自然语言处理的对抗攻防与鲁棒性分析综述 Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Lang

原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/76568/

转载文章受原作者版权保护。转载请注明原作者出处!

(0)

大家都在看

最近整理资源【免费获取】:   👉 程序员最新必读书单  | 👏 互联网各方向面试题下载 | ✌️计算机核心资源汇总