半监督学习算法在异常检测任务中的应用是什么

2024年1月1日上午5:39 • 人工智能 • 阅读 43

问题介绍

半监督学习算法可以在异常检测任务中发挥重要作用。异常检测旨在识别与正常数据规律不符的数据点，这在许多现实世界的场景中都非常重要，例如信用卡欺诈检测、网络入侵检测等。传统的异常检测算法通常依赖于大量已标记的异常样本来训练模型，然而，获取大量准确的异常样本是困难且昂贵的。半监督学习算法通过利用少量已标记的异常样本和大量未标记的正常样本，实现在异常检测问题中的性能提升。

算法原理

半监督学习中，常用的异常检测算法是半监督孪生自编码器(Semi-Supervised Variational Autoencoder, Semi-Supervised VAE)。所谓孪生自编码器是指由两个完全相同结构的自编码器组成，分别为正常样本自编码器和异常样本自编码器。半监督VAE的目标是通过最小化正常样本自编码器重构误差和异常样本自编码器重构误差，实现对异常样本的有效识别。

公式推导

半监督VAE的目标函数可以表示为最小化以下损失函数：

$$L_{总} = L_{正常} + L_{异常}$$

其中，

正常样本自编码器损失函数 $L_{正常}$:

$$L_{正常} = \frac{1}{N}\sum_{i=1}^N ||x_i – \hat{x}i||^2 + \beta \cdot KL(D(z{\mu \sigma}, N(0, I)))$$

异常样本自编码器损失函数 $L_{异常}$:

$$L_{异常} = \frac{1}{M}\sum_{j=1}^M ||x_j – \hat{x}j||^2 + \beta \cdot KL(D(z{\mu \sigma}, N(0, I)))$$

其中，$x_i$ 代表第i个正常样本，$x_j$ 代表第j个异常样本，$\hat{x}i$ 和 $\hat{x}_j$ 分别为正常样本和异常样本的重构结果，$z{\mu \sigma}$ 是自编码器的隐藏层输出，KL代表KL散度，$\beta$ 是平衡重构误差和潜在空间KL散度的权重。

计算步骤

构建半监督VAE的编码器网络和解码器网络。
使用正常样本和异常样本同时训练半监督VAE，并计算损失函数。
优化损失函数，通过反向传播算法更新网络参数。
使用训练好的半监督VAE模型对新样本进行异常检测。

算法示例

下面将使用Python代码来展示半监督VAE算法的实现细节。首先，我们将导入必要的库和数据集。

import numpy as np
import tensorflow as tf
from tensorflow import keras
from sklearn.datasets import make_blobs

# 生成虚拟数据集
X, y = make_blobs(n_samples=10000, centers=1, random_state=42)

编码器网络

接下来，我们定义半监督VAE的编码器网络。编码器网络由多层全连接层组成，输入为数据样本，输出为隐藏层的均值和方差。

def build_encoder():
 model = keras.models.Sequential([
 keras.layers.Dense(32, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(8, activation='relu'),
 keras.layers.Dense(2) # 输出维度为2，代表隐藏层的均值和方差
 ])
 return model

encoder = build_encoder()
latent = encoder(X)

潜在空间采样

为了从潜在空间中生成样本，我们需要对隐藏层的均值和方差进行采样。

def sample_from_latent(latent):
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 std = tf.exp(0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls log_var)
 epsilon = tf.random.normal(shape=tf.shape(std))
 return mean + std artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls epsilon

latent_sample = sample_from_latent(latent)

解码器网络

我们定义半监督VAE的解码器网络，用于将潜在空间的采样结果解码为重构样本。

def build_decoder():
 model = keras.models.Sequential([
 keras.layers.Dense(8, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(32, activation='relu'),
 keras.layers.Dense(2)
 ])
 return model

decoder = build_decoder()
reconstructed_sample = decoder(latent_sample)

计算损失函数

损失函数由两部分组成：重构误差和KL散度。为了计算重构误差，我们使用均方误差(Mean Squared Error, MSE)作为度量。

mse = tf.reduce_mean(tf.square(X - reconstructed_sample))

计算KL散度需要考虑潜在空间均值的平方和方差的指数项。我们还需要指定KL散度的权重参数。

latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
kl_weight = 0.01 # KL散度权重参数
total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

优化器和反向传播

我们使用Adam优化器来优化损失函数，并使用反向传播算法更新网络参数。

optimizer = tf.keras.optimizers.Adam(learning_rate=0.001)
train_op = optimizer.minimize(total_loss, var_list=encoder.trainable_variables + decoder.trainable_variables)

完整代码

下面是包含以上所有步骤的完整代码示例：

import numpy as np
import tensorflow as tf
from tensorflow import keras
from sklearn.datasets import make_blobs

# 生成虚拟数据集
X, y = make_blobs(n_samples=10000, centers=1, random_state=42)

def build_encoder():
 model = keras.models.Sequential([
 keras.layers.Dense(32, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(8, activation='relu'),
 keras.layers.Dense(2) # 输出维度为2，代表隐藏层的均值和方差
 ])
 return model

def sample_from_latent(latent):
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 std = tf.exp(0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls log_var)
 epsilon = tf.random.normal(shape=tf.shape(std))
 return mean + std artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls epsilon

def build_decoder():
 model = keras.models.Sequential([
 keras.layers.Dense(8, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(32, activation='relu'),
 keras.layers.Dense(2)
 ])
 return model

# 构建网络
encoder = build_encoder()
decoder = build_decoder()

# 计算隐藏层输出和采样结果
latent = encoder(X)
latent_sample = sample_from_latent(latent)

# 计算重构样本
reconstructed_sample = decoder(latent_sample)

# 计算损失函数
mse = tf.reduce_mean(tf.square(X - reconstructed_sample))
mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
kl_weight = 0.01 # KL散度权重参数
total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

# 优化器和反向传播
optimizer = tf.keras.optimizers.Adam(learning_rate=0.001)
train_op = optimizer.minimize(total_loss, var_list=encoder.trainable_variables + decoder.trainable_variables)

# 训练模型
epochs = 100
batch_size = 32
num_batches = X.shape[0] // batch_size

for epoch in range(epochs):
 for batch in range(num_batches):
 indices = np.random.randint(0, X.shape[0], size=batch_size)
 X_batch = X[indices]

 with tf.GradientTape() as tape:
 latent = encoder(X_batch)
 latent_sample = sample_from_latent(latent)
 reconstructed_sample = decoder(latent_sample)
 mse = tf.reduce_mean(tf.square(X_batch - reconstructed_sample))
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
 total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

 grads = tape.gradient(total_loss, encoder.trainable_variables + decoder.trainable_variables)
 optimizer.apply_gradients(zip(grads, encoder.trainable_variables + decoder.trainable_variables))

# 使用训练好的模型进行异常检测
latent = encoder.predict(X)

在上述代码中，我们构建了一个虚拟数据集，使用梯度下降算法训练半监督VAE模型，并使用训练好的模型获取数据样本的潜在空间表示。通过进一步分析潜在空间表示，我们可以识别与正常模式不符的异常样本。

希望这个口语形式的解答能够帮到您，如果还有其他问题，请随时提问。

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/822395/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Argoverse–Motion Forecasting Dataset评价指标minADE/minFDE详细介绍

文章目录前言一、概念解释二、代码总结前言最近接触了Argoverse数据集中运动预测部分v1.1版本（Motion Forecasting Dataset）。评价指标为…

人工智能 2023年6月2日
00140
2021科大讯飞试题标签预测挑战赛亚军方案总结

摘要这个比赛的任务是根据题目的文本内容，预测题目的知识点及难度标签，其中知识点包括五个类型，我们需要预测出对应类型的知识点标签。在本次比赛中，我们构造了六个独立的模型，模型结构相…

人工智能 2023年7月17日
0062
YARN资源调度过程

1 c向rm提出请求 2 rm先选取一个合适负载的nm 3 nm启动am 4 am进行请求分析（需要的数据或者内核等资源），在向rm发起资源申请，rm返回需要的资源目录 5am通过…

人工智能 2023年6月4日
0069
【招聘】上海微创医疗机器人集团 – 软件工程师/图像算法工程师

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年6月22日
0067
Jetson Xavier NX系统烧录(使用NVIDIA SDK Manager)

目录一、在host主机下安装NVIDIA SDK Manager 二、安装系统镜像三、设置SSD为系统启动项四、安装CUDA等环境的包注意：本文使用的是国产开发套件，不支持…

人工智能 2023年7月27日
0084
【数字图像处理】实验二图像增强（MATLAB实现）

目录一、实验意义及目的二、实验内容三、Matlab 相关函数介绍四、算法原理五、参考代码及扩展代码流程图（1）参考代码流程图（2）扩展代码流程图六、参考代码七、实…

人工智能 2023年7月5日
0073
【PyG】创建自己的数据集-图神经网络

，其中第二个是第一个的子类，如果希望全部数据都在内存里则需要使用第二个类。每个数据集需要提供文件夹路径作为参数，其中一个 raw_dir存储数据集的源文件，而另一个参数 proce…

人工智能 2023年7月14日
0057
【金融】【随机森林】使用随机森林对期货数据（涨跌）进行回归

【金融】【随机森林】使用随机森林对期货数据（涨跌）进行回归 RF-RF_train3year3month * 读取数据划分训练集与数据集，3年+3月，以此类推取特定数据 Exp…

人工智能 2023年6月18日
0087
【汇总篇】数据分析、机器学习、数据挖掘相关实例的补充说明

离散型变量（性别、职业等）：饼图、柱状图（一个离散型变量）、堆积柱状图（两个离散型变量）连续型变量（年龄、工资等）：直方图、箱线图时间序列变量（GDP、CPI等）：折线图数据类型…

人工智能 2023年6月11日
0094
软件智能：aaas系统AI众生的“世”和“界” 之12 世界相涉身中贸迁

下面是《楞严经》中回答” 云何名为众生世界“这个问题的的第一段中的部分内容。世为迁流。界为方位。… 一切众生。织妄相成。身中贸迁。世…

人工智能 2023年6月1日
0098
深度学习求解微分方程系列一：PINN求解框架

下面我将介绍内嵌物理知识神经网络（PINN）求解微分方程。首先介绍PINN基本方法，并基于Pytorch框架实现求解一维Poisson方程。内嵌物理知识神经网络（PINN）入门及相…

人工智能 2023年6月19日
0080
R语言计算欧几里得距离（Euclidean Distance）实战：两个向量的欧几里得距离、dataframe两个数据列的欧几里得距离

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年5月28日
0060
天猫精灵的自定义语音技能创建流程

登录 AliGenie 技能应用平台，点击 创建新技能，通过向导创建一…

人工智能 2023年5月27日
00157
ieee下载文献的方法

2.复制doi到sci-hub即可下载这个不行，还可以找其他的网址： sci-hub官方发布，通过修改dns为80.82.77.83和80.82.77.84可以访问sci-hub…

人工智能 2023年7月26日
0096
Tensorflow-gpu版本安装

安装分为3个步骤文章目录 1.安装CUDA 2. 安装cuDNN 3. 安装tensorflow-gpu 1.安装CUDA 首先要查看自己电脑的显卡支持的CUDA版本打开英伟达控…

人工智能 2023年5月23日
0075
论文翻译：用于鱼类分割和分类的大规模数据集

A Large-Scale Dataset for Fish Segmentation and Classification Oguzhan Ulucan, Diclehan Ka…

人工智能 2023年7月2日
0066

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31