半监督学习算法在异常检测任务中的应用是什么

2024年1月1日上午5:39 • 人工智能 • 阅读 39

问题介绍

半监督学习算法可以在异常检测任务中发挥重要作用。异常检测旨在识别与正常数据规律不符的数据点，这在许多现实世界的场景中都非常重要，例如信用卡欺诈检测、网络入侵检测等。传统的异常检测算法通常依赖于大量已标记的异常样本来训练模型，然而，获取大量准确的异常样本是困难且昂贵的。半监督学习算法通过利用少量已标记的异常样本和大量未标记的正常样本，实现在异常检测问题中的性能提升。

算法原理

半监督学习中，常用的异常检测算法是半监督孪生自编码器(Semi-Supervised Variational Autoencoder, Semi-Supervised VAE)。所谓孪生自编码器是指由两个完全相同结构的自编码器组成，分别为正常样本自编码器和异常样本自编码器。半监督VAE的目标是通过最小化正常样本自编码器重构误差和异常样本自编码器重构误差，实现对异常样本的有效识别。

公式推导

半监督VAE的目标函数可以表示为最小化以下损失函数：

$$L_{总} = L_{正常} + L_{异常}$$

其中，

正常样本自编码器损失函数 $L_{正常}$:

$$L_{正常} = \frac{1}{N}\sum_{i=1}^N ||x_i – \hat{x}i||^2 + \beta \cdot KL(D(z{\mu \sigma}, N(0, I)))$$

异常样本自编码器损失函数 $L_{异常}$:

$$L_{异常} = \frac{1}{M}\sum_{j=1}^M ||x_j – \hat{x}j||^2 + \beta \cdot KL(D(z{\mu \sigma}, N(0, I)))$$

其中，$x_i$ 代表第i个正常样本，$x_j$ 代表第j个异常样本，$\hat{x}i$ 和 $\hat{x}_j$ 分别为正常样本和异常样本的重构结果，$z{\mu \sigma}$ 是自编码器的隐藏层输出，KL代表KL散度，$\beta$ 是平衡重构误差和潜在空间KL散度的权重。

计算步骤

构建半监督VAE的编码器网络和解码器网络。
使用正常样本和异常样本同时训练半监督VAE，并计算损失函数。
优化损失函数，通过反向传播算法更新网络参数。
使用训练好的半监督VAE模型对新样本进行异常检测。

算法示例

下面将使用Python代码来展示半监督VAE算法的实现细节。首先，我们将导入必要的库和数据集。

import numpy as np
import tensorflow as tf
from tensorflow import keras
from sklearn.datasets import make_blobs

# 生成虚拟数据集
X, y = make_blobs(n_samples=10000, centers=1, random_state=42)

编码器网络

接下来，我们定义半监督VAE的编码器网络。编码器网络由多层全连接层组成，输入为数据样本，输出为隐藏层的均值和方差。

def build_encoder():
 model = keras.models.Sequential([
 keras.layers.Dense(32, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(8, activation='relu'),
 keras.layers.Dense(2) # 输出维度为2，代表隐藏层的均值和方差
 ])
 return model

encoder = build_encoder()
latent = encoder(X)

潜在空间采样

为了从潜在空间中生成样本，我们需要对隐藏层的均值和方差进行采样。

def sample_from_latent(latent):
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 std = tf.exp(0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls log_var)
 epsilon = tf.random.normal(shape=tf.shape(std))
 return mean + std artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls epsilon

latent_sample = sample_from_latent(latent)

解码器网络

我们定义半监督VAE的解码器网络，用于将潜在空间的采样结果解码为重构样本。

def build_decoder():
 model = keras.models.Sequential([
 keras.layers.Dense(8, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(32, activation='relu'),
 keras.layers.Dense(2)
 ])
 return model

decoder = build_decoder()
reconstructed_sample = decoder(latent_sample)

计算损失函数

损失函数由两部分组成：重构误差和KL散度。为了计算重构误差，我们使用均方误差(Mean Squared Error, MSE)作为度量。

mse = tf.reduce_mean(tf.square(X - reconstructed_sample))

计算KL散度需要考虑潜在空间均值的平方和方差的指数项。我们还需要指定KL散度的权重参数。

latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
kl_weight = 0.01 # KL散度权重参数
total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

优化器和反向传播

我们使用Adam优化器来优化损失函数，并使用反向传播算法更新网络参数。

optimizer = tf.keras.optimizers.Adam(learning_rate=0.001)
train_op = optimizer.minimize(total_loss, var_list=encoder.trainable_variables + decoder.trainable_variables)

完整代码

下面是包含以上所有步骤的完整代码示例：

import numpy as np
import tensorflow as tf
from tensorflow import keras
from sklearn.datasets import make_blobs

# 生成虚拟数据集
X, y = make_blobs(n_samples=10000, centers=1, random_state=42)

def build_encoder():
 model = keras.models.Sequential([
 keras.layers.Dense(32, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(8, activation='relu'),
 keras.layers.Dense(2) # 输出维度为2，代表隐藏层的均值和方差
 ])
 return model

def sample_from_latent(latent):
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 std = tf.exp(0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls log_var)
 epsilon = tf.random.normal(shape=tf.shape(std))
 return mean + std artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls epsilon

def build_decoder():
 model = keras.models.Sequential([
 keras.layers.Dense(8, activation='relu', input_shape=[2]),
 keras.layers.Dense(16, activation='relu'),
 keras.layers.Dense(32, activation='relu'),
 keras.layers.Dense(2)
 ])
 return model

# 构建网络
encoder = build_encoder()
decoder = build_decoder()

# 计算隐藏层输出和采样结果
latent = encoder(X)
latent_sample = sample_from_latent(latent)

# 计算重构样本
reconstructed_sample = decoder(latent_sample)

# 计算损失函数
mse = tf.reduce_mean(tf.square(X - reconstructed_sample))
mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
kl_weight = 0.01 # KL散度权重参数
total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

# 优化器和反向传播
optimizer = tf.keras.optimizers.Adam(learning_rate=0.001)
train_op = optimizer.minimize(total_loss, var_list=encoder.trainable_variables + decoder.trainable_variables)

# 训练模型
epochs = 100
batch_size = 32
num_batches = X.shape[0] // batch_size

for epoch in range(epochs):
 for batch in range(num_batches):
 indices = np.random.randint(0, X.shape[0], size=batch_size)
 X_batch = X[indices]

 with tf.GradientTape() as tape:
 latent = encoder(X_batch)
 latent_sample = sample_from_latent(latent)
 reconstructed_sample = decoder(latent_sample)
 mse = tf.reduce_mean(tf.square(X_batch - reconstructed_sample))
 mean, log_var = tf.split(latent, num_or_size_splits=2, axis=1)
 latent_loss = -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls tf.reduce_sum(1 + log_var - tf.square(mean) - tf.exp(log_var), axis=1)
 total_loss = mse + kl_weight artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls latent_loss

 grads = tape.gradient(total_loss, encoder.trainable_variables + decoder.trainable_variables)
 optimizer.apply_gradients(zip(grads, encoder.trainable_variables + decoder.trainable_variables))

# 使用训练好的模型进行异常检测
latent = encoder.predict(X)

在上述代码中，我们构建了一个虚拟数据集，使用梯度下降算法训练半监督VAE模型，并使用训练好的模型获取数据样本的潜在空间表示。通过进一步分析潜在空间表示，我们可以识别与正常模式不符的异常样本。

希望这个口语形式的解答能够帮到您，如果还有其他问题，请随时提问。

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/822395/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Pandas统计分析中（数据计算、数据格式化、数据分组统计、数据移位、数据转换、数据合并、数据导出）

本篇博文来自《Python数据分析从入门到精通》_明日科技编著相信经过第3章的学习，您已经了解Pandas了，那么本章开始进行Pandas进阶，对Pandas相关技术进一步加深讲…

人工智能 2023年7月15日
0071
pytorch_lesson10 二分类交叉熵损失函数及调用+多分类交叉熵损失函数及调用

注：仅仅是学习记录笔记，搬运了学习课程的ppt内容，本意不是抄袭！望大家不要误解！纯属学习记录笔记！！！！！！文章目录一、机器学习中的优化思想二、回归：误差平方和SSE 三、…

人工智能 2023年7月21日
0052
堪称经典，一个非常适合初学者的机器学习实战案例

大家好，今天我给大家介绍一个非常适合新手的机器学习实战案例。这是一个房价预测的案例，来源于 Kaggle 网站，是很多算法初学者的第一道竞赛题目。该案例有着解机器学习问题的完…

人工智能 2023年6月13日
00108
拉格朗日乘子法

周志华《机器学习》如何理解拉格朗日乘子法？介绍 拉格朗日乘&amp…

人工智能 2023年7月25日
0049
关于社群经济

现状：靠社群成功的组织，都在慢慢的去社群化。社群只是人与人之间的连接方式。一个社群的能量值=精神连接 * 现实连接 * 数字连接《失控：机器、社会与经济的新生物学》. 凯文….

人工智能 2023年6月4日
0057
resnet18实现cifar10分类

实验步骤搭建resnet18网络数据集加载模型训练和改进分析评估 Kaggle提交网络构建实验初期拟采用torchvision中实现的resnet18作为网络结构，为了…

人工智能 2023年7月12日
0035
保姆级教程 – atlas500部署yolov3-tiny检测实时视频流 [2] – yolov3-tiny模型转换到om模型

保姆级教程 – atlas500部署yolov3-tiny检测实时视频流 [2] – yolov3-tiny模型转换到om模型接上文 -> 内网环境…

人工智能 2023年5月23日
0079
回归模型评价指标

计算公司：R 2 = S S R S S T = 1 − S S E S S T R2 = \frac {SSR} {SST} = 1 – {\frac {SSE} {…

人工智能 2023年6月18日
0086
UWB系统的定位精度影响因素

UWB系统的定位精度影响因素影响UWB定位精度的因素较多，主要包括:多径效应、非视距传播、多址干扰、参考基站数量、参考基站位置和时钟同步误差等因素。 (1)多径效应超宽带信号在…

人工智能 2023年7月1日
0068
OpenCV图像处理入门

😊😊😊 欢迎来到本博客😊😊😊本次博客内容将继续讲解关于OpenCV的相关知识🎉 作者简介：⭐️⭐️⭐️ 目前计算机研究生在读。主要研究方向是人工智能和群智能算法方向。目前熟悉pyt…

人工智能 2023年6月18日
0052
基于Python实现的口罩佩戴检测

口罩佩戴检测设计报告和源码下载地址：https://download.csdn.net/download/sheziqiong/85594224 一题目背景 1.1 实验介绍 …

人工智能 2023年6月16日
00108
体验了一下火爆全球的 ChatGPT，我震惊了

这几天，要说编程圈最热的话题，莫过于 OpenAI的 ChatGPT，写小说，写代码，找BUG，写论文，画漫画，谱曲……简直没有它干不了的事。趁着下班时间…

人工智能 2023年7月31日
0033
OpenCV-Python实战（番外篇）——利用 SVM 算法识别手写数字

[ OpenCV_是一款非常强大的计算机视觉库，其中包含了很多功能强大的图像处理和计算机视觉 _算法。而在这个系列的第三篇文章中，我们将重点介绍如何在 _OpenCV_中绘制图形和…

人工智能 2023年6月19日
0093
Arduino Uno接JQ8900-16p语音播报模块

Arduino Uno接JQ8900-16p语音播报模块前言记录一个比较好用的语音播报模块JQ8900，这个模块成本低廉（十几块就能买到），使用方便。此外，这个模块还…

人工智能 2023年5月23日
0092
基于OPENCV分水岭的球团分割

因为项目需求，需要对某铁矿厂的球团进行粒径检测。采集系统就不详细说了，主要是颗粒运动很快，粒径在8-12mm，范围1米左右，所以既要高像素的相机，又要曝光时间很短的相机，前期拍出来…

人工智能 2023年5月26日
0073
LIO-SAM从0到1运行自己的数据集

LIO-SAM从0到1运行自己的数据集前言笔者在学习LIO_SAM时踩了不少坑，在此记录从开始到最后整个踩坑过程。文中参考了很多大佬的文章，我只是个搬运工。可以直接跳到第二…

人工智能 2023年6月23日
0078

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30