pytorch搭建孪生网络比较人脸相似性

2023年7月26日上午8:59 • 人工智能 • 阅读 68

参考文献：

神经网络学习小记录52——Pytorch搭建孪生神经网络（Siamese network）比较图片相似性_Bubbliiiing的博客-CSDN博客_神经网络图片相似性

Python – 深度学习系列2-人脸比对 Siamese_yukai08008的博客-CSDN博客

1.孪生网络

孪生神经网络（Siamese network）即”连体的神经网络”，

神经网络的”连体”是通过共享权值来实现的，如图所示。

孪生神经网络有两个输入（Input1 and Input2），利用特征提取网络将输入映射到新的空间，形成输入在新的空间中的表示。然后对得到的两个输出进行相减，得到新的输出，并进行全连接层分类，最后输出一个向量，再通过Sigmoid函数将其转化到0-1之间，该值即为两个输入的相似度。

2.孪生网络

（1）特征提取部分

本孪生网络采用vgg16的features作为特征提取网络，提取完后将两个向量展平，便于相减得到新的向量并进行全连接层分类。

代码实现：


vgg16 = models.vgg16(pretrained=True)
&#x83B7;&#x53D6;VGG16&#x7684;&#x7279;&#x5F81;&#x63D0;&#x53D6;&#x5C42;
vgg = vgg16.features

class SiameseNetwork(nn.Module):
    def __init__(self, input_shape):
        super(SiameseNetwork, self).__init__()

        self.vgg = vgg

    def forward_once(self, x):
        output = self.vgg(x)
        output = torch.flatten(output, 1)

    def forward(self, input1, input2):
        output1 = self.forward_once(input1)
        output2 = self.forward_once(input2)

这里最好不要将features的权重冻结，因为这样不能很好提取我们所需图片的特征，泛化能力也不好。

（2）全连接层

将得到的两个输出（output1和output2）进行相减，得到output，并对output进行全连接层，注意：其展平长度需通过计算得出，最后通过三个全连接层得到一个输出通道，并采取Sigmoid将其范围控制在0到1之间。（由于我们使用的损失函数是BCEWithLogitsLoss，即进行损失计算前会对预测值进行Sigmoid，因此在这里我们就不加Sigmoid）

代码实现：


def get_img_output_length(width, height):
    def get_output_length(input_length):
        # input_length += 6
        filter_sizes = [2, 2, 2, 2, 2]
        padding = [0, 0, 0, 0, 0]
        stride = 2
        for i in range(5):
            input_length = (input_length + 2 * padding[i] - filter_sizes[i]) // stride + 1
        return input_length

    return get_output_length(width) * get_output_length(height)

class SiameseNetwork(nn.Module):
    def __init__(self, input_shape):
        super(SiameseNetwork, self).__init__()

        flat_shape = 512 * get_img_output_length(input_shape[1], input_shape[0])
        # flat_shape = 1000
        self.fc = nn.Sequential(
            nn.Linear(flat_shape, 512),
            nn.ReLU(inplace=True),

            nn.Linear(512, 256),
            nn.ReLU(inplace=True),

            nn.Linear(256, 1))

    def forward(self, input1, input2):

        output = output1 - output2
        output = self.fc(output)
        # output = nn.Sigmoid(output)
        return output

3.标签的生成

对于相似的图片，我们标签为1；对于不同的图片，我们将标签设置为0

代码实现：

class SiameseNetworkDataset(Dataset):

    def __init__(self, imageFolderDataset, transform=None, should_invert=True):
        self.imageFolderDataset = imageFolderDataset
        self.transform = transform
        self.should_invert = should_invert

    def __getitem__(self, index):
        img0_tuple = random.choice(self.imageFolderDataset.imgs)
        # we need to make sure approx 50% of images are in the same class
        should_get_same_class = random.randint(0, 1)
        if should_get_same_class:
            while True:
                # keep looping till the same class image is found
                img1_tuple = random.choice(self.imageFolderDataset.imgs)
                if img0_tuple[1] == img1_tuple[1]:
                    break
        else:
            while True:
                # keep looping till a different class image is found

                img1_tuple = random.choice(self.imageFolderDataset.imgs)
                if img0_tuple[1] != img1_tuple[1]:
                    break

        img0 = Image.open(img0_tuple[0])
        img1 = Image.open(img1_tuple[0])
        img0 = img0.convert("RGB")
        img1 = img1.convert("RGB")

        if self.should_invert:
            img0 = PIL.ImageOps.invert(img0)
            img1 = PIL.ImageOps.invert(img1)

        if self.transform is not None:
            img0 = self.transform(img0)
            img1 = self.transform(img1)

        return img0, img1, torch.from_numpy(np.array([int(img1_tuple[1] == img0_tuple[1])], dtype=np.float32))

    def __len__(self):
        return len(self.imageFolderDataset.imgs)

4.损失函数和优化器

criterion = torch.nn.BCEWithLogitsLoss()
optimizer = torch.optim.Adam(net.parameters(), 0.001, betas=(0.9, 0.999))

5.训练过程

（1）参数设置：


training_dir = r"D:\Siamese_for_Face\data\faces\training"
train_batch_size = 16
train_number_epochs = 200
input_shape = [224, 224]

（2）数据集加载


transform = transforms.Compose([transforms.Resize((224, 224)),
                                transforms.ToTensor()])
folder_dataset = dset.ImageFolder(root=training_dir)
siamese_dataset = SiameseNetworkDataset(imageFolderDataset=folder_dataset,
                                        transform=transform,
                                        should_invert=False)
train_dataloader = DataLoader(siamese_dataset,
                              shuffle=True,
                              num_workers=0,
                              batch_size=train_batch_size)

（3）网络的加载并移到GPU训练

net = SiameseNetwork(input_shape)
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
net.to(device)

（4）训练循环

counter = []
loss_history = []
iteration_number = 0

if __name__ == '__main__':

    for epoch in range(0, train_number_epochs):
        for i, data in enumerate(train_dataloader, 0):
            img0, img1, label = data
            img0, img1, label = img0.to(device), img1.to(device), label.to(device)
            optimizer.zero_grad()
            output = net(img0, img1)
            loss_contrastive = criterion(output, label)
            loss_contrastive.backward()
            optimizer.step()
            if i % 10 == 0:
                print("Epoch number {}\n Current loss {}\n".format(epoch, loss_contrastive.item()))
                iteration_number += 10
                counter.append(iteration_number)
                loss_history.append(loss_contrastive.item())
    plt.plot(counter, loss_history)
    plt.show()
    torch.save(net.state_dict(), 'weights/vgg.pkl')

6.测试过程

（1）展示图片


def imshow(img, text=None, should_save=False):
    npimg = img.numpy()
    plt.axis("off")
    if text:
        plt.text(75, 8, text, style='italic', fontweight='bold',
                 bbox={'facecolor': 'white', 'alpha': 0.8, 'pad': 10})
    plt.imshow(np.transpose(npimg, (1, 2, 0)))
    plt.show()

（2）参数设置


testing_dir = r"D:\Siamese_for_Face\data\faces\testing"
input_shape = [224, 224]

（3）加载数据集

testing_dir = r"D:\Siamese_for_Face\data\faces\testing"
input_shape = [224, 224]

transform = transforms.Compose([transforms.Resize((224, 224)),
                                transforms.ToTensor()])
folder_dataset_test = dset.ImageFolder(testing_dir)
siamese_dataset = SiameseNetworkDataset(imageFolderDataset=folder_dataset_test,
                                        transform=transform,
                                        should_invert=False)
test_dataloader = DataLoader(siamese_dataset, num_workers=0, batch_size=1, shuffle=True)

（4）加载网络和训练过的权重

net = SiameseNetwork(input_shape)
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
net.to(device)
net.load_state_dict(torch.load(r'D:\Siamese_for_Face\weights\vgg.pkl'))

（5）测试过程

if __name__ == '__main__':

    dataiter = iter(test_dataloader)
    x0, _, _ = next(dataiter)

    for i in range(10):
        _, x1, label2 = next(dataiter)
        x0, x1, label2 = x0.to(device), x1.to(device), label2.to(device)
        concatenated = torch.cat((x0, x1), 0)
        # output1, output2 = net(Variable(x0), Variable(x1))
        output = net(Variable(x0), Variable(x1))[0]
        output = torch.nn.Sigmoid()(output)
        # euclidean_distance = F.pairwise_distance(output1, output2)
        imshow(torchvision.utils.make_grid(concatenated).cpu(),
               'similarity: {:.2f}'.format(output.item()))

7.网络的效果

这里我是设置了0.0005的学习率和400个epochs

感觉最后训练的效果很好，说明vgg16网络的features的特征提取能力很强大，这里要注意的是，不要设置太大的学习率，因为我们这是迁移学习，主要是利用vgg16特征提取的权重，设置太大的学习会将原本训练好的vgg16的权重扭曲太多。

8.代码

（1）gitee

Siamese_for_Face.zip · xuxuxuxu/xuxuxuxu – 码云 – 开源中国 (gitee.com)

（2）github

xuxuxuxu/Siamese_for_Face.zip at main · xuxuxuxuxuxu97/xuxuxuxu (github.com)

Original: https://blog.csdn.net/weixin_52950958/article/details/126226752
Author: xukobe97
Title: pytorch搭建孪生网络比较人脸相似性

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/716615/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

【数据挖掘】期末复习模拟题（暨考试题）

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年7月17日
0048
python baidu语音转文字

from aip import AipSpeech #baidu-aip APP_ID = ‘ ‘ API_KEY = ‘ ‘ SECRET_KEY = ‘ ‘ 百度AI库获取的参…

人工智能 2023年5月25日
0081
Logistic回归——二分类 —— matlab

目录 1.简介 2.应用范围 3.分类 3.应用条件 4.原理详解 4.1 sigmod分类函数 4.2 建立目标函数 4.3 求解相关参数 5.实列分析 5.1 读取数据（exc…

人工智能 2023年6月30日
0093
【机器学习】回归决策树

回归决策树 1. 原理概述 2. 算法描述 3. 简单实例 * 3.1 实例计算过程 3.2 回归决策树和线性回归对比 4. 小结原理概述上篇文章已经讲到，关于数据类型，我们…

人工智能 2023年6月17日
0070
（论文阅读）Document-level Relation Extraction as Semantic Segmentation

题目：Document-level Relation Extraction as Semantic Segmentation来源：2021 IJCAI原文链接：https://ar…

人工智能 2023年5月28日
0083
五、CNN-LSTM数据驱动模型

CNN-LSTM数据驱动模型深度学习是机器学习前沿且热门的理论，而其中的两大框架卷积神经网络（CNN）以及长短期记忆网络（LSTM）是深度学习的代表，CNN能过够通过使用卷积核从…

人工智能 2023年7月28日
0064
语音识别：在Kaldi上使用CVTE模型-已训练好的开源中文ASR模型

在前一篇文章中，我把Kaldi安装并编译了。相当于把利用Kaldi做语音识别的基本运行环境布置好了。这一篇文章记录我用CVTE开源的kaldi模型来进行语音识别模型的建立和使用。友…

人工智能 2023年5月25日
0083
ROS中的分布式通讯（树莓派与虚拟机）

ROS中的分布式通讯（树莓派与虚拟机）一、前言二、树莓派连接WIFI 三、查找局域网下的其他设备 IP 四、确定可以ping通五、配置文件修改六、配置主机 IP 七、配置从…

人工智能 2023年6月10日
0083
cspj2022 T4 上升点列(point)题解（floyd）

样例一： 8 23 13 23 33 61 22 25 55 3 样例一输出： 8 样例二： 4 10010 1015 2520 2030 30 样例二输出： 103 一、题目解析…

人工智能 2023年6月30日
0045
Java本地搭建宝塔部署实战固定资产设备管理系统源码

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年6月29日
0065
什么是马尔科夫决策过程（MDP）

什么是马尔科夫决策过程（MDP）？马尔科夫决策过程（Markov Decision Process，MDP）是一种用于建模序贯决策问题的数学框架。在MDP中，决策制定者通过在每个…

人工智能 2023年12月29日
0041
OFDM雷达信号模糊函数MATLAB仿真分析

OFDM雷达信号模糊函数MATLAB仿真分析 OFDM大家都不陌生，特别是主要研究通信大法的小伙伴们。正交频分复用 (OFDM) 是一种可以在多个正交子载波上编码通信数据的多载波…

人工智能 2023年6月24日
0086
pytorch 中的torchsummary

torchsummary能够查看模型的输入和输出的形状，可以更加清楚地输出模型的结构。 torchsummary.summary(model, input_size, batch_…

人工智能 2023年7月22日
0048
保姆级详细教程：Windows 安装 Visual Studio + OpenCV + OpenCV contrib

目录 0.写作背景 1.安装visual studio 2.下载OpenCV相关的源码下载OpenCV原始的源码下载OpenCV contrib的源码下载安装cmake 3….

人工智能 2023年6月19日
0086
一种由视频和音频共同驱动的说话人脸合成方法简介

最近做作业看到了一篇挺有意思的文章《Pose-Controllable Talking Face Generation by Implicitly Modularized Audi…

人工智能 2023年6月20日
0052
ubuntu18.04 安装ros2教程及环境配置

ubuntu18.04 安装ros2教程及环境配置 * – 1. ros2安装 – + 1.1添加ros2 软件源 + 1.2安装ros-eloquent和…

人工智能 2023年6月1日
0076

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

pytorch搭建孪生网络比较人脸相似性

大家都在看