GAN-生成对抗神经网络(Pytorch)-合集（1）GAN-DCGAN-CGAN

2023年7月14日上午6:40 • 人工智能 • 阅读 65

原生GAN

（Generative Adversarial Nets）
训练过程也是老三步了，再啰嗦一遍：

使用真实图片训练辨别器，标签为真
使用生成器生成的图片训练判别器，标签为假，此时图片使用生成器计算得来的，喂给判别器时要截断梯度，防止更新时把生成器也更新了
训练生成器，使用生成的图片喂给判别器，标签为真，更新生成器

论文地址：https://arxiv.org/abs/1406.2661
GAN之父了可以说是，

在mnist数据集上的生成器网络架构，详细代码见我以前博文的第二段代码：原生GAN代码-mnist数据集


class Generator(nn.Module):
    def __init__(self):
        super(Generator, self).__init__()
        self.linear = nn.Sequential(
            nn.Linear(100, 256),
            nn.Tanh(),
            nn.Linear(256, 512),
            nn.Tanh(),
            nn.Linear(512, 28*28),
            nn.Tanh()
        )

    def forward(self, x):
        x = self.linear(x)
        x = x.view(-1, 28, 28)
        return x

class Discriminator(nn.Module):
    def __init__(self):
        super(Discriminator, self).__init__()
        self.linear = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.LeakyReLU(),

            nn.Linear(512, 256),
            nn.LeakyReLU(),

            nn.Linear(256, 1),
            nn.Sigmoid()
        )

    def forward(self, x):
        x = x.view(-1, 28*28)
        x = self.linear(x)
        return x

生成器：

GAN-生成对抗神经网络(Pytorch)-合集（1）GAN-DCGAN-CGAN

; 判别器：

DCGAN

（Deep Convolutional GAN）
你可能想说，不就是把全连接层换成卷积层吗？不完全对，不仅仅如此，DCGAN在GAN的基础上做了大量改进，包括但不限于舍弃池化层，使用反卷积层，使用BN层等等，感兴趣的可以去看下原论文，我就不罗嗦了,https://arxiv.org/pdf/1511.06434.pdf

网络架构代码：
dropout不好画，别忘了这个就行，防止判别器学的太快


class Generator(nn.Module):
    def __init__(self):
        super(Generator, self).__init__()
        self.linear1 = nn.Linear(100, 256*7*7)
        self.bn1 = nn.BatchNorm1d(256*7*7)
        self.deconv1 = nn.ConvTranspose2d(256, 128,
                                          kernel_size=(3, 3),
                                          stride=1,
                                          padding=1
                                          )
        self.bn2 = nn.BatchNorm2d(128)
        self.deconv2 = nn.ConvTranspose2d(128, 64,
                                          kernel_size=(4, 4),
                                          stride=2,
                                          padding=1
                                          )
        self.bn3 = nn.BatchNorm2d(64)
        self.deconv3 = nn.ConvTranspose2d(64, 1,
                                          kernel_size=(4, 4),
                                          stride=2,
                                          padding=1
                                          )

    def forward(self, x):
        x = F.relu(self.linear1(x))
        x = self.bn1(x)
        x = x.view(-1, 256, 7, 7)
        x = F.relu(self.deconv1(x))
        x = self.bn2(x)
        x = F.relu(self.deconv2(x))
        x = self.bn3(x)
        x = torch.tanh(self.deconv3(x))
        return x

class Discriminator(nn.Module):
    def __init__(self):
        super(Discriminator, self).__init__()
        self.conv1 = nn.Conv2d(1, 64, kernel_size=3, stride=2)
        self.conv2 = nn.Conv2d(64, 128, 3, 2)
        self.bn = nn.BatchNorm2d(128)
        self.fc = nn.Linear(128*6*6, 1)

    def forward(self, x):
        x = F.dropout2d(F.leaky_relu(self.conv1(x)), p=0.3)
        x = F.dropout2d(F.leaky_relu(self.conv2(x)), p=0.3)
        x = self.bn(x)
        x = x.view(-1, 128*6*6)
        x = torch.sigmoid(self.fc(x))
        return x

当然在这里也可以看到全部的训练代码，以前的博文，第三段代码为DCGAN https://blog.csdn.net/qq_45882032/article/details/123432603
或者生成动漫头像的也很有意思https://blog.csdn.net/qq_45882032/article/details/124306864

DCGAN生成器

转置卷积输入与输出大小关系：

visio第一次用，画了好久。。。。。最后还要带入Tanh激活函数，图中忘画了。。。

; DCGAN判别器

卷积输出大小：

CGAN

(Conditional Generative Adversarial Network)条件GAN，PPT画的好像比vison好一点
成功把输入标签的label影响到了网络中，在判别器中即使生成的是张不错的图片，但如果label不对依然会被判别为假
原论文：https://arxiv.org/pdf/1411.1784.pdf

CGAN生成器

; CGAN判别器

这个以前没写过，代码放下面，还有两个小技巧再提一下，1，使用dropout防止判别器学的太快，2，Adam优化时，把判别器的刚开始的学习率调小一点，让他慢点学，判别器很容易训练的太好，这样他每次都能准确的分出生成器的假图，生成器就不知道怎么更新了。还有输入的label是one_hot编码

import torch
import torch.nn as nn
import torch.utils.data as Data
import torch.nn.functional as F
import torch.optim as optim
import numpy as np
import matplotlib.pyplot as plt
import torchvision
from torchvision import transforms

transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize(0.5, 0.5)
])

def one_hot(x, class_count=10):
    return torch.eye(class_count)[x]

dataset = torchvision.datasets.MNIST('data',
                                     train=True,
                                     transform=transform,
                                     target_transform=one_hot,
                                     download=True)

dl = Data.DataLoader(dataset, batch_size=64, shuffle=True)

class Generator(nn.Module):
    def __init__(self):
        super(Generator, self).__init__()
        self.linear1 = nn.Linear(100, 128*7*7)
        self.bn1 = nn.BatchNorm1d(128*7*7)

        self.linear2 = nn.Linear(10, 128*7*7)
        self.bn2 = nn.BatchNorm1d(128*7*7)

        self.deconv1 = nn.ConvTranspose2d(256, 128,
                                          kernel_size=(3, 3),
                                          stride=1,
                                          padding=1
                                          )
        self.bn3 = nn.BatchNorm2d(128)
        self.deconv2 = nn.ConvTranspose2d(128, 64,
                                          kernel_size=(4, 4),
                                          stride=2,
                                          padding=1
                                          )
        self.bn4 = nn.BatchNorm2d(64)
        self.deconv3 = nn.ConvTranspose2d(64, 1,
                                          kernel_size=(4, 4),
                                          stride=2,
                                          padding=1
                                          )

    def forward(self, x, label):
        x = F.relu(self.linear1(x))
        x = self.bn1(x)
        x = x.view(-1, 128, 7, 7)

        label = F.relu(self.linear2(label))
        label = self.bn2(label)
        label = label.view(-1, 128, 7, 7)

        x = torch.cat([x, label], axis=1)

        x = F.relu(self.deconv1(x))
        x = self.bn3(x)
        x = F.relu(self.deconv2(x))
        x = self.bn4(x)
        x = torch.tanh(self.deconv3(x))
        return x

class Discriminator(nn.Module):
    def __init__(self):
        super(Discriminator, self).__init__()
        self.linear = nn.Linear(10, 1*28*28)

        self.conv1 = nn.Conv2d(2, 64, kernel_size=3, stride=2)
        self.conv2 = nn.Conv2d(64, 128, 3, 2)
        self.bn = nn.BatchNorm2d(128)
        self.fc = nn.Linear(128*6*6, 1)

    def forward(self, x, label):
        label = F.leaky_relu(self.linear(label))
        label = label.view(-1, 1, 28, 28)
        x = torch.cat([label, x], axis=1)

        x = F.dropout2d(F.leaky_relu(self.conv1(x)), p=0.3)
        x = F.dropout2d(F.leaky_relu(self.conv2(x)), p=0.3)
        x = self.bn(x)
        x = x.view(-1, 128*6*6)
        x = torch.sigmoid(self.fc(x))
        return x

device = 'cuda' if torch.cuda.is_available() else 'cpu'
if device == 'cuda':
    print('using cuda:', torch.cuda.get_device_name(0))
else:
    print(device)

Gen = Generator().to(device)
Dis = Discriminator().to(device)

loss_fun = nn.BCELoss()
d_optimizer = torch.optim.Adam(Dis.parameters(), lr=1e-5)
g_optimizer = torch.optim.Adam(Gen.parameters(), lr=1e-4)

def generate_and_save_image(model, label_input, test_input):
    predictions = np.squeeze(model(test_input, label_input).cpu().numpy())
    fig = plt.figure(figsize=(4, 4))
    for i in range(predictions.shape[0]):
        plt.subplot(4, 4, i+1)
        plt.imshow((predictions[i]+1) / 2, cmap='gray')
        plt.axis('off')
    plt.show()

noise_seed = torch.randn(16, 100, device=device)
label_seed = torch.randint(0, 10, size=(16,))
label_seed = one_hot(label_seed).to(device)
D_loss = []
G_loss = []

for epoch in range(30):
    d_epoch_loss = 0
    g_epoch_loss = 0
    count = len(dl)
    for step, (img, label) in enumerate(dl):
        img = img.to(device)
        label = label.to(device)
        size = img.size(0)
        random_noise = torch.randn(size, 100, device=device)

        d_optimizer.zero_grad()
        real_output = Dis(img, label)

        d_real_loss = loss_fun(real_output,
                                    torch.ones_like(real_output)
                                    )
        d_real_loss.backward()

        gen_img = Gen(random_noise, label)
        fake_output = Dis(gen_img.detach(), label)

        d_fake_loss = loss_fun(fake_output,
                                    torch.zeros_like(fake_output)
                                    )
        d_fake_loss.backward()
        d_loss = d_real_loss + d_fake_loss
        d_optimizer.step()

        g_optimizer.zero_grad()
        fake_output = Dis(gen_img, label)
        g_loss = loss_fun(fake_output,
                               torch.ones_like(fake_output))
        g_loss.backward()
        g_optimizer.step()

        with torch.no_grad():
            d_epoch_loss += d_loss.item()
            g_epoch_loss += g_loss.item()

    with torch.no_grad():
        d_epoch_loss /= count
        g_epoch_loss /= count
        D_loss.append(d_epoch_loss)
        G_loss.append(g_epoch_loss)
        print('Epoch:', epoch+1)
        generate_and_save_image(model=Gen, label_input=label_seed, test_input=noise_seed)

    plt.plot(D_loss, label='D_loss')
    plt.plot(G_loss, label='G_loss')
    plt.legend()
    plt.show()

Original: https://blog.csdn.net/qq_45882032/article/details/124363826
Author: 挂科难
Title: GAN-生成对抗神经网络(Pytorch)-合集（1）GAN-DCGAN-CGAN

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/691503/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

目标检测数据集标注工具LabelImg从安装到使用详解

1.LabelImg的安装在Windows10系统下使用Anaconda来安装LabelImg，步骤如下：首先打开conda 这里建议专门创建一个新环境来安装LabelImg，…

人工智能 2023年6月16日
0065
全连接网络（FC）、前馈神经网络（BP）

提示：文章写完后，目录可以自动生成，如何生成可参考右边的帮助文档文章目录前言 * 全连接（FC）网络卷积神经网络（CNN）全连接网络（FC）、前馈神经网络（BP）前言全…

人工智能 2023年7月12日
0079
分割与attention

1.s elf-attention Ian goodfellow 2018年新作，在GAN生成中加入了attention的机制，同时将SNgan的思想引入到生成器当中。 2. C…

人工智能 2023年5月28日
00105
PCL(5)点云聚类之 VoxelGrid体素采样与ApproximateVoxelGrid体素采样

1 区别 1.1 原理 VoxelGrid体素采样,对点云进行体素化，创建一个三维体素栅格。在每个体素里面，求取该立方体内的所有点云重心点来代表这个立方体的表示，以此达到下采样的目…

人工智能 2023年6月2日
0071
GLU（Gated Linear Unit，门控线性单元）简介

前言简单介绍门控线性单元的结构。原始论文《Language Modeling with Gated Convolutional Networks》提出GLU，2017年（第…

人工智能 2023年7月21日
0089
（门控卷积实现）DeepFillv2（图像修复）：Free-Form Image Inpainting with Gated Convolution，pytroch代码实现

deepfillv2的动机结合了几乎所有的目前先进的图像修复技术，基于部分卷积提出了门控卷积，结合了CA中的注意力机制，根据 Adversarial Edge图像修复中的边缘…

人工智能 2023年7月13日
00134
Python绘制loss曲线、准确率曲线

Python 绘制 loss 曲线、准确率曲线使用 python 绘制网络训练过程中的的 loss 曲线以及准确率变化曲线，这里的主要思想就时先把想要的损失值以及准确率值保存下来…

人工智能 2023年6月16日
0090
numpy——文件读取与保存

numpy中的文件读取与保存 import numpy as np import pickle 读取 1.dst = numpy.load(file, mmap_mode=None…

人工智能 2023年7月15日
0071
主流图数据库对比

目录别人家的测评全面的对比详细的性能对比自己家的测评：基本信息对比性能对比查询语句对比补充 HugeGraph花钱跟不花钱的区别 Nebula花钱跟不花钱的区别背…

人工智能 2023年6月10日
0090
Pytorch机器学习（六）——YOLOV5中的自适应图片缩放letterbox

YOLOV5中的自适应图片缩放letterbox 前言一、letterbox自适应图片缩放技术一，计算收缩比二，计算收缩后图片的长宽三，计算需要填充的像素四，最后resi…

人工智能 2023年6月16日
0081
手写数字识别（识别纸上手写的数字）

说明使用pytorch框架，实现对MNIST手写数字数据集的训练和识别。重点是，自己手写数字，手机拍照后传入电脑，使用你自己训练的权重和偏置能够识别。数据预处理过程的代码是重点。…

人工智能 2023年5月26日
0084
YOLOv5网络结构完全解读【源码+手绘网络结构+模块结构】

🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀YOLOv5网络结构详解 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 🚀 …

人工智能 2023年6月16日
0077
【机器学习】线性回归（最小二乘法/梯度下降法）、多项式回归、logistic回归、softmax回归

本文部分代码参考github：Machine-Learning-for-Beginner-by-Python3 本文所有代码和数据集文件可在此下载：https://download…

人工智能 2023年6月17日
0090
学习Tensorflow 2.0 下载cifar10 cifar100数据集遇到的坑

学习编程,采坑无数,得到过别人的帮助,也希望自己采坑的经历帮助到别人代码如下（示例）： from tensorflow.keras import datasets 利用Tenso…

人工智能 2023年5月25日
00113
人工智能算法面试大总结-总目录

该面经总结了春招/秋招各厂高频面试八股，除开围绕简历扣项目细节，公司最喜欢问的还是这些经典算法中涉及的知识点。目前涵盖 Python、基础理论、分类与聚类、降维、支持向量机SV…

人工智能 2023年7月26日
00121
【快速入门】YOLOv5目标检测算法

文章目录 * – 一、YOLOv5简介 – 二、网络结构 – + 1、Input + 2、Backbone + 3、Neck + 4、Head …

人工智能 2023年5月26日
0088

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31