语义分割模型–LinkNet

2023年6月20日上午5:17 • 人工智能 • 阅读 59

这篇文章的全名为
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
感兴趣的可以自行下载查看

（1）LinkNet介绍
LinkNet采用自编码器的思想，其架构分为两个部分：编码器和解码器。编码器将输入编码到低维空间，解码器从低维空间重建输入。
由一个初始块、一个最终块、一个带有四个卷积模块的编码器块以及一个带有四个解卷积模块的解码器块组成。
框架图如下，左边是编码器块，右边是解码器块

（2）反卷积与跳跃连接
反卷积的作用与卷积相反，可以看做是卷积运算的逆过程。效果图可以参考下面
语义分割模型--LinkNet

跳跃连接表示为LinkNet网络架构中编码器和解码器之间的平行水平线。跳跃连接有助于网络在编码过程中遗忘某些信息，并在解码时再次查看这些信息。由于网络解码和生成图像所需的信息量相对较低，所以这减少了网络所需的参数量。跳跃连接可以借助不同的操作来实现。使用跳跃连接的另一个优点是，反向梯度流可以轻松地通过相同的连接来实现。LinkNet将隐藏的编码器输出添加到相应的解码器输入中，而另一种语义分割算法 Tiramisu将这两者连接在一起，然后将其发送到下一层。
（3）模型
1、卷积块ConvBlock
卷积块由卷积、batchnorm、ReLU激活函数构成。其中批次归一化可以帮助网络从更稳定的输入分布中学习，从而加快了网络的收敛速度。

2、解卷积块DeconvBlock
是解码器的构建块，与卷积块类似，由转置卷积、BatchNorm和ReLU构成。唯一区别就是将torch.nn.Conv2d换成了torch.nn.ConvTranspose2d。

3、编码器块EncoderBlock
如下图所示，LinkNet中的每个编码器块均由四个卷积块组成。前两个卷积块成为模块一，然后将其与残差输出相加，将输出传递给模块二。

4、解码器块DecoderBlock
比较简单，仅仅是对其进行反卷积操作，示意图如下。
语义分割模型--LinkNet

代码实现：

from __future__ import absolute_import
from __future__ import print_function

import os
import numpy as np

from keras.layers import Input, concatenate, Conv2D, MaxPooling2D, Activation, UpSampling2D, BatchNormalization, add
from keras.layers.core import Flatten, Reshape
from keras.models import Model
from keras.regularizers import l2
import keras.backend as K

def _shortcut(input, residual):
    """Adds a shortcut between input and residual block and merges them with "sum"
"""

    input_shape = K.int_shape(input)
    residual_shape = K.int_shape(residual)
    stride_width = int(round(input_shape[1] / residual_shape[1]))
    stride_height = int(round(input_shape[2] / residual_shape[2]))
    equal_channels = input_shape[3] == residual_shape[3]

    shortcut = input

    if stride_width > 1 or stride_height > 1 or not equal_channels:
        shortcut = Conv2D(filters=residual_shape[3],
                          kernel_size=(1, 1),
                          strides=(stride_width, stride_height),
                          padding="valid",
                          kernel_initializer="he_normal",
                          kernel_regularizer=l2(0.0001))(input)

    return add([shortcut, residual])

def encoder_block(input_tensor, m, n):
    x = BatchNormalization()(input_tensor)
    x = Activation('relu')(x)
    x = Conv2D(filters=n, kernel_size=(3, 3), strides=(2, 2), padding="same")(x)

    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=n, kernel_size=(3, 3), padding="same")(x)

    added_1 = _shortcut(input_tensor, x)

    x = BatchNormalization()(added_1)
    x = Activation('relu')(x)
    x = Conv2D(filters=n, kernel_size=(3, 3), padding="same")(x)

    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=n, kernel_size=(3, 3), padding="same")(x)

    added_2 = _shortcut(added_1, x)

    return added_2

def decoder_block(input_tensor, m, n):
    x = BatchNormalization()(input_tensor)
    x = Activation('relu')(x)
    x = Conv2D(filters=int(m/4), kernel_size=(1, 1))(x)

    x = UpSampling2D((2, 2))(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=int(m/4), kernel_size=(3, 3), padding='same')(x)

    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=n, kernel_size=(1, 1))(x)

    return x

def LinkNet(input_shape=(256, 256, 3), classes=1):
    inputs = Input(shape=input_shape)

    x = BatchNormalization()(inputs)
    x = Activation('relu')(x)
    x = Conv2D(filters=64, kernel_size=(7, 7), strides=(2, 2))(x)

    x = MaxPooling2D((3, 3), strides=(2, 2), padding="same")(x)

    encoder_1 = encoder_block(input_tensor=x, m=64, n=64)

    encoder_2 = encoder_block(input_tensor=encoder_1, m=64, n=128)

    encoder_3 = encoder_block(input_tensor=encoder_2, m=128, n=256)

    encoder_4 = encoder_block(input_tensor=encoder_3, m=256, n=512)

    decoder_4 = decoder_block(input_tensor=encoder_4, m=512, n=256)

    decoder_3_in = add([decoder_4, encoder_3])
    decoder_3_in = Activation('relu')(decoder_3_in)

    decoder_3 = decoder_block(input_tensor=decoder_3_in, m=256, n=128)

    decoder_2_in = add([decoder_3, encoder_2])
    decoder_2_in = Activation('relu')(decoder_2_in)

    decoder_2 = decoder_block(input_tensor=decoder_2_in, m=128, n=64)

    decoder_1_in = add([decoder_2, encoder_1])
    decoder_1_in = Activation('relu')(decoder_1_in)

    decoder_1 = decoder_block(input_tensor=decoder_1_in, m=64, n=64)

    x = UpSampling2D((2, 2))(decoder_1)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=32, kernel_size=(3, 3), padding="same")(x)

    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Conv2D(filters=32, kernel_size=(3, 3), padding="same")(x)

    x = UpSampling2D((2, 2))(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)

    x = Conv2D(filters=classes, kernel_size=(2, 2), padding="same")(x)

    model = Model(inputs=inputs, outputs=x)

    return model

Original: https://blog.csdn.net/weixin_45807161/article/details/123689849
Author: 你这个代码我看不懂.
Title: 语义分割模型–LinkNet

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/640648/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

语义搜索入门

文章目录前言一、简介二、语义数据搜索 * 1.Data web 2.三元组存储 3.存储和索引三、混合搜索 * 1.数据模型 2.DB和IR的轻量级集成 3.生混合搜索系统…

人工智能 2023年6月10日
0066
实体关系抽取

文章目录有监督关系抽取半监督关系抽取 * 远程监督 Bootstrapping 无监督关系抽取本文将重点放在了无监督关系抽取上，无监督关系抽取有监督和半监督限制更少，能应用的…

人工智能 2023年5月30日
00103
5种方法获取Torch网络模型参数量计算量等信息

模型参数量和计算量是什么计算量是指网络模型需要计算的运算次数，参数量是指网络模型自带的参数数量多少计算量对应时间复杂度，参数量对应于空间复杂度计算量决定了网络执行时…

人工智能 2023年6月16日
00114
零基础学kubernetes（k8s）必看教程，带你10分钟快速实战入门k8s

一、前言作为一个工作十余年，拥有比较丰富的计算机网络运维、主机运维、云计算平台运维、自动化运维开发经验的老鸟，我来说说我当初刚接触Kubernetes时的一些感受以及学习技巧。 …

人工智能 2023年6月19日
0087
YOLOv6 Tensorrt Python部署教程

B站教学视频 https://www.bilibili.com/video/BV1ka411p7u4/ Github仓库地址 https://github.com/Monday-L…

人工智能 2023年5月28日
0075
机器学习西瓜书笔记：软间隔和支持向量回归SVR

1、首先由SVM问题（最大间隔超平面模型）：所有样本都可以正确分类的最优化问题，引入软间隔SVM（允许分类错误）的最优化问题，即需要添加损失函数（样本不满足约束的程度，或者说分…

人工智能 2023年6月17日
00108
实战：QT车牌识别系统综合设计

该系统是博主结合许多QT开发项目综合制作，借用了Opencv的开发库来完成的一个项目，具体的可以按照目录来，关于识别方面仅仅提供一个思路，目前还在想如何去优化（准备采用神经网络将数…

人工智能 2023年5月26日
0072
Pycharm中如何更新第三方库（以tensorflow库为例）

Pycharm中如何更新第三方库（以tensorflow库为例）本文主要记录如何在Pycharm中更新第三方库，以tensorflow库为例。目录 Pycharm中如何更新第三…

人工智能 2023年5月23日
0084
【OpenCV 例程200篇】36. 直角坐标与极坐标转换（cv2.polarToCart）

『youcans 的 OpenCV 例程200篇 – 总目录』【youcans 的 OpenCV 例程200篇】36. 直角坐标与极坐标的转换函数 cv2.cart…

人工智能 2023年7月20日
0056
基于深度学习的建筑能耗预测02——安装Tensorflow-gpu

天津城建大学建筑学院18级-数字设计-基于深度学习的建筑能耗预测—2021WS作者：徐仔导师：万先生、丁先生 [En] Instructor: Mr. Wan and Mr. Di…

人工智能 2023年5月25日
0078
Python中读取矩阵的维度 ndarray.shape函数

【小白从小学Python、C、Java】【Python全国计算机等级考试】【Python数据分析考试必会题】 ● 标题与摘要 Python中读取矩阵的维度 ndarray.sh…

人工智能 2023年7月6日
0062
NLP模型笔记2022-25：neo4j+py2neo构建增值税电子普通发票知识图谱

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年6月1日
0090
【YOLO3D】:端到端3D点云输入的实时检测

YOLO3D:端到端3D点云输入的实时检测前言算法分析 * 模型输入网络结构回归损失 – 3D box 回归偏航角回归边界框损失函数数据集处理 &#821…

人工智能 2023年7月10日
0063
Keras 找不到权重的梯度 WARNING:tensorflow:Gradients do not exist for variables when minimizing the loss

在构建复数网络的时候，需要按照实部real与虚部image来分别创建计算权重： shape = (2,) + (input_dim, self.units) # dense&amp…

人工智能 2023年5月26日
00114
【OpenCV】基于cv2的图像阈值化处理【超详细的注释和解释】掌握基本操作

说在前面的话 博主今天给大&#x5BB6…

人工智能 2023年7月18日
0071
KGIN 2021（WWW）Learning Intents behind Interactions with Knowledge Graph for Recommendation

利用知识图谱的结构信息进行嵌入丰富物品表示 Collaborative Knowledge Base Embedding for Recommender Systems（2016）…

人工智能 2023年6月1日
0063

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

语义分割模型–LinkNet

大家都在看