Multi-Modal Knowledge Graph Construction and Application: A Survey

2023年6月1日上午9:48 • 人工智能 • 阅读 118

Absract:

存在问题：1.现实世界知识爆炸；2现存KG是with pure symbol,不好让机器去理解。

->解决问题方案：Multi-Modal KG，这可以更好地实现人类水平的机器翻译。

->得出结果：MMKG

概览：

1.defintion of MMKGs;

2.the preliminaries on multi-modal tasks and techniques;

3.systematically review the challenges,progress,opportunities on the construction and application of MMKGs;

4.analyses of the strength and weakness of different solutions.

1.Introduction

one hand: the dog and the experience of dogs–象征与其物理世界意义联系起来；

on the other hand:

1 图像中更好抽取类似关系抽取，属性抽取，（eg:Partof(keyboard and the screen are parts of a laptop)）

2.可以形成more informative entity-level sentence instead of a vague concept-level with MMKG(eg:Donald Trump is making a speech(with MMKGs);A tall man with blond hair is making a speech(no use MMKGs))。

Construction:（conclude opposite directions）[challenges,progress,opportunities]

One is from images to symbols 即 labeling images with symbols inKG
The other is from symbols to images 即 grounding symbols in KG to images

Application:

In-MMKG:旨在解决MMKG本身的质量或集成问题；
Out-of-MMKG:通用的多模式任务,MMKG可以提供帮助。

2.Definition and Preliminaries

2.1first defines two representation ways for KGs;

2.2review some preliminaries on multi-modal tasks and techniques;

2.3followed with a discussion on the connections between MMKGs and the existing multi-modal tasks and techniques.

2.1 D efinitions and Representation of MMKGs

Multi-Modal Knowledge Graph Construction and Application: A Survey

two different ways for representing MMKGs:

A-MMKG:take multi-modal data as particular attribute values of entities/concepts
N-MMKG:take multi-data as entities in KGs

N-MMKG通常将一幅图像抽象为若干图像描述符，这些描述符通常概括为图像实体在像素级的特征向量。因此可以通过简单的计算得到图像之间的关系（eg:通过图像描述符向量的内积得到图像的相似度）

2.2 Preliminaries on Multi-Modal Tasks and Techniques

well-studied multi-modal tasks
multi-modal learning techniques
followed with important progress on multi-modal pretrained language model

Multi-Modal tasks

(a problem is characterized as multi-modal if it involves data of multiple modalities)

多模态任务整合并模拟了多种交际模式，以便从多模态数据中获取知识或理解。

Multi-Modal Learning

多模态学习主要是对多模态之间的对应关系进行建模，以理解多模态数据。

面临的挑战;

Multi-Modal Representation
Multi-Modal Translation
Multi-Modal Alignment
Multi-Modal Fusion
Multi-Modal Co-Learning

Multi-Modal Pretrained Language Model(多模态预训练语言模型)

近年来，学者们设计了一些自监督预训练任务，

In terms of the Transformered-based fusion process of different modality

（就不同模态的基于Transformered的融合过程而言）

多模态预训练语言模型可分为

single-stream models
two-stream models

2.3 Discussion

虽然利用多模态学习技术和多模态预训练语言模型来处理多种多模态任务已经有了很大的研究成果，但引入多模态知识来提高已有多模态任务的性能仍是一个新型趋势。MMKG可以从以下几个方面为这些下游任务带来好处：

MMKG provieds sufficient background knowledge to enrich the representationof entities and concepts,especially for the long-tail ones.
MMKG enables the understanding of unseen objects in images
MMKG enables multi-modal reasoning
MMKG usually provides multi-modal data as additional features to bridge the information gaps in some NLP tasks.

PS:

长尾（long-tail）问题：

长尾问题是实际生产数据中的一种数据分布。其中关键的特点在于占据影响比例相对较小的部分分布着较多的实例。一个例子是统计指定话题下的100w的微博，其中的字按频次排期，除了头部的数据外，频次较低的字有着极大的数量。

常见的长尾问题解决方案：

高频部分通过人工筛选 + 人工标注，产出高质量可用数据。
低频部分，通过自动化构建的方式，产出一份可用的指定质量的数据。

To sum up:

在没有大规模MMKG支持的情况下，以往使用多模态信息的努力仍然有限。我们设想，当大规模的高质量的MMKG可用时，许多任务可以进一步改进

Original: https://blog.csdn.net/qq_42018489/article/details/123198882
Author: 小学生R_rrr
Title: Multi-Modal Knowledge Graph Construction and Application: A Survey

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/556121/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

通俗易懂方差（Variance）和偏差（Bias）

看了沐神的讲解，恍然大悟，b站可以不刷，但沐神一定要看。在统计模型中，通过方差和偏差来衡量一个模型。 1 方差和偏差的概念偏差（Bias）：预测值和真实值之间的误差方差（Var…

人工智能 2023年6月15日
0069
聚类性能评价指标

好的聚类算法一般要求类簇具有高的类内（intra-cluster）相似度和低的（inter-cluster）相似度。聚类算法有外部（External）评价指标和内部（Intern…

人工智能 2023年6月2日
0063
useState与useEffect

react中组件分为有状态组件(类组件)和(无状态组件)函数式组件，函数式组件没有状态和生命周期，但是可以通过通过useState useEffect等hook语法添加状态和生命周…

人工智能 2023年6月28日
0064
目标检测篇-FPN论文精读

1、FPN网络结构基于feature pyrimid来检测不同scale的object，共有4种思路 (a)使用图像金字塔构建特征金字塔,在每个图像尺度上独立计算的 (b)只使用…

人工智能 2023年7月9日
0045
【Javaweb】JSP标准标签库

JSTL 1.什么是JSTL 2.版本 3.标签函数库 4.优点 JSTL基本概念标签（Tag）标签库（Tag library）标签库描述文件（Tag Library Des…

人工智能 2023年6月4日
0061
opencv+gpu硬解码

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月19日
0046
【ELM分类】基于matlab遗传算法结合爬山算法优化ELM分类【含Matlab源码 1660期】

⛄一、获取代码方式获取代码方式1：完整代码已上传我的资源：【ELM分类】基于matlab鲸鱼算法优化核极限学习机数据分类【含Matlab源码 2012期】获取代码方式2：付费专…

人工智能 2023年7月2日
0071
你还在做描述性数据分析吗？卡方检验案例实操

关注”金科应用研院”，回复”CSDN”领取风控资料合集在风险管理中，对于所获得数据的深度挖掘至关重要。因为如果只是从数据的面相上得…

人工智能 2023年6月11日
0080
解决问题：使用cv2.imshow/plt.imshow显示图片正常，但保存下来的图片不正常或者全黑

今天遇到了一个Bug，花了很大的时间精力解决，在这里记录一下。 Bug如题目所说，在对一张渲染出来的图片保存时，发现存成了全黑的图，尝试过将数据归一化/改变数据类型等等方式均失败，…

人工智能 2023年7月18日
0053
软件智能：为高智商的程序赋能情商？—-暨 aaas全貌

前天，我说的是，aaas乍一看像什么-智能机床。结果是一个AI系统；昨天我思考了 aaas本身是什么-纯粹形式。结论是一个突现的过程本体。（内容只是初步完成还需要增加点内容才能…

人工智能 2023年6月1日
0068
用pandas处理类似鸢尾花数据集【附代码】

在学习深度学习的时候，前期最重要的就是对数据集的处理，先抛开数据增强等过程，单单数据集的加载也是很重要的，只有正确处理好数据集才能送入网络进行训练。前面我有写过关于VOC数据集的处…

人工智能 2023年6月19日
0085
金融数据分析——DataJoy银行客户购买预测

本预测与葡萄牙银行机构的营销活动相关。这些营销活动一般以电话为基础，银行的客服人员至少联系客户一次，以确认客户是否有意愿购买该银行的产品（定期存款）。任务是基本类型为分类任务，即预…

人工智能 2023年7月16日
0050
Spring IOC源码：obtainFreshBeanFactory 详解（下）

文章目录 Spring源码系列：前言 * 正文方法1：parseCustomElement 方法2：getNamespaceHandlerResolver().resolve …

人工智能 2023年6月27日
0082
Google I/O 2021 What‘s new in Android Machine Learning

视频地址：Android 机器学习新功能，Video 时长：9min29s 以下是视频内容的摘要，以供参考。 [En] The following is a summary of …

人工智能 2023年5月25日
0066
机器学习中的数学——距离定义（十三）：杰卡德距离（Jaccard Distance）和杰卡德相似系数（Jaccard Similarity Coefficient）

分类目录：《机器学习中的数学》总目录相关文章：· 距离定义：基础知识· 距离定义（一）：欧几里得距离（Euclidean Distance）· 距离定义（二）：曼哈顿距离（Manh…

人工智能 2023年6月16日
00128
学习笔记 Day 27(pandas)

代码:(统计字符串出现的次数) import pandas as pd import numpy as np df = pd.read_csv(‘./data/IMDB-Movie…

人工智能 2023年7月7日
0074

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Multi-Modal Knowledge Graph Construction and Application: A Survey

2.1 D efinitions and Representation of MMKGs

2.2 Preliminaries on Multi-Modal Tasks and Techniques

2.3 Discussion

大家都在看