马尔科夫链（Markov chain, Markov model）讲解（一阶和高阶）及其应用（建模数据预测）

2023年5月27日下午8:49 • 人工智能 • 阅读 96

本文简要介绍了它的概念(包括一阶链、二阶链和高阶链)及其应用(如何通过建模进行数据预测)。

[En]

This paper briefly explains its concept (including first-order, second-order and higher-order chains) and its application (how to make data prediction through modeling).

概括的来说，马尔科夫链是基于统计的数学模型。

那么，什么是基于统计数据的？列出生活中最常见的场景之一。当我们使用输入法打字时，输入法会自动弹出联想字符。这一点在写一些非常常见的名词时尤其明显，比如名字。例如，某人的名字是Olivier。当我们第一次输入这个名字时，我们需要完整正确地拼写每个单词，否则它就会变成一个像奥利维尔一样的单词。但在多次输入后，Olivier会直接弹出，甚至我们可能只需要输入O，输入法就会自动将完整的术语Olivier关联起来。输入法的内部处理实际上包含了马尔可夫链的处理逻辑：当用户频繁输入一个短语时，该短语被使用的频率更高，所以当用户只输入短语的开头时，他们也很有可能想要输入该短语，并且联想具有更高的优先级。

[En]

So, what is based on statistics? List one of the most common scenes in life. When we use the input method to type, the input method will automatically pop up associative characters. This is especially obvious when writing some very common nouns, such as names. For example, someone’s name is Olivier. When we type this name for the first time, we need to spell every word completely and correctly, otherwise it will become a word like Olivier. But after many times of typing, Olivier will pop up directly, and even we may just need to type O, and the input method will automatically associate the complete term Olivier. The internal processing of the input method actually contains the processing logic of the Markov chain: when the user enters a phrase frequently, the phrase is used more frequently, so when the user only enters the beginning of the phrase, there is also a good chance that they want to enter the phrase, and the association has a higher priority.

这个例子的目的是传达这样一个基本思想，即马尔可夫链的模型是基于历史数据的。那么，如何从历史数据中进行统计，建立马尔可夫模型呢？这需要从马尔可夫链的定义开始。

[En]

The purpose of this example is to convey the basic idea that the model of Markov chain is based on historical data. So, how to make statistics from historical data and establish a Markov model? This needs to start with the definition of Markov chain.

马尔科夫链是反映了对象状态变化的过程的数学模型。

我们还是从一个例子开始吧。我们现在有一些历史数据来记录一个地方近一年(2021年)每天的天气。我们简单地将天气类型定义为多云、晴朗和下雨。历史数据可以显示如下

[En]

Let’s still start with an example. We now have some historical data to record the weather every day in a place for nearly a year (2021). We simply define the type of weather as cloudy, sunny and rainy. Historical data can be displayed as follows

1.1晴，1.2 晴， 1.3 阴， 1.4 雨…12.30 雨，12.31 雨。
这里的天气类型可以看作是物体的状态。例如，阴天为状态1，晴天为状态2，雨天为状态3。

[En]

The type of weather here can be regarded as the state of the object. For example, cloudy days are state 1, sunny days are state 2, and rainy days are state 3.

那么历史数据可以改写为
1.12，1.2 2， 1.3 1， 1.4 3…12.30 3，12.31 3。
可以将其看作一个长度为365的长列表
[2，2，1，3…3，3]
目前我们已经有了2021年的历史数据，现在我们想要做的是，预测2022年某一天的未来的天气。

当然，这个例子本身实用性是很差的，因为使用马尔科夫预测天气效果并不好，这里仅仅是举个现实生活中的例子帮助理解马尔科夫链的建模过程。

现在我们用马尔科夫链来预测明天的天气。在这里，我们有多种选择，比如，我们用今天的天气预报明天的天气，或者我们用昨天和今天的天气预报，甚至我们用近一周的天气预报。直觉上，我们仅根据今天的天气进行预测的准确性低于较长期的天气预报。

[En]

Now we use Markov chain to predict the weather tomorrow. Here we have a variety of choices, for example, we use today’s weather to predict tomorrow’s weather, or we use yesterday’s and today’s weather to predict, or even we use nearly a week’s weather to forecast. Intuitively, the accuracy of our predictions based only on today’s weather is lower than that of longer-term weather forecasts.

这里已经包含了马尔科夫链的阶数的概念。已知今天预测明天，需要建立一阶马尔科夫链；已知两天（昨天，今天）预测明天，需要建立二阶马尔科夫链，以此类推到n阶马尔科夫链。

n阶马尔科夫链描述为，下一个阶段的状态，仅和前n个阶段的状态有关。

下一步是数据统计，我们以一阶马尔可夫链的建立为例。对于一阶链，我们需要从当前状态预测下一状态。这就引出了马尔可夫链的关键工具：概率状态转移矩阵。现在我们需要做以下概率统计：

[En]

The next step is data statistics, we take the establishment of a first-order Markov chain as an example. For the first-order chain, we need to predict the next state from the current state. This leads to the key tool of Markov chain: probabilistic state transition matrix. Now we need to do the following probability statistics:

当前状态为1，下一状态为1的概率（P11）
当前状态为1，下一状态为2的概率（P12）
当前状态为1，下一状态为3的概率（P13）
当前状态为2，下一状态为1的概率（P21）
当前状态为2，下一状态为2的概率（P22）
当前状态为2，下一状态为3的概率（P23）
当前状态为3，下一状态为1的概率（P31）
当前状态为3，下一状态为2的概率（P32）
当前状态为3，下一状态为3的概率（P33）
并把它们放入一个33的矩阵。
例如计算当前状态为1，下一状态为2的概率。我们需要在长列表中中找出为1，2的短列表个数，这样我们得到了频数，就能算出频率并近似看作概率。
所以我们得到了这样一个概率状态转移矩阵。

[En]

So we get such a probabilistic state transition matrix.

P11 P12 P13
P21 P22 P23
P31 P32 P33
它的值可能为
0.1 0.3 0.6
0.2 0.5 0.3
0.4 0.4 0.2
需要注意的是，每一行的总和必须为1，因为它只能转换为受限状态。例如，如果当前状态为2，则下一个状态只能为1/2或3。

[En]

It is important to note that the sum of each line must be 1, because it can only be transferred to a limited state. For example, if the current state is 2, then the next state can only be 1 / 2 or 3.*

从概率状态转移矩阵计算出累计状态转移矩阵

计算过程是将左边的数字相加。根据上述概率状态转移矩阵计算的累积状态转移矩阵为

[En]

The calculation process is the addition of the number on the left. The cumulative state transition matrix calculated from the above probabilistic state transition matrix is

0.1 0.4 1
0.2 0.7 1
0.4 0.8 1
如果概率状态转移矩阵正确，则累积状态转移矩阵的最右侧列必须为1。

[En]

If the probabilistic state transition matrix is correct, the rightmost column of the cumulative state transition matrix must be 1.

状态预测

生成一个0-1的随机值，并作为阈值和第x行数据进行比较，x是当前状态的值。例如当前状态为2，就和第二行数据比较。如果阈值大于等于该行第y列且小于该行第y+1列，则下一状态的值预测为y+1（小于第一列则下一状态为1）。以此类推。

高阶链建模，公式及状态预测详细过程等请阅读Synthetic High-Resolution Wind Data Generation Based on Markov Model

Original: https://blog.csdn.net/onesway2018/article/details/123899800
Author: onesway2018
Title: 马尔科夫链（Markov chain, Markov model）讲解（一阶和高阶）及其应用（建模数据预测）

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/527661/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Keras深度学习实战（2）——使用Keras构建神经网络

Keras深度学习实战（2）——使用Keras构建神经网络 * – 0 前言 – 1. Keras 简介与安装 – 2. Keras 构建神经网…

人工智能 2023年7月14日
0087
Pytorch优化器全总结（一）SGD、ASGD、Rprop、Adagrad

目录写在前面一、 torch.optim.SGD 随机梯度下降 SGD代码 SGD算法解析 1.MBGD（Mini-batch Gradient Descent）小批量梯度下降…

人工智能 2023年7月21日
0085
跟着Cell学单细胞转录组分析(六):细胞比例计算及可视化

今天接着单细胞文章的内容：从Cell学单细胞转录组分析(一):开端！！！跟着Cell学单细胞转录组分析(二):单细胞转录组测序文件的读入及Seurat对象构建跟着Cell学单…

人工智能 2023年6月23日
00110
矩阵函数的常见求法

1 待定系数法待定系数法是以 Hamilton-Cayley定理为基础的一种求矩阵函数的方法。设(n)阶矩阵(A)的特征多项式为(\phi(\lambda)=\det…

人工智能 2023年6月4日
00111
使用树莓派4b和OpenCV做机械臂夹取

文章目录前言一、基本功能二、主要代码 1.图像处理部分 2.舵机驱动部分前言本人第一次在csdn上发技术类文章，原谅在此多说一些废话。项目是自己的毕设，比较简单还望不要见…

人工智能 2023年6月25日
00102
2021-05-31 pandas读取文件&DataFrame查看和操作数据

一、读取数据通过 read_ csv 函数将 csv 读取到 pandas 的 DataFrame 对象； df_rating =pd.read_csv(" csv文件…

人工智能 2023年7月7日
0052
深度学习和语音系列教程 3-100：提取音频特征

音频功能音频特征是不包含从语音到文本模型输出的抄本或文本串的特征的语音特征。 [En] Audio features are speech features that do no…

人工智能 2023年5月27日
0094
在torch里面，view函数

在torch里面，view函数相当于numpy的reshape，来看几个例子： a = torch.arange(1, 17) # a’s shape is (16,)…

人工智能 2023年7月21日
0072
AI实现语音文字处理，PaddleSpeech项目安装使用 | 机器学习

目录前言环境安装 1、conda安装Python3.9虚拟环境 2、安装Visual Studio 2019 3、安装requirements.txt 4、安装paddlepa…

人工智能 2023年5月27日
00131
【时序】时间/时空序列分类/预测/异常检测模型顶会论文资源汇总

Github Repo：https://github.com/datamonday/TimeSeriesMoonlightBoxContributor：datamondayCrea…

人工智能 2023年7月3日
00311
图神经网络17-DGL实战：节点分类/回归

对于图神经网络来说，最常见和被广泛使用的任务之一就是节点分类。图数据中的训练、验证和测试集中的每个节点都具有从一组预定义的类别中分配的一个类别，即正确的标注。节点回归任务也类似，训…

人工智能 2023年7月2日
0064
wordcloud 词云制作

文章目录介绍安装导入基本步骤主要方法及参数 * 类wordcloud.WordCloud(**kwargs) – 参数方法属性函数wordcloud.r…

人工智能 2023年5月28日
00151
模型训练——Loss函数归纳汇总

上一篇文章《语音降噪模型归纳汇总》，很意外地收到了点赞收藏和关注，如此的反馈给了我极大的鼓舞，这里就再梳理了一下loss函数相关的知识，以求方便能作为一份工具性质的文章展现出来。能…

人工智能 2023年5月23日
00167
深度强化学习极简入门（X）——一次强化学习比赛参赛记录

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年6月10日
0072
(详细步骤和代码)利用A100 GPU加速Tensorflow

利用A100 GPU加速Tensorflow NVIDIA A100 基于 NVIDIA Ampere GPU 架构，提供一系列令人兴奋的新功能：第三代张量核心、多实例 GPU (…

人工智能 2023年5月25日
00108
001-集成电路及Verilog概述

集成电路设计发展以元件为基础→以单元为基础→以TL综合为基础→以IP为基础集成电路发展的三个阶段 70年代→80年代（单片机）→90年代（四业分离）→正在进行的变革 Veril…

人工智能 2023年6月6日
0060

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

马尔科夫链（Markov chain, Markov model）讲解（一阶和高阶）及其应用（建模数据预测）

大家都在看