华中杯数学建模 A题简单复盘（附Python源码）

2023年5月31日上午7:12 • 人工智能 • 阅读 86

华中杯 A题简单复盘（附Python 源码）

文章目录

华中杯 A题简单复盘（附Python 源码）
*
前言
题目简介
–
- 问题背景
题目以及思路
–
Code（Python）
总结

前言

五一和队友们一起坐牢了打完了华中杯，只能说确实有点小折磨。这是虽然基本上都完成了题目，但是队内的共识还是觉得这次完成的不够好，所以我专门做了一次关于这次数模比赛的复盘，希望能够有所收获。
源码在最后边。

题目简介

这次的A题是 分拣系统优化问题。

问题背景

电商公司配送中心的工作流程中，分拣环节的是影响配送中心整体性能的关键因素。

系统的流程如下

首先，系统统计汇总出当天全部待配送订单所包含的所有货品及相应数量。然后，转运工将这些货品由仓库转运至分拣处，并放置到货架上，等待分拣。而问题也做了简化，不需要考虑货架的承载量，一个货架只放置一种货品，对件数没有限制。最后，分拣工按任务单依次分拣出每一个订单的货品。

而题目就是在这样的背景下，需要我们对分拣系统做优化。

其实这个背景就划分了流程：分批、上架、分拣。
而下面的题目也是这样。

题目以及思路

具体题目和数据可以直接去官网查看“华中杯”大学生数学建模挑战赛 (hzbmmc.com)

分批算法设计

我们需要设计一个算法，实现将订单分批。货架的个数有限，而且每一个货架只能放置一种货物，当订单全部的货物总数超过货架数时，需要对订单进行分批。
算法的结果，需要计算附件中订单的信息在货架总数N为200时，输出最少的批次数，和分批中每批的订单数、货品种类数等信息。

MindMap

相信看完这道题目，很多人和我一样想到了遗传算法和聚类算法。的确遗传算法是适合寻求最优解、而聚类算法则是很适合利用样本空间中样本的相似度进行聚类。

而以下就是我对基于该题目遗传和聚类的分析。

; 遗传算法优缺点

优点

遗传算法的优点在于可以通过不断的迭代，筛选出优秀基因，针对这道题，我觉得其实就是筛选出优秀的订单的分批。

缺点

但是不同的是，我们在遗传算法的过程中，是有基因交换和基因变异两种情况的，这是遗传算法的筛选的途径。但是基因交换时，是将相同的位置（相对）的染色体进行交换，而我们如果是将不同批次的订单拆分后需要考虑的是交换后的新增种类数，而不是长度，所以单纯的遗传不一定适合。

聚类算法优缺点

优点

可以有效地将数据进行归类聚集。

缺点

就本题而言，聚类本身依靠的样本数据地相似程度，而该题需要的是新增种类，这显然不是可以通过改变相似程度的衡量方式可以处理的。直接聚类的结果可能会使得每个类都很聚集，但是不能使得类充分接近甚至达到货品种类数N的情况。

结合这两种算法，我提出了一种较为朴素的分批算法模型，算法的内核是贪心思想。

基于贪心的分批算法

分批过程，可以参考下图。

图片来自于我的队友（太强了，吹吹我的队友）

我们可以先将所有的订单按照种类数进行分类，划分大小订单。然后是根据大小订单，先用大订单快速生成批次，然后再从小订单中将订单加入批次。

关于这个模型，有几个重点。

; 订单加入批次——贪心

无论是大订单还是小订单，都先从已有的批次中选取可以添加当前订单且加入后批次 新增种类数最小，这就是一个 贪心 的体现。

先大后小

我们在该模型中提出，需要先从大订单中选取，然后再选取小订单，为什么这样呢？

原因很简单，大订单因为自身货品的种类数较多，所以彼此之间种类的差异度会较大，我们可以利用这种特性，快速生成较多的批次，同时又能够使得生成的批次中种类的数量较多。

货品上架——遗传、贪心

在分好批次之后，针对每个批次，我们需要确定批次中货品的种类的上架顺序，根据每个订单的货品种类在该批次中的最大最小次序，有下列关于订单的拣选距离定义公式

我们需要设计出算法，使得批次中订单拣选距离的总和尽可能小。

我们参考了遗传算法中的 基因交换的思路，我们先将批次中的货品的种类任意排序，然后经过多次 迭代，每次迭代的过程： 随机选取两个种类，如果种类交换次序后，批次中订单拣选距离相比原来拣选距离更小，我们就更新为新的次序。这里其实也用了贪心的思想，每次迭代保存局部最优解，然后用局部最优解去逼近全局最优解。

订单指派算法——贪心、并行分配

在分批、上架完成之后，我们需要对分拣工进行订单的指派，目的是使完成的时间尽可能小，每个分拣工的距离尽可能均衡。需要求取的结果是分拣工人数为5的时候的结果。

我们的算法是在每次优先选取移动距离最小的员工进行指派，然后优先选取Ta最近的订单，进行分配。

首先选取移动距离最小的员工这一点很好理解，因为该员工是所有员工中最先完成，所以需要优先分配，然后为了减少不必要的移动距离，需要给ta 分配最近的订单。

Code（Python）

import random

import pandas as pd
import time

N = 200

def big_and_small(choose0, order):
"""
    该函数用于在未保留的订单中划分大订单和小订单，大订单占1/3
    :param choose0:
    :param order:
    :return:
"""
    big1 = []
    small0 = []
    choose0 = dict(choose0)

    choose_order = {}

    for i1 in choose0:
        if choose0[i1] == 0:
            choose_order[i1] = len(order[i1])

    choose_order = sorted(choose_order.items(), key=lambda x: x[1])
    new_len = len(choose_order)

    mid_index = new_len * 2 // 3
    index0 = 0
    for i1 in choose_order:
        if index0 < mid_index:
            small0.append(i1[0])
        else:
            big1.append(i1[0])
        index0 += 1

    return big1, small0

def is_in(batch, order):
"""
    其实应该是得出新增订单的新增货品种类数
    :param batch: 当前批订单, 内容是包含的货品
    :param order: 当前订单，也是货品
    :return: 新增货品种类数
"""

    batch = list(batch)
    order = list(order)
    ans = 0

    for o0 in order:
        if o0 not in batch:
            ans += 1

    return ans

def choose_batch(orders1, batch01, batch02, keys0):
"""
    该函数用于在大订单集或者小订单集将这些订单实现分批
    引入洗牌算法，将选取的订单的顺序进行打乱随机
    :param orders1: 订单
    :param batch01: 批次列表1，存储批次对应的订单
    :param batch02: 批次列表2，存储批次对应的货品种类
    :param keys0: 选取订单的列表, 内容是订单号
    :return: 返回选取加入后的批次1和2
"""

    batch1 = dict(batch01)
    batch2 = dict(batch02)

    i0 = len(batch1) - 1

    if i0 < 0:
        i0 = 0
    for key0 in keys0:

        if i0 not in batch1:
            batch1[i0] = []
            batch2[i0] = []

        add_type = -1
        index0 = -1
        flag = False
        for j in range(i0 + 1):
            if key0 in batch1[j]:
                flag = True
                break
            at = is_in(batch2[j], orders[key0])

            if at == 0:
                index0 = j
                break
            if (add_type < 0 or add_type >= at) and len(batch2[j]) + at  N:
                add_type = at
                index0 = j
        if flag:
            continue

        if index0 >= 0:
            batch1[index0].append(key0)
            for good in orders1[key0]:
                if good not in batch2[index0]:
                    batch2[index0].append(good)
        else:
            i0 += 1
            batch1[i0] = []
            batch1[i0].append(key0)
            batch2[i0] = orders1[key0]

    return batch1, batch2

def choose_small(orders1, batch1, batch2, small1):
    orders1 = dict(orders1)
    keys = list(small1)
    batch1 = dict(batch1)
    batch2 = dict(batch2)
    batch1, batch2 = choose_batch(orders1, batch1, batch2, keys)
    return batch1, batch2

def choose_big(big1, orders1, batch, batch2):
    orders1 = dict(orders1)
    keys = list(big1)

    batch1 = dict(batch)
    batch2 = dict(batch2)
    batch1, batch2 = choose_batch(orders1, batch1, batch2, keys)
    return batch1, batch2

def gen_ite(order, btch_1, btch_2):
"""
    参考遗传算法的精英基因保留的思想，我们在迭代的过程中将批次中订单数较多的保留，这里每次选取10%的批次
    然后需要将这些批次标记为已经选择然后重新在为选取的订单集划分大小订单
    选取的批次可以直接新增在下一代的开头
    :param order: 订单字典，订单号对应相应的货品种类
    :param btch_1:
    :param btch_2:
    :return: 最终多次迭代后的结果
"""
    btch_1 = dict(btch_1)
    btch_2 = dict(btch_2)
    for i0 in range(1):

        btch = sorted(btch_1.items(), key=lambda x: len(x[1]), reverse=True)
        btch_len = len(btch)
        index0 = btch_len // 5

        bt1 = {}
        bt2 = {}

        choose_0 = {}
        for j in order.keys():
            choose_0[j] = 0
        for k in range(index0):
            for o0 in btch_1[k][1]:
                choose_0[o0] = 1
            bt1[k] = btch_1[btch[k][0]]
            bt2[k] = btch_2[btch[k][0]]
        keys = []
        for m in order.keys():
            if choose[m] == 0:
                keys.append(m)

        bt1, bt2 = choose_batch(order, bt1, bt2, keys)

        if len(bt1) < len(btch_1):
            btch_1 = bt1
            btch_2 = bt2
            print('第%d次, 批次数为%d' % (i0 + 1, len(btch_1)))
        le = 0
        for bi in btch_1:
            le += len(btch_1[bi])
        print(le)
    return btch_1, btch_2

def compute_dis(order, batch10, goods):
"""
    :param order: 其实就是订单集合，通过订单可以知道需要的货品
    :param batch10: 批次下的订单列表
    :param goods: 当前批次下的货物关于货物列表的索引
    :return: 返回距离和
"""
    order = dict(order)
    batch10 = list(batch10)
    goods = dict(goods)
    dis = 0
    for bat in batch10:
        index_max = -1
        index_min = -1
        for good in order[bat]:
            if index_min == -1 and index_max == -1:
                index_max = index_min = goods[good]
            else:
                index_max = max(index_max, goods[good])
                index_min = min(index_min, goods[good])

        dis += index_max - index_min
    return dis

def solution2(order, batch2, batch1):
    batch1 = dict(batch1)
    batch2 = dict(batch2)
    order = dict(order)
    pre_total = 0
    ans = []
    p = 0
    total = 0
    start0 = time.time()
    for batch in batch2.items():

        good_to_index = {}
        index_to_good = batch[1]
        for index1 in range(len(index_to_good)):
            good = index_to_good[index1]
            good_to_index[good] = index1

        dis = compute_dis(order, batch1[batch[0]], good_to_index)
        print('当前第%d批原始距离为 %d' % (p + 1, dis))
        pre_total += dis
        for i0 in range(200000):
            j = random.randint(0, len(index_to_good) - 1)
            k = random.randint(0, len(index_to_good) - 1)
            if j == k:
                continue

            good1, good2 = index_to_good[j], index_to_good[k]

            temp = good_to_index[good1]
            good_to_index[good1] = good_to_index[good2]
            good_to_index[good2] = temp
            if dis > compute_dis(order, batch1[batch[0]], good_to_index):
                index_to_good[j], index_to_good[k] = good2, good1
                dis = compute_dis(order, batch1[batch[0]], good_to_index)

            else:
                good_to_index[good1], good_to_index[good2] = good_to_index[good2], good_to_index[good1]

        print('迭代后 第%d批原始距离为 %d' % (p + 1, dis))
        total += dis
        ans.append(index_to_good)
        p += 1
    print('原始距离', pre_total)
    print('最终距离', total)
    end0 = time.time()
    print('时间消耗', end0 - start0)
    return ans

def find_min_and_max_index(order_good, good_index):
    order_good = list(order_good)
    good_index = dict(good_index)
    min_index = max_index = -1
    for good_name in order_good:
        if min_index == -1 and max_index == -1:
            min_index = max_index = good_index[good_name]
            continue
        min_index = min(min_index, good_index[good_name])
        max_index = max(max_index, good_index[good_name])

    return min_index, min_index

def right_find(order, batch1, good_index, pos):
    order = dict(order)
    batch1 = list(batch1)
    good_index = dict(good_index)
    ans = ''
    j = int(pos)
    tar = j
    for name in batch1:
        i1, i2 = find_min_and_max_index(order[name], good_index)
        if i1 >= pos and (j > i1 or j == pos):
            j = i1
            ans = name
            tar = i2

    return ans, tar

def left_find(order, batch1, good_index, pos):
    order = dict(order)
    batch1 = list(batch1)
    good_index = dict(good_index)
    ans = ''
    j = pos
    tar = j
    for name in batch1:
        i1, i2 = find_min_and_max_index(order[name], good_index)
        if i1  pos and (j < i1 or j == pos):
            j = i2
            ans = name
            tar = i1
    return ans, tar

def find_people(distance):
"""
    找工人函数
    我们根据方向和工人所处的位置决定，选取哪个工人
    :param distance: 距离数组
    :return: 返回工人的下标
"""

    ans = 0
    distance = list(distance)
    for i1 in range(1, len(distance)):
        if distance[i1] < distance[ans]:
            ans = i1
    return ans

def solution3(order, batch1, batch2, n):
"""
    该函数主要用于求解第三问
    :param order: 订单对应的货品，字典对象
    :param batch1: 分批好的订单，批号对应的订单
    :param batch2: 分批好的订单对应的货品种类，批号对应的种类
    :param n: 工人的数量
    :return: 返回第三问需要的结果
"""

    batch1 = dict(batch1)
    order = dict(order)
    batch2 = dict(batch2)

    ans = []
    peo_dis1 = []
    total_dis0 = []
    for i1 in range(n):
        total_dis0.append(0)

    for i1 in batch1.keys():

        bat_or = list(batch1[i1])
        bat_go = list(batch2[i1])

        good_index = {}

        for ind in range(len(bat_go)):

            good = bat_go[ind]

            good_index[good] = ind

        people = []
        task = []
        peo_dis0 = []

        for p in range(n):
            people.append(0)
            task.append(0)
            peo_dis0.append(0)

        while len(bat_or) > 0:

            p = find_people(peo_dis0)

            pos = people[p]

            name, next_pos = right_find(order, bat_or, good_index, pos)

            if name == '':

                name, next_pos = left_find(order, bat_or, good_index, pos)
            task[p] += 1
            temp = [name, i1 + 1, p + 1, task[p]]
            ans.append(temp)
            bat_or.remove(name)
            print(next_pos)

            dis = abs(pos - next_pos)

            peo_dis0[p] += dis
            people[p] = next_pos

        peo_dis1.append(peo_dis0)
        for ik in range(n):
            total_dis0[ik] += peo_dis0[ik]

    return ans, peo_dis1, total_dis0

if __name__ == "__main__":
    start = time.time()

    filepath = '附件1：订单信息.csv'

    df = pd.read_csv(filepath)
    con = pd.DataFrame(df, columns=['OrderNo', 'ItemNo'])
    orders = {}
    for line in con.index:

        if not con.iloc[line]['OrderNo'] in orders:
            orders[con.iloc[line]['OrderNo']] = []
        orders[con.iloc[line]['OrderNo']].append(con.iloc[line]['ItemNo'])

    s = set(orders.keys())
    print(len(s))

    choose = {}

    for key in orders.keys():
        choose[key] = 0
    big, small = big_and_small(choose, orders)
    print(len(big), len(small))
    b1, b2 = choose_big(big, orders, {}, {})
    b1, b2 = choose_small(orders, b1, b2, small)
    end = time.time()
    print(end - start)
    print(len(b1), len(b2))
    btch1, btch2 = dict(b1), dict(b2)

    file = open('result1.csv', 'w+')
    result1 = 'OrderNo,GroupNo\n'
    for b in btch1.keys():
        for o in btch1[b]:
            result1 += o + ',' + str(b + 1) + '\n'
    print(result1, file=file)
    file.close()
    ans2 = solution2(orders, batch1=btch1, batch2=btch2)

    result2 = 'ItemNo,GroupNo,ShelfNo\n'
    for index in btch2.keys():
        btch2[index] = ans2[index]
        for _i in range(len(ans2[index])):
            result2 += ans2[index][_i] + ',' + str(index + 1) + ',' + str(_i + 1) + '\n'
    file = open('result2.csv', 'w+')
    print(result2, file=file)
    file.close()

    ans3, peo_dis, total_dis = solution3(orders, batch2=btch2, batch1=btch1, n=5)
    result3 = 'OrderNo,GroupNo,WorkerNo,TaskNo\n'
    for a in ans3:
        for index in range(len(a)):
            result3 += str(a[index])
            if index + 1 < len(a):
                result3 += ','
            else:
                result3 += '\n'
    file = open('result3.csv', 'w+')
    print(result3, file=file)
    file.close()

    result4 = ''
    for b in peo_dis:
        for index in range(len(b)):
            result4 += str(b[index])
            if index + 1 < len(b):
                result4 += ','
            else:
                result4 += '\n'
    file = open('员工路程，单批.csv', 'w+')
    print(result4, file=file)
    file.close()
    file = open('员工路径总和.csv', 'w+')
    result4 = ''
    for i in range(len(total_dis)):
        result4 += str(total_dis[i])
        if i + 1 < len(total_dis):
            result4 += ','
        else:
            result4 += '\n'
    print(result4, file=file)
    file.close()

总结

这次其实可以优化的地方蛮多的，比如第一问其实可以使用遗传算法进行优秀批次的保留，然后不断筛选（可是我在实践编程中，出了bug没有实现成功）。

Original: https://blog.csdn.net/weixin_54891898/article/details/124624629
Author: 物联黄同学
Title: 华中杯数学建模 A题简单复盘（附Python源码）

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/549664/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Yolov5如何更换EIOU / alpha IOU / SIoU？

🌟想了解YOLO系列算法更多教程欢迎订阅我的专栏🌟 对于基础薄弱的同学来说，推荐阅读《目标检测蓝皮书》 📘，里面涵盖了丰富的目标检测实用知识，是你迅速掌握目标检测的理想选择！如果…

人工智能 2023年7月29日
0078
Docker超详细基础教程，快速入门docker首选(万字长文建议收藏)

一、docker概述 1.什么是docker Docker 是一个开源的应用容器引擎，基于 Go 语言并遵从 Apache2.0 协议开源。Docker 可以让开发者打包他们的应…

人工智能 2023年7月21日
0049
自考-知识点总结-数据库系统原理 04735

数据库系统原理第一章数据库系统概述数据库基本概念数据：符号记录（文本或多媒体）数据库（DB）：长期储存、有组织、可共享的数据集合数据库管理系统（DBMS）：建立与管理…

人工智能 2023年6月1日
0069
毕设–基于深度学习的人脸识别（详细步骤）

最近闲来无事，想写一个本人毕设基于深度学习的人脸识别文章。我主要利用两个不同的神经网络进行实现，分别是一个简单三层的卷积神经网络和结构复杂的VGG16神经网络，并比对了两种网络训练…

人工智能 2023年5月23日
0095
百度飞桨数据处理 API 数据格式 HWC CHW 和 PIL 图像处理之间的关系

使用百度飞桨 API 例如：Resize Normalize，处理数据的时候。 Resize：如果输入的图像是 PIL 读取的图像这个数据格式是 HWC ，Resize 就需要 H…

人工智能 2023年6月4日
0095
图像处理（一）图像灰度化的三种方式

图像处理的第一步操作基本都是进行灰度化，二灰度化的方式都很多种，可以根据自己的具体需要进行参数调整，基本分为四大种，分别为 YUV亮度灰度化、最大值灰度化、平均值灰度化、 Ga…

人工智能 2023年6月18日
0049
Python数据可视化 Matplotlib详解（一) —— 折线图与时序数据绘制

目录本小节内容基础代码实例讲解如何多图绘制 * 多行多列的情况多行单列的情况 Matplotlib 绘制时序数据多时序变量绘图如何给数据添加注释 Reference …

人工智能 2023年7月15日
0065
基于Transformer的目标检测领域最新进展

本文为检索该领域文章后做的简单记录，没有模型结构的详细介绍，不过其中两三篇文章已是目前目标检测领域的SOTA（截止到21年8月）！ a、End-to-End Object Det…

人工智能 2023年6月21日
00101
关于分类任务的常见指标一文就够了（P、R、F1、ROC、AUROC、AP）

分类任务相关(Relevant),正类无关(NonRelevant),负类被检索到(Retrieved) （判为P）true positives(TP 正类判定为正类 ,)f…

人工智能 2023年7月1日
0096
VIT(vision transformer)模型介绍+pytorch代码炸裂解析

前言一直对transformer都有很大的兴趣，之前看到有vision transformer，一直没来得及好好看，这两天拿出来吸收了下精华，顺便写个文章记录一哈地址论文：h…

人工智能 2023年5月28日
0095
自然语言处理NLTK（一）：NLTK和语料库

对于文本的研究，由于语言主要是中文，对英语的研究将会少得多，主要是因为应用的问题，而现在对于海外的产品，英语的语言处理将变得越来越重要。事实上，英语语言处理数据将比汉语处理数据更加…

人工智能 2023年5月27日
0058
2022-2028年全球与中国化学抑尘剂行业市场深度调研及投资预测分析

本文研究全球与中国市场化学抑尘剂的发展现状及未来发展趋势，分别从生产和消费的角度分析化学抑尘剂的主要生产地区、主要消费地区以及主要的生产商。重点分析全球与中国市场的主要厂商产品特点…

人工智能 2023年6月11日
0084
Pyts入门之时间序列的分类—理解LearningShapelets算法（四）

简介本章节直接跳过了BOSSVS算法，因为BOSSVS和SAX-VSM的区别也就只在于前面数据预处理的方式了，SAX-VSM用的是分箱的字词转换，而BOSSVS用的是Symbol…

人工智能 2023年7月1日
0075
高斯低通和高斯高通滤波器

1.高斯低通滤波器（GLPF）高斯低通滤波器的二维形式为：其中，D0是截止频率，D(u,v)是距频率矩形中心的距离。高斯滤波器的宽度由参数 D0 表征，决定了平滑程度，而且 D…

人工智能 2023年6月17日
0075
中国人工智能市场运营调研及发展前景预测报告2022-2028年

中国人工智能市场运营调研及发展前景预测报告2022-2028年═━┈┈━══━┈┈━══━┈┈━══━ 【出版机构】: 中商经济研究网第一章人工智能的基本介绍 1.1 人工智能…

人工智能 2023年6月11日
0068
目标跟踪之Pysot系列代码训练（SiamRPNSiamRPN++）

代码地址：https://github.com/STVIR/pysot环境配置参考博客：Siam系列跟踪算法工具包PySOT配置一、数据集准备 Pysot系列跟踪器训练的时候，首…

人工智能 2023年5月26日
00125

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

华中杯 数学建模 A题简单复盘（附Python源码）