Numpy and SIMD

2023年8月26日上午4:39 • Python • 阅读 44

Numpy is by its design a SIMD structure, which is best examplified by the list indexing feature:

Together with ohther python structures, such as dictionary, the SIMD execution can be done, but with a more unintuitive grammatic style:

from numpy import copy

newArray = copy(theArray)
for k, v in d.iteritems(): newArray[theArray==k] = v

numpy.copy — NumPy v1.22 Manual
test code:

#!/usr/bin/env python2.7

from numpy import copy, random, arange

random.seed(0)
data = random.randint(30, size=10**5)

d = {4: 0, 9: 5, 14: 10, 19: 15, 20: 0, 21: 1, 22: 2, 23: 3, 24: 0}
dk = d.keys()
dv = d.values()

def f1(a, d):
    b = copy(a)
    for k, v in d.iteritems():
        b[a==k] = v
    return b

def f2(a, d):
    for i in xrange(len(a)):
        a[i] = d.get(a[i], a[i])
    return a

def f3(a, dk, dv):
    mp = arange(0, max(a)+1)
    mp[dk] = dv
    return mp[a]

a = copy(data)
res = f2(a, d)

assert (f1(data, d) == res).all()
assert (f3(data, dk, dv) == res).all()

Any All in Python – GeeksforGeeks
python – Use a.any() or a.all() – Stack Overflow
==> obviously direct indexing filter by numpy arrays is going to be faster than numpy + dict;
===> use numpy arrays when you can; python has weaker SIMD support in general.

the indices slicing support makes SIMD easier and more versatile in implementation, but depending on the hardware support and numpy low-level implementations, i.e. how the hardware organize physical storage and accesses and how numpy utilize such features, different slicing directions might have drastically different performances.

==> in general assume row-major or C-like array storage structures, meaning slicing lower indices will be more efficient.

==> some ASICs might have much stronger support for higher indices slicing.

Numpy

While python is not structured for SIMD style prarallel programming, since you don’t even have accesses to pointers, Numpy is. And this package has proven to be quite efficient (, power of modular design I guess), so try to base your SIMD coding with Numpy.

Slicing

Numpy slicing is not as straightforward as manipulating the memory buffers directly with pointers in C/C++, but still many slicing options are supported; there are 2 versions:

basic:

array[row][col][level]

this is a successive access, reducing array dimensionality by 1 per dereference action, [], if a non-full range of indices is given.

slicing in this mode is restrictive:

array[:][:col][:level]

is actually

array[:col][:level]

==> use this mode for clear, structured expressions of dereferences.

multi-directional access, or full slicing support

array[row, col, level]

this mode does the same for specified accesses, but differs in slicing behavior from successive derefences;

array[:, :col, :level]

actually works as intended, i.e. “for all rows, take all cols till col; for each of the taken cols, take all levels till level.”

!!!!

Slicing combined with filter/list indexing as introduced in Indexing section is a powerful feature, which strongly mirrors a basic gather/scatter action in most SIMD ISAs, but they can be extremely confusing to use.

e.g.

for a 3D array a:

array([[[0.87330218, 0.348806 , 0.98876196, 0.44153593, 0.35657919],
[0.0591688 , 0.01207211, 0.76808385, 0.5382626 , 0.74737973],
[0.61562341, 0.49494463, 0.99326787, 0.78333718, 0.18965861],
[0.10603183, 0.78535426, 0.54849272, 0.6651616 , 0.99013694]],

[[0.42220155, 0.65080645, 0.92558894, 0.11468048, 0.70492543],
[0.58528903, 0.71053382, 0.96009024, 0.84545703, 0.89357304],
[0.61943998, 0.99428317, 0.54617109, 0.62770748, 0.39451982],
[0.94771556, 0.56667405, 0.18225097, 0.75520699, 0.99649013]],

[[0.05937206, 0.71885611, 0.08577789, 0.82468742, 0.61361646],
[0.13556848, 0.05283339, 0.63987149, 0.91302604, 0.37158879],
[0.37965324, 0.71274351, 0.19897426, 0.48187764, 0.55820695],
[0.20501126, 0.44322089, 0.90804689, 0.55505773, 0.66719231]]])

will give:

array([0.5382626, 0.18965861])

add in slicing:

a[:2, np.array((1,2)), np.array((3,4))]

==>
array([[0.5382626 , 0.18965861],
[0.84545703, 0.39451982]])

while

a[:2][np.array((1,2)), np.array((3,4))]

==>
Traceback (most recent call last):
File “

Maybe try:

==> try with a terminal and toy examples to help;

==> while slicing, except for only the lowest index, use the [,,,,,] mode/notation only, to avoid confusing yourself;

==> of course, you can always reshape whichever array you are dealing with into 1D and treat it as a C pointer.

Grammar

minimum is the elementwise SIMD like comparison

array.min/np.amin is a reduction operation, not elementwise

==> naturally the same applies to max/maximum

additionally, for the position of the extrema, see:

for initialization use np.full():

updates: array.fill()

pad an existing array

reshape and resize: reshape explicitly requires the new shape to be compatible with the old one

==> by choosing order=’C’ / ‘F’ (row major or C-like vs. column major or Fortran-like) while casting arrays from nD to 1D or vice versa, you can achieve interleave/deinterleave effects.

Original: https://blog.csdn.net/maxzcl/article/details/122978521
Author: EverNoob
Title: Numpy and SIMD

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/759936/

转载文章受原作者版权保护。转载请注明原作者出处！

python

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

.NET 开源项目推荐之直播控制台解决方案 Macro Deck

流媒体是一个吸引数亿万玩家的严肃行业。最受欢迎的游戏锦标赛的转播获得了数百万的观看次数，从商业角度来看，这也使游戏行业变得有趣。在直播圈有个很受欢迎的直播控制台程序Macro D…

Python 2023年10月19日
0047
Scrapy爬取1——接口数据爬取准备

本文爬取网页：https://spa1.scrape.center/ 爬取流程： 1.检查页面：检查网页源代码，查看数据是在网页HTML源代码中还是调用了接口右键检查页面源代码，…

Python 2023年10月1日
0076
【Python量化】VaR在险价值的计算

此文章首发于微信公众号：Python for Finance 链接：https://mp.weixin.qq.com/s/uaDEnSzoalTaRmZ9GNvR0A 一、VaR的…

Python 2023年9月28日
0050
【机器学习】推荐算法(附例题代码)

往期文章【机器学习】回归分析【机器学习】Logistic回归【机器学习】神经网络【机器学习】支持向量机【机器学习】主成分分析与聚类分析文章目录 * – 推荐算法 &#…

Python 2023年10月24日
0040
设计模式(Python语言)—-简单工厂模式

推荐文章很多小伙伴都发现了，用户自主「申请上首页」的按钮取消了，那博主们写的文章还有上首页曝光的机会吗？我们的回答是”当然有！！！”虽然我们取消了上首页申…

Python 2023年5月24日
0065
Linux用户和用户组配置文件知识详解

✅作者简介：热爱国学的Java后端开发者，修心和技术同步精进。🍎个人主页：Java Fans的博客🍊个人信条：不迁怒，不贰过。小知识，大智慧。💞当前专栏：Java案例分享专栏✨特色…

Python 2023年11月7日
0041
计算机视觉项目-文档扫描OCR识别

😊😊😊 欢迎来到本博客😊😊😊本次博客内容将继续讲解关于OpenCV的相关知识🎉 作者简介：⭐️⭐️⭐️ 目前计算机研究生在读。主要研究方向是人工智能和群智能算法方向。目前熟悉pyt…

Python 2023年8月1日
0071
Scrapy运行发生No module named ‘win32api‘报错解决方案

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。 Original: https://blo…

Python 2023年10月5日
0036
Bert不完全手册3. Bert训练策略优化！RoBERTa & SpanBERT

之前看过一条评论说Bert提出了很好的双向语言模型的预训练以及下游迁移的框架，但是它提出的各种训练方式槽点较多，或多或少都有优化的空间。这一章就训练方案的改良，我们来聊聊RoBER…

Python 2023年10月28日
0048
Postgres 日志监控：阻塞，死锁，Checkpoint 优化（译）

原文地址：https://pganalyze.com/blog/postgresql-log-monitoring-101-deadlocks-checkpoints-blocke…

Python 2023年6月16日
0058
自动化框架–pytest

1、pytest安装 pytest是web自动化的一种框架，对测试用例进行管理、执行、记录、出具报告的。pytest是一种外置框架，要想使用必须先安装 2、pytest与unitt…

Python 2023年9月11日
0053
Matplotlib的基本使用

文章目录 1. 什么是Matplotlib 2. 安装和基本使用 3.点线缩写和颜色的选择 4. 设置图的信息 * 4.1 设置线条样式的方法 4. 2 设置轴和标题 &#8211…

Python 2023年9月5日
0060
Pandas之DataFrame对象大总结

一、什么是DataFrame？ DataFrame是一个表格型的数据结构，它含有一组有序的列，每列可以是不同类型的值。DataFrame既有行索引也有列索引，它可以被看做是由Ser…

Python 2023年8月7日
0066
Python自学教程7-字典有哪些常用操作

Python 2023年5月24日
0073
25岁竟要求产品经验10年？我一直以为是个段子，没想到居然是真的

你是不是以为工作3年要求有10年工作经验是个段子？今天告诉你这是真事儿！ 25岁要求工作经验10年！！！你好歹给人家工作5年的机会吧！这样也好说剩下的5年工作经验靠加班！ HR…

Python 2023年9月27日
0047
diffy接口测试demo

diffy接口测试demo diffy原理图，这里就不讲了，可以去github看文档。这里就记录下写的小demo～使用flask编写一个接口脚本作为 primary code，a…

Python 2023年8月10日
0056

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Numpy and SIMD

Numpy

Slicing

Grammar

大家都在看