pandas基础入门之数据与查看

2023年8月20日下午9:15 • Python • 阅读 40

pandas是使数据分析分析工作变得更快更简单的高级数据结构和操作工具，是数据分析师必须要熟练掌握的，现将pandas学习资料整理如下：

pandas数据与查看

1.1数据抽样

head(n)查看前n行的数据,图例就是演示的展示前2行的数据。

import pandas as pd
import numpy as np

df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df,df.head(2))

out:
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
  w  y  z
a  0  1  2
b  3  4  5

tail(n)查看后n行的数据,图例就是演示的展示后2行的数据。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df,df.tail(2))

out:
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
   w   y   z
c  6   7   8
d  9  10  11

sample(n)就是随机取n行，图例就是演示的随机展示2行的数据。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df,df.sample(2))

out:
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
   w   y   z
d  9  10  11
a  0   1   2

1.2查看数据属性

shape 查看数据维度，行列数。图例DataFrame是4行3列，因此输出(4,3)。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df)
print(df.shape)

out:
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
(4, 3)

index查看行索引。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df)
print(df.index)

out：
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
Index(['a', 'b', 'c', 'd'], dtype='object')

columns查看列索引。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df)
print(df.columns)

out：
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
Index(['w', 'y', 'z'], dtype='object')

dtypes查看数据类型。常见的有以下5种类型：

object — 代表了字符串类型

int — 代表了整型

float — 代表了浮点数类型

datetime — 代表了时间类型

bool — 代表了布尔类型

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df)
print(df.dtypes)

out：
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
w    int32
y    int32
z    int32
dtype: object

values查看DataFrame中的数值。数据保存在list中。

import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list("abcd"),columns=['w','y','z'])
print(df)
print(df.values)

out：
   w   y   z
a  0   1   2
b  3   4   5
c  6   7   8
d  9  10  11
[[ 0  1  2]
 [ 3  4  5]
 [ 6  7  8]
 [ 9 10 11]]

info整体属性查看。

`ruby
import pandas as pd
import numpy as np
df=pd.DataFrame(np.arange(12).reshape(4,3),index=list(“abcd”),columns=[‘w’,’y’,’z’])
print(df)

print(df.info())

out：
w y z
a 0 1 2
b 3 4 5
c 6 7 8
d 9 10 11

Index: 4 entries, a to d
Data columns (total 3 columns):
# Column Non-Null Count Dtype

Original: https://blog.csdn.net/Liuyan_analysis/article/details/121042257
Author: Liuyan_analysis
Title: pandas基础入门之数据与查看

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/755151/

转载文章受原作者版权保护。转载请注明原作者出处！

python

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

python学习：重用父类功能的两种方式

在子类派生的新方法中如何重用父类的功能方式一：指名道姓调用某一个类下的函数=》不依赖于继承关系 class OldboyPeople: def __init__(self,name…

Python 2023年6月9日
0067
前端经典面试题 | Computed 和 Watch 的区别

put put 看上去是方法，但是实际上是计算属性，它会根据你所依赖的数据动态显示新的计算结果。计算结果会被缓存， put 的值在getter执行后是会缓存的，只有在它依赖的属性值…

Python 2023年9月16日
0029
MySQL对指定字段按指定顺序排序FIELD函数

简介：比如我们有一列数据，字段site_code代表数据区域。如下需求：现在我们查询列表，希望以字段 site_code排序，排序的方式为 PH->MY->TH…

Python 2023年6月12日
0075
python 继承（史上最详细版本）

目录继承继承简介继承是一种创建新类的方式，新建的类可称为子类或派生类，父类可称为基类或超类 python支持多继承，新建的类可以支持一个或多个父类 ”’单继承和多继承简单定…

Python 2023年8月2日
0053
爬虫_08_scrapy&持久化存储&管道操作&手动请求发送

08_scrapy&持久化存储&管道操作&手动请求发送简介：所谓的框架其实就是一个被集成了很多功能且具有很强通用性的一个项目模板。学习：学习是学好框架中…

Python 2023年10月5日
0030
Pandas知识点-详解元素级批处理函数applymap和map

Pandas知识点-详解元素级批处理函数applymap和map 在Pandas中，apply()可以对DataFrame和Series按列或行批处理，applymap()和map…

Python 2023年8月9日
0036
测试笔记：学习Pytest框架

pip install pytest -i https://mirrors.aliyun.com/pypi/simple/ 2.1 检索规则运行 test_ 或 * _test….

Python 2023年9月9日
0049
室友吃个泡面的时间，我就用Python代码下载了几千张手机壁纸，简直yyds！

手机壁纸电脑壁纸，对于广大男性同胞来说，最喜欢的不是好看，十几行代码一分钟下载很多，用完不了，来吧，秀！ [En] Mobile wallpaper computer wallpa…

Python 2023年5月24日
0046
初识Django

文章目录前言一、初步使用二、使用Django连接数据库总结前言最近在看《python编程–从入门到实践》这本书，基础和项目都有去做，之前学习了flask框架，本书中最后…

Python 2023年8月4日
0046
手把手教你用Python进行城市公交网络分析与可视化

一、数据查看和预处理数据获取自高德地图API，包含了天津市公交线路和站点名称及其经纬度数据。 import pandas as&#x…

Python 2023年9月6日
0058
go和python的比较,获取当前时间是今年第几个星期

获取当前时间是今年的第几周 golang: import ( "fmt" "time" ) func main() { datetime :…

Python 2023年6月3日
0072
熬夜怒肝，保姆级Python学习路线，起飞！

想当初女朋友编程小白零基础，到如今在互联网大厂做算法工作，就是我带她漂进Python的海洋，从此一去不复返~ 我给她制订的学习路线十分适合萌新，总共分三步：看视频作项目啃厚书…

Python 2023年9月18日
0047
google colab上让 python 视觉化套件 matplotlib 显示中文

在 matplotlib 设定字符参数从 Google API 上下载暂存字体放到咱村文件夹下 !wget ‘https://noto-website-2.storage.goo…

Python 2023年9月4日
0052
pytest篇4-Fixture熟练运用

前言前面的公众号学习了unittest的Fixture,其实pytest的Fixture大同小异，也非常类似。 1、在做web自动化时,使用Fixture的一些前置或后置条件,非…

Python 2023年9月15日
0045
KNN、图像分类、曼哈顿距离、图片像素、python、opencv、最近邻图片分类

KNN、图像分类、曼哈顿距离、图片像素、python、opencv、最近邻图片分类自己实现使用曼哈顿距离计算图像之间的距离，采用最近邻算法对图片经行分类，没有使用sklearn里…

Python 2023年8月26日
0044
Postgresql中最有用的扩展（Extensions）pg_stat_statements（译）

如果您使用Postgres，但尚未使用pg_stat_statements，则必须将其添加到工具箱中。即使您很熟悉，也可能值得回顾一下。 pg_stat_statements是所谓…

Python 2023年6月12日
0062

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

pandas基础入门之数据与查看

1.1数据抽样

1.2查看数据属性

大家都在看