python openpyxl选取部分行和列进行存储 python怎么选取某几列

转载

mob64ca13ffd0f1 2024-06-20 05:43:11

文章标签 python pandas 数据结构数据数组 文章分类 Python 后端开发

Pandas中的Dataframe数据选取（一）

前言

一、df的创建

1. 初期数据定义
2. 定义下标
3. df创建

二、Dataframe中的数据行选取

1. 初期数据定义
2. 行选取（单维度选取）

==①　整数索引切片：前闭后开 [ , )==
==②　标签索引切片：前闭后闭 [ , ]==
==③　布尔数组==
==④　单条件选取==
==⑤　多条件选取==

data = {
    'name': ['NAME0', 'NAME1', 'NAME2', 'NAME3', 'NAME4', 'NAME5', 'NAME6', 'NAME7', 'NAME8', 'NAME9'],
    'age': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9],
    'weight': ["weight0", 101, 102, np.nan, np.nan, 105, np.nan, 107, 108, 109],
    'is_single_dog': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
}

2. 定义下标

indexs = ['index0', 'index1', 'index2', 'index3', 'index4', 'index5', 'index6', 'index7', 'index8', 'index9']

3. df创建

df = pd.DataFrame(data, index=indexs)

完整代码：

# -*- coding: utf-8 -*-
import numpy as np
import pandas as pd

data = {
    'name': ['NAME0', 'NAME1', 'NAME2', 'NAME3', 'NAME4', 'NAME5', 'NAME6', 'NAME7', 'NAME8', 'NAME9'],

    'age': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9],

    'weight': ["weight0", 101, 102, np.nan, np.nan, 105, np.nan, 107, 108, 109],

    'is_single_dog': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
}

indexs = ['index0', 'index1', 'index2', 'index3', 'index4', 'index5', 'index6', 'index7', 'index8', 'index9']

df = pd.DataFrame(data, index=indexs)

print(df)

控制台输出结果：

         name  age   weight is_single_dog
index0  NAME0    0  weight0           yes
index1  NAME1    1      101           yes
index2  NAME2    2      102            no
index3  NAME3    3      NaN           yes
index4  NAME4    4      NaN            no
index5  NAME5    5      105            no
index6  NAME6    6      NaN            no
index7  NAME7    7      107           yes
index8  NAME8    8      108            no
index9  NAME9    9      109            no

二、Dataframe中的数据行选取

在Dataframe中选取数据大抵包括3中情况：

    1）行（列）选取（单维度选取）：df[]。这种情况一次只能选取行或者列，即一次选取中，只能为行或者列设置筛选条件（只能为一个维度设置筛选条件）。
    2）区域选取（多维选取）：df.loc[]，df.iloc[]，df.ix[]。这种方式可以同时为多个维度设置筛选条件。
    3）单元格选取（点选取）：df.at[]，df.iat[]。准确定位一个单元格。

1. 初期数据定义

接上记：

data = {
    'name': ['NAME0', 'NAME1', 'NAME2', 'NAME3', 'NAME4', 'NAME5', 'NAME6', 'NAME7', 'NAME8', 'NAME9'],

    'age': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9],

    'weight': ["weight0", 101, 102, np.nan, np.nan, 105, np.nan, 107, 108, 109],

    'is_single_dog': ['yes', 'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']
}

indexs = ['index0', 'index1', 'index2', 'index3', 'index4', 'index5', 'index6', 'index7', 'index8', 'index9']

df = pd.DataFrame(data, index=indexs)

2. 行选取（单维度选取）

①　整数索引切片：前闭后开 [ , )

# 选取第一行df数据
df_line_1 = df[0:1]

print(df_line_1)

控制台输出结果：

         name  age   weight is_single_dog
index0  NAME0    0  weight0           yes

# 选取第三行df数据
df_line_3 = df[2:3]

print(df_line_3)

控制台输出结果：

         name  age weight is_single_dog
index2  NAME2    2    102            no

# 选取第二行到第三行df数据
df_line_2_to_3 = df[1:3]

print(df_line_2_to_3)

控制台输出结果：

         name  age weight is_single_dog
index1  NAME1    1    101           yes
index2  NAME2    2    102            no

②　标签索引切片：前闭后闭 [ , ]

# 选取第一行df数据
df_line_1 = df[:'index0']

print(df_line_1)

控制台输出结果：

         name  age   weight is_single_dog
index0  NAME0    0  weight0           yes

# 选取第三行df数据
df_line_3 = df['index2':'index2']

print(df_line_3)

控制台输出结果：

         name  age weight is_single_dog
index2  NAME2    2    102            no

# 选取第二行到第三行df数据
df_line_2_to_3 = df['index1':'index2']

print(df_line_2_to_3)

控制台输出结果：

         name  age weight is_single_dog
index1  NAME1    1    101           yes
index2  NAME2    2    102            no

③　布尔数组

# 选取第一行df数据
df_line_1 = df[[True, False, False, False, False, False, False, False, False, False]]

print(df_line_1)

控制台输出结果：

         name  age   weight is_single_dog
index0  NAME0    0  weight0           yes

# 选取第三行df数据
df_line_3 = df[[False, False, True, False, False, False, False, False, False, False]]

print(df_line_3)

控制台输出结果：

         name  age weight is_single_dog
index2  NAME2    2    102            no

# 选取第二行到第三行df数据
df_line_2_to_3 = df[[False, True, True, False, False, False, False, False, False, False]]

print(df_line_2_to_3)

控制台输出结果：

         name  age weight is_single_dog
index1  NAME1    1    101           yes
index2  NAME2    2    102            no

④　单条件选取

# 条件(age > 8)选取行df数据
df_line_age_over_8 = df[df['age'] > 8]

print(df_line_age_over_8)

控制台输出结果：

         name  age weight is_single_dog
index9  NAME9    9    109            no

⑤　多条件选取

注意事项：多条件选取数据时，单个条件最好用括号括起来，防止出错

"""
选取满足以下条件的df数据行：
 1. age > 5
 2. weight 不为空
 3. is_single_dog = no

"""

df_line_condition_more = df[(df['age'] > 5) & (df['weight'] is not None) & (df['is_single_dog'] == 'no')]

print(df_line_condition_more)

控制台输出结果：

         name  age weight is_single_dog
index6  NAME6    6    NaN            no
index8  NAME8    8    108            no
index9  NAME9    9    109            no

注意事项：NaN的处理之后再说

"""
选取满足以下任意条件的df数据行：
 1. name = NAME9
 2. age > 7 且 age < 9
 3. weight = 107

"""

df_line_condition_more = df[(df['name'] == 'NAME9') | ((df['age'] > 7) & (df['age'] < 9)) | (df['weight'] == 107)]

print(df_line_condition_more)

控制台输出结果：

         name  age weight is_single_dog
index7  NAME7    7    107           yes
index8  NAME8    8    108            no
index9  NAME9    9    109            no

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：python matplotlib获取坐标 matplotlib设置坐标范围

下一篇：python求数组中重复元素的个数 python 数组重复元素

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯

python openpyxl选取部分行和列进行存储 python怎么选取某几列

python openpyxl选取部分行和列进行存储 python怎么选取某几列

Pandas中的Dataframe数据选取（一）

目录

前言

一、df的创建

1. 初期数据定义

2. 定义下标

3. df创建

二、Dataframe中的数据行选取

1. 初期数据定义

2. 行选取（单维度选取）

①　整数索引切片：前闭后开 [ , )

②　标签索引切片：前闭后闭 [ , ]

③　布尔数组

④　单条件选取

⑤　多条件选取

51CTO博客

python openpyxl选取部分行和列进行存储 python怎么选取某几列

python openpyxl选取部分行和列进行存储 python怎么选取某几列

Pandas中的Dataframe数据选取（一）

目录

前言

一、df的创建

1. 初期数据定义

2. 定义下标

3. df创建

二、Dataframe中的数据行选取

1. 初期数据定义

2. 行选取（单维度选取）

① 整数索引切片：前闭后开 [ , )

② 标签索引切片：前闭后闭 [ , ]

③ 布尔数组

④ 单条件选取

⑤ 多条件选取

51CTO博客

①　整数索引切片：前闭后开 [ , )

②　标签索引切片：前闭后闭 [ , ]

③　布尔数组

④　单条件选取

⑤　多条件选取