租房数据分析python 租房数据分析案例分析

转载

云中谁寄锦书来 2024-01-12 13:47:43

文章标签 租房数据分析python r语言 python 数据库数据 文章分类 数据分析人工智能

文章目录

1、导包
2、查看数据
3、重复值和空值处理
4、数据转换类型

4.1、面积数据类型转换
4.2、户型表达方式替换

5、房源数量和位置分布分析
7、户型数量基本分析
8、去掉统计数量较小的值
9、图形展示房屋类型
10、平均租金分析
11、图形可视化
12、面积基本分析

1、导包

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

2、查看数据

file_data= pd.read_csv("D:\\data\\seaborn-data-master\\链家北京租房数据.csv")
file_data.shape
file_data.info()
file_data.describe()

租房数据分析python 租房数据分析案例分析_数据库

3、重复值和空值处理

file_data.duplicated()   #重复值检测
file_data=file_data.drop_duplicates()   #删除重复值
file_data.shape

file_data=file_data.dropna()  #删除空值

4、数据转换类型

4.1、面积数据类型转换

file_data.head()
file_data["面积(㎡)"].values[0][:-2]

data_new=np.array([])
data_area = file_data["面积(㎡)"].values
for i in data_area:
    data_new = np.append(data_new,np.array(i[:-2]))
data_new

租房数据分析python 租房数据分析案例分析_数据库_02

4.2、户型表达方式替换

house_data=file_data["户型"]
temp_list =[]
for i in house_data:
    new_info = i.replace("房间","室")
    temp_list.append(new_info)
file_data.loc[:,'户型'] = temp_list
file_data

租房数据分析python 租房数据分析案例分析_数据库_03

5、房源数量和位置分布分析

file_data["区域"].unique()
# new_df = pd.DataFrame({'区域':file_data["区域"].unique,"数量":[0]*13})
new_df = pd.DataFrame({"区域":file_data["区域"].unique(), "数量":[0]*13})
new_df
area_count =file_data.groupby(by='区域').count()
new_df["数量"] = area_count.values
new_df.sort_values(by="数量",ascending=False)

租房数据分析python 租房数据分析案例分析_租房数据分析python_04

7、户型数量基本分析

house_data = file_data["户型"]
house_data.head()
def all_house(arr):
    key = np.unique(arr)
    result = {}
    for k in key:
        mask = (arr == k)
        arr_new = arr[mask]
        v = arr_new.size
        result[k] = v

    return result
house_info = all_house(house_data)

租房数据分析python 租房数据分析案例分析_数据_05

8、去掉统计数量较小的值

house_data=dict((key,value) for key,value in house_info.items() if value > 50)
# show_house = pd.DataFrame({"户型":[x for x in house_data.keys()],
#             "数量":[x for x in house_data.values()]})
show_houses = pd.DataFrame({"户型": [x for x in house_data.keys()],
              "数量": [x for x in house_data.values()]})
show_houses

租房数据分析python 租房数据分析案例分析_r语言_06

9、图形展示房屋类型

house_type = show_houses["户型"]
house_type_num = show_houses["数量"]
plt.barh(range(11),house_type_num)
plt.yticks(range(11),house_type)
plt.xlim(0,2500)

plt.title("北京市各区域租房数量统计")
plt.xlabel("数量")
plt.ylabel("房屋类型")

#给每个条上面添加具体数字
for x,y in enumerate(house_type_num):
    #print(x,y)
    plt.text(y + 0.5, x - 0.2, "%s" % y)

plt.show()

租房数据分析python 租房数据分析案例分析_租房数据分析python_07

10、平均租金分析

df_all = pd.DataFrame({"区域":file_data["区域"].unique(),"房租总金额":[0]*13,"总面积":[0]*13})

file_data.head()
sum_price = file_data["价格(元/月)"].groupby(file_data["区域"]).sum()
sum_area = file_data["面积(㎡)"].groupby(file_data["区域"]).sum()

df_all["房租总金额"]=sum_price.values
df_all["总面积"]=sum_area.values
df_all
df_all["每平方米租金(元)"]=round(df_all["房租总金额"]/df_all["总面积"],2)

租房数据分析python 租房数据分析案例分析_租房数据分析python_08

11、图形可视化

num = df_merge["数量"]
price = df_merge["每平方米租金(元)"]
lx = df_merge["区域"]
l = [i for i in range(13)]

fig = plt.figure(figsize=(10, 8), dpi=100)

# 显示折线图
ax1 = fig.add_subplot(111)
ax1.plot(l, price, "or-", label="价格")
for i, (_x, _y) in enumerate(zip(l, price)):
    plt.text(_x+0.2, _y, price[i])
ax1.set_ylim([0, 160])
ax1.set_ylabel("价格")
plt.legend(loc="upper right")

# 显示条形图
ax2 = ax1.twinx()
plt.bar(l, num, label="数量", alpha=0.2, color="green")
ax2.set_ylabel("数量")
plt.legend(loc="upper left")
plt.xticks(l, lx)


plt.show()

租房数据分析python 租房数据分析案例分析_数据_09

12、面积基本分析

#查看房屋最大面积和最小面积
print('房屋最大面积是%d平米'%(file_data['面积(㎡)'].max()))
print('房屋最小面积是%d平米'%(file_data['面积(㎡)'].min()))

print('房屋最高价格是%d元'%(file_data['价格(元/月)'].max()))
print('房屋最低价格是%d元'%(file_data['价格(元/月)'].min()))

#面积划分
area_divide = [1, 30, 50, 70, 90, 120, 140, 160, 1200]
area_cut = pd.cut(list(file_data["面积(㎡)"]), area_divide)

area_cut_num = area_cut.describe()
area_cut_num

area_per = (area_cut_num["freqs"].values)*100

labels  = ['30平米以下', '30-50平米', '50-70平米', '70-90平米',
'90-120平米','120-140平米','140-160平米','160平米以上']

plt.figure(figsize=(20, 8), dpi=100)
# plt.axes(aspect=1)

plt.pie(x=area_per, labels=labels, autopct="%.2f %%")

租房数据分析python 租房数据分析案例分析_数据_10

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：SpringBoot Filter过滤器写入Redis数据慢于拦截器获取 springboot过滤器返回信息

下一篇：java抓包数据抓包

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯

租房数据分析python 租房数据分析案例分析

租房数据分析python 租房数据分析案例分析

文章目录

1、导包

2、查看数据

3、重复值和空值处理

4、数据转换类型

4.1、面积数据类型转换

4.2、户型表达方式替换

5、房源数量和位置分布分析

7、户型数量基本分析

8、去掉统计数量较小的值

9、图形展示房屋类型

10、平均租金分析

11、图形可视化

12、面积基本分析

51CTO博客