时间序列移动平均法python 时间序列移动平均法

转载

蓝月亮 2023-11-03 09:35:37

文章标签 时间序列移动平均法python python 机器学习 numpy 时间序列 文章分类 Python 后端开发

时间序列预测基本方法：移动平均（SMA，EMA，WMA）

移动平均作为时间序列中最基本的预测方法，计算简单却很实用。不仅可以用来做预测，还有一些其他的重要作用，比如平滑序列波动，揭示时间序列的趋势特征。
移动平均简而言之即使用前n个时刻的观测值来预测下一个时刻的取值。移动平均可以分为简单移动平均、加权移动平均、指数移动平均等。注意：移动平均用于预测场景时，尤其是多步预测，需要要求序列相对平稳，没有趋势、季节性的情况。

问题公式化

给定一个时间序列{X_t}，观测值序列为：x₁，x₂，…，x_t
x_t+1为可预测为前n项的平均值，即x_t+1=E({x_t-n+1,x_t-n+2,…,x_t})，n表示窗口大小需要往前推多少时刻。
移动平均能够去除时间序列的短期波动，使得数据变得平滑，从而可以方便的看出序列的趋势特征。原序列波动较大，经过移动平均后，随机波动会明显减少，窗大小n越大，平滑后波动越小，滞后越明显。

简单移动平均（Simple Moving Average，SMA）

$时间序列移动平均法python 时间序列移动平均法_numpy$ $时间序列移动平均法python 时间序列移动平均法_python_02$ = $时间序列移动平均法python 时间序列移动平均法_机器学习_03$ $时间序列移动平均法python 时间序列移动平均法_python_04$ $时间序列移动平均法python 时间序列移动平均法_时间序列移动平均法python_05$
$时间序列移动平均法python 时间序列移动平均法_python_06$ 是窗口大小， $时间序列移动平均法python 时间序列移动平均法_numpy$ 表示 $时间序列移动平均法python 时间序列移动平均法_时间序列_08$ 时刻的移动平均值

SMA案例代码

# 导包
import numpy as np
from pandas import read_csv
import matplotlib.pyplot as plt
import pandas as pd
# 读取数据
df = pd.read_csv('household_power_consumption_days.csv', sep=',', header=0,
                      usecols=[0,1],parse_dates={'date':[0]},index_col=['date'])
# 真实数据列'Global_active_power'
real = df['Global_active_power']

# 取前6个时刻的数据预测下一时刻的数据
rolling = df['Global_active_power'].rolling(window=6)
rolling_mean = rolling.mean()

# 查看真实数据的维度和预测数据的维度
print("real Shape-- ",real.shape)#(1442,)
print("rolling_mean Shape-- ",rolling_mean.shape)#(1442,)

预测结果可视化

# 取最后258个时刻的数据绘图
plt.plot(real[-260:-2], color = 'red', label = 'Real')
plt.plot(rolling_mean[-260:-2], color = 'blue', label = 'Predicted')
plt.xlabel('Time')
plt.legend()
plt.show()

时间序列移动平均法python 时间序列移动平均法_机器学习_09

计算误差

real = real[-260:-2]
pred = rolling_mean[-260:-2]

from sklearn.metrics import mean_absolute_error
from sklearn.metrics import mean_squared_error
# 计算误差
mae = mean_absolute_error(pred,real)
mse = mean_squared_error(pred,real)
print("MAE:",mae) # MAE: 218.50487338501293
print("MSE:",mse) # MSE: 90166.86320972697

加权移动平均（Weighted Moving Average，WMA）

与SMA类似，但是在计算平均数时不是等量齐观的，可以给最近的观测值相对历史观测值更大的权重。比容，最近的用电量对预测日最有影响力，历史用电量随着时间拉长影响力越小。
$时间序列移动平均法python 时间序列移动平均法_机器学习_10$ $时间序列移动平均法python 时间序列移动平均法_python_02$ = $时间序列移动平均法python 时间序列移动平均法_时间序列_12$
$时间序列移动平均法python 时间序列移动平均法_python_06$ 是窗口大小， $时间序列移动平均法python 时间序列移动平均法_机器学习_10$ 表示 $时间序列移动平均法python 时间序列移动平均法_时间序列_08$ 时刻的移动平均值。
权重系数为 $时间序列移动平均法python 时间序列移动平均法_numpy_16$ ，即最近的一个数值权重为 $时间序列移动平均法python 时间序列移动平均法_python_06$ ，次近的为 $时间序列移动平均法python 时间序列移动平均法_机器学习_18$ ，以此类推，直到 $时间序列移动平均法python 时间序列移动平均法_机器学习_19$ 。
$时间序列移动平均法python 时间序列移动平均法_机器学习_10$ $时间序列移动平均法python 时间序列移动平均法_python_02$ = $时间序列移动平均法python 时间序列移动平均法_时间序列移动平均法python_22$

WMA案例代码

# 导包
import numpy as np
from pandas import read_csv
import matplotlib.pyplot as plt
import pandas as pd
# 读取数据
df = pd.read_csv('household_power_consumption_days.csv', sep=',', header=0,
                      usecols=[0,1],parse_dates={'date':[0]},index_col=['date'])
# 真实数据列'Global_active_power'
real = df['Global_active_power']

# 权重函数
def WMA(close, n):
    weights = np.array(range(1, n+1))
    sum_weights = np.sum(weights)

    res = close.rolling(window=n).apply(lambda x: np.sum(weights*x) / sum_weights, raw=False)
    return res

# 取前6个时刻的数据加权预测下一时刻的数据
df['WMA'] = WMA(df['Global_active_power'],6)
rolling_mean = df['WMA']

# 查看真实数据的维度和预测数据的维度
print("real Shape-- ",real.shape)#(1442,)
print("rolling_mean Shape-- ",rolling_mean.shape)#(1442,)

预测结果可视化

# 取最后258个时刻的数据绘图
plt.plot(real[-260:-2], color = 'red', label = 'Real')
plt.plot(rolling_mean[-260:-2], color = 'blue', label = 'Predicted')
plt.xlabel('Time')
plt.legend()
plt.show()

时间序列移动平均法python 时间序列移动平均法_时间序列_23

计算误差

real = real[-260:-2]
pred = rolling_mean[-260:-2]

from sklearn.metrics import mean_absolute_error
from sklearn.metrics import mean_squared_error
# 计算误差
mae = mean_absolute_error(pred,real)
mse = mean_squared_error(pred,real)
print("MAE:",mae) # MAE: 181.05885714285714
print("MSE:",mse) # MSE: 66044.51492577454

指数移动平均（Exponential Moving Average，EMA）

可以看成一种特殊的加权移动平均，也称为指数加权移动平均（EWMA）。与WMA类似，它为以前的数值分配了一系列固定的指数递减权重，即权重系数随着时间呈指数下降。EMA提供了一个更明显的指标，能更快地反映最近的趋势。
$时间序列移动平均法python 时间序列移动平均法_numpy_24$ = $时间序列移动平均法python 时间序列移动平均法_时间序列移动平均法python_25$
$时间序列移动平均法python 时间序列移动平均法_机器学习_26$ 是窗口大小， $时间序列移动平均法python 时间序列移动平均法_时间序列移动平均法python_27$ 为平滑因子，(1-\alpha)^i为呈指数增加的权重，期数离预测时刻越近权重越大。

EMA案例代码

# 导包
import numpy as np
from pandas import read_csv
import matplotlib.pyplot as plt
import pandas as pd
# 读取数据
df = pd.read_csv('household_power_consumption_days.csv', sep=',', header=0,
                      usecols=[0,1],parse_dates={'date':[0]},index_col=['date'])
# 真实数据列'Global_active_power'
real = df['Global_active_power']

# 指数函数，取前6个时刻的数据加权预测下一时刻的数据
df['EMA'] = df['Global_active_power'].ewm(span=6,min_periods=6).mean()
rolling_mean = df['EMA']

# 查看真实数据的维度和预测数据的维度
print("real Shape-- ",real.shape)#(1442,)
print("rolling_mean Shape-- ",rolling_mean.shape)#(1442,)

预测结果可视化

# 取最后258个时刻的数据绘图
plt.plot(real[-260:-2], color = 'red', label = 'Real')
plt.plot(rolling_mean[-260:-2], color = 'blue', label = 'Predicted')
plt.xlabel('Time')
plt.legend()
plt.show()

时间序列移动平均法python 时间序列移动平均法_机器学习_28

计算误差

real = real[-260:-2]
pred = rolling_mean[-260:-2]

from sklearn.metrics import mean_absolute_error
from sklearn.metrics import mean_squared_error
# 计算误差
mae = mean_absolute_error(pred,real)
mse = mean_squared_error(pred,real)
print("MAE:",mae) # MAE: 178.27270483176076
print("MSE:",mse) # MSE: 62814.68751826136

SMA：权重系数一致；

WMA：权重系数随时间间隔线性递减；

EMA：权重系数随时间间隔指数递减。

参考文献:

https://zhuanlan.zhihu.com/p/430537478

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：Java生成的条形码没有文本条形码文本编码

下一篇：windows版appium能连接iOS真机吗 appium怎么连接手机

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯

时间序列移动平均法python 时间序列移动平均法

时间序列移动平均法python 时间序列移动平均法

时间序列预测基本方法：移动平均（SMA，EMA，WMA）

问题公式化

简单移动平均（Simple Moving Average，SMA）

SMA案例代码

加权移动平均（Weighted Moving Average，WMA）

WMA案例代码

指数移动平均（Exponential Moving Average，EMA）

EMA案例代码

SMA：权重系数一致；

WMA：权重系数随时间间隔线性递减；

EMA：权重系数随时间间隔指数递减。

参考文献:

51CTO博客