python机器学习实例

原创

mob64ca12d4a164 2024-07-30 12:20:50 ©著作权

©著作权归作者所有：来自51CTO博客作者mob64ca12d4a164的原创作品，请联系作者获取转载授权，否则将追究法律责任

Python机器学习实例：简单线性回归

机器学习是人工智能领域的一个重要分支，它通过算法让计算机能够从数据中学习并做出预测或决策。Python作为一门流行的编程语言，拥有丰富的库支持机器学习，如scikit-learn、TensorFlow等。本文将通过一个简单的线性回归实例，展示如何使用Python进行机器学习。

线性回归简介

线性回归是一种预测分析方法，用于建立一个或多个自变量（解释变量）与因变量（响应变量）之间的线性关系。其基本形式为：

[ y = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + ... + \beta_n x_n + \epsilon ]

其中，( y ) 是因变量，( x_1, x_2, ..., x_n ) 是自变量，( \beta_0, \beta_1, ..., \beta_n ) 是回归系数，( \epsilon ) 是误差项。

准备数据

我们使用Python的numpy库生成一些模拟数据，作为线性回归的输入。

import numpy as np

# 生成模拟数据
np.random.seed(0)
X = 2 * np.random.rand(100, 1)
y = 4 + 3 * X + np.random.randn(100, 1)

构建线性回归模型

使用scikit-learn库中的LinearRegression类构建线性回归模型。

from sklearn.linear_model import LinearRegression

# 创建线性回归模型实例
model = LinearRegression()

# 训练模型
model.fit(X, y)

预测

使用训练好的模型进行预测。

# 预测
X_new = np.array([[0], [2]])
y_pred = model.predict(X_new)
print(y_pred)

可视化结果

使用matplotlib库将数据点和回归线可视化。

import matplotlib.pyplot as plt

plt.scatter(X, y, color='blue', label='Data points')
plt.plot(X, model.predict(X), color='red', linewidth=2, label='Regression line')
plt.scatter(X_new, y_pred, color='green', label='Predictions')
plt.xlabel('X')
plt.ylabel('y')
plt.legend()
plt.show()

序列图

使用mermaid语法展示线性回归的流程。

sequenceDiagram
    participant User as U
    participant Python as P
    participant scikit-learn as S
    participant Data as D

    U->>P: Import libraries
    P->>D: Generate data
    U->>P: Create model
    P->>S: Use LinearRegression
    U->>P: Train model
    P->>D: Fit on data
    U->>P: Predict
    P->>D: Predict values
    U->>P: Visualize
    P->>D: Plot data and regression line

关系图

使用mermaid语法展示线性回归中变量之间的关系。

erDiagram
    y {
        int id PK
        float value
    }
    x1 {
        int id PK
        float value
    }
    beta0 {
        int id PK
        float value
    }
    beta1 {
        int id PK
        float value
    }
    epsilon {
        int id PK
        float value
    }
    y ||--o{ x1 : "has"
    y ||--o{ beta0 : "intercept"
    y ||--o{ beta1 : "coefficient"
    y }|--|| epsilon : "error"