Python 遍历文件每一行判断是否只有一个换行符详解

精选原创

霍格沃兹测试 2024-06-20 18:09:46 ©著作权

文章标签 Line 换行符 Python 文章分类 软件测试

©著作权归作者所有：来自51CTO博客作者霍格沃兹测试的原创作品，请联系作者获取转载授权，否则将追究法律责任

Python 遍历文件每一行判断是否只有一个换行符详解_Python

前言

在文件处理过程中，判断文件每一行是否只有一个换行符是一个常见需求。作为测试工程师，我们经常需要对文件的格式进行验证，确保数据的完整性和规范性。本文将详细介绍如何使用 Python 遍历文件的每一行，并判断每一行是否只有一个换行符。

需求分析

我们需要编写一个 Python 程序，该程序可以：

打开并读取指定文件。
遍历文件的每一行。
判断每一行是否只有一个换行符。
输出判断结果。

程序设计

文件读取

Python 提供了多种方式读取文件内容，可以使用 open 函数配合 with 语句安全地打开和读取文件。

判断换行符

每一行的末尾如果只有一个换行符，说明该行是有效行；如果有多个换行符或其他字符，说明该行存在异常。我们可以使用字符串操作来实现这一判断。

输出结果

将每一行的判断结果输出，方便用户查看和验证。

代码实现

基础代码

首先，我们编写基础代码来读取文件并遍历每一行：

def check_newline_in_file(file_path):
    with open(file_path, 'r', encoding='utf-8') as file:
        for line_number, line in enumerate(file, start=1):
            if line.endswith('\n') and line.strip() == '':
                print(f"Line {line_number}: Only newline character found.")
            elif line.endswith('\n'):
                print(f"Line {line_number}: Valid line with content.")
            else:
                print(f"Line {line_number}: Invalid line without newline character.")

完整实现

在基础代码上，我们进一步优化，实现对每一行是否只有一个换行符的判断：

def check_newline_in_file(file_path):
    try:
        with open(file_path, 'r', encoding='utf-8') as file:
            for line_number, line in enumerate(file, start=1):
                stripped_line = line.rstrip('\n')
                if stripped_line == '':
                    print(f"Line {line_number}: Only newline character found.")
                else:
                    print(f"Line {line_number}: Content found.")
    except FileNotFoundError:
        print(f"File not found: {file_path}")
    except Exception as e:
        print(f"An error occurred: {e}")

# 使用示例
file_path = 'example.txt'
check_newline_in_file(file_path)

功能扩展

检查多种换行符

在不同操作系统中，换行符可能不同（如 Windows 中是 \r\n，而 Unix/Linux 中是 \n）。我们可以扩展代码来处理不同类型的换行符：

def check_newline_in_file(file_path):
    try:
        with open(file_path, 'rb') as file:
            for line_number, line in enumerate(file, start=1):
                line_str = line.decode('utf-8')
                if line_str.endswith('\n') or line_str.endswith('\r\n'):
                    stripped_line = line_str.rstrip('\r\n')
                    if stripped_line == '':
                        print(f"Line {line_number}: Only newline character found.")
                    else:
                        print(f"Line {line_number}: Content found.")
                else:
                    print(f"Line {line_number}: Invalid line without proper newline character.")
    except FileNotFoundError:
        print(f"File not found: {file_path}")
    except Exception as e:
        print(f"An error occurred: {e}")

# 使用示例
file_path = 'example.txt'
check_newline_in_file(file_path)

保存结果到文件

将判断结果保存到输出文件中，方便后续查看和分析：

def check_newline_in_file(file_path, output_path):
    try:
        with open(file_path, 'rb') as file, open(output_path, 'w', encoding='utf-8') as output_file:
            for line_number, line in enumerate(file, start=1):
                line_str = line.decode('utf-8')
                if line_str.endswith('\n') or line_str.endswith('\r\n'):
                    stripped_line = line_str.rstrip('\r\n')
                    if stripped_line == '':
                        result = f"Line {line_number}: Only newline character found.\n"
                    else:
                        result = f"Line {line_number}: Content found.\n"
                else:
                    result = f"Line {line_number}: Invalid line without proper newline character.\n"
                output_file.write(result)
    except FileNotFoundError:
        print(f"File not found: {file_path}")
    except Exception as e:
        print(f"An error occurred: {e}")

# 使用示例
file_path = 'example.txt'
output_path = 'output.txt'
check_newline_in_file(file_path, output_path)