书生大模型实战营第0节课-Python关卡任务

原创

wx659d2c3d17529 2024-07-26 11:51:10 ©著作权

©著作权归作者所有：来自51CTO博客作者wx659d2c3d17529的原创作品，请联系作者获取转载授权，否则将追究法律责任

书生大模型实战营第0节课-Python关卡任务

1.任务一

请实现一个wordcount函数，统计英文字符串中每个单词出现的次数。返回一个字典，key为单词，value为对应单词出现的次数。 这里有两种实现方法，一开始就想到的列表方法，但是正则表达式部分是忘了

import re

def wordcount(text):
    """
    统计给定文本中每个单词出现的次数。
    
    参数:
    text (str): 需要统计单词数目的文本字符串。
    
    返回:
    dict: 单词及其出现次数的字典。
    """
    # 使用正则表达式来分割文本，提取单词
    words = re.findall(r'\b\w+\b', text.lower())
    
    # 创建一个空字典来存储单词计数
    word_counts = {}
    
    # 遍历单词列表并计算每个单词的出现次数
    for word in words:
        if word in word_counts:
            word_counts[word] += 1
        else:
            word_counts[word] = 1
    
    return word_counts

# 示例使用
text = "This is a sample text. This text contains some words, and this is the second example."
result = wordcount(text)
print(result)

运行结果如下

{'got': 2, 'this': 1, 'panda': 1, 'plush': 1, 'toy': 1, 'for': 3, 'my': 1, 'daughter': 1, 's': 3, 'birthday': 1, 'who': 1, 'loves': 1, 'it': 7, 'and': 3, 'takes': 1, 'everywhere': 1, 'soft': 1, 'super': 1, 'cute': 1, 'its': 1, 'face': 1, 'has': 1, 'a': 3, 'friendly': 1, 'look': 1, 'bit': 1, 'small': 1, 'what': 1, 'i': 4, 'paid': 1, 'though': 1, 'think': 1, 'there': 1, 'might': 1, 'be': 1, 'other': 1, 'options': 1, 'that': 1, 'are': 1, 'bigger': 1, 'the': 1, 'same': 1, 'price': 1, 'arrived': 1, 'day': 1, 'earlier': 1, 'than': 1, 'expected': 1, 'so': 1, 'to': 2, 'play': 1, 'with': 1, 'myself': 1, 'before': 1, 'gave': 1, 'her': 1}

任务二

请使用本地vscode连接远程开发机，将上面你写的wordcount函数在开发机上进行debug，体验debug的全流程，并完成一份debug笔记（需要截图）。 调试断点主要是在正则表达式和循环部分，因为对这部分内容不太熟练.然后是循环这部分检查一下计数过程屏幕截图 2024-07-26 114617.png