python多线程的坑

原创

mob64ca12dedda8 2023-10-02 04:27:34 ©著作权

文章标签 Python 多线程线程间通信 文章分类 Python 后端开发

©著作权归作者所有：来自51CTO博客作者mob64ca12dedda8的原创作品，请联系作者获取转载授权，否则将追究法律责任

Python多线程的坑

引言

多线程是现代软件开发中常用的技术之一，它可以显著提高程序的性能和响应能力。在Python中，我们可以通过threading模块来实现多线程编程。然而，使用多线程编程也会带来一些潜在的问题和坑。本文将介绍一些常见的Python多线程坑，并提供相应的解决方案。

1. 全局解释器锁（GIL）

在Python中，全局解释器锁（Global Interpreter Lock，简称GIL）是一个重要的概念。GIL的存在使得Python的多线程并不能发挥多核CPU的优势。简单来说，GIL在同一时间只允许一个线程执行Python字节码，这意味着在多核CPU上，Python的多线程程序并不能真正实现并行计算。

import threading

def count():
    total = 0
    for _ in range(1000000):
        total += 1

threads = []
for _ in range(10):
    thread = threading.Thread(target=count)
    thread.start()
    threads.append(thread)

for thread in threads:
    thread.join()

上述代码创建了10个线程，每个线程都执行了一个简单的累加操作。然而，由于GIL的存在，这些线程并不能并行执行，最终的结果与单线程执行累加操作的结果是一样的。

2. 线程同步

多线程程序中，线程之间共享资源是常见的情况。然而，如果不进行适当的线程同步，就会出现竞态条件（Race Condition）等问题。Python提供了多种线程同步的机制，例如Lock、Semaphore、Condition等。下面是一个使用Lock进行线程同步的示例：

import threading

total = 0
lock = threading.Lock()

def count():
    global total
    for _ in range(1000000):
        with lock:
            total += 1

threads = []
for _ in range(10):
    thread = threading.Thread(target=count)
    thread.start()
    threads.append(thread)

for thread in threads:
    thread.join()

print(total)

在上述代码中，我们使用了Lock来确保在修改total变量时，只有一个线程能够访问它。这样可以避免多线程之间对total变量的竞争。

3. 线程间通信

在多线程编程中，线程间通信也是一个重要的问题。Python提供了多种线程间通信的机制，例如使用Queue来进行消息传递。下面是一个使用Queue进行线程间通信的示例：

import threading
import queue

q = queue.Queue()

def producer():
    for i in range(10):
        q.put(i)

def consumer():
    while True:
        item = q.get()
        if item is None:
            break
        print(item)

threads = []
producer_thread = threading.Thread(target=producer)
producer_thread.start()
threads.append(producer_thread)

for _ in range(10):
    consumer_thread = threading.Thread(target=consumer)
    consumer_thread.start()
    threads.append(consumer_thread)

for thread in threads:
    thread.join()

在上述代码中，我们使用Queue来实现生产者-消费者模型。其中，producer线程向队列中放入数据，consumer线程从队列中取出数据进行处理。通过使用Queue，我们可以安全地进行线程间通信，避免了数据竞争等问题。

4. 死锁

死锁是多线程编程中常见的问题之一。当多个线程等待彼此释放资源时，就会导致死锁的发生。下面是一个简单的死锁示例：

import threading

lock1 = threading.Lock()
lock2 = threading.Lock()

def func1():
    lock1.acquire()
    lock2.acquire()
    lock2.release()
    lock1.release()

def func2():
    lock2.acquire()
    lock1.acquire()
    lock1.release()
    lock2.release()

thread1 = threading.Thread(target=func1)
thread2 = threading.Thread(target=func2)

thread1.start()
thread2.start()

thread1.join()
thread2.join()
``