链表及linux中的链表

转载

yzs的专栏 2023-01-01 20:58:05 博主文章分类：c/c++

链表是C语言编程中常用的数据结构，比如我们要建一个整数链表，一般可能这么定义：

struct  int_node {
         int  val;
         struct  int_node *next;
};

为了实现链表的插入、删除、遍历等功能，另外要再实现一系列函数，比如：

void  insert_node( struct  int_node **head,  int  val);
 
void  delete_node( struct  int_node *head,  struct  int_node *current);
 
void  access_node( struct  int_node *head)
{
         struct  int_node *node;
         for  (node = head; node != NULL; node = node->next) {
                 // do something here
         }
}

如果我们的代码里只有这么一个数据结构的话，这样做当然没有问题，但是当代码的规模足够大，需要管理很多种链表，难道需要为每一种链表都要实现一套插入、删除、遍历等功能函数吗？

熟悉C++的同学可能会说，我们可以用标准模板库啊，但是，我们这里谈的是C，在C语言里有没有比较好的方法呢？

Mr.Dave在他的博客里介绍了自己的实现，这个实现是个很好的方案，各位不妨可以参考一下。在本文中，我们把目光投向当今开源界最大的C项目--Linux Kernel，看看Linux内核如何解决这个问题。

Linux内核中一般使用双向链表，声明为struct list_head，这个结构体是在include/linux/types.h中定义的，链表的访问是以宏或者内联函数的形式在include/linux/list.h中定义。

struct  list_head {
     struct  list_head *next, *prev;
};

Linux内核为链表提供了一致的访问接口。

void  INIT_LIST_HEAD( struct  list_head *list)；
void  list_add( struct  list_head * new ,  struct  list_head *head)；
void  list_add_tail( struct  list_head * new ,  struct  list_head *head)；
void  list_del( struct  list_head *entry);
int  list_empty( const  struct  list_head *head)；

以上只是从Linux内核里摘选的几个常用接口，更多的定义请参考Linux内核源代码。

我们先通过一个简单的实作来对Linux内核如何处理链表建立一个感性的认识。

#include <stdio.h>
#include "list.h"
 
struct  int_node {
         int  val;
         struct  list_head list;
};
 
int  main()
{
         struct  list_head head, *plist;
         struct  int_node a, b;
 
         a.val = 2;
         b.val = 3;
 
         INIT_LIST_HEAD(&head);
         list_add(&a.list, &head);
         list_add(&b.list, &head);
 
         list_for_each(plist, &head) {
                 struct  int_node *node = list_entry(plist,  struct  int_node, list);
                 printf ( "val = %d\n" , node->val);
         }
 
         return  0;
}

看完这个实作，是不是觉得在C代码里管理一个链表也很简单呢？

代码中包含的头文件list.h是我从Linux内核里抽取出来并做了一点修改的链表处理代码，现附在这里给大家参考，使用的时候只要把这个头文件包含到自己的工程里即可。

链表及linux中的链表_数据结构

代码

list_head通常是嵌在数据结构内使用，在上文的实作中我们还是以整数链表为例，int_node的定义如下：

struct  int_node {
         int  val;
         struct  list_head list;
};

使用list_head组织的链表的结构如下图所示：

链表及linux中的链表_偏移量_02

遍历链表是用宏list_for_each来完成。

#define list_for_each(pos, head) \       

                 for         (pos = (head)->next; prefetch(pos->next), pos != (head); \       

                 pos = pos->next)

在这里，pos和head均是struct list_head。在遍历的过程中如果需要访问节点，可以用list_entry来取得这个节点的基址。

#define list_entry(ptr, type, member) \       

                 container_of(ptr, type, member)

我们来看看container_of是如何实现的。如下图所示，我们已经知道TYPE结构中MEMBER的地址，如果要得到这个结构体的地址，只需要知道MEMBER在结构体中的偏移量就可以了。如何得到这个偏移量地址呢？这里用到C语言的一个小技巧，我们不妨把结构体投影到地址为0的地方，那么成员的绝对地址就是偏移量。得到偏移量之后，再根据ptr指针指向的地址，就可以很容易的计算出结构体的地址。

链表及linux中的链表_偏移量_03