Linux-3.14.12内存管理笔记【kmalloc与kfree实现】【转】

转载

请给我倒杯茶 2022-08-30 08:56:58 博主文章分类：【嵌入式基础】

kmalloc()是基于slab/slob/slub分配分配算法上实现的，不少地方将其作为slab/slob/slub分配算法的入口，实际上是略有区别的。

现在分析一下其实现：

1. 【file:/include/linux/slab.h】
2. /**
3. * kmalloc - allocate memory
4. * @size: how many bytes of memory are required.
5. * @flags: the type of memory to allocate.
6. *
7. * kmalloc is the normal method of allocating memory
8. * for objects smaller than page size in the kernel.
9. *
10. * The @flags argument may be one of:
11. *
12. * %GFP_USER - Allocate memory on behalf of user. May sleep.
13. *
14. * %GFP_KERNEL - Allocate normal kernel ram. May sleep.
15. *
16. * %GFP_ATOMIC - Allocation will not sleep. May use emergency pools.
17. * For example, use this inside interrupt handlers.
18. *
19. * %GFP_HIGHUSER - Allocate pages from high memory.
20. *
21. * %GFP_NOIO - Do not do any I/O at all while trying to get memory.
22. *
23. * %GFP_NOFS - Do not make any fs calls while trying to get memory.
24. *
25. * %GFP_NOWAIT - Allocation will not sleep.
26. *
27. * %__GFP_THISNODE - Allocate node-local memory only.
28. *
29. * %GFP_DMA - Allocation suitable for DMA.
30. * Should only be used for kmalloc() caches. Otherwise, use a
31. * slab created with SLAB_DMA.
32. *
33. * Also it is possible to set different flags by OR'ing
34. * in one or more of the following additional @flags:
35. *
36. * %__GFP_COLD - Request cache-cold pages instead of
37. * trying to return cache-warm pages.
38. *
39. * %__GFP_HIGH - This allocation has high priority and may use emergency pools.
40. *
41. * %__GFP_NOFAIL - Indicate that this allocation is in no way allowed to fail
42. * (think twice before using).
43. *
44. * %__GFP_NORETRY - If memory is not immediately available,
45. * then give up at once.
46. *
47. * %__GFP_NOWARN - If allocation fails, don't issue any warnings.
48. *
49. * %__GFP_REPEAT - If allocation fails initially, try once more before failing.
50. *
51. * There are other flags available as well, but these are not intended
52. * for general use, and so are not documented here. For a full list of
53. * potential flags, always refer to linux/gfp.h.
54. */
55. static __always_inline void *kmalloc(size_t size, gfp_t flags)
56. {
57. if (__builtin_constant_p(size)) {
58. if (size > KMALLOC_MAX_CACHE_SIZE)
59. return kmalloc_large(size, flags);
60. #ifndef CONFIG_SLOB
61. if (!(flags & GFP_DMA)) {
62. int index = kmalloc_index(size);
63. 
64. if (!index)
65. return ZERO_SIZE_PTR;
66. 
67. return kmem_cache_alloc_trace(kmalloc_caches[index],
68. flags, size);
69. }
70. #endif
71. }
72. return __kmalloc(size, flags);
73. }

kmalloc()的参数size表示申请的空间大小，而flags则表示分配标志。kamlloc的分配标志众多，各标志都分配标识特定的bit位，藉此可以多样组合。

GFP_USER：用于表示为用户空间分配内存，可能会引起休眠；

GFP_KERNEL：内核内存的常规分配，可能会引起休眠；

GFP_ATOMIC：该分配不会引起休眠，但可能会使用应急内存资源，通常用于中断处理中；

GFP_HIGHUSER：使用高端内存进行分配；

GFP_NOIO：分配内存时，禁止任何IO操作；

GFP_NOFS：分配内存时，禁止任何文件系统操作；

GFP_NOWAIT：分配内存时禁止休眠；

__GFP_THISNODE：分配内存时，仅从本地节点内存中分配；

GFP_DMA：从DMA内存中分配合适的内存，应仅使用于kmalloc的cache分配；

__GFP_COLD：用于请求分配冷热页中的冷页；

__GFP_HIGH：用于表示该分配优先级较高并可能会使用应急内存资源；

__GFP_NOFAIL：用于指示该分配不允许分配失败，该标志需要慎用；

__GFP_NORETRY：如果分配内存未能够直接获取到，则不再尝试分配，直接放弃；

__GFP_NOWARN：如果分配过程中失败，不上报任何告警；

__GFP_REPEAT：如果分配过程中失败，则尝试再次申请；

函数入口if判断内的__builtin_constant_p是Gcc内建函数，用于判断一个值是否为编译时常量，是则返回true，否则返回false。也就意味着如果调用kmalloc()传入常量且该值大于KMALLOC_MAX_CACHE_SIZE（即申请空间超过kmalloc()所能分配最大cache的大小），那么将会通过kmalloc_large()进行分配；否则都将通过__kmalloc()进行分配。如果通过kmalloc_large()进行内存分配，将会经kmalloc_large()->kmalloc_order()->__get_free_pages()，最终通过Buddy伙伴算法申请所需内存。

伙伴算法前面已经分析过了，不再赘述，接下来看__kmalloc()的实现：

1. 【file:/mm/slub.c】
2. void *__kmalloc(size_t size, gfp_t flags)
3. {
4. struct kmem_cache *s;
5. void *ret;
6. 
7. if (unlikely(size > KMALLOC_MAX_CACHE_SIZE))
8. return kmalloc_large(size, flags);
9. 
10. s = kmalloc_slab(size, flags);
11. 
12. if (unlikely(ZERO_OR_NULL_PTR(s)))
13. return s;
14. 
15. ret = slab_alloc(s, flags, _RET_IP_);
16. 
17. trace_kmalloc(_RET_IP_, ret, size, s->size, flags);
18. 
19. return ret;
20. }

该函数同样判断申请是否超过最大cache大小，如果是则通过kmalloc_large()进行分配；接着通过申请大小及申请标志调用kmalloc_slab()查找适用的kmem_cache；最后通过slab_alloc()进行slab分配。

具体看一下kmalloc_slab()的实现：

1. 【file:/mm/slab_commmon.c】
2. /*
3. * Find the kmem_cache structure that serves a given size of
4. * allocation
5. */
6. struct kmem_cache *kmalloc_slab(size_t size, gfp_t flags)
7. {
8. int index;
9. 
10. if (unlikely(size > KMALLOC_MAX_SIZE)) {
11. WARN_ON_ONCE(!(flags & __GFP_NOWARN));
12. return NULL;
13. }
14. 
15. if (size <= 192) {
16. if (!size)
17. return ZERO_SIZE_PTR;
18. 
19. index = size_index[size_index_elem(size)];
20. } else
21. index = fls(size - 1);
22. 
23. #ifdef CONFIG_ZONE_DMA
24. if (unlikely((flags & GFP_DMA)))
25. return kmalloc_dma_caches[index];
26. 
27. #en

dif
return kmalloc_caches[index];
}

如果申请的大小超过KMALLOC_MAX_SIZE最大值，则返回NULL表示失败；如果申请大小小于192,且不为0，将通过size_index_elem宏转换为下标后，经size_index全局数组取得索引值，否则将直接通过fls()取得索引值；最后如果开启了DMA内存配置且设置了GFP_DMA标志，将结合索引值通过kmalloc_dma_caches返回kmem_cache管理结构信息，否则将通过kmalloc_caches返回该结构。

由此可以看出kmalloc()实现较为简单，起分配所得的内存不仅是虚拟地址上的连续存储空间，同时也是物理地址上的连续存储空间。这是有别于后面将会分析到的vmalloc()申请所得的内存。

此外再过一下kfree()的接口实现，该函数在多处均有实现，主要是在slab.c/slob.c/slub.c中，所以也说kmalloc()和kfree()是基于slab/slob/slub实现的。这里接前面的slub算法，主要分析一下slub.c中的kfree()实现：

1. 【file:/mm/slub.c】
2. void kfree(const void *x)
3. {
4. struct page *page;
5. void *object = (void *)x;
6. 
7. trace_kfree(_RET_IP_, x);
8. 
9. if (unlikely(ZERO_OR_NULL_PTR(x)))
10. return;
11. 
12. page = virt_to_head_page(x);
13. if (unlikely(!PageSlab(page))) {
14. BUG_ON(!PageCompound(page));
15. kfree_hook(x);
16. __free_memcg_kmem_pages(page, compound_order(page));
17. return;
18. }
19. slab_free(page->slab_cache, page, object, _RET_IP_);
20. }

该函数实现简单，首先是经过trace_kfree()记录kfree轨迹，然后if (unlikely(ZERO_OR_NULL_PTR(x)))对地址做非零判断，接着virt_to_head_page(x)将虚拟地址转换到页面；再是判断if (unlikely(!PageSlab(page)))判断该页面是否作为slab分配管理，如果是的话则转为通过slab_free()进行释放，否则将进入if分支中；在if分支中，将会kfree_hook()做释放前kmemleak处理（该函数主要是封装了kmemleak_free()），完了之后将会__free_memcg_kmem_pages()将页面释放，同时该函数内也将cgroup释放处理。

kmalloc()和kfree()也就这么简单了。