1.. highlightlang:: c 2 3 4.. _memory: 5 6***************** 7Memory Management 8***************** 9 10.. sectionauthor:: Vladimir Marangozov <Vladimir.Marangozov@inrialpes.fr> 11 12 13 14.. _memoryoverview: 15 16Overview 17======== 18 19Memory management in Python involves a private heap containing all Python 20objects and data structures. The management of this private heap is ensured 21internally by the *Python memory manager*. The Python memory manager has 22different components which deal with various dynamic storage management aspects, 23like sharing, segmentation, preallocation or caching. 24 25At the lowest level, a raw memory allocator ensures that there is enough room in 26the private heap for storing all Python-related data by interacting with the 27memory manager of the operating system. On top of the raw memory allocator, 28several object-specific allocators operate on the same heap and implement 29distinct memory management policies adapted to the peculiarities of every object 30type. For example, integer objects are managed differently within the heap than 31strings, tuples or dictionaries because integers imply different storage 32requirements and speed/space tradeoffs. The Python memory manager thus delegates 33some of the work to the object-specific allocators, but ensures that the latter 34operate within the bounds of the private heap. 35 36It is important to understand that the management of the Python heap is 37performed by the interpreter itself and that the user has no control over it, 38even if they regularly manipulate object pointers to memory blocks inside that 39heap. The allocation of heap space for Python objects and other internal 40buffers is performed on demand by the Python memory manager through the Python/C 41API functions listed in this document. 42 43.. index:: 44 single: malloc() 45 single: calloc() 46 single: realloc() 47 single: free() 48 49To avoid memory corruption, extension writers should never try to operate on 50Python objects with the functions exported by the C library: :c:func:`malloc`, 51:c:func:`calloc`, :c:func:`realloc` and :c:func:`free`. This will result in mixed 52calls between the C allocator and the Python memory manager with fatal 53consequences, because they implement different algorithms and operate on 54different heaps. However, one may safely allocate and release memory blocks 55with the C library allocator for individual purposes, as shown in the following 56example:: 57 58 PyObject *res; 59 char *buf = (char *) malloc(BUFSIZ); /* for I/O */ 60 61 if (buf == NULL) 62 return PyErr_NoMemory(); 63 ...Do some I/O operation involving buf... 64 res = PyString_FromString(buf); 65 free(buf); /* malloc'ed */ 66 return res; 67 68In this example, the memory request for the I/O buffer is handled by the C 69library allocator. The Python memory manager is involved only in the allocation 70of the string object returned as a result. 71 72In most situations, however, it is recommended to allocate memory from the 73Python heap specifically because the latter is under control of the Python 74memory manager. For example, this is required when the interpreter is extended 75with new object types written in C. Another reason for using the Python heap is 76the desire to *inform* the Python memory manager about the memory needs of the 77extension module. Even when the requested memory is used exclusively for 78internal, highly-specific purposes, delegating all memory requests to the Python 79memory manager causes the interpreter to have a more accurate image of its 80memory footprint as a whole. Consequently, under certain circumstances, the 81Python memory manager may or may not trigger appropriate actions, like garbage 82collection, memory compaction or other preventive procedures. Note that by using 83the C library allocator as shown in the previous example, the allocated memory 84for the I/O buffer escapes completely the Python memory manager. 85 86 87.. _memoryinterface: 88 89Memory Interface 90================ 91 92The following function sets, modeled after the ANSI C standard, but specifying 93behavior when requesting zero bytes, are available for allocating and releasing 94memory from the Python heap: 95 96 97.. c:function:: void* PyMem_Malloc(size_t n) 98 99 Allocates *n* bytes and returns a pointer of type :c:type:`void\*` to the 100 allocated memory, or *NULL* if the request fails. Requesting zero bytes returns 101 a distinct non-*NULL* pointer if possible, as if ``PyMem_Malloc(1)`` had 102 been called instead. The memory will not have been initialized in any way. 103 104 105.. c:function:: void* PyMem_Realloc(void *p, size_t n) 106 107 Resizes the memory block pointed to by *p* to *n* bytes. The contents will be 108 unchanged to the minimum of the old and the new sizes. If *p* is *NULL*, the 109 call is equivalent to ``PyMem_Malloc(n)``; else if *n* is equal to zero, 110 the memory block is resized but is not freed, and the returned pointer is 111 non-*NULL*. Unless *p* is *NULL*, it must have been returned by a previous call 112 to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. If the request fails, 113 :c:func:`PyMem_Realloc` returns *NULL* and *p* remains a valid pointer to the 114 previous memory area. 115 116 117.. c:function:: void PyMem_Free(void *p) 118 119 Frees the memory block pointed to by *p*, which must have been returned by a 120 previous call to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. Otherwise, or 121 if ``PyMem_Free(p)`` has been called before, undefined behavior occurs. If 122 *p* is *NULL*, no operation is performed. 123 124The following type-oriented macros are provided for convenience. Note that 125*TYPE* refers to any C type. 126 127 128.. c:function:: TYPE* PyMem_New(TYPE, size_t n) 129 130 Same as :c:func:`PyMem_Malloc`, but allocates ``(n * sizeof(TYPE))`` bytes of 131 memory. Returns a pointer cast to :c:type:`TYPE\*`. The memory will not have 132 been initialized in any way. 133 134 135.. c:function:: TYPE* PyMem_Resize(void *p, TYPE, size_t n) 136 137 Same as :c:func:`PyMem_Realloc`, but the memory block is resized to ``(n * 138 sizeof(TYPE))`` bytes. Returns a pointer cast to :c:type:`TYPE\*`. On return, 139 *p* will be a pointer to the new memory area, or *NULL* in the event of 140 failure. This is a C preprocessor macro; p is always reassigned. Save 141 the original value of p to avoid losing memory when handling errors. 142 143 144.. c:function:: void PyMem_Del(void *p) 145 146 Same as :c:func:`PyMem_Free`. 147 148In addition, the following macro sets are provided for calling the Python memory 149allocator directly, without involving the C API functions listed above. However, 150note that their use does not preserve binary compatibility across Python 151versions and is therefore deprecated in extension modules. 152 153:c:func:`PyMem_MALLOC`, :c:func:`PyMem_REALLOC`, :c:func:`PyMem_FREE`. 154 155:c:func:`PyMem_NEW`, :c:func:`PyMem_RESIZE`, :c:func:`PyMem_DEL`. 156 157 158Object allocators 159================= 160 161The following function sets, modeled after the ANSI C standard, but specifying 162behavior when requesting zero bytes, are available for allocating and releasing 163memory from the Python heap. 164 165By default, these functions use :ref:`pymalloc memory allocator <pymalloc>`. 166 167.. warning:: 168 169 The :term:`GIL <global interpreter lock>` must be held when using these 170 functions. 171 172.. c:function:: void* PyObject_Malloc(size_t n) 173 174 Allocates *n* bytes and returns a pointer of type :c:type:`void\*` to the 175 allocated memory, or *NULL* if the request fails. 176 177 Requesting zero bytes returns a distinct non-*NULL* pointer if possible, as 178 if ``PyObject_Malloc(1)`` had been called instead. The memory will not have 179 been initialized in any way. 180 181 182.. c:function:: void* PyObject_Realloc(void *p, size_t n) 183 184 Resizes the memory block pointed to by *p* to *n* bytes. The contents will be 185 unchanged to the minimum of the old and the new sizes. 186 187 If *p* is *NULL*, the call is equivalent to ``PyObject_Malloc(n)``; else if *n* 188 is equal to zero, the memory block is resized but is not freed, and the 189 returned pointer is non-*NULL*. 190 191 Unless *p* is *NULL*, it must have been returned by a previous call to 192 :c:func:`PyObject_Malloc`, :c:func:`PyObject_Realloc` or :c:func:`PyObject_Calloc`. 193 194 If the request fails, :c:func:`PyObject_Realloc` returns *NULL* and *p* remains 195 a valid pointer to the previous memory area. 196 197 198.. c:function:: void PyObject_Free(void *p) 199 200 Frees the memory block pointed to by *p*, which must have been returned by a 201 previous call to :c:func:`PyObject_Malloc`, :c:func:`PyObject_Realloc` or 202 :c:func:`PyObject_Calloc`. Otherwise, or if ``PyObject_Free(p)`` has been called 203 before, undefined behavior occurs. 204 205 If *p* is *NULL*, no operation is performed. 206 207 208In addition, the following macro sets are provided: 209 210* :c:func:`PyObject_MALLOC`: alias to :c:func:`PyObject_Malloc` 211* :c:func:`PyObject_REALLOC`: alias to :c:func:`PyObject_Realloc` 212* :c:func:`PyObject_FREE`: alias to :c:func:`PyObject_Free` 213* :c:func:`PyObject_Del`: alias to :c:func:`PyObject_Free` 214* :c:func:`PyObject_DEL`: alias to :c:func:`PyObject_FREE` (so finally an alias 215 to :c:func:`PyObject_Free`) 216 217 218.. _pymalloc: 219 220The pymalloc allocator 221====================== 222 223Python has a *pymalloc* allocator optimized for small objects (smaller or equal 224to 512 bytes) with a short lifetime. It uses memory mappings called "arenas" 225with a fixed size of 256 KiB. It falls back to :c:func:`malloc` and 226:c:func:`realloc` for allocations larger than 512 bytes. 227 228*pymalloc* is the default allocator of :c:func:`PyObject_Malloc`. 229 230The arena allocator uses the following functions: 231 232* :c:func:`mmap` and :c:func:`munmap` if available, 233* :c:func:`malloc` and :c:func:`free` otherwise. 234 235.. versionchanged:: 2.7.7 236 The threshold changed from 256 to 512 bytes. The arena allocator now 237 uses :c:func:`mmap` if available. 238 239 240.. _memoryexamples: 241 242Examples 243======== 244 245Here is the example from section :ref:`memoryoverview`, rewritten so that the 246I/O buffer is allocated from the Python heap by using the first function set:: 247 248 PyObject *res; 249 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */ 250 251 if (buf == NULL) 252 return PyErr_NoMemory(); 253 /* ...Do some I/O operation involving buf... */ 254 res = PyString_FromString(buf); 255 PyMem_Free(buf); /* allocated with PyMem_Malloc */ 256 return res; 257 258The same code using the type-oriented function set:: 259 260 PyObject *res; 261 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */ 262 263 if (buf == NULL) 264 return PyErr_NoMemory(); 265 /* ...Do some I/O operation involving buf... */ 266 res = PyString_FromString(buf); 267 PyMem_Del(buf); /* allocated with PyMem_New */ 268 return res; 269 270Note that in the two examples above, the buffer is always manipulated via 271functions belonging to the same set. Indeed, it is required to use the same 272memory API family for a given memory block, so that the risk of mixing different 273allocators is reduced to a minimum. The following code sequence contains two 274errors, one of which is labeled as *fatal* because it mixes two different 275allocators operating on different heaps. :: 276 277 char *buf1 = PyMem_New(char, BUFSIZ); 278 char *buf2 = (char *) malloc(BUFSIZ); 279 char *buf3 = (char *) PyMem_Malloc(BUFSIZ); 280 ... 281 PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */ 282 free(buf2); /* Right -- allocated via malloc() */ 283 free(buf1); /* Fatal -- should be PyMem_Del() */ 284 285In addition to the functions aimed at handling raw memory blocks from the Python 286heap, objects in Python are allocated and released with :c:func:`PyObject_New`, 287:c:func:`PyObject_NewVar` and :c:func:`PyObject_Del`. 288 289These will be explained in the next chapter on defining and implementing new 290object types in C. 291 292