CoreNEURON memory allocation routines are confusing #806

olupton · 2022-04-26T09:36:10Z

Describe the issue
The current codebase uses a wide range of different memory [de]allocation routines, starting from a mix of new/delete and malloc/free and extending (via other system calls like posix_memalign) to various homespun methods like coreneuron::[de]allocate_unified:

CoreNeuron/coreneuron/utils/memory.h

Lines 33 to 39 in 6306690

    
           /** @brief Allocate unified memory in GPU builds iff GPU enabled, otherwise new 
        
            */ 
        
           void* allocate_unified(std::size_t num_bytes); 
        
           /** @brief Deallocate memory allocated by `allocate_unified`. 
        
            */ 
        
           void deallocate_unified(void* ptr, std::size_t num_bytes);

alloc_memory, calloc_memory, free_memory:

CoreNeuron/coreneuron/utils/memory.h

Lines 33 to 39 in 6306690

    
           /** @brief Allocate unified memory in GPU builds iff GPU enabled, otherwise new 
        
            */ 
        
           void* allocate_unified(std::size_t num_bytes); 
        
           /** @brief Deallocate memory allocated by `allocate_unified`. 
        
            */ 
        
           void deallocate_unified(void* ptr, std::size_t num_bytes);

and helper structs like MemoryManaged:

CoreNeuron/coreneuron/utils/memory.h

Lines 142 to 167 in 6306690

    
           class MemoryManaged { 
        
             public: 
        
               void* operator new(size_t len) { 
        
                   void* ptr; 
        
                   cudaMallocManaged(&ptr, len); 
        
                   cudaDeviceSynchronize(); 
        
                   return ptr; 
        
               } 
        
               void* operator new[](size_t len) { 
        
                   void* ptr; 
        
                   cudaMallocManaged(&ptr, len); 
        
                   cudaDeviceSynchronize(); 
        
                   return ptr; 
        
               } 
        
               void operator delete(void* ptr) { 
        
                   cudaDeviceSynchronize(); 
        
                   cudaFree(ptr); 
        
               } 
        
               void operator delete[](void* ptr) { 
        
                   cudaDeviceSynchronize(); 
        
                   cudaFree(ptr); 
        
               } 
        
           };

Some of these names are not very descriptive, and some of their behaviours change according to compile time and runtime options, which has led to bugs (see #594, for example) when (for example) we end up with mismatched allocations and deallocations.

This should be improved, with more descriptively named methods and better organisation that enforces consistent pairing of allocation and deallocation functions.

Discussion
Note that we need to be able to request a few different types of allocation. For example, in GPU builds, we may need to distinguish between:

Host-only memory.
Unified memory (accessible from both host and device) even when CORENRN_ENABLE_CUDA_UNIFIED_MEMORY=OFF, for things like Random123 state where we require unified memory (Use CUDA unified memory for Random123 state #595)
Unified memory if CORENRN_ENABLE_CUDA_UNIFIED_MEMORY=ON, otherwise host memory.

Additionally the last two should probably return host-only memory in GPU builds where the GPU was not enabled at runtime by passing --gpu or coreneuron.gpu = True.

Ideally we would handle these through a more uniform API that makes these distinctions clear.
Right now, {alloc,calloc,free}_memory and MemoryManaged provide point 3 (though without a test on --gpu), coreneuron::[de]allocate_unified provide point 2, and a mix of standard APIs provide point 1.

The text was updated successfully, but these errors were encountered:

olupton added gpu dev labels Apr 26, 2022

olupton mentioned this issue Apr 26, 2022

Support for shared libraries in GPU execution (python launch support) #795

Merged

10 tasks

ohm314 self-assigned this Sep 14, 2022

ohm314 linked a pull request Oct 24, 2022 that will close this issue

New allocators with sensible names and more #872

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CoreNEURON memory allocation routines are confusing #806

CoreNEURON memory allocation routines are confusing #806

olupton commented Apr 26, 2022

CoreNEURON memory allocation routines are confusing #806

CoreNEURON memory allocation routines are confusing #806

Comments

olupton commented Apr 26, 2022