Can I reduce size of array in CUDA

I have allocated memory for 3d array using cudaMalloc3D - after execution of first kernel I established that I do not need part of it. For example in pseudo code :

A = [100,100,100]
kernel()// data of intrest is just in subrange of A
B = [10:20, 20:100, 50:80]// part that I need other entries I would like to have removed
... // new allocations
kernelb()...

The rest of memory I would like to free (or immidiately use to other arrays that I will need to allocate now)

I know that I can free array and reallocate - but It do not seem to the best option. P.S.

By the way Is there a way to use cudaMallocAsync like cudaMalloc3D - I mean cudaMalloc3D makes it convienient to use 3d array and takes care for paddings.

The current CUDA API does not have realloc functionality. It seems you already know the common workaround of cudaMalloc smaller array -> cudaMemcpy to smaller array -> cudaFree large array

In case you really need realloc, you could write your own allocator using GPU virtual memory management. https://developer.nvidia.com/blog/introducing-low-level-gpu-virtual-memory-management/

Module not found, did you mean "*js"?

Warning: Different store and key passwords not supported for PKCS12 KeyStores. Ignoring user-specified -keypass value. in Android Studio

How to determine the exact lib/include variable generated by find_packge() in config mode? [duplicate]

Why we should use a URL for the Go module name?

How to place independent groups of radio buttons on same row and place well panels around the groups in order to delineate them?

Format string with dictionary in C#

Window Function not working in HackerRank MySQL-8.0.20? [duplicate]

count, print out and delete rows that do not contain a digit

What does two return statements inside an SQL function mean?

unexpected behavior when using append method on python dictionary [duplicate]

Difference between AWS EBS, EC2, S3

Store simple value in a matrix in C