Using std::merge/std::stable_sort function with device_vector as a inputs( running function on CPU when data present on GPU) doesn't compile .