Performance of qsort vs std::sort?
Solution 1:
std::clock() is not a viable timing clock. You should use a platform-specific higher resolution timer, like the Windows High Performance Timer. More than that, the way that you call clock() is that first, text is output to the console, which is included in the time. This definitely invalidates the test. In addition, make sure that you compiled with all optimizations.
Finally, I copied and pasted your code, and got 0.016 for qsort and 0.008 for std::sort.
Solution 2:
I am surprised that no one mentions caches.
In your code, you start by touching ary and *ary_copy* so they are resident in the cache at the time of qsort. During qsort, *ary_copy* might get evicted. At the time of std::sort, the elements would have to be fetched from memory or a larger (read slower) cache level. This will of course depend on your cache sizes.
Try to reverse the test, i.e., start by running std::sort.
As some people have pointed out; making the array larger will make the test more fair. The reason is that a large array is less likely to fit in cache.