Can we rely on the reduce-capacity trick?
Is it actually guaranteed anywhere that the following reduce-capacity trick will "work"?
int main() {
std::string s = "lololololol";
s = ""; // capacity still non-zero
string(s).swap(s); // ?
}
It doesn't seem to "work" for me (in that the capacity remains non-zero), and I can't find anything in the standard that says anything more than that the "contents" must be swapped between the two [here, identical] objects.
Similarly, for sequence containers:
int main() {
vector<int> v { 1,2,3,4,5 };
v.clear(); // capacity still non-zero
vector<int>(v).swap(v); // ?
}
As far as I'm aware, this "trick" is semi-widely used; perhaps this widespread adoption is misguided?
(Of course, in C++11 we have shrink_to_fit
[albeit non-binding] instead, making this kind of moot.)
Solution 1:
I've always been taught that there is no guaranteed standard way to lower the capacity. All methods have been (and still are) implementation defined.
§ 23.2.1\8 says:
The expression
a.swap(b)
, for containersa
andb
of a standard container type other thanarray
, shall exchange the values ofa
andb
without invoking any move, copy, or swap operations on the individual container elements...
This guarantees that the internal pointers of vectors must be swapped.
However, I cannot find anything that guarantee on the capacity of a newly created vector.
§ 21.4.2\1 says that one of the basic_string
default constructor's post conditions is that capacity()
returns an unspecified value.
§ 21.4.2\3 says that one of the basic_string copy constructor's post conditions is that capacity()
returns a value at least as big as size()
.
§ 21.4.6.8\2 says that string::swap
runs in constant time, which (effectively) requires that the internal pointers are swapped.
As far as I can tell, a conforming implementation could have string::max_size() { return 4;}
, and swapping all internals from one buffer to another would therefore be constant time. (vector can't do that though)
Obviously, take this all with a grain of salt. I'm quoting from the C++ draft from Feb28,'11, and I can't find specifications for the vector's copy constructor. Also, not finding evidence for is not the same as finding evidence against.
Solution 2:
On his errata page for "Effective STL," Scott Meyers notes:
When string implementations use reference counting, the swap trick using the copy constructor doesn't decrease the capacity, because the copy constructor isn't allocating any memory; it's just adjusting a reference count. A more reliable way to perform shrink-to-fit is to create the temporary string via the range constructor, e.g., string(s.begin(), s.end()).swap(s); This version of the swap trick is safer for vectors, too, because it eliminates any chance that the copy constructor will copy the other vector's excess capacity (which implementations are permitted to do).
As for the 'guarantee,' Meyers notes:
The language police require that I inform you that there's no guarantee that this technique will truly eliminate excess capacity. Implementers are free to give vectors and strings excess capacity if they want to, and sometimes they want to. [Effective STL, Item 17]