user2296145 - 1 year ago 69

C++ Question

Here you can see the code, run it and see the times.

http://rextester.com/MNQZS47293

I get similar results on my machine (using the same version of MSVC), the lookup in the vector is slower than in the std::set.

I would expect the sorted vector version to be faster, due to better locality of the data (more cache friendly).

In the worse case, I would expect them to be similar, because they both perform a binary search, but I cannot understand why the std::set is much faster than the sorted vector version.

Thank you very much

Edit: Sorry, I pasted the wrong link (I modified the code but forgot to copy the link) the old code was using an unordered_set, this code is using a set, and the question remains the same: Why is the binary search over a sorted vector slower than over a set? I've noticed that if the number of elements is large enough, then the sorted vector is faster, but I still cannot understand why the set can outperform the sorted vector for any number of elements.

Answer Source

For the updated question:

`-O1`

and `-O2`

gives the same performance for the two methods.

`-Ox`

slows down the vector version.

Why this is, you need to look at the disassembly or at the details of the `-Ox`

level. It has nothing to do with the algorithmic properties of `set.find`

and `lower_bound/binary_search`

.

Regarding the locality of data. A binary_search and a set::find has for reasonable implementations exactly the same locality of data. The set might even win with the data being read in a left-to-right fashion.