Finding three elements in an array whose sum is closest to a given number
Given an array of integers, A1, A2, ..., An, including negatives and positives, and another integer S. Now we need to find three different integers in the array, whose sum is closest to the given integer S. If there exists more than one solution, any of them is ok.
You can assume all the integers are within int32_t range, and no arithmetic overflow will occur with calculating the sum. S is nothing special but a randomly picked number.
Is there any efficient algorithm other than brute force search to find the three integers?
Solution 1:
Is there any efficient algorithm other than brute force search to find the three integers?
Yep; we can solve this in O(n2) time! First, consider that your problem P
can be phrased equivalently in a slightly different way that eliminates the need for a "target value":
original problem
P
: Given an arrayA
ofn
integers and a target valueS
, does there exist a 3-tuple fromA
that sums toS
?modified problem
P'
: Given an arrayA
ofn
integers, does there exist a 3-tuple fromA
that sums to zero?
Notice that you can go from this version of the problem P'
from P
by subtracting your S/3 from each element in A
, but now you don't need the target value anymore.
Clearly, if we simply test all possible 3-tuples, we'd solve the problem in O(n3) -- that's the brute-force baseline. Is it possible to do better? What if we pick the tuples in a somewhat smarter way?
First, we invest some time to sort the array, which costs us an initial penalty of O(n log n). Now we execute this algorithm:
for (i in 1..n-2) {
j = i+1 // Start right after i.
k = n // Start at the end of the array.
while (k >= j) {
// We got a match! All done.
if (A[i] + A[j] + A[k] == 0) return (A[i], A[j], A[k])
// We didn't match. Let's try to get a little closer:
// If the sum was too big, decrement k.
// If the sum was too small, increment j.
(A[i] + A[j] + A[k] > 0) ? k-- : j++
}
// When the while-loop finishes, j and k have passed each other and there's
// no more useful combinations that we can try with this i.
}
This algorithm works by placing three pointers, i
, j
, and k
at various points in the array. i
starts off at the beginning and slowly works its way to the end. k
points to the very last element. j
points to where i
has started at. We iteratively try to sum the elements at their respective indices, and each time one of the following happens:
- The sum is exactly right! We've found the answer.
- The sum was too small. Move
j
closer to the end to select the next biggest number. - The sum was too big. Move
k
closer to the beginning to select the next smallest number.
For each i
, the pointers of j
and k
will gradually get closer to each other. Eventually they will pass each other, and at that point we don't need to try anything else for that i
, since we'd be summing the same elements, just in a different order. After that point, we try the next i
and repeat.
Eventually, we'll either exhaust the useful possibilities, or we'll find the solution. You can see that this is O(n2) since we execute the outer loop O(n) times and we execute the inner loop O(n) times. It's possible to do this sub-quadratically if you get really fancy, by representing each integer as a bit vector and performing a fast Fourier transform, but that's beyond the scope of this answer.
Note: Because this is an interview question, I've cheated a little bit here: this algorithm allows the selection of the same element multiple times. That is, (-1, -1, 2) would be a valid solution, as would (0, 0, 0). It also finds only the exact answers, not the closest answer, as the title mentions. As an exercise to the reader, I'll let you figure out how to make it work with distinct elements only (but it's a very simple change) and exact answers (which is also a simple change).
Solution 2:
certainly this is a better solution because it's easier to read and therefore less prone to errors. The only problem is, we need to add a few lines of code to avoid multiple selection of one element.
Another O(n^2) solution (by using a hashset).
// K is the sum that we are looking for
for i 1..n
int s1 = K - A[i]
for j 1..i
int s2 = s1 - A[j]
if (set.contains(s2))
print the numbers
set.add(A[i])
Solution 3:
John Feminella's solution has a bug.
At the line
if (A[i] + A[j] + A[k] == 0) return (A[i], A[j], A[k])
We need to check if i,j,k are all distinct. Otherwise, if my target element is 6
and if my input array contains {3,2,1,7,9,0,-4,6}
. If i print out the tuples that sum to 6, then I would also get 0,0,6
as output . To avoid this, we need to modify the condition in this way.
if ((A[i] + A[j] + A[k] == 0) && (i!=j) && (i!=k) && (j!=k)) return (A[i], A[j], A[k])