Loop invariant of Selection Sort

algorithm-analysisalgorithms

 Selection Sort(A[1,2..n]:array of integers)
 1.for i=1  to n-1
 2.for j=i+1 to n
 3.if A[j]<A[i]
 4     swap(A[i],A[j])

I am trying to prove the correctness of this algorithm,I read from my book and here is what is written:We must have 2 invariants for the inner and outer loop

The inner states:Every time we reach line 2,the current A[i] hold the value of a minimum element from A[i ,…,j-1]

The outer states : Everytime we are at line 1,the current subarray A[1,..,i-1] consists of i-1 in number smallest elements from the original array
A'[1,….,n] in sorted order

Now to prove that it is really sorting we use the outer loop with the claim that the part from A[1,..,i-1] is sorted and from the inner loop invariant A[i] is the minimum from A[1,…n] and it is solved ,however what seems unclear to me is how we design the inner invariant ,why do we say A[i] is the minimum from A[i,..,j-1] I tried with some values and it always turns only 1 element and I guess in our final proof we say not A[i] is the minimum in A[i,…j-1] but A[i,..n].Shouldn't we say A[i] is the minimum in A[i,..,j-1] since we have proved that ?

Best Answer

The analysis of the program would become easier if you rewrote the for loops as the equivalent while loops, because then it becomes obvious what is the value of the 'controlled variable' after the loop has finished. In this style, your inner loop is

j = i+1
(* Inv: i+1 <= j <= n+1 and A[i] = min A[i..j-1] *)
while j <= n:
    if a[j] < a[i]:
        swap(A[i], A[j])
    j = j+1

The invariant is true when j = i+1, and it is maintained by the loop body. When the loop terminates, we have j = n+1, and the invariant tells us that A[i] = min A[i..j-1] = min A[i..n]. That is what is needed to justify a claim that A[1..i] contains the smallest i elements of A in sorted order.

The outer loop becomes

i = 1
(* Inv: 1 <= i <= n and A[1..i-1] contains the smallest i-1
        elements of A[1..n] in sorted order *)
while i <= n-1:
    [Inner loop]
    i = i+1

So if the inner loop ends with A[1..i] as described, this turns into A[1..i-1] after taking into account the increment of i. The outer loop can terminate with i = n, because at that point we have selected and sorted n-1 elements of A, and the last remaining element must be the biggest.

It does no harm and aids clarity to make the ranges of variables like i and j explicit in the invariants.

Generally speaking: this sort of analysis becomes easier if you adopt 0-based indexing, because then the fog of +1's and -1's dissipates; but if your book uses 1-based indexing, I guess you're stuck with it. I also like to use the notation A[i..j) for A[i..j-1].

Some languages (dating back to Algol 60) say that the value of the controlled variable is undefined after a for loop, and others encourage us to make the variable local to the loop. Either way, we are discouraged from saying that its value after the loop is n+1, but that assertion is needed to make sense of what the invariant tells us about the state after the loop is finished. That's why I teach my students to mentally rewrite the loop as a while before analysis.

Related Solutions

Merge sort versus quick sort performance

If you look at your code for swapping you:

// If current element is lower than pivot
// then swap it with the element at store_index
// and move the store_index to the right.

But, ~50% of the time that string you just swapped needs to be moved back, which is why faster merge sorts work from both ends at the same time.

Next if you check to see if the first and last elements are the same before doing each of the recursive call you avoid wasting time calling a function only to quickly exit it. This happens 10000000 in your final test which does add noticeable amounts of time.

Use,

if (pivot_index -1 > start) quick_sort(lines, start, pivot_index - 1);

if (pivot_index + 1 < end) quick_sort(lines, pivot_index + 1, end);

You still want an outer function to do an initial if (start < end) but that only needs to happen once so that function can just call an unsafe version of your code without that outer comparison.

Also, picking a random pivot tends to avoid N^2 worst case results, but it's probably not a big deal with your random data set.

Finally, the hidden problem is QuickSort is comparing strings in ever smaller buckets that are ever closer together,

(Edit: So, AAAAA, AAAAB, AAAAC, AAAAD then AAAAA, AAAAB. So, strcmp needs to step though a lot of A's before looking the useful parts of the strings.)

but with Merge sort you look at the smallest buckets first while they are vary random. Mergsorts final passes do compare a lot of strings close to each other, but it's less of an issue then. One way to make Quick sorts faster for strings is to compare the first digits of the outer strings and if there the same ignore them when doing the inner comparisons, but you have to be careful that all strings have enough digits that your not skipping past the null terminator.

Sublinear Extra Space MergeSort

To merge 2 blocks of size M you only need M extra space:

If you have a block from i to j and a block from j to k you first copy the first block to the additional space so i to j is free to receive the sorted array.

Then the merge is implemented as:

extra = new array[j-i]
arraycopy(arr,i,extra,0,j-i)//copy from arr to extra
p =0
while i<k
    if(arr[j]<extra[p])
       arr[i++]=arr[j++]
    else
       arr[i++]=extra[p++]
    if(j==k)break;
    if(i==j)break;
end while
arraycopy(extra,p,arr,i,extra.length-p)//copy remaining from extra

you can see that i only increments without j up to j-i times so i is never larger than j

Best Answer

Related Solutions

Merge sort versus quick sort performance

Sublinear Extra Space MergeSort

Related Topic