You are on page 1of 46

Algorithm Design and

Analysis
Ms. Malak Abdullah Alsabban
Department of Computer Engineering, College of Computer and information
Science, Majmaah University, Majmaah, KSA
2

The Sorting
Problem
The Sorting Problem

Input: A sequence of n numbers [a1, a2, … , an].


Output: A permutation or reordering [a'1, a'2, … , a'n ] of the input
sequence such that a'1  a'2  …  a'n .

An instance of the Sorting Problem:

Input: A sequence of 6 number [31, 41, 59, 26, 41, 58].

Expected output for given instance:

Expected
Output: The permutation of the input [26, 31, 41, 41, 58 , 59].

3
4
Insertion Sort
 is an efficient algorithm for sorting a small number of
elements.
 Example: like sorting a hand of playing cards
– Start with an empty left hand and the cards facing down on
the table.
– Remove one card at a time from the table, and insert it into
the correct position in the left hand
• compare it with each of the cards already in the hand, from right
to left
– The cards held in the left hand are sorted
• these cards were originally the top cards of the pile on the table
Insertion Sort

To insert 12, we need to


make room for it by moving
first 36 and then 24.

5
Insertion Sort

6
Insertion Sort

7
Insertion Sort

8
11
Pseudocode
 Is an informal high-level description of the program or
other algorithm.
 In pseudocode, we employ whatever expressive
method is most clear and concise to specify a given
algorithm.
 Pseudocode is not concerned with issues of software
engineering. Issues of data abstraction, modularity,
and error handling are often ignored in order to convey
the essence of the algorithm more concisely.
pseudocode for insertion sort
procedure parameter

12
Proving Loop Invariants

 Proving loop invariants works like induction


 Initialization (base case):
– It is true prior to the first iteration of the loop
 Maintenance (inductive step):
– If it is true before an iteration of the loop, it remains true before
the next iteration
 Termination:
– When the loop terminates, the invariant gives us a useful
property that helps show that the algorithm is correct
– Stop the induction when the loop terminates

13
Loop Invariant for Insertion Sort

 Initialization:
– Just before the first iteration, j = 2:
the subarray A[1 . . j-1] = A[1], (the
element originally in A[1]) – is sorted

14
Loop Invariant for Insertion Sort

 Maintenance:
– the while inner loop moves A[j -1], A[j -2], A[j -3], and so
on, by one position to the right until the proper position for
key (which has the value that started out in A[j]) is found
– At that point, the value of key is placed into this position.

15
Loop Invariant for Insertion Sort

 Termination:
– The outer for loop ends when j = n + 1  j-1 = n
– Replace n with j-1 in the loop invariant:
• the subarray A[1 . . n] consists of the elements originally in
A[1 . . n], but in sorted order

j-1 j

 The entire array is sorted!

Invariant: at the start of each iteration of the for loop the


elements in A[1 . . j-1] are in sorted order
16
17
Analyzing algorithms
 Predicting the resources that the algorithm requires.
 Examples:
-memory
-communication bandwidth
-computer hardware are of primary concern
-computational time.
18
Analysis of Insertion Sort
 The time taken by the INSERTION-SORT procedure
depends on the input
 sorting a thousand numbers takes longer than sorting
three numbers.
 INSERTION- SORT can take different amounts of time
to sort two input sequences of the same size
depending on how nearly sorted they already are.
 input size: the number of items in the input (array size
n)
 running time: the number of primitive operations or
“steps” executed.
Analysis of Insertion Sort

INSERTION-SORT(A) cost times


for j ← 2 to n c1 n
key ← A[ j ] c2 n-1
// Insert A[ j ] into the sorted sequence A[1 . . j -1] 0 n-1
i←j-1 c4 n-1

n
while i > 0 and A[i] > key c5 j 2 j
t
c6 
n
do A[i + 1] ← A[i] j 2
(t j  1)
c7 
n
i←i–1 j 2
(t j  1)
A[i + 1] ← key c8 n-1
tj: # of times the while statement is executed at iteration j

T (n)  c1n  c2 (n  1)  c4 (n  1)  c5  t j  c6  t j  1  c7  t j  1  c8 (n  1)
n n n

j 2 j 2 j 2
19
Best Case Analysis

 The array is already sorted “while i > 0 and A[i] > key”
– A[i] ≤ key upon the first time the while loop test is run (when
i = j -1)

– tj = 1

 We can express this running time as an+b for


constants a and b (Linear function)

 T(n) =  (n) order of growth in n

20
Worst Case Analysis

 The array is in reverse sorted order“while i > 0 and A[i] > key”
– Always A[i] > key in while loop test
– Have to compare key with all elements to the left of the j-th
position  compare with j-1 elements  tj = j
n
n(n  1) n
n(n  1) n
n(n  1)
using 
j 1
j
2
  j 
j 2 2
 1   ( j 1) 
j 2 2
we have:

 an 2  bn  c a quadratic function of n

 T(n) =  (n2) order of growth in n2


21
22

 We usually have to concentrate on finding only the


worst-case running time. WHY?
 The worst-case running time of an algorithm gives us
an upper bound on the running time for any input.
 For some algorithms, the worst case occurs fairly often.
 The “average case” is often roughly as bad as the
worst case.
23
Order of growth
 how the time for computation increases when you
increase the input size.
 First, we ignored the actual cost of each statement
 constants give us more detail than we really need
 consider only the leading term of a formula (e.g., a n2),
lower-order terms are relatively insignificant
Insertion Sort - Summary
 Advantages
– Good running time for “almost sorted” arrays O(n)
 Disadvantages
– O(n2) running time in worst and average case

To print the execution time of your code:


#include <ctime>
int main(void) {
clock_t tStart = clock();
/* Do your stuff here */
printf("Time taken: %.2fs\n", (double)(clock() -
tStart)/CLOCKS_PER_SEC);
return 0;
}

24
Designing Algorithm

 We can choose from a wide range of algorithm design


techniques.
 In Insertion sort, we used Incremental approach.
 having sorted subarray A[1..j-1]. insert the single
element A[j] into its proper placethe sorted subarray
A[1..j-1].
– Design approach: incremental
– Sorts in place: Yes
– Running time: (n2)

 Merge Sort
– Design approach: divide and conquer
– Sorts in place: No
– Running time: Let’s see!!
Divide-and-Conquer Approach

 Divide the problem into a number of sub-problems


– smaller instances of the same problem.

 Conquer the sub-problems


– Solve the sub-problems recursively

– Sub-problem size small enough  solve the problems in


straightforward manner

 Combine the solutions of the sub-problems


– Obtain the solution for the original problem

26
Merge Sort

 Merge sort algorithm follows the divide-and-conquer


paradigm
 To sort an array A[p . . r]:
 Divide
– Divide the n-element sequence to be sorted into two
subsequences of n/2 elements each
 Conquer
– Sort the subsequences recursively using merge sort
– When the size of the sequences is 1 there is nothing more
to do
 Combine
– Merge the two sorted subsequences 27
Merge Sort
Idea: divide a list of elements in half and recursively
sort each sublist. p r
q
1 2 3 4 5 6 7 8

Alg.: MERGE-SORT(A, p, r) 5 2 4 7 1 3 2 6

if p < r Check for base case

then q ← (p + r)/2 Divide

MERGE-SORT(A, p, q) Conquer

MERGE-SORT(A, q + 1, r) Conquer

MERGE(A, p, q, r) Combine

 Initial call: MERGE-SORT(A, 1, n)

28
Example – n Power of 2

1 2 3 4 5 6 7 8

Divide 5 2 4 7 1 3 2 6 q=4

1 2 3 4 5 6 7 8

5 2 4 7 1 3 2 6

1 2 3 4 5 6 7 8

5 2 4 7 1 3 2 6

1 2 3 4 5 6 7 8

5 2 4 7 1 3 2 6

29
Example – n Power of 2

1 2 3 4 5 6 7 8

Conquer 1 2 2 3 4 5 6 7
and
Merge 1 2 3 4 5 6 7 8

2 4 5 7 1 2 3 6

1 2 3 4 5 6 7 8

2 5 4 7 1 3 2 6

1 2 3 4 5 6 7 8

5 2 4 7 1 3 2 6

30
Example – n Not a Power of 2

1 2 3 4 5 6 7 8 9 10 11

4 7 2 6 1 4 7 3 5 2 6 q=6
Divide
1 2 3 4 5 6 7 8 9 10 11

q=3 4 7 2 6 1 4 7 3 5 2 6 q=9

1 2 3 4 5 6 7 8 9 10 11

4 7 2 6 1 4 7 3 5 2 6

1 2 3 4 5 6 7 8 9 10 11

4 7 2 6 1 4 7 3 5 2 6

1 2 4 5 7 8

4 7 6 1 7 3

31
Example – n Not a Power of 2

1 2 3 4 5 6 7 8 9 10 11

Conquer 1 2 2 3 4 4 5 6 6 7 7
and
1 2 3 4 5 6 7 8 9 10 11
Merge 1 2 4 4 6 7 2 3 5 6 7

1 2 3 4 5 6 7 8 9 10 11

2 4 7 1 4 6 3 5 7 2 6

1 2 3 4 5 6 7 8 9 10 11

4 7 2 1 6 4 3 7 5 2 6

1 2 4 5 7 8

4 7 6 1 7 3

32
Merging

p q r
1 2 3 4 5 6 7 8

2 4 5 7 1 2 3 6

 Input: Array A and indices p, q, r such that p≤


q<r
– Subarrays A[p . . q] and A[q + 1 . . r] are sorted
 Output: One single sorted subarray A[p . . r]

33
Merging

p q r
 Idea for merging: 1 2 3 4 5 6 7 8

2 4 5 7 1 2 3 6
– Two piles of sorted cards
• Choose the smaller of the two top cards
• Remove it and place it in the output pile
– Repeat the process until one pile is empty
– Take the remaining input pile and place it face-down onto
the output pile

A1 A[p, q]
A[p, r]

A2 A[q+1, r]

34
Example: MERGE(A, 9, 12, 16)

p q r

35
Example: MERGE(A, 9, 12, 16)

36
Example (cont.)

37
Example (cont.)

38
Example (cont.)

Done!

39
Merge - Pseudocode

p q r
1 2 3 4 5 6 7 8

2 4 5 7 1 2 3 6

n1 n2

p q

L 2 4 5 7 
q+1 r

R 1 2 3 6 

40
41
Loop invariant of merge sorting algorithm
Running Time of Merge (assume last for loop)

 Initialization (copying into temporary arrays):


– (n1 + n2) = (n)
 Adding the elements to the final array:
- n iterations, each taking constant time  (n)
 Total time for Merge:
– (n)

42
Analyzing Divide-and Conquer Algorithms

 The recurrence is based on the three steps of the


paradigm:
– T(n) – running time on a problem of size n
– Divide the problem into a subproblems, each of size n/b:
takes D(n)
– Conquer (solve) the subproblems aT(n/b)
– Combine the solutions C(n)

(1) if n ≤ c
T(n) = aT(n/b) + D(n) + C(n) otherwise

43
MERGE-SORT Running Time
 Divide:
– just computes the middle of the subarray, which takes
constant time. compute q as the average of p and r:
D(n) = (1)
 Conquer:
– recursively solve 2 subproblems, each of size n/2 
2T (n/2)
 Combine:
– MERGE on an n-element subarray takes (n) time 
C(n) = (n)

(1) if n =1 Use Master’s


Theorem:
T(n)= 2T(n/2) + (n) if n > 1 T(n) is Θ (n lgn)
44
Solve the Recurrence

Let us rewrite recurrence:

T(n) = c if n = 1
2T(n/2) + cn if n > 1

c represents the time required to solve problems of size 1


46
Recursion tree

How to construct a recursion tree


for the recurrence T (n)=2T(n/2)
+cn.
Part (a) shows T (n), which
progressively expands in (b)–(d)
to form the recursion tree. The
fully expanded tree in part (d) has
lg(n)+1 levels (i.e., it has height lg
n, as indicated), and each level
contributes a total cost of cn. The
total cost, therefore, is cn(lg n+1),
which is Θ(nlgn)
.
Merge Sort - Discussion
 Running time insensitive of the input

 Advantages:
– Guaranteed to run in (nlgn)

 Disadvantage
– Requires extra space N

47
Readings

 Chapter 2

48

You might also like