SelectionSort and QuickSort both have worst-case complexity of O(n^2). MergeSort has a worst-case complexity of O(n log n).
The tests in this course may contain any material covered in lecture up until this point, although much stronger emphasis will be placed on material for which you have received graded feedback. This page contains a collection of exercises that you may wish to complete to prepare yourself. We also encourage you to review the lecture notes from topics presented since the previous test.
Make sure to review the sorting algorithms we covered in class and be ready to discuss them.
What is the worst-case time complexity of SelectionSort? QuickSort? MergeSort?
Solution:
SelectionSort and QuickSort both have worst-case complexity of O(n^2).
MergeSort has a worst-case complexity of O(n log n).
Although QuickSort’s worst-case time complexity is worse than MergeSort’s, we use QuickSort as a default in practice. Why?
Solution:
Randomized QuickSort's *expected* worst-case complexity is O(n log n).
Also, QuickSort is an *in-place* sort, meaning that it performs less copying
than MergeSort.
What is an in-place sorting algorithm?
Solution:
An *in-place* sorting algorithm is one in which the contents of the array
are sorted while remaining within the array.
What is wrong with the following QuickSort pseudocode? What would happen if we used this pseudocode to sort an array? How could we fix it?
Function brokenQuickSort(array, left, right) pivot <- partition(array, left, right) brokenQuickSort(array, left, pivot-1) brokenQuickSort(array, pivot+1, right) EndFunction
Solution:
There is no recursive base case in the provided pseudocode. If this code
were executed, it would break the list down into paritions of size 0 and
keep trying to split those size 0 partitions indefinitely. We need to add
a condition that does not try to sort if the array is of size less than two.
Write pseudocode for a partition
function which can be used in the QuickSort algorithm.
Solution:
Any partition function which correctly organizes the array into a "low
section" and a "high section", returning the index of the pivot between them,
is acceptable. Here's the algorithm which was discussed in class:
Function partition(A, start, end)
pivot <- A[end]
left <- start
right <- end-1
While left <= right
If A[left] <= pivot
left = left + 1
ElseIf A[right] > pivot
right = right - 1
Else
swap A[left] and A[right]
swap A[left], A[end]
Return left
EndFunction
Write pseudocode for the merge
function of the MergeSort algorithm.
Solution:
Function merge(a1, s2, a2, s2, result)
Input: sorted arrays a1 of size s1, a2 of size s2
array result
Output: none, but sorted combination stored in result
i <- 0
j <- 0
k <- 0
While i < s1 And j < s2
If a1[i] <= a2[j]
result[k] <- a1[i]
i <- i + 1
k <- k + 1
Else
result[k] <- a2[j]
j <- j + 1
k <- k + 1
EndIf
EndWhile
While i < s1
result[k] <- a1[i]
i <- i + 1
k <- k + 1
EndWhile
While j < s2
result[k] <- a2[j]
j <- j + 1
k <- k + 1
EndWhile
EndFunction
Recall the definition of big-\(O\) notation we used in class: \(f(n)\) is \(O(g(n))\) if there exists \(c>0, n_0\geq 1\) such that for all \(n \geq n_0\), we have \(f(n) \leq c\cdot g(n)\). You should be prepared, given this definition, to prove complexity bounds on functions.
Prove that \(f(n) = 3n^3 + 5n^2 + 8\) is \(O(n^3)\).
Solution:
Prove that \(f(n) = n^2 - 2n +2\) is \(O(n^2)\).
Solution:
Prove that \(f(n) = \frac{n^2+1}{n}\) is \(O(n).\)
Solution:
It's helpful to first simplify \(f(n)\). We have
\begin{align*}
f(n) &= \frac{n^2+1}{n} = \frac{n^2}{n} + \frac{1}{n} = n+\frac{1}{n}\ .
\end{align*}
Note that for any \(n\geq 1\), it holds that \(\frac{1}{n} \leq 1 \leq n\). Choose \(c := 2\) and \(n_0 := 1\). Then for all \(n \geq n_0\), it holds that
\begin{align*}
f(n) &= \frac{n^2+1}{n} \\
&= n + \frac{1}{n} \\
&\leq n+1\\
&\leq n + n \\
&=2 \cdot n \\
&= c \cdot n\ .
\end{align*}
Hence, \(f(n) = O(n)\).
You should be prepared to discuss the data structures we have created in class, the fields in their implementations, and the invariants we rely upon in order to write algorithms for those data structures.
What is an ADT? What is the difference between an ADT and a data structure? Give two examples of each.
Solution:
An *ADT* is a description of how data can be accessed and manipulated. In
other words, an ADT is an interface allowing the caller to store and retrieve
data. A *data structure* is a concrete realization of an ADT; data structures
give specific algorithms to implement the interface that the ADT describes.
Two examples of ADTs are Lists and Stacks. Lists and Stacks are both defined
by a collection of methods that allow a user to store and manipulate data
(e.g. `insertFirst`, `getLast`, etc. for List; `pop`, `push`, etc. for Stack).
Two examples of data structures are linked lists and array lists. Both linked
lists and array lists are concrete implementations of the List ADT; each of
them give implementations for the interface that the List ADT describes.
What is the difference between a List
and a Stack
? When might we use a Stack
instead of a List
?
Solution:
`List` and `Stack` are two different ADTs. The `List` ADT supports a variety
of operations (such as `get` by index) that a `Stack` does not. A `Stack` may
be implemented using a `List` but is not a kind of `List`.
We might use a `Stack` instead of a `List` if the operations provided by
`Stack` are enough for our algorithm. By using a "weaker" data structure, the
algorithm that uses it is easier to verify as correct since it's easier to
understand how the data structure behaves.
What are the big-\(O\) complexities of the following operations? Answer for each of (a) LinkedList
, and (b) ArrayList
. Briefly explain each answer.
Insert to the front of the list.
Insert to the back of the list.
Retrieve the first element of the list.
Retrieve the middle element of the list.
Remove an element from the back of the list.
Solution:
|--------------------------------------------------|
| Operation | ArrayList | LinkedList |
|-------------|-----------------------|------------|
| insertFront | O(n) | O(1) |
| insertLast | O(1) + expandCapacity | O(1) |
| getFirst | O(1) | O(1) |
| get(i) | O(1) | O(n) |
| removeLast | O(1) | O(n) |
|--------------------------------------------------|
Draw a stack diagram of a stack frame in which the following code has been run. Assume the LinkedStack was implemented using a LinkedList.
Stack<int>* stack = new LinkedStack<int>(); stack->push(4); stack->push(5);
Solution: