Implementing Merge Sort in Python

Introduction

Sorting is a fundamental operation in computer science and plays a crucial role in various applications. One of the most efficient and popular sorting algorithms is Merge Sort. Merge Sort follows the “divide and conquer” paradigm, dividing the unsorted list into smaller sublists, sorting them individually, and then merging them back together to obtain the final sorted result. In this article, we will explore the inner workings of Merge Sort and implement it in Python.

Understanding Merge Sort

Merge Sort is based on the concept of recursion and is known for its stable sorting behavior, meaning that it preserves the relative order of elements with equal values. The algorithm consists of two main steps: splitting the list and merging the sublists.

Steps to Implement Merge Sort in Python

Step 1: Splitting the List:

The first step in Merge Sort is to divide the unsorted list into smaller sublists recursively until each sublist contains only one element. This process is known as the “splitting” phase and is crucial to the divide and conquer approach. By breaking down the list into smaller parts, we can easily sort them and then merge them back together.

Step 2: Merging the Sublists:

Once we have individual elements in each sublist, the merging phase begins. In this step, we merge the sublists together, comparing elements from each sublist and arranging them in the correct order. This process is repeated recursively until the entire list is sorted.

Visual Representation of How Merge Sort works

describing visual representation of merge sort and how it works, step by step with an example.

Implementation of Merge Sort in Python:


def merge_sort(arr):
    if len(arr) <= 1:
        return arr
    
    mid = len(arr) // 2
    left_half = arr[:mid]
    right_half = arr[mid:]
    
    left_half = merge_sort(left_half)
    right_half = merge_sort(right_half)
    
    return merge(left_half, right_half)


def merge(left, right):
    merged = []
    i = j = 0
    
    while i < len(left) and j < len(right):
        if left[i] < right[j]:
            merged.append(left[i])
            i += 1
        else:
            merged.append(right[j])
            j += 1
    
    while i < len(left):
        merged.append(left[i])
        i += 1
    
    while j < len(right):
        merged.append(right[j])
        j += 1
    
    return merged

# Testing the merge_sort function
unsorted_list = [6, 2, 8, 3, 1, 9, 5, 7, 4]
sorted_list = merge_sort(unsorted_list)
print(sorted_list)

Output


[1, 2, 3, 4, 5, 6, 7, 8, 9]

Explanation:

The `merge_sort` function recursively splits the input array into two halves, `left_half` and `right_half`, until each sublist contains only one element.
It then merges the sublists by calling the `merge` function, which compares and combines the elements in the correct order.
The merge function uses two pointers, `i` and `j`, to iterate through the left and right sublists.
The elements from the sublists are merged into the `merged` list by comparing them. The smaller element is appended to `merged`, and the respective pointer is incremented.
Finally, the merged list is returned as the sorted result.

Complexity Analysis of Quick Sort

Time Complexity

Merge Sort has a time complexity of O(n log n) in all cases. Here’s how the time complexity is derived:

1. Splitting Phase:

The splitting phase divides the input array into two halves repeatedly until each sublist contains only one element.
This process takes O(log n) time because the input size is halved in each recursion.

2. Merging Phase:

The merging phase combines the sublists together, comparing and arranging the elements in the correct order.
In the worst case, when merging two sublists of size n/2, it takes O(n) time.
Since there are log n levels in the recursion tree (due to the splitting phase), the total merging time is O(n log n).

Hence, the overall time complexity of Merge Sort is O(n log n). This time complexity remains consistent regardless of the input distribution or size.

Space Complexity

Merge Sort has a space complexity of O(n) because it requires additional space for the merging process. Here’s how the space complexity is determined:

1. Splitting Phase:

During the splitting phase, no additional space is required as it only involves dividing the original array into smaller sublists. The space complexity for this phase is O(1).

2. Merging Phase:

In the merging phase, an auxiliary array is used to merge the sublists together.
The size of the auxiliary array is equal to the size of the original array.
Therefore, the space complexity for the merging phase is O(n).

3. Overall Space Complexity:

As the splitting phase does not require any additional space, and the merging phase requires O(n) space, the overall space complexity of Merge Sort is O(n).

It’s important to note that the space complexity of Merge Sort can be a drawback when sorting large datasets. However, compared to other sorting algorithms with the same time complexity, Merge Sort’s stable and predictable behavior makes it a preferred choice in many scenarios.

Conclusion

Merge Sort is a powerful sorting algorithm that guarantees a time complexity of O(n log n), making it efficient for large datasets. By breaking down the sorting process into smaller sub-problems and merging the results, Merge Sort provides a stable and reliable solution for sorting arrays or lists in Python. Understanding the inner workings and implementing Merge Sort can greatly enhance your programming skills and efficiency in solving sorting-related problems.

Implementing Merge Sort in Python

Introduction

Understanding Merge Sort

Steps to Implement Merge Sort in Python

Complexity Analysis of Quick Sort

Time Complexity

Space Complexity

Conclusion

Subhankar Rakshit

Python Data Classes (With Examples)

Python’s Global, Local, and Nonlocal Variables

Python Program to Find the GCD using Euclidean Algorithm

Python Program to Generate Pascal’s Triangle

Python Program to Find the Nth Prime Number

Introduction

Understanding Merge Sort

Steps to Implement Merge Sort in Python

Complexity Analysis of Quick Sort

Time Complexity

Space Complexity

Conclusion

Subhankar Rakshit

You may also like

Trending now