Draft:Powersort

Powersort
Class	Sorting Algorithm
Data structure	Array
Worst-case performance	O(nlogn)
Worst-case space complexity	O(n)

Review waiting, please be patient.

This may take 6 weeks or more, since drafts are reviewed in no specific order. There are 990 pending submissions waiting for review.

If the submission is accepted, then this page will be moved into the article space.
If the submission is declined, then the reason will be posted here.
In the meantime, you can continue to improve this submission by editing normally.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Reviewer tools

Instructions · What links here · Powersort (talk: + · bio) · (log) · Copyvios report · reFill · Citation Bot · (Search: Google, Wikipedia) · Submitted 19 days ago by Sebawild (talk: D · +) · Last edited 4 days ago by Citation bot

The following Wikipedia contributor has declared a personal or professional connection to the subject of this page. Relevant policies and guidelines may include conflict of interest, autobiography, and neutral point of view.

sebawild (talk · contribs) This user has declared a connection. (I am one of the authors of the academic articles about Powersort. I have otherwise no financial or other conflict of interest.)

Powersort

Powersort is an adaptive sorting algorithm designed to optimally exploit existing order in the input data with minimal overhead. Since version 3.11, Powersort is the default list sorting algorithm in CPython^[1] and is also used in PyPy and AssemblyScript. Powersort belongs to the family of merge sort algorithms. More specifically, Powersort builds on Timsort; it is a drop-in replacement for Timsort's suboptimal heuristic merge policy. Unlike the latter, it is derived from first principles (see connection to nearly optimal binary search trees) and offers strong performance guarantees.

Like Timsort, Powersort is a stable sort and comparison based. This property is essential for many applications^[2]. Powersort was proposed by J. Ian Munro and Sebastian Wild^[3].

Algorithm Overview

A merge policy is an order to do binary merges (one pair of adjacent runs in each step) that eventually produces a big sorted list. It can always be represented by a binary tree. The merge cost of an execution is the sum of all produced runs.

Powersort is a stable mergesort variant that adapts to existing runs in the input data, i.e., ranges in the input that are already in order. It maintains a stack of runs yet to be merged and alternates between finding the next run and merging adjacent runs near the top of the run stack.

This non-recursive mode of operation is particularly cache-friendly.

Powersort in action. The “power” of a run boundary corresponds to how deep the blue line can sink down, connecting the midpoints of the two runs, before it “hits” a node of the imaginary, green perfectly balanced binary search tree. Illustration from PyCon US 2023 talk by Sebastian Wild^[4].

Like Timsort, it enforces a minimal run length by “filling up” short runs using insertion sort up to a chosen minimal run length. Each merge step combines two adjacent runs into a single one using a “galloping strategy”: exponential search is used to find the prefix of one run that precedes the minimum in the other run. This can save comparisons compared to a traditional linear merge.

Powersort improves Timsort in terms of the merge policy, i.e., the rule(s) that decides which runs on the run stack are merged before proceeding. Timsort's original policy used a suboptimal heuristic based solely on the lengths of runs; Powersort replaces this with a rule simulating Mehlhorn's algorithm for computing nearly optimal binary search trees with low overhead^[3], thereby achieving optimal adaptivity up to an additive linear term.

The pseudocode below shows a simplified Powersort implementation.

algorithm PowerSort(A[0..n))
    S := stack of runs  // capacity ⌈lg(n)⌉ + 1
    b₁ := 0;  e₁ := FirstRunOf(A[b₁..n)) 
    // A[s₁..e₁) is leftmost run
    while e₁ < n
        b₂ := e₁ + 1;  e₂ := FirstRunOf(A[b₂..n)) 
        // A[s₂..e₂) next run
        P := NodePower(n, b₁, e₁, b₂, e₂) 
        while S.top().power > P
            (b₁, e₁) := Merge(S.pop(), A[b₁..e₁))
        end while
        S.push((A[b₁, e₁), P));
        b₁ := b₂;  e₁ := e₂
    end while  // Now A[b₁..e₁) is the rightmost run
    while ¬S.empty()
        (b₁, e₁) := Merge(S.pop(), A[b₁..e₁))
    end while

algorithm NodePower(n, b₁, e₁, b₂, e₂, n)
    n₁ := e₁ − b₁;  n₂ := e₂ − b₂;  
    a := (b₁ + n₁/2)/n;  b := (b₂ + n₂/2)/n
    p := 0
    while ⌊a · 2^p⌋ == ⌊b · 2^p⌋
        p := p + 1 
    end while
    return p

Adoption

The implementation of Powersort in CPython began with version 3.11, replacing the older Timsort algorithm. The change was motivated by Powersort's superior performance and stability. The core implementation can be found in the CPython source code within the listobject.c file, where the list sorting functions are defined. The detailed merge policies and algorithm are described in listsort.txt.^[5]^[6] The transition to Powersort involved addressing issue #78742 in the CPython repository.^[7]

The PyPy project, known for its high-performance Just-In-Time (JIT) compiler for Python, also integrated Powersort. The relevant commit, identified as `ab2269f3c0ca0265b4ff462f1bda29442182de11`, details the inclusion of Powersort into PyPy's list sorting functions.^[8]

In AssemblyScript, Powersort was integrated to enhance the performance of WebAssembly applications. The relevant pull request, #1904, and the implementation details can be found in the sort.ts file within the AssemblyScript standard library.^[9]^[10] These implementations across different platforms highlight the adaptability and efficiency of Powersort in various programming environments.

Comparison with Timsort

Timsort's original merge policy caused several problems before being replaced by Powersort.

First, as accidentally observed during the formal verification of Timsort, Tim Peters's original formulation did not guarantee the desired height bound for the run stack, leaving both CPython and OpenJDK vulnerable to a stack overflow. This was eventually fixed by adding a forth rule to Timsort, but required two major patches of OpenJDK^[11]. While eventually successful, the correctness proof of Timsort's stack height and the running-time analysis are very complicated.

Further, it was discovered that Timsort's merge policy also has a performance blind spot: specific patterns of run lengths cause Timsort to repeatedly perform very unbalanced merges, resulting in asymptotically 50% overhead^[12].

Powersort simultaneously removes all of these limitations. Despite its advanced theoretical foundation, it's analysis is much easier^[13] and it provably never uses more than $n{\mathcal {H}}+O(n)$ comparisons, where ${\mathcal {H}}=\sum _{i=1}^{r}{\frac {\ell _{i}}{n}}\log _{2}\left({\frac {n}{\ell _{i}}}\right)$ , for $\ell _{1},\ldots ,\ell _{r}$ the lengths of the runs in the input.

Powersort has been shown to outperform Timsort by up to 30% on certain types of input data that contain long runs of sorted elements.^[3] Powersort has further been extended to multiway merging, something that was not possible with Timsort.

Multiway Powersort

Example merge tree for Powersort (top) and 4- way Powersort (bottom) for an input of size n = 16.

Multiway Powersort^[13] is an extension of Powersort that generalizes the binary merging process to k-way merging. This approach can reduce the number of merge operations and hence reduces the amount of memory transfer. It was proposed by William Cawley Gelling, Markus E. Nebel, Benjamin Smith, and Sebastian Wild in 2023.

Multiway Powersort retains the stability and adaptiveness of the original Powersort algorithm^[13], and is just as easy to analyze.

The key differences to normal Powersort are:

The computation of powers uses the basis k instead of 2.
When merging runs, we have to pop all runs from the stack that are tied in power with the topmost run.

Details are shown in the pseudocode below.

algorithms k-wayPowerSort(A[0..n))
    S := empty stack  // capacity (k − 1)⌈log_k(n) + 1⌉
    b₁ := 0;  e₁ = FirstRunOf(A[b_1..n))
    while e₁ < n
        b₂ := e₁;  e₂ := FirstRunOf(A[b_2..n))
        P := Power_k(n, b₁, e₁, b₂, e₂)
        while S.top().power > P
            P':= S.top().power
            L := empty list;  L.append(S.pop())
            while S.top().power == P′
                L.append(S.pop())
            end while
            // merge runs in L with A[b₁..e₁)
            (b₁, e₁) := Merge(L, A[b₁..e₁))
        end while
        S.push((A[b₁, e₁), P))
        b₁ := b₂;  e₁ := e₂
    end while // Now A[b₁..e₁) is the rightmost run
    while ¬S.empty()
        // pop (up to) k − 1 runs, merge with A[b₁..e₁)
        (b₁, e₁) := Merge(S.pop(k − 1), A[b₁..e₁))
    end while

algorithm Power_k(n, b₁, e₁, b₂, e₂)
    n₁ := e₁ − b₁;  n₂ := e₂ − b₂; p := 0
    a := (b₁ + n₁/2)/n;  b := (b₂ + n₂/2)/n
    while ⌊a · k^p⌋ == ⌊b · k^p⌋
        p := p + 1
    end while
    return p

Further Resources

Practical hands-on experience with Powersort can be gained by playing Powersort's Pursuit.
A more graphical approach to explaining Powersort is used in Sebastian Wild's PyCon US 2023 talk.

References

^ James, Mike. "Python Now Uses Powersort". I Programmer. Retrieved 2024-06-21.
^ Peters, Tim (2002). "Timsort". Python Mailing List.
^ ^a ^b ^c Munro, J. Ian; Wild, Sebastian (2018). "Nearly-Optimal Mergesorts: Fast, Practical Sorting Methods That Optimally Adapt to Existing Runs". 26th Annual European Symposium on Algorithms (ESA): 63:1–63:16. arXiv:1805.04154. doi:10.4230/lipics.esa.2018.63.
^ Wild, Sebastian (19 June 2023). "Quicksort, Timsort, Powersort (PyCon US 2023 talk)". YouTube. Retrieved 2024-10-17.
^ "CPython Implementation Details". GitHub. Retrieved 2024-07-30.
^ "CPython List Sort". GitHub. Retrieved 2024-07-30.
^ "Issue #78742: Integrate Powersort". GitHub. Retrieved 2024-07-30.
^ "PyPy Implementation of Powersort". Heptapod. Retrieved 2024-07-30.
^ "AssemblyScript Pull Request #1904". GitHub. Retrieved 2024-07-30.
^ "AssemblyScript Sort Implementation". GitHub. Retrieved 2024-07-30.
^ Auger, Nicolas; Jugé, Vincent; Nicaud, Cyril; Pivoteau, Carine (2018). [DROPS]. doi:10.4230/LIPIcs.ESA.2018.4. ISBN 9783959770811. S2CID 44091254. Retrieved 1 September 2018.
^ Buss, Sam; Knop, Alexander (2019). "Strategies for Stable Merge Sorting". "Strategies for Stable Merge Sorting.". pp. 1272–1290. arXiv:1801.04641. doi:10.1137/1.9781611975482.78. ISBN 978-1-61197-548-2. {{cite book}}: Unknown parameter |DUPLICATE_title= ignored (help)
^ ^a ^b ^c Gelling, William Cawley; Nebel, Markus E.; Smith, Benjamin; Wild, Sebastian (2023). "Multiway Powersort". Symposium on Algorithm Engineering and Experiments (ALENEX 2023): 190–200. arXiv:2209.06909. doi:10.1137/1.9781611977561.ch16. ISBN 978-1-61197-756-1.

[powersort-1] James, Mike. "Python Now Uses Powersort". I Programmer. Retrieved 2024-06-21.

[Timsort-2] Peters, Tim (2002). "Timsort". Python Mailing List.

[MunroWild2018-3] Munro, J. Ian; Wild, Sebastian (2018). "Nearly-Optimal Mergesorts: Fast, Practical Sorting Methods That Optimally Adapt to Existing Runs". 26th Annual European Symposium on Algorithms (ESA): 63:1–63:16. arXiv:1805.04154. doi:10.4230/lipics.esa.2018.63.

[powersort-pycon-4] Wild, Sebastian (19 June 2023). "Quicksort, Timsort, Powersort (PyCon US 2023 talk)". YouTube. Retrieved 2024-10-17.

[5] "CPython Implementation Details". GitHub. Retrieved 2024-07-30.

[6] "CPython List Sort". GitHub. Retrieved 2024-07-30.

[7] "Issue #78742: Integrate Powersort". GitHub. Retrieved 2024-07-30.

[8] "PyPy Implementation of Powersort". Heptapod. Retrieved 2024-07-30.

[9] "AssemblyScript Pull Request #1904". GitHub. Retrieved 2024-07-30.

[10] "AssemblyScript Sort Implementation". GitHub. Retrieved 2024-07-30.

[11] Auger, Nicolas; Jugé, Vincent; Nicaud, Cyril; Pivoteau, Carine (2018). [DROPS]. doi:10.4230/LIPIcs.ESA.2018.4. ISBN 9783959770811. S2CID 44091254. Retrieved 1 September 2018.

[BussKnop2019-12] Buss, Sam; Knop, Alexander (2019). "Strategies for Stable Merge Sorting". "Strategies for Stable Merge Sorting.". pp. 1272–1290. arXiv:1801.04641. doi:10.1137/1.9781611975482.78. ISBN 978-1-61197-548-2. {{cite book}}: Unknown parameter |DUPLICATE_title= ignored (help)

[MultiwayPowersort-13] Gelling, William Cawley; Nebel, Markus E.; Smith, Benjamin; Wild, Sebastian (2023). "Multiway Powersort". Symposium on Algorithm Engineering and Experiments (ALENEX 2023): 190–200. arXiv:2209.06909. doi:10.1137/1.9781611977561.ch16. ISBN 978-1-61197-756-1.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

v t e Sorting algorithms
Theory	Computational complexity theory Big O notation Total order Lists Inplacement Stability Comparison sort Adaptive sort Sorting network Integer sorting X + Y sorting Transdichotomous model Quantum sort
Exchange sorts	Bubble sort Cocktail shaker sort Odd–even sort Comb sort Gnome sort Proportion extend sort Quicksort
Selection sorts	Selection sort Heapsort Smoothsort Cartesian tree sort Tournament sort Cycle sort Weak-heap sort
Insertion sorts	Insertion sort Shellsort Splaysort Tree sort Library sort Patience sorting
Merge sorts	Merge sort Cascade merge sort Oscillating merge sort Polyphase merge sort
Distribution sorts	American flag sort Bead sort Bucket sort Burstsort Counting sort Interpolation sort Pigeonhole sort Proxmap sort Radix sort Flashsort
Concurrent sorts	Bitonic sorter Batcher odd–even mergesort Pairwise sorting network Samplesort
Hybrid sorts	Block merge sort Kirkpatrick–Reisch sort Timsort Introsort Spreadsort Merge-insertion sort
Other	Topological sorting Pre-topological order Pancake sorting Spaghetti sort
Impractical sorts	Stooge sort Slowsort Bogosort