m-ary tree

(Redirected from K-way tree)

In graph theory, an m-ary tree (for nonnegative integers m) (also known as n-ary, k-ary or k-way tree) is an arborescence (or, for some authors, an ordered tree)[1][2] in which each node has no more than m children. A binary tree is an important case where m = 2; similarly, a ternary tree is one where m = 3.

An example of a m-ary tree with m=5

Types of m-ary trees

edit
  • A full m-ary tree is an m-ary tree where within each level every node has 0 or m children.
  • A complete m-ary tree[3][4] (or, less commonly, a perfect m-ary tree[5]) is a full m-ary tree in which all leaf nodes are at the same depth.

Properties of m-ary trees

edit
  • For an m-ary tree with height h, the upper bound for the maximum number of leaves is  .
  • The height h of an m-ary tree does not include the root node, with a tree containing only a root node having a height of 0.
  • The height of a tree is equal to the maximum depth D of any node in the tree.
  • The total number of nodes   in a complete m-ary tree is  , while the height h is

  By the definition of Big-Ω, the maximum depth  

  • The height of a complete m-ary tree with n nodes is  .
  • The total number of possible m-ary tree with n nodes is   (which is a Catalan number).[6]

Traversal methods for m-ary trees

edit

Traversing a m-ary tree is very similar to traversing a binary tree. The pre-order traversal goes to parent, left subtree and the right subtree, and for traversing post-order it goes by left subtree, right subtree, and parent node. For traversing in-order, since there are more than two children per node for m > 2, one must define the notion of left and right subtrees. One common method to establish left/right subtrees is to divide the list of children nodes into two groups. By defining an order on the m children of a node, the first   nodes would constitute the left subtree and   nodes would constitute the right subtree.

Convert a m-ary tree to binary tree

edit
 
An example of conversion of a m-ary tree to a binary tree.m=6

Using an array for representing a m-ary tree is inefficient, because most of the nodes in practical applications contain less than m children. As a result, this fact leads to a sparse array with large unused space in the memory. Converting an arbitrary m-ary tree to a binary tree would only increase the height of the tree by a constant factor and would not affect the overall worst-case time complexity. In other words,   since  .

First, we link all the immediate children nodes of a given parent node together in order to form a link list. Then, we keep the link from the parent to the first (i.e., the leftmost) child and remove all the other links to the rest of the children. We repeat this process for all the children (if they have any children) until we have processed all the internal nodes and rotate the tree by 45 degrees clockwise. The tree obtained is the desired binary tree obtained from the given m-ary tree.

Methods for storing m-ary trees

edit

Arrays

edit
 
An example of storing a m-ary tree with m=3 in an array

m-ary trees can also be stored in breadth-first order as an implicit data structure in arrays, and if the tree is a complete m-ary tree, this method wastes no space. In this compact arrangement, if a node has an index i, its c-th child in range {1,…,m} is found at index  , while its parent (if any) is found at index   (assuming the root has index zero, meaning a 0-based array). This method benefits from more compact storage and better locality of reference, particularly during a preorder traversal. The space complexity of this method is  .

Pointer-based

edit

Each node would have an internal array for storing pointers to each of its   children:

 
Pointer-based implementation of m-ary tree where m=4.

Compared to array-based implementation, this implementation method has superior space complexity of  .

Enumeration of m-ary trees

edit

Listing all possible m-ary trees is useful in many disciplines as a way of checking hypotheses or theories. Proper representation of m-ary tree objects can greatly simplify the generation process. One can construct a bit sequence representation using the depth-first search of a m-ary tree with n nodes indicating the presence of a node at a given index using binary values. For example, the bit sequence x=1110000100010001000 is representing a 3-ary tree with n=6 nodes as shown below.

 
3-ary tree with bit sequence of 1110000100010001000 and Simple Zero Sequence of 004433

The problem with this representation is that listing all bit strings in lexicographic order would mean two successive strings might represent two trees that are lexicographically very different. Therefore, enumeration over binary strings would not necessarily result in an ordered generation of all m-ary trees.[7] A better representation is based on an integer string that indicates the number of zeroes between successive ones, known as Simple Zero Sequence.   is a Simple Zero Sequence corresponding to the bit sequence   where j is the number of zeroes needed at the tail end of the sequence to make the string have the appropriate length. For example,   is the simple zero sequence representation of the above figure. A more compact representation of 00433 is  , which is called zero sequence, which duplicate bases cannot be adjacent. This new representation allows to construct a next valid sequence in  . A simple zero sequence is valid if  That is to say that number of zeros in the bit sequence of a m-ary tree cannot exceed the total number of null pointers (i.e., pointers without any child node attached to them). This summation is putting restriction on   nodes so that there is room for adding the   without creating an invalid structure (i.e. having an available null pointer to attached the last node to it).

The table below shows the list of all valid simple zero sequences of all 3-ary trees with 4 nodes:

222 200 111 033 013
221 132 110 032 012
220 131 105 031 011
213 130 104 030 010
212 123 103 024 006
211 122 102 023 005
210 121 101 022 004
204 120 100 021 003
203 114 042 020 002
202 113 041 015 001
201 112 040 014 000

Starting from the bottom right of the table (i.e., "000"), there is a backbone template that governs the generation of the possible ordered trees starting from "000" to "006". The backbone template for this group ("00X") is depicted below, where an additional node is added in the positions labeled "x".

 

Once one has exhausted all possible positions in the backbone template, a new template will be constructed by shifting the 3rd node one position to the right as depicted below, and the same enumeration would occur until all possible positions labeled "X" is exhausted.

 

Going back to the table of enumeration of all m-ary trees, where   and  , we can easily observe the apparent jump from "006" to "010" can be explained trivially in an algorithmic fashion as depicted below:

 

The pseudocode for this enumeration is given below:[7]

Procedure NEXT(s1, s2, …, sn−1)
    if si = 0 for all i then
        finished
    else
        i ← max {i | si > 0}
        sisi − 1
        if i < n − 1 then
            si ← (i + 1) ⋅ (m − 1) − sum(sj)
        end if
        for ji + 2, i + 3, …, n − 1
            sjk − 1
    end if
end

Loopless enumeration

edit

A generation algorithm that takes   worst-case time are called loopless since the time complexity cannot involve a loop or recursion. Loopless enumeration of m-ary trees is said to be loopless if after initialization, it generates successive tree objects in  . For a given a m-ary tree T with   being one of its nodes and   its  -th child, a left-t rotation at   is done by making   the root node, and making   and all of its subtrees a child of  , additionally we assign the   left most children of   to   and the right most child of   stays attached to it while   is promoted to root, as shown below:

 
Convert an m-ary tree to left-tree
    for i = 1...n:
         for t = 2...m:
             while t child of node at depth i ≠ 1:
                L-t rotation at nodes at depth i
              end while
          end for
     end for

A right-t rotation at d is the inverse of this operation. The left chain of T is a sequence of   nodes such that   is the root and all nodes except   have one child connected to their left most (i.e.,  ) pointer. Any m-ary tree can be transformed to a left-chain tree using sequence of finite left-t rotations for t from 2 to m. Specifically, this can be done by performing left-t rotations on each node   until all of its   sub-tree become null at each depth. Then, the sequence of number of left-t rotations performed at depth i denoted by   defines a codeword of a m-ary tree that can be recovered by performing the same sequence of right-t rotations.

Let the   tuple of   represent the number of L-2 rotations, L-3 rotations, ..., L-m rotations that has occurred at the root (i.e., i=1).Then,   is the number of L-t rotations required at depth i.

Capturing counts of left-rotations at each depth is a way of encoding an m-ary tree. Thus, enumerating all possible legal encoding would helps us to generate all the m-ary trees for a given m and n. But, not all   sequences of m non-negative integers represent a valid m-ary tree. A sequence of   non-negative integers is a valid representation of a m-ary tree if and only if[8]  

Lexicographically smallest code-word representation of a m-ary with n nodes is all zeros and the largest is n−1 ones followed by m−1 zero on its right.

Initialization
    c[i] to zero for all i from 1 to n⋅(k − 1)
    p[i] set to n − 1 for i from 1 to n
    sum ← 0
    jm − 1

Termination Condition
    Terminate when c[1] = n − 1

Procedure NEXT[8]
    sumsum + 1 − c[j + 1]
    c[j] ← c[j] + 1
    if p[q[j]] > p[q[j + 1]] + 1 then
        p[q[j]] ← p[q[j + 1]] + 1
    end if
    p[q[j + c[j]]] ← p[q[j]]
    c[j + 1] ← 0
    if sum = p[q[j]] then
        jj − 1
    else
        p[n] ← sum
        jm − 1
    end if
end

Application

edit

One of the applications of m-ary tree is creating a dictionary for validation of acceptable strings. In order to do that, let m be equal to the number of valid alphabets (e.g., number of letters of the English alphabet) with the root of the tree representing the starting point. Similarly, each of the children can have up to m children representing the next possible character in the string. Thus, characters along the paths can represent valid keys by marking the end character of the keys as "terminal node". For example, in the example below "at" and "and" are valid key strings with "t" and "d" marked as terminal nodes. Terminal nodes can store extra information to be associated with a given key. There are similar ways to building such a dictionary using B-tree, Octree and/or trie.

 

See also

edit

References

edit
  1. ^ Li, Liwu (1998). Java: Data Structures and Programming. Section 8.1.2.1, Multi-Ary Trees: Springer. doi:10.1007/978-3-642-95851-9. ISBN 978-3-642-95853-3. S2CID 708416. Retrieved 20 July 2023.{{cite book}}: CS1 maint: location (link)
  2. ^ Stanley, Richard P. (2011). Enumerative Combinatorics, Volume I. Appendix: Graph Theory Terminology: Cambridge University Press. p. 573. ISBN 978-1-107-60262-5. Retrieved 20 July 2023.
  3. ^ Gross, Jonathan L.; Yellen, Jay; Anderson, Mark (2018). Graph Theory and Its Applications (3rd ed.). Section 3.2, Rooted Trees, Ordered Trees, and Binary Trees: CRC Press. p. 132. ISBN 978-1-4822-4948-4.{{cite book}}: CS1 maint: location (link)
  4. ^ Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L.; Stein, Clifford (2022). Introduction to Algorithms (4th ed.). Section B.5.3, Binary and positional trees: MIT Press. p. 1174. ISBN 9780262046305. Retrieved 20 July 2023.{{cite book}}: CS1 maint: location (link)
  5. ^ Schumacher, Patrik (2012). "Excursion: Network Theory". The Autopoiesis of Architecture: A New Agenda for Architecture, Vol. II. Wiley. p. 103. ISBN 978-0-470-66616-6.
  6. ^ Graham, Ronald L.; Knuth, Donald E.; Patashnik, Oren (1994). Concrete Mathematics: A Foundation for Computer Science (2nd ed.). AIP.
  7. ^ a b Baronaigien, Dominique Roelants van (2000). "Loop Free Generation of K-ary trees". Journal of Algorithms. 35 (1): 100–107. doi:10.1006/jagm.1999.1073.
  8. ^ a b Korsh, James F (1994). "Loopless generation of k-ary tree sequences". Information Processing Letters. 52 (5). Elsevier: 243–247. doi:10.1016/0020-0190(94)00149-9.
  • Storer, James A. (2001). An Introduction to Data Structures and Algorithms. Birkhäuser Boston. ISBN 3-7643-4253-6.
edit