mlpack

RectangleTree

The RectangleTree class represents a generic multidimensional space partitioning tree. It is heavily templatized to control splitting behavior and other behaviors, and is the actual class underlying trees such as the RTree. In general, the RectangleTree class is not meant to be used directly, and instead one of the numerous variants should be used instead:

The RectangleTree and its variants are capable of inserting points and deleting them. This is different from BinarySpaceTree and other mlpack tree types, where the tree is built entirely in batch at construction time. However, this capability comes with a runtime cost, and so in general the use of RectangleTree with mlpack algorithms will be slower than the batch-construction treesโ€”but, if insert/delete functionality is required, RectangleTree is the only choice.


For users who want to use RectangleTree directly or with custom behavior, the full class is still detailed in the subsections below. RectangleTree supports the TreeType API and can be used with mlpackโ€™s tree-based algorithms, although using custom behavior may require a template typedef.

๐Ÿ”— See also

๐Ÿ”— Template parameters

The RectangleTree class takes six template parameters. The first three of these are required by the TreeType API (see also this more detailed section). The full signature of the class is:

template<typename DistanceType,
         typename StatisticType,
         typename MatType,
         typename SplitType,
         typename DescentType,
         template<typename> class AuxiliaryInformationType>
class RectangleTree;

Note that the TreeType API requires trees to have only three template parameters. In order to use a RectangleTree with its six template parameters with an mlpack algorithm that needs a TreeType, it is easiest to define a template typedef:

template<typename DistanceType, typename StatisticType, typename MatType>
using CustomTree = Rectangle<DistanceType, StatisticType, MatType,
    CustomSplitType, CustomDescentType, CustomAuxiliaryInformationType>

Here, CustomSplitType, CustomDescentType, and CustomAuxiliaryInformationType are the desired splitting and descent strategies and auxiliary information type. This is the way that all RectangleTree variants (such as RTree) are defined.

๐Ÿ”— Constructors

RectangleTrees are constructed by inserting points in a dataset sequentially. The dataset is not permuted during the construction process.






Notes:


๐Ÿ”— Constructor parameters:

name type description default
data MatType Column-major matrix to build the tree on. Pass with std::move(data) to avoid copying the matrix. (N/A)
maxLeafSize size_t Maximum number of points to store in each leaf. 20
minLeafSize size_t Minimum number of points to store in each leaf. 8
maxNumChildren size_t Maximum number of children allowed in each non-leaf node. 5
minNumChildren size_t Minimum number of children in each non-leaf node. 2
dimensionality size_t Dimensionality of points to be held in the tree. (N/A)
ย  ย  ย  ย 
x arma::vec Column vector: point to insert into tree. Should have type matching the column vector type associated with MatType, and must have node.Dataset().n_rows elements. (N/A)
i size_t Index of point in node.Dataset() to delete from node. (N/A)

๐Ÿ”— Basic tree properties

Once a RectangleTree object is constructed, various properties of the tree can be accessed or inspected. Many of these functions are required by the TreeType API.


๐Ÿ”— Accessing members of a tree

See also the developer documentation for basic tree functionality in mlpack.


๐Ÿ”— Accessing data held in a tree


๐Ÿ”— Accessing computed bound quantities of a tree

The following quantities are cached for each node in a RectangleTree, and so accessing them does not require any computation. In the documentation below, ElemType is the element type of the given MatType; e.g., if MatType is arma::mat, then ElemType is double.

Note: for more details on each bound quantity, see the developer documentation on bound quantities for trees.


๐Ÿ”— Other functionality

๐Ÿ”— Bounding distances with the tree

The primary use of trees in mlpack is bounding distances to points or other tree nodes. The following functions can be used for these tasks.


๐Ÿ”— Tree traversals

Like every mlpack tree, the RectangleTree class provides a single-tree and dual-tree traversal that can be paired with a RuleType class to implement a single-tree or dual-tree algorithm.

๐Ÿ”— StatisticType

Each node in a RectangleTree holds an instance of the StatisticType class. This class can be used to store additional bounding information or other cached quantities that a RectangleTree does not already compute.

mlpack provides a few existing StatisticType classes, and a custom StatisticType can also be easily implemented:

Note: this section is still under constructionโ€”not all statistic types are documented yet.

๐Ÿ”— EmptyStatistic

The EmptyStatistic class is an empty placeholder class that is used as the default StatisticType template parameter for mlpack trees.

The class does not hold any members and provides no functionality. See the implementation.

๐Ÿ”— Custom StatisticTypes

A custom StatisticType is trivial to implement. Only a default constructor and a constructor taking a RectangleTree is necessary.

class CustomStatistic
{
 public:
  // Default constructor required by the StatisticType policy.
  CustomStatistic();

  // Construct a CustomStatistic for the given fully-constructed
  // `RectangleTree` node.  Here we have templatized the tree type to make it
  // easy to handle any type of `RectangleTree`.
  template<typename TreeType>
  StatisticType(TreeType& node);

  //
  // Adding any additional precomputed bound quantities can be done; these
  // quantities should be computed in the constructor.  They can then be
  // accessed from the tree with `node.Stat()`.
  //
};

Example: suppose we wanted to know, for each node, the exact time at which it was created. A StatisticType could be created that has a std::time_t member, whose value is computed in the constructor.

๐Ÿ”— SplitType

The SplitType template parameter controls the algorithm used to split each node of a RectangleTree while building. The splitting strategy used can be entirely arbitraryโ€”the SplitType simply needs to split a leaf node and a non-leaf node into children.

mlpack provides several drop-in choices for SplitType, and it is also possible to write a fully custom split:

Note: this section is still under constructionโ€”not all split types are documented yet.

๐Ÿ”— RTreeSplit

The RTreeSplit class implements the original R-tree splitting strategy and can be used with the RectangleTree class. This is the splitting strategy used for the RTree class, and is the same strategy proposed in the original paper (pdf). The strategy works as follows:

For implementation details, see the source code.

๐Ÿ”— RStarTreeSplit

The RStarTreeSplit class implements the improved R*-tree splitting strategy and can be used with the RectangleTree class. This is the splitting strategy used for the RStarTree class, and is the strategy proposed in the R*-tree paper (pdf). The strategy computes, for each possible binary split in each dimension,

The split that minimizes the combined volume and maximizes the overlap is chosen.

In addition, the RStarTreeSplit will sometimes perform forced reinsertion, where points are removed from a node during the splitting process and reinserted into the tree. This can help decrease the overlap between adjacent nodes in the tree, which in turn improves the quality of the tree for search and other tasks.

For implementation details, see the source code.

๐Ÿ”— XTreeSplit

The XTreeSplit class implements the improved splitting strategy for the XTree as described in the X-tree paper (pdf). This strategy is an improved version of the standard RTreeSplit, where the overlap of sibling nodes is minimized.

When overlap cannot be prevented, XTreeSplit will instead create โ€œsuper-nodesโ€ with more children than typically allowed. The split is then deferred until a later time when overlap can be more effectively avoided.

For implementation details, see the source code.

๐Ÿ”— RPlusTreeSplit

The RPlusTreeSplit class implements the splitting policy of the R+-tree. The strategy splits nodes (leaves and non-leaves) by partitioning along the dimension that results in the two children with minimum volume, similar to the kd-tree.

For implementation details, see the source code. Note that RPlusTreeSplit is a template typedef of the general RPlusTreeSplitType<> class.

๐Ÿ”— RPlusPlusTreeSplit

The RPlusPlusTreeSplit class implements the splitting policy of the R++-tree. This class can only be used in a tree that uses RPlusPlusTreeAuxiliaryInformation as the AuxiliaryInformationType.

The splitting strategy splits leaf nodes along an arbitrarily-chosen dimension, and splits non-leaf nodes along the dimension that minimizes the number of descendant nodes that also must be split along that dimension.

For implementation details, see the source code. Note that RPlusPlusTreeSplit is a template typedef of the general RPlusTreeSplitType<> class.

๐Ÿ”— HilbertRTreeSplit<>

The HilbertRTreeSplit<> class is an implementation of the HilbertRTree splitting strategy. This strategy, proposed in the original paper (pdf), has two main differences from the standard RTreeSplit strategy:

When inserting a point, one cooperating sibling node is found. If both the node and its cooperating sibling are full, then all points in the two nodes as well as the point being inserted are ordered by Z-ordering value (also known as Morton ordering), and split evenly into three nodes.

Notes:

๐Ÿ”— Custom SplitTypes

Custom split strategies for a RectangleTree can be implemented via the SplitType template parameter. By default, the RTreeSplit splitting strategy is used, but it is also possible to implement and use a custom SplitType. Any custom SplitType class must implement the following signature:

class SplitType
{
 public:
  // Given the leaf node `tree`, split into multiple nodes.  `TreeType` will be
  // the relevant `RectangleTree` type.  `tree` should be modified directly.
  //
  // `relevels` is an auxiliary array used by some splitting strategies, such as
  // the `RStarTreeSplit`, to indicate whether a node needs to be reinserted
  // into the tree.
  template<typename TreeType>
  static void SplitLeafNode(TreeType* tree, std::vector<bool>& relevels);

  // Given the non-leaf node `tree`, split into multiple nodes.  `TreeType` will
  // be the relevant `RectangleTree` type.  `tree` should be modified directly.
  //
  // `relevels` is an auxiliary array used by some splitting strategies, such as
  // the `RStarTreeSplit`, to indicate whether a node needs to be reinserted
  // into the tree.
  template<typename TreeType>
  static void SplitNonLeafNode(TreeType* tree, std::vector<bool>& relevels);
};

๐Ÿ”— DescentType

The DescentType template parameter controls the algorithm used to assign child points and child nodes to nodes in a RectangleTree. The strategy used can be arbitrary: the DescentType simply needs to return an index of a child to insert a point or node into.

mlpack provides several drop-in choices for DescentType, and it is also possible to write a fully custom split:

Note: this section is still under constructionโ€”not all split types are documented yet.

๐Ÿ”— RTreeDescentHeuristic

The RTreeDescentHeuristic is the default descent strategy for the RectangleTree and is used by the RTree. The strategy is simple: the child node whose volume will increase the least is chosen as the child to insert a point or other node into.

For implementation details, see the source code.

๐Ÿ”— RStarTreeDescentHeuristic

The RStarTreeDescentHeuristic is a descent strategy for the RectangleTree and is used by the RStarTree. The heuristic will always prefer to insert a point or node into a child node whose hyperrectangle bound already contains the point or node to be inserted.

When inserting a point or node into a node whose children are leaves, the strategy will choose to insert into the child where the overall overlap of childrenโ€™s volumes after insertion is minimized.

When inserting a point or node into a node whose children are not leaves, the strategy will choose to insert into the child whose volume is the smallest after insertion.

For implementation details, see the source code.

๐Ÿ”— RPlusTreeDescentHeuristic

The RPlusTreeDescentHeuristic is the descent strategy used by the RPlusTree. When determining which node to insert a point into, the following heuristic is used:

๐Ÿ”— RPlusPlusTreeDescentHeuristic

The RPlusPlusTreeDescentHeuristic is the descent strategy used by the RPlusPlusTree. The strategy chooses the child whose outer bound (held by RPlusPlusTreeAuxiliaryInformation) contains the point to be inserted.

For implementation details, see the source code.

๐Ÿ”— HilbertRTreeDescentHeuristic

The HilbertRTreeDescentHeuristic is the descent strategy used by the HilbertRTree. The strategy depends on the concept of Z-ordering (or Morton ordering): the child node whose minimum Z-ordering value is closest to but greater than the Z-ordering value of the point to be inserted is chosen.

For implementation details, see the source code.

๐Ÿ”— Custom DescentTypes

Custom descent strategies for a RectangleTree can be implemented via the DescentType template parameter. By default, the RTreeDescentHeuristic descent strategy is used, but it is also possible to implement and use a custom DescentType. Any custom DescentType class must implement the following signature:

class DescentType
{
 public:
  // Return a `size_t` indicating which child of `node` should be chosen to
  // insert `point` in.
  //
  // `TreeType` will be the relevant `RectangleTree` type.
  template<typename TreeType>
  static size_t ChooseDescentNode(const TreeType* node, const size_t point);

  // Return a `size_t` indicating which child of `node` should be chosen to
  // insert `insertedNode` in.
  //
  // `TreeType` will be the relevant `RectangleTree` type.
  template<typename TreeType>
  static size_t ChooseDescentNode(const TreeType* node,
                                  const TreeType* insertedNode);
};

๐Ÿ”— AuxiliaryInformationType

The AuxiliaryInformationType template parameter holds any auxiliary information required by the SplitType or DescentType strategies. By default, the NoAuxiliaryInformation class is used, which holds nothing. Different variants of RectangleTrees may use other predefined types for their AuxiliaryInformationTypes:

๐Ÿ”— XTreeAuxiliaryInformation

The XTreeAuxiliaryInformation class is the auxiliary information type used by the XTree class, and is meant to be used with the XTreeSplit splitting strategy. It holds information required to construct super-nodes (a concept specific to X-trees), where splitting is being deferred.

For implementation details, see the source code.

๐Ÿ”— RPlusPlusTreeAuxiliaryInformation

The RPlusPlusTreeAuxiliaryInformation class is used by the RPlusPlusTree to store information required for tree building. In addition to the regular HRectBound that is used to maintain the minimum bounding rectangle of each node, each R++-tree node also maintains an โ€˜outer boundโ€™ that represents the maximum bounding rectangle. This maximum bounding rectangle is used for splitting, instead of the minimum bounding rectangle; this helps prevent overlap in nodes.

For an object auxInfo, the function auxInfo.OuterBound() will return an HRectBound&. If the tree was built with a non-standard MatType, then the type returned will be HRectBound<EuclideanDistance, ElemType>, where ElemType is the element type of the given MatType.

For implementation details, see the source code.

๐Ÿ”— DiscreteHilbertRTreeAuxiliaryInformation

The DiscreteHilbertRTreeAuxiliaryInformation class is used by the HilbertRTree. It stores the largest Z-ordering value of any descendant point of a node. (This can be accessed with the HilbertValue() method.)

For more details, see the source code.

๐Ÿ”— Custom AuxiliaryInformationTypes

Custom AuxiliaryInformationTypes can be implemented and used with the AuxiliaryInformationType template parameter. Any custom AuxiliaryInformationType class must implement the following signature:

// TreeType will be the type of RectangleTree that the auxiliary information
// type is being used in.
template<typename TreeType>
class CustomAuxiliaryInformationType
{
 public:
  // Default constructor is required.
  CustomAuxiliaryInformationType();
  // Construct the object with a tree node that may not yet be constructed.
  CustomAuxiliaryInformationType(TreeType* node);
  // Construct the object with another object and another tree node, optionally
  // making a 'deep copy' instead of just copying pointers where relevant.
  CustomAuxiliaryInformationType(const CustomAuxiliaryInformationType& other,
                                 TreeType* node,
                                 const bool deepCopy = true);

  // Just before a point is inserted into a node, this is called.
  // `node` is the node that will have `node.Dataset().col(point)` inserted into
  // it.
  //
  // Optionally, this method can manipulate `node`.  If so, `true` should be
  // returned to indicate that `node` was changed.  Otherwise, return `false`
  // and the RectangleTree will perform its default behavior.
  bool HandlePointInsertion(TreeType* node, const size_t point);

  // Just before a child node is inserted into a node, this is called.
  // `node` is the node that will have `nodeToInsert` inserted into it as a
  // child.
  //
  // Optionally, this method can manipulate `node`.  If so, `true` should be
  // returned to indicate that `node` was changed.  Otherwise, return `false`
  // and the RectangleTree will perform its default behavior.
  bool HandleNodeInsertion(TreeType* node,
                           TreeType* nodeToInsert,
                           const bool atMaxDepth);

  // Just before a point is deleted from a node, this is called.
  // `node` is the node that will have `node.Dataset().col(point)` deleted from
  // it.
  //
  // Optionally, this method can manipulate `node`.  If so, `true` should be
  // returned to indicate that `node` was changed.  Otherwise, return `false`
  // and the RectangleTree will perform its default behavior.
  bool HandlePointDeletion(TreeType* node, const size_t point);

  // Just before a child node is deleted from a node, this is called.
  // `node` is the node that will have `node.Child(nodeIndex)` deleted from it.
  //
  // Optionally, this method can manipulate `node`.  If so, `true` should be
  // returned to indicate that `node` was changed.  Otherwise, return `false`
  // and the RectangleTree will perform its default behavior.
  bool HandleNodeRemoval(TreeType* node, const size_t nodeIndex);

  // When `node` is changed, this is called so that the auxiliary information
  // can be updated.  If information needs to be propagated upward, return
  // `true` and then `UpdateAuxiliaryInfo(node->Parent())` will be called.
  bool UpdateAuxiliaryInfo(TreeType* node);
};

๐Ÿ”— Example usage

The RectangleTree class is only really necessary when a custom split type or custom descent strategy is intended to be used. For simpler use cases, one of the typedefs of RectangleTree (such as RTree) will suffice.

For this reason, all of the examples below explicitly specify all six template parameters of RectangleTree. Writing a custom splitting strategy, writing a custom descent strategy, and writing a custom auxiliary information type are discussed in the previous sections. Each of the parameters in the examples below can be trivially changed for different behavior.


Build a RectangleTree on the cloud dataset and print basic statistics about the tree.

// See https://datasets.mlpack.org/cloud.csv.
arma::mat dataset;
mlpack::data::Load("cloud.csv", dataset, true);

// Build the rectangle tree with a leaf size of 10.  (This means that leaf nodes
// cannot contain more than 10 points.)
//
// The std::move() means that `dataset` will be empty after this call, and no
// data will be copied during tree building.
mlpack::RectangleTree<mlpack::EuclideanDistance,
                      mlpack::EmptyStatistic,
                      arma::mat,
                      mlpack::RTreeSplit,
                      mlpack::RTreeDescentHeuristic,
                      mlpack::NoAuxiliaryInformation> tree(std::move(dataset));

// Print the bounding box of the root node.
std::cout << "Bounding box of root node:" << std::endl;
for (size_t i = 0; i < tree.Bound().Dim(); ++i)
{
  std::cout << " - Dimension " << i << ": [" << tree.Bound()[i].Lo() << ", "
      << tree.Bound()[i].Hi() << "]." << std::endl;
}
std::cout << std::endl;

// Print the number of children in the root, and the allowable range.
std::cout << "Number of children of root: " << tree.NumChildren()
    << "; allowable range: [" << tree.MinNumChildren() << ", "
    << tree.MaxNumChildren() << "]." << std::endl;

// Print the number of descendant points of the root, and of each of its
// children.
std::cout << "Descendant points of root:        "
    << tree.NumDescendants() << "." << std::endl;
for (size_t i = 0; i < tree.NumChildren(); ++i)
{
  std::cout << "Descendant points of child " << i << ":  "
      << tree.Child(i).NumDescendants() << "." << std::endl;
}
std::cout << std::endl;

// Compute the center of the RectangleTree.
arma::vec center;
tree.Center(center);
std::cout << "Center of tree: " << center.t();

Build two RectangleTrees on subsets of the corel dataset and compute minimum and maximum distances between different nodes in the tree.

// See https://datasets.mlpack.org/corel-histogram.csv.
arma::mat dataset;
mlpack::data::Load("corel-histogram.csv", dataset, true);

// Convenience typedef for the tree type.
using TreeType = mlpack::RectangleTree<mlpack::EuclideanDistance,
                                       mlpack::EmptyStatistic,
                                       arma::mat,
                                       mlpack::RTreeSplit,
                                       mlpack::RTreeDescentHeuristic,
                                       mlpack::NoAuxiliaryInformation>;

// Build trees on the first half and the second half of points.
TreeType tree1(dataset.cols(0, dataset.n_cols / 2));
TreeType tree2(dataset.cols(dataset.n_cols / 2 + 1, dataset.n_cols - 1));

// Compute the maximum distance between the trees.
std::cout << "Maximum distance between tree root nodes: "
    << tree1.MaxDistance(tree2) << "." << std::endl;

// Get the leftmost grandchild of the first tree's root---if it exists.
if (!tree1.IsLeaf() && !tree1.Child(0).IsLeaf())
{
  TreeType& node1 = tree1.Child(0).Child(0);

  // Get the leftmost grandchild of the second tree's root---if it exists.
  if (!tree2.IsLeaf() && !tree2.Child(0).IsLeaf())
  {
    TreeType& node2 = tree2.Child(0).Child(0);

    // Print the minimum and maximum distance between the nodes.
    mlpack::Range dists = node1.RangeDistance(node2);
    std::cout << "Possible distances between two grandchild nodes: ["
        << dists.Lo() << ", " << dists.Hi() << "]." << std::endl;

    // Print the minimum distance between the first node and the first
    // descendant point of the second node.
    const size_t descendantIndex = node2.Descendant(0);
    const double descendantMinDist =
        node1.MinDistance(node2.Dataset().col(descendantIndex));
    std::cout << "Minimum distance between grandchild node and descendant "
        << "point: " << descendantMinDist << "." << std::endl;

    // Which child of node2 is closer to node1?
    const size_t closestIndex = node2.GetNearestChild(node1);
    std::cout << "Child " << closestIndex << " is closest to node1."
        << std::endl;

    // And which child of node1 is further from node2?
    const size_t furthestIndex = node1.GetFurthestChild(node2);
    std::cout << "Child " << furthestIndex << " is furthest from node2."
        << std::endl;
  }
}

Build a RectangleTree on 32-bit floating point data and save it to disk.

// See https://datasets.mlpack.org/corel-histogram.csv.
arma::fmat dataset;
mlpack::data::Load("corel-histogram.csv", dataset);

// Build the RectangleTree using 32-bit floating point data as the matrix
// type.  We will still use the default EmptyStatistic and EuclideanDistance
// parameters.  A leaf size of 100 is used here.
mlpack::RectangleTree<mlpack::EuclideanDistance,
                      mlpack::EmptyStatistic,
                      arma::fmat,
                      mlpack::RTreeSplit,
                      mlpack::RTreeDescentHeuristic,
                      mlpack::NoAuxiliaryInformation> tree(
    std::move(dataset), 100);

// Save the tree to disk with the name 'tree'.
mlpack::data::Save("tree.bin", "tree", tree);

std::cout << "Saved tree with " << tree.Dataset().n_cols << " points to "
    << "'tree.bin'." << std::endl;

Load a 32-bit floating point RectangleTree from disk, then traverse it manually and find the number of leaf nodes with less than 10 points.

// This assumes the tree has already been saved to 'tree.bin' (as in the example
// above).

// This convenient typedef saves us a long type name!
using TreeType = mlpack::RectangleTree<mlpack::EuclideanDistance,
                                       mlpack::EmptyStatistic,
                                       arma::fmat,
                                       mlpack::RTreeSplit,
                                       mlpack::RTreeDescentHeuristic,
                                       mlpack::NoAuxiliaryInformation>;

TreeType tree;
mlpack::data::Load("tree.bin", "tree", tree);
std::cout << "Tree loaded with " << tree.NumDescendants() << " points."
    << std::endl;

// Recurse in a depth-first manner.  Count both the total number of leaves, and
// the number of leaves with less than 10 points.
size_t leafCount = 0;
size_t totalLeafCount = 0;
std::stack<TreeType*> stack;
stack.push(&tree);
while (!stack.empty())
{
  TreeType* node = stack.top();
  stack.pop();

  if (node->NumPoints() < 10)
    ++leafCount;
  ++totalLeafCount;

  for (size_t i = 0; i < node->NumChildren(); ++i)
    stack.push(&node->Child(i));
}

// Note that it would be possible to use TreeType::SingleTreeTraverser to
// perform the recursion above, but that is more well-suited for more complex
// tasks that require pruning and other non-trivial behavior; so using a simple
// stack is the better option here.

// Print the results.
std::cout << leafCount << " out of " << totalLeafCount << " leaves have fewer "
    << "than 10 points." << std::endl;

Build a RectangleTree by iteratively inserting points from the corel dataset, print some information, and then remove a few randomly chosen points.

// See https://datasets.mlpack.org/corel-histogram.csv.
arma::mat dataset;
mlpack::data::Load("corel-histogram.csv", dataset, true);

// This convenient typedef saves us a long type name!
using TreeType = mlpack::RectangleTree<mlpack::EuclideanDistance,
                                       mlpack::EmptyStatistic,
                                       arma::mat,
                                       mlpack::RTreeSplit,
                                       mlpack::RTreeDescentHeuristic,
                                       mlpack::NoAuxiliaryInformation>;

// Create an empty tree of the right dimensionality.
TreeType t(dataset.n_rows);

// Insert points one by one for the first half of the dataset.
for (size_t i = 0; i < dataset.n_cols / 2; ++i)
  t.Insert(dataset.col(i));

std::cout << "After inserting half the points, the root node has "
    << t.NumDescendants() << " descendant points and "
    << t.NumChildren() << " child nodes." << std::endl;

// For the second half, insert the points backwards.
for (size_t i = dataset.n_cols - 1; i >= dataset.n_cols / 2; --i)
  t.Insert(dataset.col(i));

std::cout << "After inserting all the points, the root node has "
    << t.NumDescendants() << " descendant points and "
    << t.NumChildren() << " child nodes." << std::endl;

// Remove three random points.
t.Delete(mlpack::math::RandInt(0, t.NumDescendants()));
std::cout << "After removing 1 point, the root node has " << t.NumDescendants()
    << " descendant points." << std::endl;
t.Delete(mlpack::math::RandInt(0, t.NumDescendants()));
std::cout << "After removing 2 points, the root node has " << t.NumDescendants()
    << " descendant points." << std::endl;
t.Delete(mlpack::math::RandInt(0, t.NumDescendants()));
std::cout << "After removing 3 points, the root node has " << t.NumDescendants()
    << " descendant points." << std::endl;