11.扩充的数据结构、动态有序统计和区间树

Augmenting data structures

Dynamic order statistics

OS-Select(i) - returns i^th smallest item in dynamic set
OS-Rank(x) - returns rank of x in sorted order
Idea: Keep subtree sizes in nodes of a red-black tree

OS-Select(x, i) // ith smallest in subtree rooted at x
    k ← size[left[x]] + 1  // k = rank(x)
    if i = k then return x
    if i < k then return OS-Select(left[x], i)
    else return OS-Select(right[x], i-k)

Analysis: O(lg n)

OS-Rank in CLRS

OS-Rank(T, x)     // rank of x in the tree T
    r = x.left.size + 1
    y = x
    while y ≠ T.root
        if y == y.p.right
            r = r + y.p.left.size + 1
        y = y.p
    return r

Q: Why not keep ranks in nodes instead of subtre suzes?
A: Hard to maintain when r-b tree is modified.

Modifying operations: Insert, delete

Strategy: Update subtree sizes when inserting or deleting
Ex: 像一个红黑树insert的时候，不但要重新算size，还要handle rebalancing. recoloring 不影响size。rotation 最多只发生两次，而且只有少量node的size要改变。

Data-Structure augmentation

Methodology (Ex OS trees)

Choose underlying data structure (red-black tree)
Determine additional information (subtree size)
Verity info can be maintained for the modifying operations(Insert, Delete, rotations)
Develop new operations that use info (DS-Select, OS-Rank)

Usually, mush plays with interactions between steps.

Example: Interval trees

Maintain a set of intervals
e.g. time intervals

time intervals with query operation

Methodology

red-black tree keyed on the low endpoint (我觉得，同时存了high endpoint)
Store in the node the largest value m in the subtree rooted at that node
Modifying ops
Insert: Fix m's on way down
But, also need to handle rotations (takes O(1) time)
Insert time = O(lg n)
Delete similar (书上有但是今晚要自己先想一想)
New opertations

Interval-Search(i)  //Find an interval that overlaps i
    x ← root
    while x ≠ nil and (low[i] > high[int[x]] or low [int[x]] > high[i])     //int[x] = the interval of x
    do // i and int[x] don't overlap
        if left[x] ≠ nil and low [i] ≤ m[left[x]]
            then x ← left[x]
        else x ← right[x]
    return x

List all overlaps: O(k*lg n) "output sensitive" 方法是每找到一个就把它删掉
Best to date = O(k + lg n) 在另一种数据结构中能实现
k 是overlaps的数量

Correctness

Thearem

Let L = {i' left[x]}, R = {i' right[x]}

If search goes right then {i' L: i's overlaps i} = ∅
If search goes left, then:
{i' L: i' overlaps i} = ∅
{i' R: i' overlaps i} = ∅

Proof

Suppose search goes right

If left[x] = nil, done since L = ∅
Otherwise, low[i] > m[left[x]]

Suppose search goes left and {i' L : i' overlaps i} = ∅
Then, low[i] m[left[x]] = high[j] for some j L
Since j L, j doesn't overlap i high[i] < low[j]