张劲声

《算法导论》第三版第11章散列表练习&思考题个人答案

11.1 直接寻址表

11.1-1

解：

DIRECT-ADDRESS-FINDMAX(T)
for i = T.length - 1 to 0
    if T[i] != NIL
        return T[i]

最坏情况 $O (m)$ 。

11.1-2

思路：1代表存在，0代表不存在；插入置位，删除复位。

11.1-3

思路：可以将寻址表的每一个元素指向包含相同关键字的一个双向循环链表。再使用第10章的相关知识完成。

11.1-4

解（来自参考答案）：
We denote the huge array by $T$ and, taking the hint from the book, we also have a stack implemented by an array $S$ . The size of $S$ equals the number of keys actually stored, so that $S$ should be allocated at the dictionary’s maximum size. The stack

has an attribute $S . t o p$ , so that only entries $S [1 . . S . t o p]$ are valid.
The idea of this scheme is that entries of $T$ and $S$ validate each other. If key $k$ is

actually stored in $T$ , then $T [k]$ contains the index, say $j$ , of a valid entry in $S$ , and

$S [j]$ contains the value $k$ . Let us call this situation, in which $\le T[k] \le S.top$ , $S [T [k]] = k$ , and $T [S [j]] = j$ , a validating cycle.
Assuming that we also need to store pointers to objects in our direct-address table, we can store them in an array that is parallel to either $T$ or $S$ . Since $S$ is smaller than $T$ , we’ll use an array $S^{'}$ , allocated to be the same size as $S$ , for these pointers. Thus, if the dictionary contains an object $x$ with key $k$ , then there is a validating cycle and $S^{'} [T [k]]$ points to $x$ .
The operations on the dictionary work as follows:

Initialization: Simply set $S . t o p = 0$ , so that there are no valid entries in the stack.
SEARCH: Given key $k$ , we check whether we have a validating cycle, i.e., whether $\le T [k] \le S.top$ and $S [T [k]] = k$ . If so, we return $S^{'} [T [k]]$ , and otherwise we return $\text{NIL}$ .
INSERT: To insert object $x$ with key $k$ , assuming that this object is not already in the dictionary, we increment $S . t o p$ , set $S [S . t o p] = k$ , set $S^{'} [S . t o p] = x$ , and set $T [k] = S . t o p$ .
DELETE: To delete object $x$ with key $k$ , assuming that this object is in the dictionary, we need to break the validating cycle. The trick is to also ensure that we don’t leave a “hole” in the stack, and we solve this problem by moving the top entry of the stack into the position that we are vacating-and then fixing up that entry’s validating cycle. That is, we execute the following sequence of assignments:

$\begin{aligned} & S[T[k]] = S[S.top] \\ & S'[T[k]] = S'[S.top] \\ & T[S[T[k]]] = T[k] \\ & T[k] = 0 \\ & S.top = S.top - 1 \end{aligned}$
Each of these operation - initialization, $\text{SEARCH}$ , $\text{INSERT}$ , and $\text{DELETE}$ -takes $O (1)$ time.

11.2 散列表

11.2-1

解：对不相同的kl组合求1/m的和，可得 $\frac{n(n-1)}{2m}$ 。

11.2-2

解：中间过程略，最后结果是
0
1→10→19→28
2→20
3→12
4
5→5
6→33→15
7
8→17

11.2-3

解：
查找：期望时间不变，但查找的值越大所需时间越多（如果是单链表升序排列的话）
插入：期望时间不变，所需时间略多（需要执行一次时间复杂度是 $O (1)$ 的插入操作）
删除：期望时间不变，但删除的值越大所需时间越大（如果是单链表升序排列的话）

11.2-4

思路：标志位用来标志该槽位是否被占用，如果没有被占用，两个指针分别指向前一个和后一个空槽位（如同一个双向链表）；如果被占用，一个指针指向保存的元素。
解（来自参考答案）：
The flag in each slot will indicate whether the slot is free.
（每个插槽中的标志将指示插槽是否空闲。）
A free slot is in the free list, a doubly linked list of all free slots in the table. The slot thus contains two pointers.
A used slot contains an element and a pointer (possibly $\text{NIL}$ ) to the next element that hashes to this slot. (Of course, that pointer points to another slot in the table.)
（空闲插槽位于空闲列表中，空闲列表是表中所有空闲插槽的双向链表。因此，槽包含两个指针。已使用的插槽包含一个元素和一个指向下一个散列到此插槽的元素的指针（可能是 $\text {NIL}$ ）。（当然，该指针指向表中的另一个插槽。））
Operations（操作）

Insertion（插入）:

If the element hashes to a free slot, just remove the slot from the free list and store the element there (with a $\text{NIL}$ pointer). The free list must be doubly linked in order for this deletion to run in $O (1)$ time.

If the element hashes to a used slot $j$ , check whether the element $x$ already there “belongs” there (its key also hashes to slot $j$ ).

If so, add the new element to the chain of elements in this slot. To do so, allocate a free slot (e.g., take the head of the free list) for the new element and put this new slot at the head of the list pointed to by the hashed-to slot ( $j$ ).
If not, $E$ is part of another slot’s chain. Move it to a new slot by allocating one from the free list, copying the old slot’s ( $j$ 's) contents (element $x$ and pointer) to the new slot, and updating the pointer in the slot that pointed to $j$ to point to the new slot. Then insert the new element in the now-empty slot as usual.

To update the pointer to $j$ , it is necessary to find it by searching the chain of elements starting in the slot $x$ hashes to.

Deletion（删除）:
Let $j$ be the slot the element $x$ to be deleted hashes to.

If $x$ is the only element in $j$ ( $j$ doesn’t point to any other entries), just free the slot, returning it to the head of the free list.
If $x$ is in $j$ but there’s a pointer to a chain of other elements, move the first pointed-to entry to slot $j$ and free the slot it was in.
If $x$ is found by following a pointer from $j$ , just free $x$ 's slot and splice it out of the chain (i.e., update the slot that pointed to $x$ to point to $x$ 's successor).

Searching（查找）:
Check the slot the key hashes to, and if that is not the desired element, follow the chain of pointers from the slot.
All the operations take expected $O (1)$ times for the same reason they do with the version in the book: The expected time to search the chains is $\alpha)$ regardless of where the chains are stored, and the fact that all the elements are stored in the table means that $\alpha \le 1$ . If the free list were singly linked, then operations that involved removing an arbitrary slot from the free list would not run in $O (1)$ time.

11.2-5

这不是很显然吗。。 $\frac{|U|}{m}>n$ 必然至少有一个槽中有多于n个的元素，鸽笼原理？

11.2-6

思路：最长链长度为L，共有m条链，可以看成一个m行L列的矩阵，只要调用RANDOM(1, m)和RANDOM(1, L)，直到找到一个包含元素的位置，需要mL/n（即L/α）次，再查找该元素即可。

11.3 散列函数

11.3-1

思路：比较链表中元素的散列值和给定关键字的散列值。

11.3-2

解：

sum = 0
for i = 1 to r
    sum = (sum *128 + s[i]) mod m // 使用sum作为散列值

11.3-3

解（来自参考答案）：
First, we observe that we can generate any permutation by a sequence of interchanges of pairs of characters. One can prove this property formally, but informally, consider that both heapsort and quicksort work by interchanging pairs of elements and that they have to be able to produce any permutation of their input array. Thus, it suffices to show that if string $x$ can be derived from string $y$ by interchanging a single pair of characters, then $x$ and $y$ hash to the same value.
（首先，我们观察到我们可以通过一系列字符交换生成任何排列。可以正式地证明这个属性，但是非正式地，考虑堆排序和快速排序都可以通过交换元素对来工作，并且他们必须能够产生输入数组的任何排列。因此，足以证明如果字符串 $x$ 可以通过交换一对字符从字符串 $y$ 派生，那么 $x$ 和 $y$ 将散列到相同的值。）
Let us denote the $i$ th character in $x$ by $x_i$ , and similarly for $y$ . The interpretation of $x$ in radix $2^p$ is $\sum_{i = 0}^{n - 1} x_i 2^{ip}$ , and so $(\sum_{i = 0}^{n - 1} x_i 2^{ip}) \mod (2^p - 1)$ . Similarly, $(\sum_{i = 0}^{n - 1} y_i 2^{ip}) \mod (2^p - 1)$ .
Suppose that $x$ and $y$ are identical strings of $n$ characters except that the characters in positions $a$ and $b$ are interchanged: $x_a = y_b$ and $y_a = x_b$ . Without loss of generality, let $a > b$ . We have
$\Big(\sum_{i = 0}^{n - 1} x_i 2^{ip}\Big) \mod (2^p - 1) - \Big(\sum_{i = 0}^{n - 1} y_i 2^{ip}\Big) \mod (2^p - 1).$
Since $\le h(x)$ , $h(y) < 2^p - 1$ , we have that $2^p - 1) < h(x) - h(y) < 2^p - 1$ . If we show that $h(x) - h(y)) \mod (2^p - 1) = 0$ , then $h (x) = h (y)$ .
Since the sums in the hash functions are the same except for indices $a$ and $b$ , we have
$\begin{aligned} (h(x) - h(y)) \mod (2^p - 1) & = ((x_a 2^{ap} + x_b 2^{bp}) - (y_a 2^{ap} + y_b 2^{bp})) \mod (2^p - 1) \\ & = ((x_a 2^{ap} + x_b 2^{bp}) - (x_b 2^{ap} + x_a 2^{bp})) \mod (2^p - 1) \\ & = ((x_a - x_b)2^{ap} - (x_a - x_b) 2^{bp}) \mod (2^p - 1) \\ & = ((x_a - x_b)(2^{ap} - 2^{bp})) \mod (2^p - 1) \\ & = ((x_a - x_b)2^{bp}(2^{(a - b)p} - 1)) \mod (2^p - 1). \end{aligned}$
By equation $\text{(A.5)}$ ,
$\sum_{i = 0}^{a - b - 1} 2^{pi} = \frac{2^{(a - b)p} - 1}{2^p - 1},$
and multiplying both sides by $s^p - 1$ , we get $2^{(a - b)p} - 1 = \big(\sum_{i = 0}^{a - b - 1} 2^{pi}\big)(2^p - 1)$ . Thus,
$\begin{aligned} (h(x) - h(y))\mod(2^p - 1) & = \Bigg((x_a - x_b)2^{bp}\Bigg(\sum_{i = 0}^{a - b - 1} 2^{pi}\Bigg)(2^p - 1)\Bigg) \mod (2^p - 1) \\ & = 0, \end{aligned}$
since one of the factors is $2^p - 1$ .
We have shown that $h(x) - h(y)) \mod (2^p - 1) = 0$ , and so $h (x) = h (y)$ .

11.3-4

解：
$h (61) = 700$
$h (62) = 318$
$h (63) = 936$
$h (64) = 554$
$h (65) = 172$

11.3-5

解（来自参考答案）：
Let $b = ∣ B ∣$ and $u = ∣ U ∣$ . We start by showing that the total number of collisions is minimized by a hash function that maps $u / b$ elements of $U$ to each of the $b$ values in $B$ . For a given hash function, let $u_j$ be the number of elements that map to $\in B$ . We have $\sum_{j \in B} u_j$ . We also have that the number of collisions for a given value of $\in B$ is $\binom{u_j}{2} = u_j(u_j - 1) / 2$ .
Lemma
The total number of collisions is minimized when $u_j = u / b$ for each $\in B$ .
Proof
If $u_j \le u / b$ , let us call $j$ underloaded, and if $u_j \ge u / b$ , let us call $j$ overloaded. Consider an unbalanced situation in which $u_j \ne u / b$ for at least one value $\in B$ . We can think of converting a balanced situation in which all $u_j$ equal $u / b$ into the unbalanced situation by repeatedly moving an element that maps to an underloaded value to map instead to an overloaded value. (If you think of the values of $B$ as representing buckets, we are repeatedly moving elements from buckets containing at most $u / b$ elements to buckets containing at least $u / b$ elements.)
We now show that each such move increases the number of collisions, so that all the moves together must increase the number of collisions. Suppose that we move an element from an underloaded value $j$ to an overloaded value $k$ , and we leave all other elements alone. Because $j$ is underloaded and $k$ is overloaded, $u_j \le u / b\le u_k$ . Considering just the collisions for values $j$ and $k$ , we have $u_j(u_j - 1) / 2 + u_k(u_k - 1) / 2$ collisions before the move and $u_j - 1)(u_j - 2) / 2 + (u_k + 1)u_k / 2$ collisions afterward. We wish to show that
$u_j(u_j - 1) / 2 + u_k(u_k - 1) / 2 < (u_j - 1)(u_j - 2) / 2 + (u_k + 1)u_k / 2.$
We have the following sequence of equivalent inequalities:
$\begin{aligned} u_j & < u_k + 1 \\ 2u_j & < 2u_k + 2 \\ -u_k & < u_k - 2u_j + 2 \\ u_j^2 - u_j + u_k^2 - u_k & < u_j^2 - 3u_j + 2 + u_k^2 + u_k \\ u_j(u_j - 1) + u_k(u_k - 1) & < (u_j - 1)(u_j - 2) + (u_k + 1)u_k \\ u_j(u_j - 1) / 2 + u_k(u_k - 1) / 2 & < (u_j - 1)(u_j - 2) / 2 + (u_k + 1)u_k / 2. \end{aligned}$
Thus, each move increases the number of collisions. We conclude that the number of collisions is minimized when $u_j = u / b$ for each $\in B$ .
By the above lemma, for any hash function, the total number of collisions must be at least $b (u / b) (u / b - 1) / 2$ . The number of pairs of distinct elements is $\binom{u}{2} = u(u - 1) / 2$ . Thus, the number of collisions per pair of distinct elements must be at least
$\begin{aligned} \frac{b(u / b)(u / b - 1) / 2}{u(u - 1) / 2} & = \frac{u / b - 1}{u - 1} \\ & > \frac{u / b - 1}{u} \\ & = \frac{1}{b} - \frac{1}{u}. \end{aligned}$
Thus, the bound on the probability of a collision for any pair of distinct elements can be no less than $1 / b - 1 / u = 1 / ∣ B ∣ - 1 / ∣ U ∣$ .

11.3-6

证明（来自参考答案）：
Fix $\in \mathbb Z_p$ . By exercise 31.4-4, $h_b(x)$ collides with $h_b(y)$ for at most $n - 1$ other $\in U$ . Since there are a total of $p$ possible values that $h_b$ takes on, the
probability that $h_b(x) = h_b(y)$ is bounded from above by $\frac{n - 1}{p}$ , since this holds for any value of $b$ , $\mathcal H$ is $((n - 1) / p)$ -universal.

11.4 开放寻址法

11.4-1

解：
线性探查：
$\begin{array}{r|ccccccccc} h(k, i) = (k + i) \mod 11 & T_0 & T_1 & T_2 & T_3 & T_4 & T_5 & T_6 & T_7 & T_8 \\ \hline 0 \mod 11 & & 22 & 22 & 22 & 22 & 22 & 22 & 22 & 22 \\ 1 \mod 11 & & & & & & & & 88 & 88 \\ 2 \mod 11 & & & & & & & & & \\ 3 \mod 11 & & & & & & & & & \\ 4 \mod 11 & & & & 4 & 4 & 4 & 4 & 4 & 4 \\ 5 \mod 11 & & & & & 15 & 15 & 15 & 15 & 15 \\ 6 \mod 11 & & & & & & 28 & 28 & 28 & 28 \\ 7 \mod 11 & & & & & & & 17 & 17 & 17 \\ 8 \mod 11 & & & & & & & & & 59 \\ 9 \mod 11 & & & 31 & 31 & 31 & 31 & 31 & 31 & 31 \\ 10 \mod 11 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 \end{array}$
二次探查：
$\begin{array}{r|ccccccccc} h(k, i) = (k + i + 3i^2) \mod 11 & T_0 & T_1 & T_2 & T_3 & T_4 & T_5 & T_6 & T_7 & T_8 \\ \hline 0 \mod 11 & & 22 & 22 & 22 & 22 & 22 & 22 & 22 & 22 \\ 1 \mod 11 & & & & & & & & & \\ 2 \mod 11 & & & & & & & & 88 & 88 \\ 3 \mod 11 & & & & & & & 17 & 17 & 17 \\ 4 \mod 11 & & & & 4 & 4 & 4 & 4 & 4 & 4 \\ 5 \mod 11 & & & & & & & & & \\ 6 \mod 11 & & & & & & 28 & 28 & 28 & 28 \\ 7 \mod 11 & & & & & & & & & 59 \\ 8 \mod 11 & & & & & 15 & 15 & 15 & 15 & 15 \\ 9 \mod 11 & & & 31 & 31 & 31 & 31 & 31 & 31 & 31 \\ 10 \mod 11 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 \end{array}$
双重散列：
$\begin{array}{r|ccccccccc} h(k, i) = (k + i(1 + k \mod 10)) \mod 11 & T_0 & T_1 & T_2 & T_3 & T_4 & T_5 & T_6 & T_7 & T_8 \\ \hline 0 \mod 11 & & 22 & 22 & 22 & 22 & 22 & 22 & 22 & 22 \\ 1 \mod 11 & & & & & & & & & \\ 2 \mod 11 & & & & & & & & & 59 \\ 3 \mod 11 & & & & & & & 17 & 17 & 17 \\ 4 \mod 11 & & & & 4 & 4 & 4 & 4 & 4 & 4 \\ 5 \mod 11 & & & & & 15 & 15 & 15 & 15 & 15 \\ 6 \mod 11 & & & & & & 28 & 28 & 28 & 28 \\ 7 \mod 11 & & & & & & & & 88 & 88 \\ 8 \mod 11 & & & & & & & & & \\ 9 \mod 11 & & & 31 & 31 & 31 & 31 & 31 & 31 & 31 \\ 10 \mod 11 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 & 10 \end{array}$

11.4-2

解：

HASH-DELETE(T, k)
i = 0
repeat
    j = h(k, i)
    if T[j] == k
        T[j] = DELETED
        return j 
    i = i + 1
until T[j] == NIL or i == m
error "element not exist"

HASH-INSERT(T, k)
i = 0
repeat
    j = h(k, i)
    if T[j] == NIL or T[j] == DELETED
        T[j] = k
        return j
    else i = i + 1
until i == m
error "hash table oveflow"

11.4-3

解：
$\alpha=3/4$ ：不成功4次，成功约1.848次
$\alpha=7/8$ ：不成功8次，成功约2.377次

11.4-4

证明：假设 $d = \gcd(m, h_2(k))$ , 最小公倍数 $\cdot h_2(k) / d$ 。
因为 $d | h_2(k)$ ，有 $\cdot h_2(k) / d \mod m = 0 \cdot (h_2(k) / d \mod m) = 0$ ，因此 $l + ih_2(k)) \mod m = ih_2(k) \mod m$ ，意味着 $ih_2(k) \mod m$ 有周期 $m / d$ 。

11.4-5

解：
$\begin{aligned} \frac{1}{1 - \alpha} & = 2 \cdot \frac{1}{\alpha} \ln\frac{1}{1 - \alpha} \\ \alpha & = 0.71533. \end{aligned}$

11.5 完全散列

11.5-1

证明（来自参考答案）： $\begin{aligned} p(n, m) & = \frac{m}{m} \cdot \frac{m - 1}{m} \cdots \frac{m - n + 1}{m} \\ & = \frac{m \cdot (m - 1) \cdots (m - n + 1)}{m^n}. \end{aligned}$
$\begin{aligned} (m - i) \cdot (m - n + i) & = (m - \frac{n}{2} + \frac{n}{2} - i) \cdot (m - \frac{n}{2} - \frac{n}{2} + i) \\ & = (m - \frac{n}{2})^2 - (i - \frac{n}{2})^2 \\ & \le (m - \frac{n}{2})^2. \end{aligned}$
$\begin{aligned} p(n, m) & \le \frac{m \cdot (m - \frac{n}{2})^{n - 1}}{m^n} \\ & = (1 - \frac{n}{2m}) ^ {n - 1}. \end{aligned}$
基于式 $\text{(3.12)}$ , $e^x \ge 1 + x$ ,
$\begin{aligned} p(n, m) & \le (e^{-n / 2m})^{n - 1} \\ & = e^{-n(n - 1) / 2m}. \end{aligned}$

思考题

（以下来自参考答案）

11-1（散列最长探查的界）

a.

Since we assume uniform hashing, we can use the same observation as is used in Corollary 11.7: that inserting a key entails an unsuccessful search followed by placing the key into the first empty slot found. As in the proof of Theorem 11.6, if we let $X$ be the random variable denoting the number of probes in an unsuccessful search, then $\Pr\{X \ge i\} \le \alpha^{i - 1}$ . Since $\le m / 2$ , we have $\alpha \le 1 / 2$ . Letting $i = k + 1$ , we have $\Pr\{X > k\} = \Pr\{X \ge k + 1\} \le (1 / 2)^{(k + 1) - 1} = 2^{-k}$ .

b.

Substituting $2\lg n$ into the statement of part (a) yields that the probability that the $i$ th insertion requires more than $2\lg n$ probes is at most $2^{-2\lg n} = (2^{\lg n})^{-2} = n^{-2} = 1 / n^2$ .
We must deal with the possibility that $2\lg n$ is not an integer, however. Then the event that the $i$ th insertion requires more than $2\lg n$ probes is the same as the event that the $i$ th insertion requires more than $\lfloor 2\lg n \rfloor$ probes. Since $\lfloor 2\lg n \rfloor > 2\lg n - 1$ , we have that the probability of this event is at most $2^{-\lfloor 2\lg n \rfloor} < 2^{-(2\lg n - 1)} = 2 / n^2 = O(1 / n^2)$ .

c.

Let the event $A$ be $2\lg n$ , and for $\ldots, n$ , let the event $A_i$ be $X_i > 2\lg n$ . In part (b), we showed that $Pr\{A_i\} = O(1 / n^2)$ for $\ldots, n$ . From how we defined these events, $A_1 \cup A_2 \cup \cdots \cup A_n$ . Using Boole’s inequality, $\text{(C.19)}$ , we have
$\begin{aligned} \Pr\{A\} & \le \Pr\{A_1\} + \Pr\{A_1\} + \cdots + \Pr\{A_n\} \\ & \le n \cdot O(1 / n^2) \\ & = O(1 / n). \end{aligned}$

d.

We use the definition of expectation and break the sum into two parts:
$\begin{aligned} \text E[X] & = \sum_{k = 1}^n k \cdot \Pr\{X = k\} \\ & = \sum_{k = 1}^{\lceil 2\lg n \rceil} k \cdot \Pr\{X = k\} + \sum_{\lceil 2\lg n \rceil + 1}^n k \cdot \Pr\{X = k\} \\ & \le \sum_{k = 1}^{\lceil 2\lg n \rceil} \lceil 2\lg n \rceil \cdot \Pr\{X = k\} + \sum_{\lceil 2\lg n \rceil + 1}^n n \cdot \Pr\{X = k\} \\ & = \lceil 2\lg n \rceil \sum_{k = 1}^{\lceil 2\lg n \rceil} \Pr\{X = k\} + n \sum_{\lceil 2\lg n \rceil + 1}^n \Pr\{X = k\}. \end{aligned}$
Since $X$ takes on exactly one value, we have that $\sum_{k = 1}^{\lceil 2\lg n \rceil} \Pr\{X = k\} = \Pr\{X \le \lceil 2\lg n \rceil\} \le 1$ and $\sum_{k = \lceil 2\lg n \rceil + 1}^n \Pr\{X = k\} \le \Pr\{X > 2\lg n\} = O(1 / n)$ , by part ©. Therefore,
$\begin{aligned} \text E[X] & \le \lceil 2\lg n \rceil \cdot 1 + n \cdot O(1 / n) \\ & = \lceil 2\lg n \rceil + O(1) \\ & = O(\lg n). \end{aligned}$

11-2（链接法中槽大小的界）

a.

A particular key is hashed to a particular slot with probability $1 / n$ . Suppose we select a specific set of $k$ keys. The probability that these $k$ keys are inserted into the slot in question and that all other keys are inserted elsewhere is
$\Big(\frac{1}{n}\Big)^k \Big(1 - \frac{1}{n}\Big)^{n - k}.$
Since there are $\binom{n}{k}$ ways to choose our $k$ keys, we get
$Q_k = \Big(\frac{1}{n}\Big)^k \Big(1 - \frac{1}{n}\Big)^{n - k} \binom{n}{k}.$

b.

For $\ldots, n$ , let $X_i$ be a random variable denoting the number of keys that hash to slot $i$ , and let $A_i$ be the event that $X_i = k$ , i.e., that exactly $k$ keys hash to slot $i$ . From part (a), we have $Pr\{A\} = Q_k$ . Then,
$\begin{aligned} P_k & = \Pr\{M = k\} \\ & = \Pr\Big\{\Big(\max_{1 \le i \le n} X_i\Big) = k\Big\} \\ & = \Pr\{\text{there exists $i$ such that $X_i = k$ and that $X_i\le k$ for $i = 1, 2, \ldots, n$}\} \\ & \le \Pr\{\text{there exists $i$ such that $X_i = k$}\} \\ & = \Pr\{A_1 \cup A_2 \cup \cdots \cup A_n\} \\ & \le \Pr\{A_1\} + \Pr\{A_2\} + \cdots + \Pr\{A_n\} \qquad \text{(by inequality (C.19))} \\ & = nQ_k. \end{aligned}$

c.

We start by showing two facts. First, $1 - 1 / n < 1$ , which implies $1 - 1 / n)^{n - k} < 1$ . Second, $\cdot (n - 1) \cdot (n - 2) \cdots (n - k + 1) < n^k$ . Using these facts, along with the simplification $k! > (k / e)^k$ of equation $\text{(3.18)}$ , we have
$\begin{aligned} Q_k & = \Big(\frac{1}{n}\Big)^k \Big(1 - \frac{1}{n}\Big)^{n - k} \frac{n!}{k!(n - k)!} \\ & < \frac{n!}{n^k k! (n - k)!} & ((1 - 1 / n)^{n - k} < 1) \\ & < \frac{1}{k!} & (n! / (n - k)! < n^k) \\ & < \frac{e^k}{k^k}. & (k! > (k / e)^k) \end{aligned}$

d.

Notice that when $n = 2$ , $\lg\lg n = 0$ , so to be precise, we need to assume that $\ge 3$ .
In part ©, we showed that $Q_k < e^k / k^k$ for any $k$ ; in particular, this inequality holds for $k_0$ . Thus, it suffices to show that $e^{k_0} / k_0^{k_0} < 1 / n^3$ or, equivalently, that $n^3 < k_0^{k_0} / e^{k_0}$ .
Taking logarithms of both sides gives an equivalent condition:
$\begin{aligned} 3\lg n & < k_0(\lg k_0 - \lg e) \\ & = \frac{c\lg n}{\lg\lg n}(\lg c + \lg\lg n - \lg\lg\lg n - \lg e). \end{aligned}$
Dividing both sides by $\lg n$ gives the condition
$\begin{aligned} 3 & < \frac{c}{\lg\lg n} (\lg c + \lg\lg n - \lg\lg\lg n - \lg e) \\ & = c \Big(1 + \frac{\lg c - \lg e}{\lg\lg n} - \frac{\lg\lg\lg n}{\lg\lg n}\Big). \end{aligned}$
Let $x$ be the last expression in parentheses:
$\Big(1 + \frac{\lg c - \lg e}{\lg\lg n} - \frac{\lg\lg\lg n}{\lg\lg n}\Big).$
We need to show that there exists a constant $c > 1$ such that $3 < c x$ .
Noting that $\lim_{n \to \infty} x = 1$ , we see that there exists $n_0$ such that $\ge 1 / 2$ for all $\ge n_0$ . Thus, any constant $c > 6$ works for $\ge n_0$ .
We handle smaller values of $n$ —in particular, $\le n < n_0$ —as follows. Since $n$ is constrained to be an integer, there are a finite number of n in the range $\le n < n_0$ . We can evaluate the expression $x$ for each such value of $n$ and determine a value of $c$ for which $3 < c x$ for all values of $n$ . The final value of $c$ that we use is the larger of

$6$ , which works for all $\ge n_0$ , and
$\max_{3 \le n \le n_0}\{c: 3 < cx\}$ , i.e., the largest value of $c$ that we chose for the range $\le n < n_0$ .

Thus, we have shown that $Q_{k_0} < 1 / n^3$ , as desired.
To see that $P_k < 1 / n^2$ for $\ge k_0$ , we observe that by part (b), $P_k \le nQ_k$ for all $k$ . Choosing $k = k_0$ gives $P_{k_0} \le nQ_{k_0} < n \cdot (1 / n^3) = 1 / n^2$ . For $k > k_0$ , we will show that we can pick the constant $c$ such that $Q_k < 1 / n^3$ for all $\ge k_0$ , and thus conclude that $P_k < 1 / n^2$ for all $\ge k_0$ .
To pick $c$ as required, we let $c$ be large enough that $k_0 > 3 > e$ . Then $e / k < 1$ for all $\ge k_0$ , and so $e^k / k^k$ decreases as $k$ increases. Thus,
$\begin{aligned} Q_k & < e^k / k^k \\ & \le e^{k_0} / k^{k_0} \\ & < 1 / n^3 \end{aligned}$
for $\ge k_0$ .

e.

The expectation of $M$ is
$\begin{aligned} \text E[M] & = \sum_{k = 0}^n k \cdot \Pr\{M = k\} \\ & = \sum_{k = 0}^{k_0} k \cdot \Pr\{M = k\} + \sum_{k = k_0 + 1}^n k \cdot \Pr\{M = k\} \\ & \le \sum_{k = 0}^{k_0} k_0 \cdot \Pr\{M = k\} + \sum_{k = k_0 + 1}^n n \cdot \Pr\{M = k\} \\ & \le k_0 \sum_{k = 0}^{k_0} \Pr\{M = k\} + n \sum_{k = k_0 + 1}^n \Pr\{M = k\} \\ & = k_0 \cdot \Pr\{M \le k_0\} + n \cdot \Pr\{M > k_0\}, \end{aligned}$
which is what we needed to show, since $k_0 = c \lg n / \lg\lg n$ .
To show that $\text E[M] = O(\lg n / \lg\lg n)$ , note that $\Pr\{M \le k_0\} \le 1$ and
$\begin{aligned} \Pr\{M > k_0\} & = \sum_{k = k_0 + 1}^n \Pr\{M = k\} \\ & = \sum_{k = k_0 + 1}^n P_k \\ & < \sum_{k = k_0 + 1}^n 1 / n^2 & \text{(by part (d))} \\ & < n \cdot (1 / n^2) \\ & = 1 / n. \end{aligned}$
We conclude that
$\begin{aligned} \text E[M] & \le k_0 \cdot 1 + n \cdot (1 / n) \\ & = k_0 + 1 \\ & = O(\lg n / \lg\lg n). \end{aligned}$

11-3（二次探查）

a.

From how the probe-sequence computation is specified, it is easy to see that the probe sequence is
$\langle h(k), h(k) + 1, h(k) + 1 + 2, h(k) + 1 + 2 + 3, \ldots, h(k) + 1 + 2 + 3 + \cdots + i, \ldots \rangle,$
where all arithmetic is modulo $m$ . Starting the probe numbers from $0$ , the $i$ th probe is offset (modulo $m$ ) from $h (k)$ by
$\sum_{j = 0}^i j = \frac{i(i + 1)}{2} = \frac{1}{2}i^2 + \frac{1}{2}i.$
Thus, we can write the probe sequence as
$\Big(h(k) + \frac{1}{2} i + \frac{1}{2} i^2 \Big) \mod m,$
which demonstrates that this scheme is a special case of quadratic probing.

b.

Let $h^{'} (k, i)$ denote the ith probe of our scheme. We saw in part (a) that $\mod m$ . To show that our algorithm examines every table position in the worst case, we show that for a given key, each of the first $m$ probes hashes to a distinct value. That is, for any key $k$ and for any probe numbers $i$ and $j$ such that $\le i < j < m$ , we have $\ne h'(k, j)$ . We do so by showing that $h^{'} (k, i) = h^{'} (k, j)$ yields a contradiction.
Let us assume that there exists a key $k$ and probe numbers $i$ and $j$ satsifying $\le i < j < m$ for which $h^{'} (k, i) = h^{'} (k, j)$ . Then
$\mod m,$
which in turn implies that
$\mod m,$
or
$\mod m.$
Since $j (j + 1) / 2 - i (i + 1) / 2 = (j - i) (j + i + 1) / 2$ , we have
$\mod m.$
The factors $j - i$ and $j + i + 1$ must have different parities, i.e., $j - i$ is even if and only if $j + i + 1$ is odd. (Work out the various cases in which $i$ and $j$ are even and odd.) Since $\mod m$ , we have $(j - i) (j + i + 1) / 2 = r m$ for some integer $r$ or, equivalently, $\cdot 2m$ . Using the assumption that $m$ is a power of $2$ , let $m = 2^p$ for some nonnegative integer $p$ , so that now we have $\cdot 2^{p + 1}$ . Because exactly one of the factors $j - i$ and $j + i + 1$ is even, $2^{p + 1}$ must divide one of the factors. It cannot be $j - i$ , since $j - i < m < 2^{p + 1}$ . But it also cannot be $j + i + 1$ , since $\le (m - 1) + (m - 2) + 1 = 2m - 2 < 2^{p + 1}$ . Thus we have derived the contradiction that $2^{p + 1}$ divides neither of the factors $j - i$ and $j + i + 1$ . We conclude that $\ne h'(k, j)$ .

11-4（散列和认证）

a.

The number of hash functions for which $h (k) = h (l)$ is $\frac{m}{m^2}|\mathcal H| = \frac{1}{m}|\mathcal H|$ , therefore the family is universal.

b.

For $\langle 0, 0, \ldots, 0 \rangle$ , $\mathcal H$ could not be $2$ -universal.

c.

Let $\in U$ be fixed, distinct $n$ -tuples. As $a_i$ and $b$ range over $\mathbb Z_p, h'_{ab}(x)$ is equally likely to achieve every value from $1$ to $p$ since for any sequence $a$ , we can let $b$ vary from $1$ to $p - 1$ .
Thus, $\langle h'_{ab}(x), h'_{ab}(y) \rangle$ is equally likely to be any of the $p^2$ sequences, so $\mathcal H$ is $2$ -universal.

d.

Since $\mathcal H$ is $2$ -universal, every pair of $\langle t, t' \rangle$ is equally likely to appear, thus $t^{'}$ could be any value from $\mathbb Z_p$ . Even the adversary knows $\mathcal H$ , since $\mathcal H$ is $2$ -universal, then $\mathcal H$ is universal, the probability of choosing a hash function that $h (k) = h (l)$ is at most $1 / p$ , therefore the probability is at most $1 / p$ .

你可能感兴趣的:(算法)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
Goolge earth studio 进阶4——路径修改与平滑陟彼高冈yu Google earth studio 进阶教程旅游
如果我们希望在大约中途时获得更多的城市鸟瞰视角。可以将相机拖动到这里并创建一个新的关键帧。camera_target_clip_7EarthStudio会自动平滑我们的路径，所以当我们通过这个关键帧时，不是一个生硬的角度，而是一个平滑的曲线。camera_target_clip_8路径上有贝塞尔控制手柄，允许我们调整路径的形状。右键单击，我们可以选择“平滑路径”，这是默认的自动平滑算法，或者我们可
基于社交网络算法优化的二维最大熵图像分割智能算法研学社（Jack旭）智能优化算法应用图像分割算法 php 开发语言
智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码文章目录智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码1.前言2.二维最大熵阈值分割原理3.基于社交网络优化的多阈值分割4.算法结果：5.参考文献：6.Matlab代码摘要：本文介绍基于最大熵的图像分割，并且应用社交网络算法进行阈值寻优。1.前言阅读此文章前，请阅读《图像分割：直方图区域划分及信息统计介绍》htt
121. 买卖股票的最佳时机薄荷糖的味道_fb40
给定一个数组，它的第i个元素是一支给定股票第i天的价格。如果你最多只允许完成一笔交易（即买入和卖出一支股票），设计一个算法来计算你所能获取的最大利润。注意你不能在买入股票前卖出股票。示例1:输入:[7,1,5,3,6,4]输出:5解释:在第2天（股票价格=1）的时候买入，在第5天（股票价格=6）的时候卖出，最大利润=6-1=5。注意利润不能是7-1=6,因为卖出价格需要大于买入价格。示例2:输入:
每日算法&面试题，大厂特训二十八天——第二十天（树）肥学 ⚡算法题⚡面试题每日精进 java 算法数据结构
目录标题导读算法特训二十八天面试题点击直接资料领取导读肥友们为了更好的去帮助新同学适应算法和面试题，最近我们开始进行专项突击一步一步来。上一期我们完成了动态规划二十一天现在我们进行下一项对各类算法进行二十八天的一个小总结。还在等什么快来一起肥学进行二十八天挑战吧！！特别介绍小白练手专栏，适合刚入手的新人欢迎订阅编程小白进阶python有趣练手项目里面包括了像《机器人尬聊》《恶搞程序》这样的有趣文章
回溯算法-重新安排行程 chirou_ 算法数据结构图论 c++图搜索
leetcode332.重新安排行程这题我还没自己ac过，只能现在凭着刚学完的热乎劲把我对题解的理解记下来。本题我认为对数据结构的考察比较多，用什么数据结构去存数据，去读取数据，都是很重要的。classSolution{private:unordered_map>targets;boolbacktracking(intticketNum,vector&result){//1.确定参数和返回值//2
Faiss：高效相似性搜索与聚类的利器网络·魚大数据 faiss
Faiss是一个针对大规模向量集合的相似性搜索库，由FacebookAIResearch开发。它提供了一系列高效的算法和数据结构，用于加速向量之间的相似性搜索，特别是在大规模数据集上。本文将介绍Faiss的原理、核心功能以及如何在实际项目中使用它。Faiss原理：近似最近邻搜索：Faiss的核心功能之一是近似最近邻搜索，它能够高效地在大规模数据集中找到与给定查询向量最相似的向量。这种搜索是近似的，
insert into select 主键自增_mybatis拦截器实现主键自动生成 weixin_39521651 insert into select 主键自增 mybatis delete返回值 mybatis insert返回主键 mybatis insert返回对象 mybatis plus insert返回主键 mybatis plus 插入生成id
前言前阵子和朋友聊天，他说他们项目有个需求，要实现主键自动生成，不想每次新增的时候，都手动设置主键。于是我就问他，那你们数据库表设置主键自动递增不就得了。他的回答是他们项目目前的id都是采用雪花算法来生成，因此为了项目稳定性，不会切换id的生成方式。朋友问我有没有什么实现思路，他们公司的orm框架是mybatis，我就建议他说，不然让你老大把mybatis切换成mybatis-plus。mybat
k均值聚类算法考试例题_k均值算法(k均值聚类算法计算题) 寻找你83497 k均值聚类算法考试例题
?算法：第一步：选K个初始聚类中心，z1(1),z2(1)，…，zK(1)，其中括号内的序号为寻找聚类中心的迭代运算的次序号。聚类中心的向量值可任意设定，例如可选开始的K个.k均值聚类：---------一种硬聚类算法，隶属度只有两个取值0或1，提出的基本根据是“类内误差平方和最小化”准则；模糊的c均值聚类算法：--------一种模糊聚类算法，是.K均值聚类算法是先随机选取K个对象作为初始的聚类
Python实现简单的机器学习算法 master_chenchengg python python 办公效率 python开发 IT
Python实现简单的机器学习算法开篇：初探机器学习的奇妙之旅搭建环境：一切从安装开始必备工具箱第一步：安装Anaconda和JupyterNotebook小贴士：如何配置Python环境变量算法初体验：从零开始的Python机器学习线性回归：让数据说话数据准备：从哪里找数据编码实战：Python实现线性回归模型评估：如何判断模型好坏逻辑回归：从分类开始理论入门：什么是逻辑回归代码实现：使用skl
推荐算法_隐语义-梯度下降 _feivirus_ 算法机器学习和数学推荐算法机器学习隐语义
importnumpyasnp1.模型实现"""inputrate_matrix:M行N列的评分矩阵，值为P*Q.P:初始化用户特征矩阵M*K.Q:初始化物品特征矩阵K*N.latent_feature_cnt:隐特征的向量个数max_iteration:最大迭代次数alpha:步长lamda:正则化系数output分解之后的P和Q"""defLFM_grad_desc(rate_matrix,l
K近邻算法_分类鸢尾花数据集 _feivirus_ 算法机器学习和数学分类机器学习 K近邻
importnumpyasnpimportpandasaspdfromsklearn.datasetsimportload_irisfromsklearn.model_selectionimporttrain_test_splitfromsklearn.metricsimportaccuracy_score1.数据预处理iris=load_iris()df=pd.DataFrame(data=ir
数据结构 | 栈和队列 TT-Kun 数据结构与算法数据结构栈队列 C语言
文章目录栈和队列1.栈：后进先出（LIFO）的数据结构1.1概念与结构1.2栈的实现2.队列：先进先出（FIFO）的数据结构2.1概念与结构2.2队列的实现3.栈和队列算法题3.1有效的括号3.2用队列实现栈3.3用栈实现队列3.4设计循环队列结论栈和队列在计算机科学中，栈和队列是两种基本且重要的数据结构，它们在处理数据存储和访问顺序方面有着独特的规则和应用。本文将详细介绍栈和队列的概念、结构、实
[Python] 数据结构详解及代码 AIAdvocate 算法 python 数据结构链表
今日内容大纲介绍数据结构介绍列表链表1.数据结构和算法简介程序大白话翻译,程序=数据结构+算法数据结构指的是存储,组织数据的方式.算法指的是为了解决实际业务问题而思考思路和方法,就叫:算法.2.算法的5大特性介绍算法具有独立性算法是解决问题的思路和方式,最重要的是思维,而不是语言,其(算法)可以通过多种语言进行演绎.5大特性有输入,需要传入1或者多个参数有输出,需要返回1个或者多个结果有穷性,执行
Python算法L5：贪心算法小熊同学哦 Python算法算法 python 贪心算法
Python贪心算法简介目录Python贪心算法简介贪心算法的基本步骤贪心算法的适用场景经典贪心算法问题1.**零钱兑换问题**2.**区间调度问题**3.**背包问题**贪心算法的优缺点优点：缺点：结语贪心算法（GreedyAlgorithm）是一种在每一步选择中都采取当前最优或最优解的算法。它的核心思想是，在保证每一步局部最优的情况下，希望通过贪心选择达到全局最优解。虽然贪心算法并不总能得到全
【RabbitMQ 项目】服务端：数据管理模块之绑定管理月夜星辉雪 rabbitmq 分布式
文章目录一.编写思路二.代码实践一.编写思路定义绑定信息类交换机名称队列名称绑定关键字：交换机的路由交换算法中会用到没有是否持久化的标志，因为绑定是否持久化取决于交换机和队列是否持久化，只有它们都持久化时绑定才需要持久化。绑定就好像一根绳子，两端连接着交换机和队列，当一方不存在，它就没有存在的必要了定义绑定持久化类构造函数：如果数据库文件不存在则创建，打开数据库，创建binding_table插入
非对称加密算法原理与应用2——RSA私钥加密文件私语茶馆云部署与开发架构及产品灵感记录 RSA2048 私钥加密
作者：私语茶馆1.相关章节（1）非对称加密算法原理与应用1——秘钥的生成-CSDN博客第一章节讲述的是创建秘钥对，并将公钥和私钥导出为文件格式存储。本章节继续讲如何利用私钥加密内容，包括从密钥库或文件中读取私钥，并用RSA算法加密文件和String。2.私钥加密的概述本文主要基于第一章节的RSA2048bit的非对称加密算法讲述如何利用私钥加密文件。这种加密后的文件，只能由该私钥对应的公钥来解密。
粒子群优化 (PSO) 在三维正弦波函数中的应用 subject625Ruben 机器学习人工智能 matlab 算法
在这篇博客中，我们将展示如何使用粒子群优化（PSO）算法求解三维正弦波函数，并通过增加正弦波扰动，使优化过程更加复杂和有趣。本文将介绍目标函数的定义、PSO参数设置以及算法执行的详细过程，并展示搜索空间中的动态过程和收敛曲线。1.目标函数定义我们使用的目标函数是一个三维正弦波函数，定义如下：objectiveFunc=@(x)sin(sqrt(x(1).^2+x(2).^2))+0.5*sin(5
非对称加密算法————RSA理论及详情 hu19930613
转自：https://www.kancloud.cn/kancloud/rsa_algorithm/48484一、一点历史1976年以前，所有的加密方法都是同一种模式：（1）甲方选择某一种加密规则，对信息进行加密；（2）乙方使用同一种规则，对信息进行解密。由于加密和解密使用同样规则（简称"密钥"），这被称为"对称加密算法"（Symmetric-keyalgorithm）。这种加密模式有一个最大弱点
ai绘画工具midjourney怎么下载？附作品管理教程设计师早上好
Midjourney是一款功能强大的AI绘画工具，它使用机器学习技术和深度神经网络等算法，可以生成各种艺术风格的绘画作品。在创意设计、广告宣传等方面有着广泛的应用前景。那么，ai绘画工具midjourney怎么下载？本文将为您介绍Midjourney的下载以及作品的相关管理。一、Midjourney下载Midjourney的下载非常简单，只需打开Midjourney官网（点击“GetMidjour
【加密算法基础——对称加密和非对称加密】 XWWW668899 网络安全服务器笔记
对称加密与非对称加密对称加密和非对称加密是两种基本的加密方法，各自有不同的特点和用途。以下是详细比较：1.对称加密特点密钥:使用相同的密钥进行加密和解密。发送方和接收方必须共享这个密钥。速度:通常速度较快，适合处理大量数据。实现:算法相对简单，计算效率高。常见算法AES(高级加密标准)DES(数据加密标准)3DES(三重数据加密标准)RC4(流密码)应用场景文件加密磁盘加密传输大量数据时的加密2.
【算法练习】IDEA集成leetcode插件实现快速刷 2401_84102892 2024年程序员学习算法 intellij-idea leetcode
============点击右侧边leetcode->设置->配置地址、用户名、密码、存放目录、文件模板用户名要登录后在账号信息里看模板代码1.codefilename!velocityTool.camelC
【加密算法基础——RSA 加密】 XWWW668899 网络服务器笔记 python
RSA加密RSA（Rivest-Shamir-Adleman）加密是非对称加密，一种广泛使用的公钥加密算法，主要用于安全数据传输。公钥用于加密，私钥用于解密。RSA加密算法的名称来源于其三位发明者的姓氏：R:RonRivestS:AdiShamirA:LeonardAdleman这三位计算机科学家在1977年共同提出了这一算法，并发表了相关论文。他们的工作为公钥加密的基础奠定了重要基础，使得安全通
机器学习-聚类算法不良人龍木木机器学习机器学习算法聚类
机器学习-聚类算法1.AHC2.K-means3.SC4.MCL仅个人笔记，感谢点赞关注！1.AHC2.K-means3.SC传统谱聚类：个人对谱聚类算法的理解以及改进4.MCL目前仅专注于NLP的技术学习和分享感谢大家的关注与支持！
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
高性能javascript--算法和流程控制海淀萌狗
-for,while和do-while性能相当-避免使用for-in循环，==除非遍历一个属性量未知的对象==es5:for-in遍历的对象便不局限于数组，还可以遍历对象。原因：for-in每次迭代操作会同时搜索实例或者原型属性，for-in循环的每次迭代都会产生更多开销，因此要比其他循环类型慢，一般速度为其他类型循环的1/7。因此，除非明确需要迭代一个属性数量未知的对象，否则应避免使用for-i
深度 Qlearning：在直播推荐系统中的应用 AGI通用人工智能之禅程序员提升自我硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
深度Q-learning：在直播推荐系统中的应用关键词：深度Q-learning,强化学习,直播推荐系统,个性化推荐1.背景介绍1.1问题的由来随着互联网技术的飞速发展,直播平台如雨后春笋般涌现。面对海量的直播内容,用户很难快速找到自己感兴趣的内容。因此,个性化推荐系统在直播平台中扮演着越来越重要的角色。1.2研究现状目前,主流的个性化推荐算法包括协同过滤、基于内容的推荐等。这些方法在一定程度上缓
JVM源码分析之堆外内存完全解读 HeapDump性能社区
概述广义的堆外内存说到堆外内存，那大家肯定想到堆内内存，这也是我们大家接触最多的，我们在jvm参数里通常设置-Xmx来指定我们的堆的最大值，不过这还不是我们理解的Java堆，-Xmx的值是新生代和老生代的和的最大值，我们在jvm参数里通常还会加一个参数-XX:MaxPermSize来指定持久代的最大值，那么我们认识的Java堆的最大值其实是-Xmx和-XX:MaxPermSize的总和，在分代算法
《算法》四学习——1.1节进阶的Farmer 算法算法笔记
前言买了一本算法4，每天看一点，对每个小结来个学习总结，输出驱动输入。本篇笔记针对第一章基础1.1基础编程模型1.1节总结了相关的语法、语言特性和书中将会用到的库。笔记自己在编码中容易遗漏的点&&优先级比||高在开发中习惯了加括号，所以没注意到这点，教材上也有但是忘记了二分查找中计算mid=left+(right-left)/2这样计算可以有效避免(left+right)/2溢出答疑java无穷大
排序路小白同学
1.冒泡排序冒泡算法是一种基础的排序算法，这种算法会重复的比较数组中相邻的两个元素。如果一个元素比另一个元素大（小），那么就交换这两个元素的位置。重复这一比较直至最后一个元素。这一比较会重复n-1趟，每一趟比较n-j次，j是已经排序好的元素个数。每一趟比较都能找出未排序元素中最大或者最小的那个数字。这就如同水泡从水底逐个飘到水面一样。冒泡排序是一种时间复杂度较高，效率较低的排序方法。其空间复杂度是
分享100个最新免费的高匿HTTP代理IP mcj8089 代理IP 代理服务器匿名代理免费代理IP 最新代理IP
推荐两个代理IP网站： 1. 全网代理IP：http://proxy.goubanjia.com/ 2. 敲代码免费IP：http://ip.qiaodm.com/ 120.198.243.130:80,中国/广东省 58.251.78.71:8088,中国/广东省 183.207.228.22:83,中国/
mysql高级特性之数据分区 annan211 java 数据结构 mongodb 分区 mysql
mysql高级特性 1 以存储引擎的角度分析，分区表和物理表没有区别。是按照一定的规则将数据分别存储的逻辑设计。器底层是由多个物理字表组成。 2 分区的原理分区表由多个相关的底层表实现，这些底层表也是由句柄对象表示，所以我们可以直接访问各个分区。存储引擎管理分区的各个底层表和管理普通表一样(所有底层表都必须使用相同的存储引擎)，分区表的索引只是
JS采用正则表达式简单获取URL地址栏参数 chiangfai js 地址栏参数获取
GetUrlParam:function GetUrlParam(param){ var reg = new RegExp("(^|&)"+ param +"=([^&]*)(&|$)"); var r = window.location.search.substr(1).match(reg); if(r!=null
怎样将数据表拷贝到powerdesigner (本地数据库表) Array_06 powerDesigner
================================================== 1、打开PowerDesigner12，在菜单中按照如下方式进行操作 file->Reverse Engineer->DataBase 点击后，弹出 New Physical Data Model 的对话框 2、在General选项卡中 Model name:模板名字，自
logbackのhelloworld 飞翔的马甲日志 logback
一、概述 1.日志是啥？当我是个逗比的时候我是这么理解的：log.debug()代替了system.out.print(); 当我项目工作时，以为是一堆得.log文件。这两天项目发布新版本，比较轻松，决定好好地研究下日志以及logback。传送门1：日志的作用与方法： http://www.infoq.com/cn/articles/why-and-how-log 上面的作
新浪微博爬虫模拟登陆随意而生新浪微博
转载自：http://hi.baidu.com/erliang20088/item/251db4b040b8ce58ba0e1235 近来由于毕设需要，重新修改了新浪微博爬虫废了不少劲，希望下边的总结能够帮助后来的同学们。现行版的模拟登陆与以前相比，最大的改动在于cookie获取时候的模拟url的请求
synchronized 香水浓 java thread
Java语言的关键字，可用来给对象和方法或者代码块加锁，当它锁定一个方法或者一个代码块的时候，同一时刻最多只有一个线程执行这段代码。当两个并发线程访问同一个对象object中的这个加锁同步代码块时，一个时间内只能有一个线程得到执行。另一个线程必须等待当前线程执行完这个代码块以后才能执行该代码块。然而，当一个线程访问object的一个加锁代码块时，另一个线程仍然
maven 简单实用教程 AdyZhang maven
1. Maven介绍 1.1. 简介 java编写的用于构建系统的自动化工具。目前版本是2.0.9，注意maven2和maven1有很大区别，阅读第三方文档时需要区分版本。 1.2. Maven资源见官方网站；The 5 minute test，官方简易入门文档；Getting Started Tutorial，官方入门文档；Build Coo
Android 通过 intent传值获得null aijuans android
我在通过intent 获得传递兑现过的时候报错，空指针,我是getMap方法进行传值，代码如下 1 2 3 4 5 6 7 8 9 public void getMap(View view){ Intent i =
apache 做代理报如下错误：The proxy server received an invalid response from an upstream baalwolf response
网站配置是apache＋tomcat,tomcat没有报错，apache报错是： The proxy server received an invalid response from an upstream server. The proxy server could not handle the request GET /. Reason: Error reading fr
Tomcat6 内存和线程配置 BigBird2012 tomcat6
1、修改启动时内存参数、并指定JVM时区（在windows server 2008 下时间少了8个小时）在Tomcat上运行j2ee项目代码时，经常会出现内存溢出的情况，解决办法是在系统参数中增加系统参数： window下，在catalina.bat最前面 set JAVA_OPTS=-XX:PermSize=64M -XX:MaxPermSize=128m -Xms5
Karam与TDD bijian1013 Karam TDD
一.TDD 测试驱动开发（Test-Driven Development,TDD）是一种敏捷（AGILE）开发方法论，它把开发流程倒转了过来，在进行代码实现之前，首先保证编写测试用例，从而用测试来驱动开发（而不是把测试作为一项验证工具来使用）。 TDD的原则很简单： a.只有当某个
[Zookeeper学习笔记之七]Zookeeper源代码分析之Zookeeper.States bit1129 zookeeper
public enum States { CONNECTING, //Zookeeper服务器不可用，客户端处于尝试链接状态 ASSOCIATING, //？？？ CONNECTED, //链接建立，可以与Zookeeper服务器正常通信 CONNECTEDREADONLY, //处于只读状态的链接状态，只读模式可以在
【Scala十四】Scala核心八：闭包 bit1129 scala
Free variable A free variable of an expression is a variable that’s used inside the expression but not defined inside the expression. For instance, in the function literal expression (x: Int) => (x
android发送json并解析返回json ronin47 android
package com.http.test; import org.apache.http.HttpResponse; import org.apache.http.HttpStatus; import org.apache.http.client.HttpClient; import org.apache.http.client.methods.HttpGet; import
一份IT实习生的总结 brotherlamp PHP php资料 php教程 php培训 php视频
今天突然发现在不知不觉中自己已经实习了 3 个月了，现在可能不算是真正意义上的实习吧，因为现在自己才大三，在这边撸代码的同时还要考虑到学校的功课跟期末考试。让我震惊的是，我完全想不到在这 3 个月里我到底学到了什么，这是一件多么悲催的事情啊。同时我对我应该 get 到什么新技能也很迷茫。所以今晚还是总结下把，让自己在接下来的实习生活有更加明确的方向。最后感谢工作室给我们几个人这个机会让我们提前出来
据说是2012年10月人人网校招的一道笔试题-给出一个重物重量为X,另外提供的小砝码重量分别为1，3，9。。。3^N。将重物放到天平左侧，问在两边如何添加砝码 bylijinnan java
public class ScalesBalance { /** * 题目： * 给出一个重物重量为X,另外提供的小砝码重量分别为1，3，9。。。3^N。（假设N无限大，但一种重量的砝码只有一个） * 将重物放到天平左侧，问在两边如何添加砝码使两边平衡 * * 分析： * 三进制 * 我们约定括号表示里面的数是三进制，例如 47=(1202
dom4j最常用最简单的方法 chiangfai dom4j
要使用dom4j读写XML文档,需要先下载dom4j包,dom4j官方网站在 http://www.dom4j.org/目前最新dom4j包下载地址:http://nchc.dl.sourceforge.net/sourceforge/dom4j/dom4j-1.6.1.zip 解开后有两个包,仅操作XML文档的话把dom4j-1.6.1.jar加入工程就可以了,如果需要使用XPath的话还需要
简单HBase笔记 chenchao051 hbase
一、Client-side write buffer 客户端缓存请求描述：可以缓存客户端的请求，以此来减少RPC的次数，但是缓存只是被存在一个ArrayList中，所以多线程访问时不安全的。可以使用getWriteBuffer()方法来取得客户端缓存中的数据。默认关闭。二、Scan的Caching 描述： next( )方法请求一行就要使用一次RPC,即使
mysqldump导出时出现when doing LOCK TABLES daizj mysql mysqdump 导数据
　　执行　mysqldump -uxxx -pxxx -hxxx -Pxxxx database tablename > tablename.sql　导出表时，会报 mysqldump: Got error: 1044: Access denied for user 'xxx'@'xxx' to database 'xxx' when doing LOCK TABLES 解决
CSS渲染原理 dcj3sjt126com Web
从事Web前端开发的人都与CSS打交道很多，有的人也许不知道css是怎么去工作的，写出来的css浏览器是怎么样去解析的呢？当这个成为我们提高css水平的一个瓶颈时，是否应该多了解一下呢？一、浏览器的发展与CSS
《阿甘正传》台词 dcj3sjt126com
Part Ⅰ: 《阿甘正传》Forrest Gump经典中英文对白 Forrest: Hello! My names Forrest. Forrest Gump. You wanna Chocolate? I could eat about a million and a half othese. My momma always said life was like a box ochocol
Java处理JSON dyy_gusi json
Json在数据传输中很好用，原因是JSON 比 XML 更小、更快，更易解析。在Java程序中，如何使用处理JSON，现在有很多工具可以处理，比较流行常用的是google的gson和alibaba的fastjson，具体使用如下： 1、读取json然后处理 class ReadJSON { public static void main(String[] args)
win7下nginx和php的配置 geeksun nginx
1. 安装包准备 nginx : 从nginx.org下载nginx-1.8.0.zip php：从php.net下载php-5.6.10-Win32-VC11-x64.zip， php是免安装文件。 RunHiddenConsole: 用于隐藏命令行窗口 2. 配置 # java用8080端口做应用服务器，nginx反向代理到这个端口即可 p
基于2.8版本redis配置文件中文解释 hongtoushizi redis
转载自： http://wangwei007.blog.51cto.com/68019/1548167 在Redis中直接启动redis-server服务时, 采用的是默认的配置文件。采用redis-server xxx.conf 这样的方式可以按照指定的配置文件来运行Redis服务。下面是Redis2.8.9的配置文
第五章常用Lua开发库3-模板渲染 jinnianshilongnian nginx lua
动态web网页开发是Web开发中一个常见的场景，比如像京东商品详情页，其页面逻辑是非常复杂的，需要使用模板技术来实现。而Lua中也有许多模板引擎，如目前我在使用的lua-resty-template，可以渲染很复杂的页面，借助LuaJIT其性能也是可以接受的。如果学习过JavaEE中的servlet和JSP的话，应该知道JSP模板最终会被翻译成Servlet来执行；而lua-r
JZSearch大数据搜索引擎颠覆者 JavaScript
系统简介：大数据的特点有四个层面：第一，数据体量巨大。从TB级别，跃升到PB级别；第二，数据类型繁多。网络日志、视频、图片、地理位置信息等等。第三，价值密度低。以视频为例，连续不间断监控过程中，可能有用的数据仅仅有一两秒。第四，处理速度快。最后这一点也是和传统的数据挖掘技术有着本质的不同。业界将其归纳为4个“V”——Volume，Variety，Value，Velocity。大数据搜索引
10招让你成为杰出的Java程序员 pda158 java 编程框架
如果你是一个热衷于技术的 Java 程序员，那么下面的 10 个要点可以让你在众多 Java 开发人员中脱颖而出。　　 1. 拥有扎实的基础和深刻理解 OO 原则　　对于 Java 程序员，深刻理解 Object Oriented Programming（面向对象编程）这一概念是必须的。没有 OOPS 的坚实基础，就领会不了像 Java 这些面向对象编程语言
tomcat之oracle连接池配置小网客 oracle
tomcat版本7.0 配置oracle连接池方式：修改tomcat的server.xml配置文件： <GlobalNamingResources> <Resource name="utermdatasource" auth="Container" type="javax.sql.DataSou
Oracle 分页算法汇总 vipbooks oracle sql 算法 .net
这是我找到的一些关于Oracle分页的算法，大家那里还有没有其他好的算法没？我们大家一起分享一下！ -- Oracle 分页算法一 select * from ( select page.*,rownum rn from (select * from help) page -- 20 = (currentPag

《算法导论》第三版第11章 散列表 练习&思考题 个人答案