Design a search autocomplete system for a search engine. Users may input a sentence (at least one word and end with a special character ‘#’). For each character they type except ‘#’, you need to return the top 3 historical hot sentences that have prefix the same as the part of sentence already typed. Here are the specific rules:
Your job is to implement the following functions:
The constructor function:
AutocompleteSystem(String[] sentences, int[] times):
This is the constructor. The input is historical data. Sentences is a string array consists of previously typed sentences. Times is the corresponding times a sentence has been typed. Your system should record these historical data.
Now, the user wants to input a new sentence. The following function will provide the next character the user types:
List
The input c is the next character typed by the user. The character will only be lower-case letters (‘a’ to ‘z’), blank space (’ ‘) or a special character (’#’). Also, the previously typed sentence should be recorded in your system. The output will be the top 3 historical hot sentences that have prefix the same as the part of sentence already typed.
Example:
Operation: AutocompleteSystem([“i love you”, “island”,“ironman”, “i love leetcode”], [5,3,2,2])
The system have already tracked down the following sentences and their corresponding times:
"i love you"
: 5
times
"island"
: 3
times
"ironman"
: 2
times
"i love leetcode"
: 2
times
Now, the user begins another search:
Operation: input(‘i’)
Output: [“i love you”, “island”,“i love leetcode”]
Explanation:
There are four sentences that have prefix "i".
Among them, “ironman” and “i love leetcode” have same hot degree. Since ’ ’ has ASCII code 32 and ‘r’ has ASCII code 114, “i love leetcode” should be in front of “ironman”. Also we only need to output top 3 hot sentences, so “ironman” will be ignored.
Operation: input(’ ')
Output: [“i love you”,“i love leetcode”]
Explanation:
There are only two sentences that have prefix "i "
.
Operation: input(‘a’)
Output: []
Explanation:
There are no sentences that have prefix "i a"
.
Operation: input(’#’)
Output: []
Explanation:
The user finished the input, the sentence "i a"
should be saved as a historical sentence in system. And the following input will be counted as a new search.
Note:
简单翻译一下,就是实现一个搜索引擎自动匹配历史搜索字段的类。现在有sentences[],所有搜索过的句子的历史记录;还有times[], 所有sentences的历史搜索次数。现在要实现一个功能,每在搜索引擎中input一个字符,就返回top3最热的搜索记录。
看到单词匹配,首先想到的就是Tier字典树,Node里面多存一个Map
用来记录frequency。每次input操作,都让Tier Tree在当前节点的孩子中找,找到了就令curr = next
,找不到就返回空值,说明没有历史记录的匹配。最后在curr.counts
的map中找到包含当前字符串的搜索历史,和其frequency一起组成Pair存入PriorityQueue,再对PriorityQueue进行多条件的排序,
PriorityQueue
取出Top3即可。
public class AutocompleteSystem {
class TrieNode {
Map<Character, TrieNode> children;
Map<String, Integer> counts;
public TrieNode() {
children = new HashMap<Character, TrieNode>();
counts = new HashMap<String, Integer>();
}
}
class Pair {
String s;
int c;
public Pair(String s, int c) {
this.s = s;
this.c = c;
}
}
TrieNode root;
String prefix;
public AutocompleteSystem(String[] sentences, int[] times) {
root = new TrieNode();
prefix = "";
for (int i = 0; i < sentences.length; i++) {
add(sentences[i], times[i]);
}
}
private void add(String s, int count) {
TrieNode curr = root;
for (char c : s.toCharArray()) {
TrieNode next = curr.children.get(c);
if (next == null) {
next = new TrieNode();
curr.children.put(c, next);
}
curr = next;
curr.counts.put(s, curr.counts.getOrDefault(s, 0) + count);
}
}
public List<String> input(char c) {
if (c == '#') {
add(prefix, 1);
prefix = "";
return new ArrayList<String>();
}
prefix = prefix + c;
TrieNode curr = root;
for (char cc : prefix.toCharArray()) {
TrieNode next = curr.children.get(cc);
if (next == null) {
return new ArrayList<String>();
}
curr = next;
}
PriorityQueue<Pair> pq = new PriorityQueue<>((a, b) -> (a.c == b.c ? a.s.compareTo(b.s) : b.c - a.c));
for (String s : curr.counts.keySet()) {
pq.add(new Pair(s, curr.counts.get(s)));
}
List<String> res = new ArrayList<String>();
for (int i = 0; i < 3 && !pq.isEmpty(); i++) {
res.add(pq.poll().s);
}
return res;
}
}