HashMap - JDK 1.7 和 JDK 1.8

1，HashMap基础

1）HashMap非线程安全，允许null key和null value，hash table不允许。
2）HashMaprehash可能会出现循环链表，get则会死循环；多线程put可能会导致元素丢失。
3）HashMap的几个属性
initialCapacity：HashMap初始化时，Entry数组(buckets)的大小。
loadFactor：负载因子，表示hash表满的程度，默认0.75。
threshold：rehash的阈值。capacity * loadFactor
modCount：HashMap结构修改的次数，用于实现HashMap Iterator的fast-falil快速失败，抛出ConcurrentModificationException 。
4）equals和hashCode方法
重写equals方法一定要重写hashCode(成对重写)：因为HashMap通过hashCode定位table位置，如objA.equals(objB)，map.put(objA, "113")后，map.get(objB)取出来为null。
hashCode相等，equals不同：造成hash冲突，链表过长。
5）bin(bucket、桶): 是一个可以保存多个键值对的数据结构。

2，JDK 1.7下的HashMap

1）底层数据结构，Entry数组 + 链表(当hash冲突很多时，HashMap退化为链表)

image.png

2）HashMap put过程
1.如果Entry数组table为空，初始化数组
2.如果key == null，putForNullKey(value)，放到table[0]的冲突连上。
3.hash(key)获取key的hash值，int i = indexFor(hash,table.length)获取table数组下标。
4.从table[i]开始遍历链表，如果key对应的value存在，值newValue覆盖oldValue，并返回oldValue。
覆盖条件: hash值相等且(key引用相等 || key内容equals) e.hash == hash && (e.key == key || key.equals(e.key))
5.对应位置无元素，则modCount++ -> addEntry(hash,key,value,bucketIndex)放入链表头部 -> return null。
3）HashMap get过程
1.key == null, getForNullKey()去table[0]链表检索。
获取key的hash值，获取对象的table index位置。
遍历table[i]处的链表，通过hash值相等且(key的引用相等或者key的内容equals)。
4）resize扩容过程
要求: length必须为2的幂，方便计算indexFor。length - 1表示形式为0...111，hashCode%length -> hashCode & length-1
原数组中元素，必须rehash到新数组指定位置。扩容条件: ++size > threshold，新数组大小newCap = oldCap << 1。
1.创建新数组new Entry[newCap]
2.将table中元素，transfer到newTable
3.引用替换，table = newTable，修改threshold阈值。
transfer过程

image.png

5）HashMap线程不安全
多线程resize，链表成环，导致get死循环。
多线程resize，导致put元素，get为null。

image.png

image.png

核心原因：线程2 resize完成后，e和next顺序是反的，导致e节点被线程1 transfer了两次。

image.png

2，JDK8 HashMap

1）由数组+链表+红黑树(链表长度大于等于8时转换)组成。为了解决hash冲突频繁，链表过长查询效率低的问题。
根据第一个节点数据类型Node还是TreeNode 来判断该位置下是链表还是红黑树。

image.png

2）put元素过程

image.png

image.png

3） Java8 中，当链表中的元素超过了 8 个以后，会将链表转换为红黑树，在这些位置进行查找的时候可以降低时间复杂度为 O(logN)。
if (binCount >= TREEIFY_THRESHOLD - 1)
treeifyBin(tab, hash);

image.png

4）resize中链表的处理
根据(e.hash & oldCap) == 0 取到第n位，为0则在低链表，为1则在高链表。
低链表在数组中仍在原来的bucket位置。
高链表的位置相对于低链表的偏移为oldCap。

3，java.util.ConcurrentModificationException

1） HashMap的modCount
The number of times this HashMap has been structurally modified。结构被修改的次数
put操作：++modCount;
remove操作：++modCount;
2）for each语法糖增强for循环通过迭代器实现，在使用迭代器遍历元素的时候，对集合进行删除可能会抛出ConcurrentModificationException。

image.png

3）模拟ConcurrentModificationException

public class TestHashMap {

  public static void main(String[] args) {
    Map winMap = Maps.newHashMap();
    winMap.put("000", 2);
    winMap.put("001", 1);
    processFilterRemoveSkill(winMap);
  }

  public static void processFilterRemoveSkill(Map winMap){
//    if(winMap.containsKey("000")){
//      winMap.remove("000");//移除
//    }
    
    //第一次循环modCount=2，remove后modCount=3，此时
    for(String productId : winMap.keySet()){
      if(productId.equals("000")){
        winMap.remove(productId);//移除
      }
    }
  }
}

HashMap - JDK 1.7 和 JDK 1.8

1，HashMap基础

2，JDK 1.7下的HashMap

2，JDK8 HashMap

3，java.util.ConcurrentModificationException

你可能感兴趣的:(HashMap - JDK 1.7 和 JDK 1.8)