HashMap源码理解

时间 2019-11-18

标签 hashmap 源码理解繁體版

原文原文链接

private static int roundUpToPowerOf2(int number) {
    return number >= MAXIMUM_CAPACITY
            ? MAXIMUM_CAPACITY
            : (number > 1) ? Integer.highestOneBit((number - 1) << 1) : 1;
}

Integer.highestOneBit方法是获取不大于自己的2^n的值，那该方法具体含义是：html
获取新的数组容量值，若是给定值大于等于最大的容量，则返回最大容量，不然：若是容量小于等于1，则返回1，不然返回大于等于给定值的最接近的2^n的值java
容量为何要是2^n次方呢？且看以下代码：数组

static int indexFor(int h, int length) {
    // assert Integer.bitCount(length) == 1 : "length must be a non-zero power of 2";
    return h & (length-1);
}

给方法是查找h(ash)在数组的索引位置；那如今看length为何要是2^n次方呢？点此查看app
保证&以后的数据不大于lengththis
length-1以后，最低的n位都是1，那与h的&运算以后的值便是h的最低n位spa
采用length-1而不是直接length是由于2^n最低一位是0，那&运算以后数据都分布在偶数位，不是随机的.net

private void inflateTable(int toSize) {
    // Find a power of 2 >= toSize
    int capacity = roundUpToPowerOf2(toSize);

    threshold = (int) Math.min(capacity * loadFactor, MAXIMUM_CAPACITY + 1);
    table = new Entry[capacity];
    initHashSeedAsNeeded(capacity);
}

扩容：先获取最接近而且>=指定容量的的数值做为扩容后的容量;扩容界限就是取容量*加载因子和最大容量+1的最小值；从新初始化table；扩容以后的table没有任何数据，因此在相关调用操做以后会有数据从新分配操做，比较耗时，因此在初始化一个hashmap的时候最好指定容量避免扩容操做发生code

public V get(Object key) {
    if (key == null)
        return getForNullKey();
    Entry<K,V> entry = getEntry(key);

    return null == entry ? null : entry.getValue();
}

get方法：htm

若是key==null，则单独获取blog

private V getForNullKey() {
    if (size == 0) {
        return null;
    }
    for (Entry<K,V> e = table[0]; e != null; e = e.next) {
        if (e.key == null)
            return e.value;
    }
    return null;
}

若是size==0，则直接返回null；
遍历table，判断key，返回value

final Entry<K,V> getEntry(Object key) {
    if (size == 0) {
        return null;
    }

    int hash = (key == null) ? 0 : hash(key);
    for (Entry<K,V> e = table[indexFor(hash, table.length)];
         e != null;
         e = e.next) {
        Object k;
        if (e.hash == hash &&
            ((k = e.key) == key || (key != null && key.equals(k))))
            return e;
    }
    return null;
}

根据key获取Entry：

获取hash值
获取数组索引
获取数组中索引的第一个entry
遍历entry，若是hash值相等而且（key相等或者equal），则返回当前entry便可

public boolean containsKey(Object key) {
    return getEntry(key) != null;
}

判断key是否存在只是调用getEntry方法判断是否为null便可

public V put(K key, V value) {
    if (table == EMPTY_TABLE) {
        inflateTable(threshold);
    }
    if (key == null)
        return putForNullKey(value);
    int hash = hash(key);
    int i = indexFor(hash, table.length);
    for (Entry<K,V> e = table[i]; e != null; e = e.next) {
        Object k;
        if (e.hash == hash && ((k = e.key) == key || key.equals(k))) {
            V oldValue = e.value;
            e.value = value;
            e.recordAccess(this);
            return oldValue;
        }
    }

    modCount++;
    addEntry(hash, key, value, i);
    return null;
}

put方法：

若是table为空数组，则先扩容到扩容界限(threshold)的数组，若是是默认的初始化方法，则threshold=容量；执行该方法以后。threshold=容量*加载因子；
若是key==null，单独存入该值，putForNullKey
根据hash和length获取table索引，找出第一个entry
遍历entry，若是找到则从新设置新值，返回旧值，不然新添加一个entry（addEntry）

private V putForNullKey(V value) {
    for (Entry<K,V> e = table[0]; e != null; e = e.next) {
        if (e.key == null) {
            V oldValue = e.value;
            e.value = value;
            e.recordAccess(this);
            return oldValue;
        }
    }
    modCount++;
    addEntry(0, null, value, 0);
    return null;
}

存入key==null的value，直接查找table的第一个位置（index=0），若是找到则从新设置新值，返回旧值，不然添加新的entry；
key==null的值所有在table【0】的entry上

void addEntry(int hash, K key, V value, int bucketIndex) {
    if ((size >= threshold) && (null != table[bucketIndex])) {
        resize(2 * table.length);
        hash = (null != key) ? hash(key) : 0;
        bucketIndex = indexFor(hash, table.length);
    }

    createEntry(hash, key, value, bucketIndex);
}

建立新的entry：

先检查是否知足扩容条件：size>=threshold&&null!=table[bucketIndex];
若是知足扩容，计算新的hash和数组索引（bucketIndex）
建立新的entry（createEntry）
注意事项：size是针对hash表里的全部数据的容量，而扩容是指数组扩容

void resize(int newCapacity) {
    Entry[] oldTable = table;
    int oldCapacity = oldTable.length;
    if (oldCapacity == MAXIMUM_CAPACITY) {
        threshold = Integer.MAX_VALUE;
        return;
    }

    Entry[] newTable = new Entry[newCapacity];
    transfer(newTable, initHashSeedAsNeeded(newCapacity));
    table = newTable;
    threshold = (int)Math.min(newCapacity * loadFactor, MAXIMUM_CAPACITY + 1);
}

先获取oldTable的length，若是已经是最大长度，则无需扩容，而且将threshold设为Integer.MAX_VALUE，不可扩容！
建立新的Entry数组，将旧的table数据添加到newTable中（transfer）
设置新的table和threshold

void transfer(Entry[] newTable, boolean rehash) {
    int newCapacity = newTable.length;
    for (Entry<K,V> e : table) {
        while(null != e) {
            Entry<K,V> next = e.next;
            if (rehash) {
                e.hash = null == e.key ? 0 : hash(e.key);
            }
            int i = indexFor(e.hash, newCapacity);
            e.next = newTable[i];
            newTable[i] = e;
            e = next;
        }
    }
}

简单理解为：数据拷贝

获取新的数组长度，遍历table数组；
遍历每一个链表：重定向到新的数组和链表中
如今遍历到[1,0]即table[1]下的第一个entry，计算得出在新的i=5，则该entry的下一个是newTable[5],newTable[5]=entry
从以上能够看出，新进入newTable的数据在后进入的next下

void createEntry(int hash, K key, V value, int bucketIndex) {
    Entry<K,V> e = table[bucketIndex];
    table[bucketIndex] = new Entry<>(hash, key, value, e);
    size++;
}

建立新的Entry：

先根据索引（bucketIndex）获取数组中的元素（e）
建立新的entry，位于链表第一个entry，而e是当前新的entry的next

public void putAll(Map<? extends K, ? extends V> m) {
    int numKeysToBeAdded = m.size();
    if (numKeysToBeAdded == 0)
        return;

    if (table == EMPTY_TABLE) {
        inflateTable((int) Math.max(numKeysToBeAdded * loadFactor, threshold));
    }

    /*
     * Expand the map if the map if the number of mappings to be added
     * is greater than or equal to threshold.  This is conservative; the
     * obvious condition is (m.size() + size) >= threshold, but this
     * condition could result in a map with twice the appropriate capacity,
     * if the keys to be added overlap with the keys already in this map.
     * By using the conservative calculation, we subject ourself
     * to at most one extra resize.
     */
    if (numKeysToBeAdded > threshold) {
        int targetCapacity = (int)(numKeysToBeAdded / loadFactor + 1);
        if (targetCapacity > MAXIMUM_CAPACITY)
            targetCapacity = MAXIMUM_CAPACITY;
        int newCapacity = table.length;
        while (newCapacity < targetCapacity)
            newCapacity <<= 1;
        if (newCapacity > table.length)
            resize(newCapacity);
    }

    for (Map.Entry<? extends K, ? extends V> e : m.entrySet())
        put(e.getKey(), e.getValue());
}

将一个集合添加到现有集合中：

先获取要添加结合的元素数量
若是现有集合是空集合，扩容：当前threshold和根据添加元素容量计算的新容量的最大值
假如添加的集合元素数量>threshold，则判断当前table是否须要扩容
此处是保守估计新的集合添加以后的容量：可是能够保证最多只有一次调用resize方法！
若是numKeysToBeAdded <=threshold，即便在put方法致使扩容也至多有一次：扩容至两倍，那threshold也会变为两倍
若是numKeysToBeAdded >threshold，若是：targetCapacity>table.length，则在put方法可能会致使resize，不然newCapacity一定大于table.length，在此处resize，put方法就不会resize了

参考文章：http://blog.csdn.net/eson_15/article/details/51158865

我的博客：http://www.whereta.com