HashBasedTable的key可以重复吗 hashmap的key可以重复吗

转载

编程小匠人之魂 2024-08-13 16:12:41

HashMap

HashMap是面试必问的知识点之一，也是java开发最常用的一种数据模型，HashMap属于复合结构，以key-value形式存储数据，其中key是不允许重复的但是允许为空，value是可以重复或为空的，在jdk1.8前，它的结构为数组+链表，在jdk1.8后变成了数组+单向链表+红黑树
当JVM存储HashMap的K-V时，首先计算key的hash值，以此来确定插入的数组位置，但是这一hash值的位置可能已经存在有值，这时就顺延到同一hash值的元素的后面形成了单向链表，同一个链表上的Hash值是相同的，所以说数组存放的是链表！他的结构图：

红黑树的阈值是8，当链表大于等于8时链表变成了红黑树结构，大大减少了查找的时间。
这是源码的解释：

1/**
 2     * The bin count threshold for using a tree rather than list for a
 3     * bin.  Bins are converted to trees when adding an element to a
 4     * bin with at least this many nodes. The value must be greater
 5     * than 2 and should be at least 8 to mesh with assumptions in
 6     * tree removal about conversion back to plain bins upon
 7     * shrinkage.
 8     */
 9    static final int TREEIFY_THRESHOLD = 8;
10
11    /**
12     * The bin count threshold for untreeifying a (split) bin during a
13     * resize operation. Should be less than TREEIFY_THRESHOLD, and at
14     * most 6 to mesh with shrinkage detection under removal.
15     */
16    static final int UNTREEIFY_THRESHOLD = 6;

当长度低于6时会由红黑树转成链表,TreeNodes占用空间是普通Nodes的两倍,所以只有当bin包含足够多的节点时才会转成TreeNodes,而是否足够多就是由TREEIFY_THRESHOLD的值决定的,当bin中节点数变少时，又会转成普通的bin,这样就解析了为什么不是一开始就将其转换为TreeNodes，而是需要一定节点数才转为TreeNodes，说白了就是trade-off，空间和时间的权衡

hashmap是如何get值呢？

1public V get(Object key) {
 2        Node<K,V> e;
 3        return (e = getNode(hash(key), key)) == null ? null : e.value;
 4    }
 5      /**
 6     * Implements Map.get and related methods
 7     *
 8     * @param hash hash for key
 9     * @param key the key
10     * @return the node, or null if none
11     */
12    final Node<K,V> getNode(int hash, Object key) {
13        Node<K,V>[] tab;//Entry对象数组
14    Node<K,V> first,e; //在tab数组中经过散列的第一个位置
15    int n;
16    K k;
17    /*找到插入的第一个Node，方法是hash值和n-1相与，tab[(n - 1) & hash]*/
18    //也就是说在一条链上的hash值相同的
19        if ((tab = table) != null && (n = tab.length) > 0 &&(first = tab[(n - 1) & hash]) != null) {
20    /*检查第一个Node是不是要找的Node*/
21            if (first.hash == hash && // always check first node
22                ((k = first.key) == key || (key != null && key.equals(k))))//判断条件是hash值要相同，key值要相同
23                return first;
24      /*检查first后面的node*/
25            if ((e = first.next) != null) {
26                if (first instanceof TreeNode)
27                    return ((TreeNode<K,V>)first).getTreeNode(hash, key);
28                /*遍历后面的链表，找到key值和hash值都相同的Node*/
29                do {
30                    if (e.hash == hash &&
31                        ((k = e.key) == key || (key != null && key.equals(k))))
32                        return e;
33                } while ((e = e.next) != null);
34            }
35        }
36        return null;
37    }

get(key)方法时获取key的hash值，计算hash&(n-1)得到在链表数组中的位置first=tab[hash&(n-1)],先判断first的key是否与参数key相等，不等就遍历后面的链表找到相同的key值返回对应的Value值

HashMap是基于hashing的原理，我们使用put(key, value)存储对象到HashMap中,使用get(key)从HashMap中获取对象
给put(key, value)方法传递键和值时，它先调用key.hashCode()方法，返回的hashCode值，用于找到bucket位置，来储存Entry对象
Map提供了一些常用方法，如keySet()、entrySet()等方法
keySet()方法返回值是Map中key值的集合；entrySet()的返回值也是返回一个Set集合，此集合的类型为Map.Entry
hashmap查询的时间复杂度,（最好O（1）,最差O（n）,如果是红黑O（logn））
默认的负载因子0.75f：0.75是平衡了时间和空间等因素,负载因子越小桶的数量越多,读写的时间复杂度越低（极限情况O(1), 哈希碰撞的可能性越小）;负载因子越大桶的数量越少，读写的时间复杂度越高(极限情况O(n), 哈希碰撞可能性越高)