前端学数据结构之字典和散列表

时间 2020-04-16

标签前端数据结构字典列表繁體版

原文原文链接

字典

集合表示一组互不相同的元素（不重复的元素）。在字典中，存储的是[键，值]对，其中键名是用来查询特定元素的。字典和集合很类似，集合以[值，值]的形式存储元素，字典则是以[键，值]的形式来存储元素。字典也称做映射。node

字典如同ES6中的Map类。

散列表

1、哈希表app

一、概念函数

哈希表(Hash Table)也叫散列表，是依据关键码值（Key Value）而直接进行访问的数据结构。它经过把关键码值映射到哈希表中的一个位置来訪问记录，以加快查找的速度。

这个映射函数就作散列函数。存放记录的数组叫作散列表。this

二、散列存储的基本思路spa

以数据中每个元素的keyword为自变量。经过散列函数H（k）计算出函数值，以该函数值做为一块连续存储空间的的单元地址，将该元素存储到函数值相应的单元中。

三、哈希表查找的时间复杂度code

哈希表存储的是键值对，其查找的时间复杂度与元素数量多少无关。哈希表在查找元素时是经过计算哈希码值来定位元素的位置从而直接访问问元素的，所以，哈希表查找的时间复杂度为O（1）。

2、常用的哈希函数blog

1.直接寻址法

取keyword或者keyword的某个线性函数值做为哈希地址,即H(Key)=Key或者H(Key)=a*Key+b(a,b为整数),这样的散列函数也叫作自身函数.假设H(Key)的哈希地址上已经有值了,那么就往下一个位置找,知道找到H(Key)的位置没有值了就把元素放进去.

2.数字分析法

分析一组数据,比方一组员工的出生年月,这时咱们发现出生年月的前几位数字通常都一样,所以,出现冲突的几率就会很是大,但是咱们发现年月日的后几位表示月份和详细日期的数字区别很是大,假设利用后面的几位数字来构造散列地址,则冲突的概率则会明显减小.所以数字分析法就是找出数字的规律,尽量利用这些数据来构造冲突概率较低的散列地址.

3.平方取中法

取keyword平方后的中间几位做为散列地址.一个数的平方值的中间几位和数的每一位都有关。

所以，有平方取中法获得的哈希地址同keyword的每一位都有关。是的哈希地址具备较好的分散性。

该方法适用于keyword中的每一位取值都不够分散或者较分散的位数小于哈希地址所需要的位数的状况。

4.折叠法

折叠法即将keyword切割成位数一样的几部分,最后一部分位数可以不一样,而后取这几部分的叠加和(注意:叠加和时，去除进位)做为散列地址.数位叠加可以有移位叠加和间界叠加两种方法.移位叠加是将切割后的每一部分的最低位对齐,而后相加;间界叠加是从一端向还有一端沿切割界来回折叠,而后对齐相加.

5.随机数法

选择一个随机数,去keyword的随机值做为散列地址,通常用于keyword长度不一样的场合.

6.除留余数法

取keyword被某个不大于散列表表长m的数p除后所得的余数为散列地址.即H(Key)=Key MOD p,p<=m.不只可以对keyword直接取模,也可在折叠、平方取中等运算以后取模。对p的选择很是重要，通常取素数或m，若p选得很差。则很是easy产生冲突。

通常p取值为表的长度为表的长度。

HashTable类

也叫HashMap类，是Dictionary类的一种散列表实现方式

散列值比较大，能够用算出的散列值除以一个数取余

散列值可能相同，存在冲突，处理冲突

处理冲突有几种方法：分离连接、线性探查和双散列法

分离连接

分离连接法包括为散列表的每个位置建立一个链表并将元素存储在里面。它是解决冲突的最简单的方法，可是它在HashTable实例以外还须要额外的存储空间

// 链表
function LinkedList() {

    let Node = function(element){

        this.element = element;
        this.next = null;
    };

    let length = 0;
    let head = null;

    this.append = function(element){

        let node = new Node(element),
            current;

        if (head === null){
            head = node;
        } else {

            current = head;
            while(current.next){
                current = current.next;
            }

            current.next = node;
        }

        length++; 
    };

    this.isEmpty = function() {
        return length === 0;
    };

    this.getHead = function(){
        return head;
    };
    
    
    this.toString = function(){

        let current = head,
            string = '';

        while (current) {
            string += current.element + (current.next ? ', ' : '');
            current = current.next;
        }
        return string;

    };
}


function HashTableSeparateChaining(){
    // 散列表
    var table = [];
    // 使用append方法向LinkedList实例中添加一个ValuePair实例（键和值）
    var ValuePair = function(key, value){
     this.key = key;
     this.value = value;
     this.toString = function() {
      return '[' + this.key + ' - ' + this.value + ']'; 
      }
    };
    // 散列函数
    var loseloseHashCode = function (key) {
        var hash = 0;
        for (var i = 0; i < key.length; i++) {
            hash += key.charCodeAt(i);
        }
        return hash % 37;
    };

    var hashCode = function(key){
        return loseloseHashCode(key);
    };
    
    // 添加
    this.put = function(key, value){
        var position = hashCode(key);
        console.log(position + ' - ' + key);

        if (table[position] == undefined) {
            table[position] = new LinkedList();
        }
        table[position].append(new ValuePair(key, value));
    };
    // 查找
    this.get = function(key) {
        var position = hashCode(key);

        if (table[position] !== undefined  && !table[position].isEmpty()){

            //获取头节点
            var current = table[position].getHead();

            do {
                if (current.element.key === key){
                    return current.element.value;
                }
                current = current.next;
            } while(current);
        }
        return undefined;
    };
    
    // 删除
    this.remove = function(key){

        var position = hashCode(key);

        if (table[position] !== undefined){

           //获取头节点
            var current = table[position].getHead();

            do {
                if (current.element.key === key){
                    table[position].remove(current.element);
                    if (table[position].isEmpty()){
                        table[position] = undefined;
                    }
                    return true;
                }
                current = current.next;
            } while(current);
        }

        return false;
    };

    // 表里全部数据
    this.print = function() {
        for (var i = 0; i < table.length; ++i) {
            if (table[i] !== undefined) {
               console.log(i+'----'+table[i].toString());
            }
        }
    };
}

验证

var hash = new HashTableSeparateChaining();
hash.put('Gandalf', 'gandalf@email.com');
hash.put('John', 'johnsnow@email.com');
hash.put('Tyrion', 'tyrion@email.com');
hash.put('Aaron', 'aaron@email.com');
hash.put('Donnie', 'donnie@email.com');
hash.put('Ana', 'ana@email.com');
hash.put('Jonathan', 'jonathan@email.com');
hash.put('Jamie', 'jamie@email.com');
hash.put('Sue', 'sue@email.com');
hash.put('Mindy', 'mindy@email.com');
hash.put('Paul', 'paul@email.com');
hash.put('Nathan', 'nathan@email.com');

hash.print();

线性探查

另外一种解决冲突的方法是线性探查。当想向表中某个位置加入一个新元素的时候，若是索引为index的位置已经被占据了，就尝试index+1的位置。若是index+1的位置也被占据了，就尝试index+2的位置

function HashLinearProbing(){

    var table = [];
    var ValuePair = function(key, value){
        this.key = key;
        this.value = value;

        this.toString = function() {
            return '[' + this.key + ' - ' + this.value + ']';
        }
    };
    // 散列函数
    var loseloseHashCode = function (key) {
        var hash = 0;
        for (var i = 0; i < key.length; i++) {
            hash += key.charCodeAt(i);
        }
        return hash % 37;
    };

    var hashCode = function(key){
        return loseloseHashCode(key);
    };
    
    // 添加
    this.put = function(key, value){
        var position = hashCode(key);
        console.log(position + ' - ' + key);

        if (table[position] == undefined) {
            table[position] = new ValuePair(key, value);
        } else {
            var index = ++position;
            while (table[index] != undefined){
                index++;
            }
            table[index] = new ValuePair(key, value);
        }
    };

    // 删除
    this.remove = function(key){
        var position = hashCode(key);

        if (table[position] !== undefined){
            if (table[position].key === key) {
                table[position] = undefined;
            } else {
                var index = ++position;
                while (table[index] === undefined || table[index].key !== key){
                    index++;
                }
                if (table[index].key === key) {
                    table[index] = undefined;
                }
            }
        }
    };

    // 输出
    this.print = function() {
        for (var i = 0; i < table.length; ++i) {
            if (table[i] !== undefined) {
                console.log(i + ' -> ' + table[i].toString());
            }
        }
    };
}

5 -> [Jonathan - jonathan@email.com]
6 -> [Jamie - jamie@email.com]
7 -> [Sue - sue@email.com]
10 -> [Nathan - nathan@email.com]
13 -> [Donnie - donnie@email.com]
14 -> [Ana - ana@email.com]
16 -> [Tyrion - tyrion@email.com]
17 -> [Aaron - aaron@email.com]
19 -> [Gandalf - gandalf@email.com]
29 -> [John - johnsnow@email.com]
32 -> [Mindy - mindy@email.com]
33 -> [Paul - paul@email.com]

更好的散列函数

个能够实现的比“loselose”更好的散列函数是djb2：

var djb2HashCode = function (key) {
 var hash = 5381; //{1}
 for (var i = 0; i < key.length; i++) { //{2}
     hash = hash * 33 + key.charCodeAt(i); //{3}
 }
 return hash % 1000; //{4}
};