HashMap 常见应用：实现 SQL JOIN

时间 2019-11-07

标签 hashmap 常见应用实现 sql join 栏目 SQL 繁體版

原文原文链接

在个人上一篇文章中，讲到了我本身初步认识 HashMap 的一个经验分享：HashMap 浅析 —— LeetCode Two Sum 刷题总结。做为一个 CRUD 工程师，平时不多接触到基础组件的涉及，那么是否是很难有机会用到 HashMap 呢？java

今天，就举一个常见的查询例子，来看看咱们如何使用 HashMap 来提升代码的效率。算法

已知一个 Student 类：编程

public class Student {
    private Long id;

    private String name;

    public Student(Long id, String name) {
        this.id = id;
        this.name = name;
    }

    // ---Getters And Setters---
}

和一个 Score 类：segmentfault

public class Score {
    private Long studentId;

    private String mathScore;

    private String englishScore;

    public Score(Long studentId, String mathScore, String englishScore) {
        this.studentId = studentId;
        this.mathScore = mathScore;
        this.englishScore = englishScore;
    }

    // ---Getters And Setters---
}

咱们须要把 Student 和 Score 合并到一块儿，即类 Report：微服务

public class Report {
    private Long studentId;

    private String studentName;

    private String mathScore;

    private String englishScore;

    public Report(Long studentId, String studentName, String mathScore, String englishScore) {
        this.studentId = studentId;
        this.studentName = studentName;
        this.mathScore = mathScore;
        this.englishScore = englishScore;
    } 
}

看类的属性咱们就明白了，这里其实至关于在 Student 和 Score 之间作一个 Join，获得 Report。这是咱们在编程中常见的场景（例如查询了订单中心，用户中心，支付中心，合并各个中心返回的结果造成一个表单，由于各个中心是独立的微服务，没法使用 SQL JOIN）。测试

现有两个 List：List<Student> 和 List<Score>：this

List<Student> students = Arrays.asList(
        new Student(1L, "Angle"),
        new Student(2L, "Baby")
);

List<Score> scores = Arrays.asList(
        new Score(1L, "90", "87"),
        new Score(2L, "92", "78")
);

在学会使用 HashMap 以前，我可能会作一次双重循环：spa

List<Report> reports = new ArrayList<>();
for (Student student : students) {
    for (Score score : scores) {
        if (!student.getId().equals(score.getStudentId())) {
            continue;
        }

        reports.add(
                new Report(student.getId(), student.getName(), score.getMathScore(), score.getEnglishScore())
        );
        break;
    }
}

时间复杂度最差的状况下是O(n * m)。code

可是使用 HashMap 来改善程序，就能获得不错的效果：blog

Map<Long, Student> map = new HashMap<>();
for (Student student : students) {
    map.put(student.getId(), student);
}

List<Report> reports = new ArrayList<>();
for (Score score : scores) {
    Student student = map.get(score.getStudentId());
    if(student == null){ // 避免 NPE
        continue;
    }
    reports.add(
            new Report(student.getId(), student.getName(), score.getMathScore(), score.getEnglishScore())
    );
}

双重循环，变成了两次循环，时间复杂度是O(n + m)。

显然要比前面的方法效果要好一些。笔者写了测试代码分别测试两个方法的效率，在 10w 数据下，执行时间以下：

差距好像挺大。想了解为何 HashMap 可以获得如此好的效果，能够看个人这篇文章：HashMap 浅析 —— LeetCode Two Sum 刷题总结。若是读者有更好的解法欢迎留言交流，笔者水平有限，在算法上研究很少。

10w 数据的测试源码见下方，各位读者能够自行试验下效果：

package com.xiangyu.demo.hashmap;

import com.xiangyu.java.hashmap.Report;
import com.xiangyu.java.hashmap.Score;
import com.xiangyu.java.hashmap.Student;
import org.junit.Before;
import org.junit.Test;

import java.util.ArrayList;
import java.util.HashMap;
import java.util.List;
import java.util.Map;

public class HashMapTest {
    private List<Student> students = new ArrayList<>();

    private List<Score> scores = new ArrayList<>();

    @Before
    public void before() {
        // 每一个list 里放 10w 数据
        for (long i = 0; i < 100000; i++) {
            students.add(new Student(i, "test"));
            scores.add(new Score(i, "95", "95"));
        }
    }

    @Test
    public void TestHashMap() {
        Map<Long, Student> map = new HashMap<>();
        for (Student student : students) {
            map.put(student.getId(), student);
        }

        List<Report> reports = new ArrayList<>();
        for (Score score : scores) {
            Student student = map.get(score.getStudentId());
            if (student == null) {
                continue;
            }
            reports.add(
                    new Report(student.getId(), student.getName(), score.getMathScore(), score.getEnglishScore())
            );
        }
        System.out.println(reports.size());
    }

    @Test
    public void testFor2() {
        List<Report> reports = new ArrayList<>();
        for (Student student : students) {
            for (Score score : scores) {
                if (!student.getId().equals(score.getStudentId())) {
                    continue;
                }

                reports.add(
                        new Report(student.getId(), student.getName(), score.getMathScore(), score.getEnglishScore())
                );
                break;
            }
        }
        System.out.println(reports.size());
    }
}