ThreadLocal 深度解析

时间 2019-11-07

原文原文链接

一.对ThreadLocal的理解二.深刻解析ThreadLocal类三.ThreadLocal的应用场景算法

对ThreadLocal的理解

ThreadLocal是一个本地线程副本变量工具类。主要用于将私有线程和该线程存放的副本对象作一个映射，各个线程之间的变量互不干扰，在高并发场景下，能够实现无状态的调用，特别适用于各个线程依赖不一样的变量值完成操做的场景。数据库

咱们先写一个例子编程

class ConnectionManager {
     
    private static Connection connect = null;
     
    public static Connection openConnection() {
        if(connect == null){
            connect = DriverManager.getConnection();
        }
        return connect;
    }
     
    public static void closeConnection() {
        if(connect!=null)
            connect.close();
    }
}
复制代码

假设有这样一个数据库连接管理类，这段代码在单线程中使用是没有任何问题的，可是若是在多线程中使用呢？很显然，在多线程中使用会存在线程安全问题：第一，这里面的2个方法都没有进行同步，极可能在openConnection方法中会屡次建立connect；第二，因为connect是共享变量，那么必然在调用connect的地方须要使用到同步来保障线程安全，由于极可能一个线程在使用connect进行数据库操做，而另一个线程调用closeConnection关闭连接。数组

因此出于线程安全的考虑，必须将这段代码的两个方法进行同步处理，而且在调用connect的地方须要进行同步处理。安全

这样将会大大影响程序执行效率，由于一个线程在使用connect进行数据库操做的时候，其余线程只有等待。bash

那么你们来仔细分析一下这个问题，这地方到底需不须要将connect变量进行共享？事实上，是不须要的。假如每一个线程中都有一个connect变量，各个线程之间对connect变量的访问其实是没有依赖关系的，即一个线程不须要关心其余线程是否对这个connect进行了修改的。服务器

到这里，可能会有朋友想到，既然不须要在线程之间共享这个变量，能够直接这样处理，在每一个须要使用数据库链接的方法中具体使用时才建立数据库连接，而后在方法调用完毕再释放这个链接。好比下面这样：session

class ConnectionManager {
     
    private  Connection connect = null;
     
    public Connection openConnection() {
        if(connect == null){
            connect = DriverManager.getConnection();
        }
        return connect;
    }
     
    public void closeConnection() {
        if(connect!=null)
            connect.close();
    }
}
 
 
class Dao{
    public void insert() {
        ConnectionManager connectionManager = new ConnectionManager();
        Connection connection = connectionManager.openConnection();
         
        //使用connection进行操做
         
        connectionManager.closeConnection();
    }
}
复制代码

这样处理确实也没有任何问题，因为每次都是在方法内部建立的链接，那么线程之间天然不存在线程安全问题。因为在方法中须要频繁地开启和关闭数据库链接，这样会致使服务器压力很是大，严重影响程序执行性能。多线程

这种状况下咱们可使用ThreadLocal，由于ThreadLocal在每一个线程中对该变量会建立一个副本，即每一个线程内部都会有一个该变量，且在线程内部任何地方均可以使用，线程之间互不影响，这样一来就不存在线程安全问题，也不会严重影响程序执行性能。并发

可是要注意，虽然ThreadLocal可以解决上面说的问题，可是因为在每一个线程中都建立了副本，因此要考虑它对资源的消耗，好比内存的占用会比不使用ThreadLocal要大。

下图为ThreadLocal的内部结构图

从上面的结构图，咱们已经窥见ThreadLocal的核心机制：

每一个Thread线程内部都有一个Map。
Map里面存储线程本地对象（key）和线程的变量副本（value）
可是，Thread内部的Map是由ThreadLocal维护的，由ThreadLocal负责向map获取和设置线程的变量值。

因此对于不一样的线程，每次获取副本值时，别的线程并不能获取到当前线程的副本值，造成了副本的隔离，互不干扰。

.深刻解析ThreadLocal类

Thread线程内部的Map在类中描述以下：

public class Thread implements Runnable {
    /* ThreadLocal values pertaining to this thread. This map is maintained
     * by the ThreadLocal class. */
    ThreadLocal.ThreadLocalMap threadLocals = null;
}

复制代码

咱们先了解一下ThreadLocal类提供以下几个核心方法：

public T get() { }
public void set(T value) { }
public void remove() { }
protected T initialValue() { }
复制代码

get()方法用于获取当前线程的副本变量值。
set()方法用于设置当前线程的副本变量值。
initialValue()为当前线程初始副本变量值。
remove()方法移除当前前程的副本变量值。

先看下get方法的实现：

/**
 * Returns the value in the current thread's copy of this * thread-local variable. If the variable has no value for the * current thread, it is first initialized to the value returned * by an invocation of the {@link #initialValue} method. * * @return the current thread's value of this thread-local
 */
public T get() {
    Thread t = Thread.currentThread();
    ThreadLocalMap map = getMap(t);
    if (map != null) {
        ThreadLocalMap.Entry e = map.getEntry(this);
        if (e != null)
            return (T)e.value;
    }
    return setInitialValue();
}
复制代码

第一句是取得当前线程，而后经过getMap(t)方法获取到一个map，map的类型为ThreadLocalMap。而后接着下面获取到<key,value>键值对，注意这里获取键值对传进去的是 this，而不是当前线程t。若是获取成功，则返回value值。若是map为空，则调用setInitialValue方法返回value。咱们上面的每一句来仔细分析：首先看一下getMap方法中作了什么：

ThreadLocalMap getMap(Thread t) {
    return t.threadLocals;
}
复制代码

可能你们没有想到的是，在getMap中，是调用当期线程t，返回当前线程t中的一个成员变量threadLocals。那么咱们继续取Thread类中取看一下成员变量threadLocals是什么：

ThreadLocal.ThreadLocalMap threadLocals = null;
复制代码

实际上就是一个ThreadLocalMap，这个类型是ThreadLocal类的一个内部类，咱们继续取看ThreadLocalMap的实现：

static class ThreadLocalMap {

        /**
         * The entries in this hash map extend WeakReference, using
         * its main ref field as the key (which is always a
         * ThreadLocal object).  Note that null keys (i.e. entry.get()
         * == null) mean that the key is no longer referenced, so the
         * entry can be expunged from table.  Such entries are referred to
         * as "stale entries" in the code that follows.
         */
        static class Entry extends WeakReference<ThreadLocal<?>> {
            /** The value associated with this ThreadLocal. */
            Object value;

            Entry(ThreadLocal<?> k, Object v) {
                super(k);
                value = v;
            }
        }
复制代码

能够看到ThreadLocalMap的Entry继承了WeakReference，而且使用ThreadLocal做为键值。而后再继续看setInitialValue方法的具体实现：

private T setInitialValue() {
    T value = initialValue();
    Thread t = Thread.currentThread();
    ThreadLocalMap map = getMap(t);
    if (map != null)
        map.set(this, value);
    else
        createMap(t, value);
    return value;
}

protected T initialValue() {
    return null;
}

复制代码

若是map不为空，设置键值对为空，再建立Map，看一下createMap的实现：

/**
     * Create the map associated with a ThreadLocal. Overridden in
     * InheritableThreadLocal.
     *
     * @param t the current thread
     * @param firstValue value for the initial entry of the map
     */
    void createMap(Thread t, T firstValue) {
        t.threadLocals = new ThreadLocalMap(this, firstValue);
    }
复制代码

ThreadLocal为每一个线程建立变量的副本的过程：

首先，在每一个线程Thread内部有一个ThreadLocal.ThreadLocalMap类型的成员变量threadLocals，这个threadLocals就是用来存储实际的变量副本的，键值为当前ThreadLocal变量，value为变量副本（即T类型的变量）。

初始时，在Thread里面，threadLocals为空，当经过ThreadLocal变量调用get()方法或者set()方法，就会对Thread类中的threadLocals进行初始化，而且以当前ThreadLocal变量为键值，以ThreadLocal要保存的副本变量为value，存到threadLocals。

而后在当前线程里面，若是要使用副本变量，就能够经过get方法在threadLocals里面查找。

下面经过一个例子来证实经过ThreadLocal能达到在每一个线程中建立变量副本的效果：

public class Test {
    ThreadLocal<Long> longLocal = new ThreadLocal<Long>();
    ThreadLocal<String> stringLocal = new ThreadLocal<String>();
     
    public void set() {
        longLocal.set(Thread.currentThread().getId());
        stringLocal.set(Thread.currentThread().getName());
    }
     
    public long getLong() {
        return longLocal.get();
    }
     
    public String getString() {
        return stringLocal.get();
    }
     
    public static void main(String[] args) throws InterruptedException {
        final Test test = new Test();
        test.set();
        System.out.println(test.getLong());
        System.out.println(test.getString());
   
        Thread thread1 = new Thread(){
            public void run() {
                test.set();
                System.out.println(test.getLong());
                System.out.println(test.getString());
            };
        };
        thread1.start();
        thread1.join();
         
        System.out.println(test.getLong());
        System.out.println(test.getString());
    }
}

复制代码

输出结果以下图

从这段代码的输出结果能够看出，在main线程中和thread1线程中，longLocal保存的副本值和stringLocal保存的副本值都不同。最后一次在main线程再次打印副本值是为了证实在main线程中和thread1线程中的副本值确实是不一样的。

总结一下：

1）实际的经过ThreadLocal建立的副本是存储在每一个线程本身的threadLocals中的； 2）为什么threadLocals的类型ThreadLocalMap的键值为ThreadLocal对象，由于每一个线程中可有多个threadLocal变量，就像上面代码中的longLocal和stringLocal； 3）在进行get以前，必须先set，不然会报空指针异常；

若是想在get以前不须要调用set就能正常访问的话，必须重写initialValue()方法。由于在上面的代码分析过程当中，咱们发现若是没有先set的话，即在map中查找不到对应的存储，则会经过调用setInitialValue方法返回i，而在setInitialValue方法中，有一个语句是T value = initialValue()，而默认状况下，initialValue方法返回的是null。

看下面这个例子：

public class Test {
    ThreadLocal<Long> longLocal = new ThreadLocal<Long>();
    ThreadLocal<String> stringLocal = new ThreadLocal<String>();
 
    public void set() {
        longLocal.set(Thread.currentThread().getId());
        stringLocal.set(Thread.currentThread().getName());
    }
     
    public long getLong() {
        return longLocal.get();
    }
     
    public String getString() {
        return stringLocal.get();
    }
     
    public static void main(String[] args) throws InterruptedException {
        final Test test = new Test();
         
        System.out.println(test.getLong());
        System.out.println(test.getString());
 
        Thread thread1 = new Thread(){
            public void run() {
                test.set();
                System.out.println(test.getLong());
                System.out.println(test.getString());
            };
        };
        thread1.start();
        thread1.join();
         
        System.out.println(test.getLong());
        System.out.println(test.getString());
    }
}
复制代码

在main线程中，没有先set，直接get的话，运行时会报空指针异常。可是若是改为下面这段代码，即重写了initialValue方法：

public class Test {
    ThreadLocal<Long> longLocal = new ThreadLocal<Long>(){
        protected Long initialValue() {
            return Thread.currentThread().getId();
        };
    };
    ThreadLocal<String> stringLocal = new ThreadLocal<String>(){;
        protected String initialValue() {
            return Thread.currentThread().getName();
        };
    };
     
    public void set() {
        longLocal.set(Thread.currentThread().getId());
        stringLocal.set(Thread.currentThread().getName());
    }
     
    public long getLong() {
        return longLocal.get();
    }
     
    public String getString() {
        return stringLocal.get();
    }
     
    public static void main(String[] args) throws InterruptedException {
        final Test test = new Test();
 
        test.set();
        System.out.println(test.getLong());
        System.out.println(test.getString());
     
         
        Thread thread1 = new Thread(){
            public void run() {
                test.set();
                System.out.println(test.getLong());
                System.out.println(test.getString());
            };
        };
        thread1.start();
        thread1.join();
         
        System.out.println(test.getLong());
        System.out.println(test.getString());
    }
}
复制代码

就能够直接不用先set而直接调用get了。

set()方法

/**
 * Sets the current thread's copy of this thread-local variable * to the specified value. Most subclasses will have no need to * override this method, relying solely on the {@link #initialValue} * method to set the values of thread-locals. * * @param value the value to be stored in the current thread's copy of
 *        this thread-local.
 */
public void set(T value) {
    Thread t = Thread.currentThread();
    ThreadLocalMap map = getMap(t);
    if (map != null)
        map.set(this, value);
    else
        createMap(t, value);
}

ThreadLocalMap getMap(Thread t) {
    return t.threadLocals;
}

void createMap(Thread t, T firstValue) {
    t.threadLocals = new ThreadLocalMap(this, firstValue);
}

复制代码

set方法过程： 1.获取当前线程的成员变量map 2.map非空，则从新将ThreadLocal和新的value副本放入到map中。 3.map空，则对线程的成员变量ThreadLocalMap进行初始化建立，并将ThreadLocal和value副本放入map中。

remove()方法

/**
 * Removes the current thread's value for this thread-local * variable. If this thread-local variable is subsequently * {@linkplain #get read} by the current thread, its value will be * reinitialized by invoking its {@link #initialValue} method, * unless its value is {@linkplain #set set} by the current thread * in the interim. This may result in multiple invocations of the * <tt>initialValue</tt> method in the current thread. * * @since 1.5 */ public void remove() { ThreadLocalMap m = getMap(Thread.currentThread()); if (m != null) m.remove(this); } ThreadLocalMap getMap(Thread t) { return t.threadLocals; } 复制代码

ThreadLocalMap

ThreadLocalMap是ThreadLocal的内部类，没有实现Map接口，用独立的方式实现了Map的功能，其内部的Entry也独立实现。

在ThreadLocalMap中，也是用Entry来保存K-V结构数据的。可是Entry中key只能是ThreadLocal对象，这点被Entry的构造方法已经限定死了。

static class Entry extends WeakReference<ThreadLocal> {
    /** The value associated with this ThreadLocal. */
    Object value;

    Entry(ThreadLocal k, Object v) {
        super(k);
        value = v;
    }
}

复制代码

Entry继承自WeakReference（弱引用，生命周期只能存活到下次GC前），但只有Key是弱引用类型的，Value并不是弱引用。

ThreadLocalMap的成员变量：

static class ThreadLocalMap {
    /**
     * The initial capacity -- MUST be a power of two.
     */
    private static final int INITIAL_CAPACITY = 16;

    /**
     * The table, resized as necessary.
     * table.length MUST always be a power of two.
     */
    private Entry[] table;

    /**
     * The number of entries in the table.
     */
    private int size = 0;

    /**
     * The next size value at which to resize.
     */
    private int threshold; // Default to 0
}

复制代码

Hash冲突怎么解决

和HashMap的最大的不一样在于，ThreadLocalMap结构很是简单，没有next引用，也就是说ThreadLocalMap中解决Hash冲突的方式并不是链表的方式，而是采用线性探测的方式，所谓线性探测，就是根据初始key的hashcode值肯定元素在table数组中的位置，若是发现这个位置上已经有其余key值的元素被占用，则利用固定的算法寻找必定步长的下个位置，依次判断，直至找到可以存放的位置。

ThreadLocalMap解决Hash冲突的方式就是简单的步长加1或减1，寻找下一个相邻的位置。

/**
 * Increment i modulo len.
 */
private static int nextIndex(int i, int len) {
    return ((i + 1 < len) ? i + 1 : 0);
}

/**
 * Decrement i modulo len.
 */
private static int prevIndex(int i, int len) {
    return ((i - 1 >= 0) ? i - 1 : len - 1);
}

复制代码

显然ThreadLocalMap采用线性探测的方式解决Hash冲突的效率很低，若是有大量不一样的ThreadLocal对象放入map中时发送冲突，或者发生二次冲突，则效率很低。

因此这里引出的良好建议是：每一个线程只存一个变量，这样的话全部的线程存放到map中的Key都是相同的ThreadLocal，若是一个线程要保存多个变量，就须要建立多个ThreadLocal，多个ThreadLocal放入Map中时会极大的增长Hash冲突的可能。

ThreadLocalMap的问题

因为ThreadLocalMap的key是弱引用，而Value是强引用。这就致使了一个问题，ThreadLocal在没有外部对象强引用时，发生GC时弱引用Key会被回收，而Value不会回收，若是建立ThreadLocal的线程一直持续运行，那么这个Entry对象中的value就有可能一直得不到回收，发生内存泄露。

如何避免泄漏 既然Key是弱引用，那么咱们要作的事，就是在调用ThreadLocal的get()、set()方法时完成后再调用remove方法，将Entry节点和Map的引用关系移除，这样整个Entry对象在GC Roots分析后就变成不可达了，下次GC的时候就能够被回收。

若是使用ThreadLocal的set方法以后，没有显示的调用remove方法，就有可能发生内存泄露，因此养成良好的编程习惯十分重要，使用完ThreadLocal以后，记得调用remove方法。

ThreadLocal<Session> threadLocal = new ThreadLocal<Session>();
try {
    threadLocal.set(new Session(1, "Misout的博客"));
    // 其它业务逻辑
} finally {
    threadLocal.remove();
}

复制代码

三.ThreadLocal的应用场景

最多见的ThreadLocal使用场景为用来解决数据库链接、Session管理等。

private static ThreadLocal<Connection> connectionHolder
= new ThreadLocal<Connection>() {
public Connection initialValue() {
    return DriverManager.getConnection(DB_URL);
}
};
 
public static Connection getConnection() {
return connectionHolder.get();
}
复制代码

private static final ThreadLocal<Session> threadLocal = new ThreadLocal<Session>();

//获取Session
public static Session getCurrentSession(){
    Session session =  threadLocal.get();
    //判断Session是否为空，若是为空，将建立一个session，并设置到本地线程变量中
    try {
        if(session ==null&&!session.isOpen()){
            if(sessionFactory==null){
                rbuildSessionFactory();// 建立Hibernate的SessionFactory
            }else{
                session = sessionFactory.openSession();
            }
        }
        threadLocal.set(session);
    } catch (Exception e) {
        // TODO: handle exception
    }

    return session;
}

复制代码

private static final ThreadLocal threadSession = new ThreadLocal();
 
public static Session getSession() throws InfrastructureException {
    Session s = (Session) threadSession.get();
    try {
        if (s == null) {
            s = getSessionFactory().openSession();
            threadSession.set(s);
        }
    } catch (HibernateException ex) {
        throw new InfrastructureException(ex);
    }
    return s;
}
复制代码

每一个线程访问数据库都应当是一个独立的Session会话，若是多个线程共享同一个Session会话，有可能其余线程关闭链接了，当前线程再执行提交时就会出现会话已关闭的异常，致使系统异常。此方式能避免线程争抢Session，提升并发下的安全性。

使用ThreadLocal的典型场景正如上面的数据库链接管理，线程会话管理等场景，只适用于独立变量副本的状况，若是变量为全局共享的，则不适用在高并发下使用。

总结

每一个ThreadLocal只能保存一个变量副本，若是想要一个线程可以保存多个副本，就须要建立多个ThreadLocal。
ThreadLocal内部的ThreadLocalMap键为弱引用，会有内存泄漏的风险。
适用于无状态，副本变量独立后不影响业务逻辑的高并发场景。若是若是业务逻辑强依赖于副本变量，则不适合用ThreadLocal解决，须要另寻解决方案。