netty4.x ByteBuf 基本机制及其骨架实现

时间 2020-02-06

标签 netty4.x netty bytebuf 基本机制及其骨架实现栏目 Netty 繁體版

原文原文链接

概述

netty 是一个 NIO 框架，在 JDK API 已提供相对直接的 NIO Library 的状况下，几乎不多的软件系统会直接用 NIO 进行编程，也不多有开发者会直接使用 NIO 技术开发网络相关的程序。由于 native nio library 已饱受诟病，API 难用，容易出错，存在一些声称解决但还没解决的 bug（bug id = 6403933，JDK 1.7 声称解决了该 Bug，但实际上只是下降了该 bug 发生的几率），使用 native nio library 来开发可靠性、鲁棒性高的网络程序，工做量以及出错率都要更高，使用 netty 框架，就是为了解决这些问题。java

ByteBuffer 的忏悔

基于 NIO 非阻塞模型的编程，基本上是面向数据容器编程，BIO 与 NIO 除了它们在阻塞 IO 线程方面有所不一样外，它们在操做数据方面是有一些共性的，那就是从网络流中读数据，并放入一个容器中。
对于 BIO 来讲，大多数时候，这个容器就是一个字节数组编程

byte[] buf = new byte[8196];
int cnt = 0;
while ((cnt = input.read(buf)) != -1) {
    //......
}

在这里，容器就是指 buf 这个字节数组。而在 NIO 中，容器是指 ByteBuffer，因为 NIO 编程的复杂性，须要解决相似于 TCP 半包问题等，所以对这个容器的要求不只仅是“存储数据”那么简单，还但愿这个容器能提供另外的功能，这是 ByteBuffer 存在的缘由，它提供了一些方便的 API，让开发者操做底层的字节数组。
然而 ByteBuffer 存在几个不得人心的缺点：设计模式

API 功能有限
长度固定
读写时的手工操做

从 ByteBuffer 的源码里能够看到，它用 4 个下标来辅助管理本身身上的数据，参见它的父类 java.nio.Buffer
数组

capacity 是 ByteBuffer 的总容量，一旦设定不能改变，就像一个水缸同样，水缸的大小永远是你看到它的时候那么大，它不会变大也不会变小，最多能装多少水是肯定的；
在这里假设一种先向 ByteBuffer 写数据后再读出来的场景。往 ByteBuffer 里写入数据时，写入多少数据，position 这个下标就会增长多少，换言之，在往 ByteBuffer 写数据时，position 指向的是下一个能够写入的位置，而 limit 此时会和 capacity 同样大。开始读数据时，第一个，要知道从哪里读，第二个，要知道读到哪里为止，为此 ByteBuffer 提供了一个 flip() 方法，这个 flip() 方法将 limit 置为 position 位置，此时 limit 表明要读到哪里为止，再将 position 位置置为 0，此时 position 表明要从哪里开始读。微信

所以，在读的时候，读取 position 到 limit 之间的数据，就能读到上一次写入的数据。但不得不说，这种方法显得有点笨拙，不太人性化，这意味着在编写代码的时候，要时刻谨记写完数据后，读数据以前，要先调用 flip 方法，这种“不著名”的潜规则，容易让开发者趟坑。网络

ByteBuf 让人耳目一新

netty 中的 ByteBuf 采用了新的作法，只用两个下标来辅助管理数据，分别是 readerIndex 和 writerIndexapp

readerIndex 表明当前读取的位置，writerIndex 表明下一个能够写入的位置，写入一部分数据后，writerIndex 往右移动，而 readerIndex 和 writeIndex 之间的数据就变为可读的了。框架

若是原先写入了 N 个长度的数据，接下来读取 M (M < N)个长度的数据，那么读取后 ByteBuf 就变成下面的样子ide

咱们再也不须要那笨拙的 flip 方法了，只须要关注 readerIndex 与 writerIndex。工具

ByteBuf 谱系

在 netty4.x 中，ByteBuf 是一个抽象类，但它也是在十分抽象，由于它定义的全部方法都是抽象方法，若是换我来想，我会想怎么不定义为一个 Interface 呢，ByteBuf 类也加了一个注解

@SuppressWarnings("ClassMayBeInterface")

但这么作其实无伤大雅，留着抽象类的身份，猜想是考虑到了之后可能增长工具类方法或者公共方法。ByteBuf 下的子类以下图所示：

除了 AbstractByteBuf 类，其它直接的子类都给人一种有“特殊做用”的感受，好比说 EmptyByteBuf。最主要的类仍是 AbstractByteBuf 类，它定义了大多数 ByteBuf 功能的公共逻辑代码，在 netty 应用程序的开发中，用到的 ByteBuf 的功能，以及 ByteBuf 的具体实例，都跟它有关。

特殊做用的类

与 AbstractByteBuf 同级的类有 EmptyByteBuf，UnreleasableByteBuf，SwappedByteBuf 以及 ReplayingDecoderBuffer。想要解读这些有特殊做用的类，须要先了解字节序和引用计数。

字节序

两个计算机系统之间通讯，经过网络发送字节数据，双方必须为字节数据的顺序达成一致的协议，不然将没法对数据进行正确的解析，不一样的计算机体系结构有不一样的字节序，字节序可分为大端字节序(big-endian)和小端字节序(little-endian)。

小端就是低位字节排放在内存的低地址端，高位字节排放在内存的高地址端
大端就是高位字节排放在内存的低地址端，低位字节排放在内存的高地址端

以数字 0x12 34 56 78为例，在大端模式下，其存储的形式为：

低地址 -----------------> 高地址
0x12  |  0x34  |  0x56  |  0x78

小端模式下，其存储形式为：

低地址 ------------------> 高地址
0x78  |  0x56  |  0x34  |  0x12

通常状况下，基于 TCP 的网络通讯约定采用大端字节序，而机器 CPU 的字节序则各有各的不一样。

SwappedByteBuf 与字节序

SwappedByteBuf 这个类的命名并无直接地反映出类的做用，在 ByteBuf 类中定义了一个方法，用于设置该 ByteBuf 中的数据采用的是哪一种字节序存储数据：

public abstract ByteBuf order(ByteOrder endianness);

netty 中的 ByteBuf 默认是使用 big-endian 的，若是须要修改字节序，意味着读写数据的时候要进行顺序的转换，通常状况下咱们会直接在 ByteBuf 的读写方法里去作修改，但那样意味着要修改不少个方法，netty 的作法是为每一个 ByteBuf 集成一个 SwappedByteBuf，做为自身的字节序包装器。以 AbstractByteBuf 的 order 方法为例：

@Override
    public ByteBuf order(ByteOrder endianness) {
        if (endianness == null) {
            throw new NullPointerException("endianness");
        }
        if (endianness == order()) {
            return this;
        }

        SwappedByteBuf swappedBuf = this.swappedBuf;
        if (swappedBuf == null) {
            this.swappedBuf = swappedBuf = new SwappedByteBuf(this);
        }
        return swappedBuf;
    }

AbstractByteBuf 组合了一个 SwappedByteBuf 实例，当它的 order 方法被调用来设置字节序时，若是设置的字节序与自身的字节序不一样，那么就将本身披上 SwappedByteBuf 外套，返回自身。接下来看 SwappedByteBuf 的具体实现，能够发现，SwappedByteBuf 里维护了被它包装的 ByteBuf，以及新的 ByteOrder。

public final class SwappedByteBuf extends ByteBuf {

    private final ByteBuf buf;
    private final ByteOrder order;

    public SwappedByteBuf(ByteBuf buf) {
        if (buf == null) {
            throw new NullPointerException("buf");
        }
        this.buf = buf;
        if (buf.order() == ByteOrder.BIG_ENDIAN) {
            order = ByteOrder.LITTLE_ENDIAN;
        } else {
            order = ByteOrder.BIG_ENDIAN;
        }
    }

    ......  
}

与字节序无关的操做，都 delegate 给原来的 buf，例如：

@Override
    public int capacity() {
        return buf.capacity();
    }

而与字节序有关的操做，则根据当前的字节序，对数据进行反排序处理，例如 writeInt 方法：

@Override
    public ByteBuf writeInt(int value) {
        buf.writeInt(ByteBufUtil.swapInt(value));
        return this;
    }

/**
     * Toggles the endianness of the specified 32-bit integer.
     */
    public static int swapInt(int value) {
        return Integer.reverseBytes(value);
    }

一样，除了写数据相关的方法，读数据相关的方法也是这么处理的。

引用计数

netty 中 ByteBuf 用来做为数据的容器，是一种频繁被建立和销毁的对象，ByteBuf 须要的内存空间，能够在 JVM Heap 中申请分配，也能够在 Direct Memory 中申请，其中在 Direct Memory 中分配的 ByteBuf，其建立和销毁的代价比在 JVM Heap 中的更高，但抛开哪一个代价高哪一个代价低不说，光是频繁建立和频繁销毁这一点，就已奠基了效率不高的基调。
netty 中支持 ByteBuf 的池化，而引用计数就是实现池化的关键技术点，不过并不是只有池化的 ByteBuf 才有引用计数，非池化的也会有引用计数。
ByteBuf 类实现了 ReferenceCounted 接口，该接口标记一个类是一个引用计数管理对象。

public abstract class ByteBuf implements ReferenceCounted, Comparable<ByteBuf>

ReferenceCounted 接口定义了这几个方法：

public interface ReferenceCounted {
    int refCnt();

    ReferenceCounted retain();

    ReferenceCounted retain(int increment);

    boolean release();

    boolean release(int decrement);
}

每个引用计数对象，都维护了自身的引用计数，当第一次被建立时，引用计数为1，经过 refCnt() 方法能够获得当前的引用计数，retain() retain(int increment) 增长自身的引用计数，而 release() 和 release(int increment) 则减小当前的引用计数，若是引用计数达到 0，而且当前的 ByteBuf 被释放成功，那这两个方法的返回值为 true。须要注意的是，各类不一样类型的 ByteBuf 本身决定机子的释放方式，若是是池化的 ByteBuf，那么就会进池子，若是不是池化的，则销毁底层的字节数组引用或者释放对应的堆外内存。
经过 AbstractReferenceCountedByteBuf 这个类的 release 方法实现，能够看出大概的执行逻辑：

@Override
    public final boolean release() {
        for (;;) {
            int refCnt = this.refCnt;
            if (refCnt == 0) {
                throw new IllegalReferenceCountException(0, -1);
            }

            if (refCntUpdater.compareAndSet(this, refCnt, refCnt - 1)) {
                if (refCnt == 1) {
                    deallocate();
                    return true;
                }
                return false;
            }
        }
    }

释放对象的方法定义在 deallocate() 方法里，而它是个抽象方法。
对于非池化的 heap ByteBuf 来讲，释放对象实际上就是释放底层字节数组的引用：

@Override
    protected void deallocate() {
        array = null;
    }

对于非池化的 direct ByteBuf 来讲，释放对象实际上就是释放堆外内存：

@Override
    protected void deallocate() {
        ByteBuffer buffer = this.buffer;
        if (buffer == null) {
            return;
        }

        this.buffer = null;

        if (!doNotFree) {
            PlatformDependent.freeDirectBuffer(buffer);
        }

        if (leak != null) {
            leak.close();
        }
    }

对于池化的 ByteBuf 来讲，就是把本身归还到对象池里：

@Override
    protected final void deallocate() {
        if (handle >= 0) {
            final long handle = this.handle;
            this.handle = -1;
            memory = null;
            chunk.arena.free(chunk, handle);
            if (leak != null) {
                leak.close();
            } else {
                recycle();
            }
        }
    }

UnreleasableByteBuf 与引用计数

顾名思义，这个类就是不可释放的 ByteBuf，它也是一个包装器模式的引用，被它包装的 ByteBuf 不会受引用计数的影响，不会被释放，它对 ReferenceCounted 接口的实现以下所示：

@Override
    public ByteBuf retain(int increment) {
        return this;
    }

    @Override
    public ByteBuf retain() {
        return this;
    }

    @Override
    public boolean isReadable(int size) {
        return buf.isReadable(size);
    }

    @Override
    public boolean isWritable(int size) {
        return buf.isWritable(size);
    }

    @Override
    public int refCnt() {
        return buf.refCnt();
    }

可见它直接忽略了对 retain 和 release 方法的调用效果，这种“不可释放的 ByteBuf”在什么状况下会用到呢，在一些静态的具备固定内容而且内容不改变的 ByteBuf 时候会用到，由于很是经常使用，因此不须要释放，会更有效率。例如在处理 HTTP 协议时候，常常须要返回带有回车换行的数据，这里回车换行就能够定义为一个静态的 ByteBuf，而且不容许释放。这有点相似于设计模式中单例模式的那个“单例”。

EmptyByteBuf

EmptyByteBuf 是一个没有任何内容，也不容许读或者写的 ByteBuf，它存在的目的是为了在调用 ByteBufAllocator 建立新 ByteBuf 的时候，若是指定容量大小为0，则返回一个 EmptyByteBuf，这里仅仅是单例模式的一个运用

ReplayingDecoderBuffer

这个 ByteBuf 专用于 ReplayingDecoder，这个 decoder 主要是为了完成对一段已知长度报文进行全包获取，由于这个场景在网络编程中太经常使用了，所以 netty 单独实现了一个 ReplayingDecoder 来应对这种场景。这里暂时不深刻讲解 ReplayingDecoder。

ByteBuf 骨架实现

AbstractByteBuf 是 ByteBuf 的骨架实现，它实现了大部分与 ByteBuf 有关的功能方法，把不肯定的行为留为抽象方法，交给它的实现者去实现。

setter 与 getter

为了实践面向对象封装的特性，见过太多类在定义其变量的 setter 和 getter 方法时，清一色地使用 setXXX(int xxx)和 getXXX()。不过 netty 的编码风格中，它的 setter 和 getter 方法是这样的：

public ByteBuf readerIndex(int readerIndex); // setter
public int readerIndex(); //getter

方法名同名，但参数列表和返回值不同。而且对于 setter 类方法来讲，它支持更加 modern 的作法，那就是方法的链式调用，setter 后返回自身，立马能够进行下一次方法调用。
但在 AbstractByteBuf 中仍是有以 set 开头的的方法的，好比说：

@Override
    public ByteBuf setIndex(int readerIndex, int writerIndex) {
        if (readerIndex < 0 || readerIndex > writerIndex || writerIndex > capacity()) {
            throw new IndexOutOfBoundsException(String.format(
                    "readerIndex: %d, writerIndex: %d (expected: 0 <= readerIndex <= writerIndex <= capacity(%d))",
                    readerIndex, writerIndex, capacity()));
        }
        this.readerIndex = readerIndex;
        this.writerIndex = writerIndex;
        return this;
    }

而其它的 set 开头的方法，则不能说它是 setter 了，由于这些方法其实是在操做数据，为某个下标位置填入数据，例如:

public ByteBuf setByte(int index, int value);

读取数据

AbstractByteBuf 中有两类读取数据的方法，一类以 get 开头，例如 getInt()，另外一类以 read 开头，例如readInt()。这二者的区别是，get 不会致使 readerIndex 的增长，而 read 会致使 readerIndex 的增长；另外一个区别是，read 只能读取已经被写入的数据，也就是说，读取的位置不能超过 writeIndex，而 get 却能够在任意位置读取，只要不超过 capacity 就能够。经过如下代码能够看出这两点区别：

@Override
    public int getInt(int index) {
        checkIndex(index, 4);
        return _getInt(index);
    }

    protected final void checkIndex(int index, int fieldLength) {
        ensureAccessible();
        if (fieldLength < 0) {
            throw new IllegalArgumentException("length: " + fieldLength + " (expected: >= 0)");
        }
        if (index < 0 || index > capacity() - fieldLength) {
            throw new IndexOutOfBoundsException(String.format(
                    "index: %d, length: %d (expected: range(0, %d))", index, fieldLength, capacity()));
        }
    }

    @Override
    public int readInt() {
        checkReadableBytes(4);
        int v = _getInt(readerIndex);
        readerIndex += 4;
        return v;
    }

    protected final void checkReadableBytes(int minimumReadableBytes) {
        ensureAccessible();
        if (minimumReadableBytes < 0) {
            throw new IllegalArgumentException("minimumReadableBytes: " + minimumReadableBytes + " (expected: >= 0)");
        }
        if (readerIndex > writerIndex - minimumReadableBytes) {
            throw new IndexOutOfBoundsException(String.format(
                    "readerIndex(%d) + length(%d) exceeds writerIndex(%d): %s",
                    readerIndex, minimumReadableBytes, writerIndex, this));
        }
    }

在这里也体现了出了先前提到的引用计数的做用，在读取的时候，会调用 ensureAccessible() 方法来肯定当前本身的引用计数是多少。若是是 0，则这次读取时非法的。

protected final void ensureAccessible() {
        if (refCnt() == 0) {
            throw new IllegalReferenceCountException(0);
        }
    }

一样，并不是只有读取数据才会判断引用计数，写入数据的时候也会判断引用计数。
真正读取数据的方法，定义成了抽象方法，供不一样的实现者去实现，例如 _getInt() 方法，Heap ByteBuf 的实现是直接读取底层的数组：

@Override
    protected int _getInt(int index) {
        return  (array[index]     & 0xff) << 24 |
                (array[index + 1] & 0xff) << 16 |
                (array[index + 2] & 0xff) <<  8 |
                 array[index + 3] & 0xff;
    }

而 Direct ByteBuf，则是委托给了 ByteBuffer ：

@Override
    protected int _getInt(int index) {
        return buffer.getInt(index);
    }

写入数据

与读取数据同样，写入数据也分为改变 writerIndex 和不改变 writerIndex 的方法，分别是 write 开头和 set 开头。其中 set 开头的方法和读取数据时的 get 开头的方法同样，都只是检查一下有没有超过 capacity，并不会去检查 writerIndex 或者是 readerIndex，至关于说这些方法能够在任意一个地方写入数据，只要不超过 capacity，以下所示：

@Override
    public ByteBuf setInt(int index, int value) {
        checkIndex(index, 4);
        _setInt(index, value);
        return this;
    }

而 write 开头的方法的调用，则会对应着 writerIndex 的增加：

@Override
    public ByteBuf writeInt(int value) {
        ensureWritable(4);
        _setInt(writerIndex, value);
        writerIndex += 4;
        return this;
    }

注意在这里，写入操做还伴随着对是否有足够的空间写入的肯定，继而伴随着 ByteBuf 的动态扩容。

ByteBuf 动态扩容机制

若是当前没有足够的空间写入数据了，ByteBuffer 会直接报错，而 ByteBuf 则会进行动态扩容，其扩容的主要逻辑在如下的方法里：

@Override
    public ByteBuf ensureWritable(int minWritableBytes) {
        if (minWritableBytes < 0) {
            throw new IllegalArgumentException(String.format(
                    "minWritableBytes: %d (expected: >= 0)", minWritableBytes));
        }

        if (minWritableBytes <= writableBytes()) {
            return this;
        }

        if (minWritableBytes > maxCapacity - writerIndex) {
            throw new IndexOutOfBoundsException(String.format(
                    "writerIndex(%d) + minWritableBytes(%d) exceeds maxCapacity(%d): %s",
                    writerIndex, minWritableBytes, maxCapacity, this));
        }

        // Normalize the current capacity to the power of 2.
        int newCapacity = calculateNewCapacity(writerIndex + minWritableBytes);

        // Adjust to the new capacity.
        capacity(newCapacity);
        return this;
    }

首先的前提是，扩容虽好，但并不意味着能够无限扩容，所以有一个 maxCapaciy 变量限制着：你能够扩容，但不能够无限扩容，我容许你走进个人世界，但不容许你在个人世界里走来走去。
扩容的逻辑主要分为两块：

计算新的容量
扩展至新容量

计算新容量的方法以下所示：

private int calculateNewCapacity(int minNewCapacity) {
        final int maxCapacity = this.maxCapacity;
        final int threshold = 1048576 * 4; // 4 MiB page

        if (minNewCapacity == threshold) {
            return threshold;
        }

        // If over threshold, do not double but just increase by threshold.
        if (minNewCapacity > threshold) {
            int newCapacity = minNewCapacity / threshold * threshold;
            if (newCapacity > maxCapacity - threshold) {
                newCapacity = maxCapacity;
            } else {
                newCapacity += threshold;
            }
            return newCapacity;
        }

        // Not over threshold. Double up to 4 MiB, starting from 64.
        int newCapacity = 64;
        while (newCapacity < minNewCapacity) {
            newCapacity <<= 1;
        }

        return Math.min(newCapacity, maxCapacity);
    }

计算新容量的逻辑很简单，若是指望的新容量不超过 4MB，则从 64 字节开始，一直翻倍，直到超过时望的新容量，此时新的容量不大于 4MB，而且是 64 的倍数。若是指望的新容量已经超过了 4MB，那么就再增长 4 MB 的倍数，至因而1倍仍是2倍仍是N倍，由指望的容量决定。
计算完新的容量，接下来就须要把 ByteBuf 的容量扩展至新的容量，扩展容量对于不一样类型的 ByteBuf 来讲，其实现方式也不同，例如对于 Heap ByteBuf 来讲，扩容就意味着数组拷贝，以下所示：

@Override
    public ByteBuf capacity(int newCapacity) {
        ensureAccessible();
        if (newCapacity < 0 || newCapacity > maxCapacity()) {
            throw new IllegalArgumentException("newCapacity: " + newCapacity);
        }

        int oldCapacity = array.length;
        if (newCapacity > oldCapacity) {
            byte[] newArray = new byte[newCapacity];
            System.arraycopy(array, readerIndex(), newArray, readerIndex(), readableBytes());
            setArray(newArray);
        } else if (newCapacity < oldCapacity) {
            byte[] newArray = new byte[newCapacity];
            int readerIndex = readerIndex();
            if (readerIndex < newCapacity) {
                int writerIndex = writerIndex();
                if (writerIndex > newCapacity) {
                    writerIndex(writerIndex = newCapacity);
                }
                System.arraycopy(array, readerIndex, newArray, readerIndex, writerIndex - readerIndex);
            } else {
                setIndex(newCapacity, newCapacity);
            }
            setArray(newArray);
        }
        return this;
    }

这是 ByteBuf 比 ByteBuffer 更好的一个地方，既有 maxCapacity 防止无限扩容，又能在容许的范围内动态扩展容量，开发者无须关心。至于扩展的梯段为何是 4MB，还没办法知道这个值是怎么来的，应该是通过大量的测试或者以经验来判断的。

丢弃一部分数据

前面提到一张图，当写入数据后读取一部分数据，被读取后的那一部分，实际上就变成了能够丢弃的数据了，不然就会有一种“占着茅坑不拉shi”的感受了，白白占用了大量的空间

AbstractByteBuf 提供了方法来对这些数据进行丢弃，原理其实就是将有效的数据移位，重置 readerIndex 和 writerIndex，对于 Heap ByteBuf 来讲，这一般也意味着数组拷贝。

@Override
    public ByteBuf discardReadBytes() {
        ensureAccessible();
        if (readerIndex == 0) {
            return this;
        }

        if (readerIndex != writerIndex) {
            setBytes(0, this, readerIndex, writerIndex - readerIndex);
            writerIndex -= readerIndex;
            adjustMarkers(readerIndex);
            readerIndex = 0;
        } else {
            adjustMarkers(readerIndex);
            writerIndex = readerIndex = 0;
        }
        return this;
    }

一般，数组拷贝是一个关于性能的敏感词，过多的数组拷贝，意味着效率低，所以除非能确承认以丢弃的数据占整个 ByteBuf 的大部分，不然不要轻易去显式丢弃那些已经读取的数据。