Java知识探究一:关于IO类库

时间 2019-11-08

标签 java 知识探究关于栏目 Java 繁體版

原文原文链接

通过组织考察，令我突然发觉本身在最经常使用的Java中也有不少不明白的地方，实为平身一大憾事，今天特地抽时间将这些点滴记录下来，与你们一块儿分享java

第一批想整理的知识点以下：算法

Java的IO探究，IO的整个结构与发展，顺带附上公司某小工写的断点续传代码学习。
Java的异常机制，关于编译时异常和运行时异常的探究。
JavaCommon包的理解，尤为是collection包的一些小见解，其实容器嘛，什么样的Utils也逃不出一些基本的范畴，好比存、取、排序、安全性、校验等等等。

闲话很少说，先开始今天的主题，研究一下IO的整个结构vim

从体系结构上划分，IO系统总共分为两大模块， IO和NIO（非阻塞），IO诞生于JDK1.4以前，JDK1.4时，产生了NIO，而且借用NIO重构了部分IO的代码，好比FileInputStream中增长了对NIO进行支持的getChannel()方法，再好比Reader和FileReader基本用nio所有重写了。windows

1、Think in IO数组

IO从实现上，大体分为字节流和字符流两种：缓存

字节流。对文件的读写操纵以字节为单位，说的直白一点，就是操做byte，byte数组。对应无符号整数的话，就是read方法的正常返回值范围在[0，255]之间，范围有限的返回值有不少优势，比较有表明性的一个就是能够流来作一个简单的zip实现，算法的话，采用huffman树。固然，一个一个字节操做的话，效率不高，利用Buffer则效率提升很多。可是字节流有个问题，那就是在操做文本文件的时候，对于编码会有不少多余的代码，例子以下

FileInputStream is = new FileInputStream("F:\\books\\base\\vim经常使用指令.txt");
        byte[] buff = new byte[BUFFER_SIZE];
        int readSize = 0;
        while ((readSize = is.read(buff)) != -1)
        {
            System.out.println(readSize);
            if(readSize<1024){
                byte[] tmp = new byte[readSize];
                System.arraycopy(buff, 0, tmp, 0, readSize);
                System.out.print(new String(tmp, "GBK"));
            }else{
                System.out.print(new String(buff, "GBK"));
            }
        }

字符流。以字符做为单元进行操做，Reader内部实现其实就是以char或者char数组做为缓存容器的。操做文本文件时候方便许多。编码采用系统默认的编码格式。找了很久才找到代码的说+_+，代码隐藏的很深，从Reader找到ImputStreamReader，再到StreamDecoder再到nio包中的Charset，最终是优先获取系统中的环境变量，System.getProperties()也能够获取，windows7中文版的话，获取到的是“ file.encoding=GB18030”

/**
     * Returns the default charset of this Java virtual machine.
     *
     * <p> The default charset is determined during virtual-machine startup and
     * typically depends upon the locale and charset of the underlying
     * operating system.
     *
     * @return  A charset object for the default charset
     *
     * @since 1.5
     */
    public static Charset defaultCharset() {
        if (defaultCharset == null) {
        synchronized (Charset.class) {
        java.security.PrivilegedAction pa =
            new GetPropertyAction("file.encoding");
        String csn = (String)AccessController.doPrivileged(pa);
        Charset cs = lookup(csn);
        if (cs != null)
            defaultCharset = cs;
                else 
            defaultCharset = forName("UTF-8");
            }
    }
    return defaultCharset;
    }

下面详细叙述一下字节流安全

1、InputStream 和 OutputStream 是两个 abstact 类，对于字节为导向的 stream 都扩展这两个鸡肋（基类 ^_^ ） ; app

FileInputStream，打开本地文件的流，经常使用，有3个构造方法
public FileInputStream(File file)
public FileInputStream(String name)
public FileInputStream(FileDescriptor fdObj) 值得强调，这个构造是不能直接用的，FileDescriptor 至关于打开文件的句柄，能够用一个文件流建立另外一个，这样建立的流至关因而一个。一个流关闭的话，另外一个也不能读取。
PipedInputStream，必须与PipedOutputStream一块儿使用，必须是两个或者多个线程中使用，相似生产者消费者模型， PipedOutputStream将数据写到共享的buffer数组中，通知PipedInputStream读取。
有两点注意事项：async

a）使用PipedInputStream的read方法时候要注意，若是缓冲区没有数据的话，会阻塞当前线程，在主线程中运行的话，会卡住不动。ide

b）PipedOutputStream所在的线程若是中止，那么PipedOutputStream所使用的资源也会回收，会形成pipe 的“broken”，PipedInputStream的read方法也会报错。

“A pipe is said to be broken if a thread that was providing data bytes to the connected piped output stream is no longer alive. ”
FilterInputStream，自己是不能被实例化的，是BufferedInputStream等的父类，其实不建立这个类也能够实现它的子类，这个类内部的方法几乎所有都是复用父类的方法。其实它存在的意义更可能是表明一个抽象，意思是在InputStream的基础之上对返回数据进行了从新包装或者处理，处理缘由可能各不相同，因而又了各不相同的子类。
LineNumberInputStream，这个类是字节流和字符流转换中的失败产物，已经肯定为被废弃，废弃的理由是在字节流中强制的判断读取换行，不考虑编码方面的问题。先无论功能能不能实现，首先从抽象层次上面就有欠缺。挪到字符流里面就皆大欢喜。对应的有LineNumberReader这个类可使用。具体参见LineNumberReader详解。
DataInputStream，直接读取目标文件的byte，拼接或转化byte为其余基本类型，好比下面方法
```
public final int readInt() throws IOException {
        int ch1 = in.read();
        int ch2 = in.read();
        int ch3 = in.read();
        int ch4 = in.read();
        if ((ch1 | ch2 | ch3 | ch4) < 0)
            throw new EOFException();
        return ((ch1 << 24) + (ch2 << 16) + (ch3 << 8) + (ch4 << 0));
    }
```
对于基本类型能够这样转化，可是对于float和double，各自用了Float类和Double类中的native方法进行转化，想来与操做系统底层有关系。
```
public final double readDouble() throws IOException {
    return Double.longBitsToDouble(readLong());
    }
```
惟一实现的比较复杂的是readUTF方法，须要读取所有数据，必须是符合格式的，须要用DataOutputStream的writeUTF进行对应的写。DataInputStream在实际运用中，仍是应该与DataOutputStream一块儿使用，否则的话，意义不是十分大。

BufferedInputStream，初始化一个8192大小的缓存，提升效率用，调用API上面没有任何不一样，只是减小了直接读取系统数据的次数。内部持有一个普通的inputStream，只有缓冲区空了之后，才真正调用inputStream的read去写满缓冲区，因此直接用BufferedInputStream的read方法能够提升效率。
有点意思的是这个类里面用了一个AtomicReferenceFieldUpdater对象来进行对volatile类型缓冲byte数组的更新和替换，这个类的compareAndSet方法带有原子性质的比较和更新。

/**
     * Atomic updater to provide compareAndSet for buf. This is
     * necessary because closes can be asynchronous. We use nullness
     * of buf[] as primary indicator that this stream is closed. (The
     * "in" field is also nulled out on close.)
     */
    private static final 
        AtomicReferenceFieldUpdater<BufferedInputStream, byte[]> bufUpdater = 
        AtomicReferenceFieldUpdater.newUpdater
        (BufferedInputStream.class,  byte[].class, "buf");// 建立原子更新器
...
/**
     * Fills the buffer with more data, taking into account
     * shuffling and other tricks for dealing with marks.
     * Assumes that it is being called by a synchronized method.
     * This method also assumes that all data has already been read in,
     * hence pos > count.
     */
    private void fill() throws IOException {
        byte[] buffer = getBufIfOpen();
    if (markpos < 0)
        pos = 0;        /* no mark: throw away the buffer */
    else if (pos >= buffer.length)    /* no room left in buffer */
        if (markpos > 0) {    /* can throw away early part of the buffer */
        int sz = pos - markpos;
        System.arraycopy(buffer, markpos, buffer, 0, sz);
        pos = sz;
        markpos = 0;
        } else if (buffer.length >= marklimit) {
        markpos = -1;    /* buffer got too big, invalidate mark */
        pos = 0;    /* drop buffer contents */
        } else {        /* grow buffer */
        int nsz = pos * 2;
        if (nsz > marklimit)
            nsz = marklimit;
        byte nbuf[] = new byte[nsz];
        System.arraycopy(buffer, 0, nbuf, 0, pos);
                if (!bufUpdater.compareAndSet(this, buffer, nbuf)) {//进行更新比较， 若是buf对象和buffer相同， 那么进行更新，不一样的话，不更新
                    // Can't replace buf if there was an async close.
                    // Note: This would need to be changed if fill()
                    // is ever made accessible to multiple threads.
                    // But for now, the only way CAS can fail is via close.
                    // assert buf == null;
                    throw new IOException("Stream closed");
                }
                buffer = nbuf;
        }
        count = pos;
    int n = getInIfOpen().read(buffer, pos, buffer.length - pos);
        if (n > 0)
            count = n + pos;
    }

PushBackInputStream，特色是unread()方法，做用是在读取流的过程当中自行添加入字节或者字节数组，进行从新读取，小说中随机插入的广告url却是能够用这个实现，冷不丁的在读取过程当中插入一个urlbyte数组，倒也方便。

ByteArrayInputStream，特色是内存操做，读取的数据所有都在缓存数组中，构造方法以下

public ByteArrayInputStream(byte buf[])
public ByteArrayInputStream(byte buf[], int offset, int length)

SequenceInputStream，构造时候能见多个流进行拼接，依次进行read，其中包含的流会自动进行关闭，在调用时候进行关闭

public int read() throws IOException {
    if (in == null) {
        return -1;
    }
    int c = in.read();
    if (c == -1) {
        nextStream();// 读完一个流之后， 自动变动下一个，可是这个方法不是线程安全的， 两个一块儿调，后果十分严重
        return read();
    }
    return c;
    }

/**
     *  Continues reading in the next stream if an EOF is reached.
     */
    final void nextStream() throws IOException {
    if (in != null) {
        in.close();
    }

        if (e.hasMoreElements()) {
            in = (InputStream) e.nextElement();
            if (in == null)
                throw new NullPointerException();
        }
        else in = null;

    }

StringBufferInputStream，这个类已经被废弃，缘由是错误的对字节流进行向字符流的转化，忽略了编码问题。值得一提的是，这个类里基本全部部分方法都是线程安全的。swing的某个类中还引用了这个方法。
ObjectInputStream，这个类能够说的比较多

2、OutputStream，基本每一个InputStream都有一个对应的OutputStream，来实现对应的功能，基本全都是抽象方法。

FileOutputStream，FileDescriptor至关于句柄，既然是句柄，就会有多个流可能使用之，因此FileDescriptor有incrementAndGetUseCount方法，用来线程安全的进行引用计数器+1的操做。另外值得注意的是，FileOutputStream还有追加写入的构造方法

public FileOutputStream(File file, boolean append)
        throws FileNotFoundException
    {
        String name = (file != null ? file.getPath() : null);
    SecurityManager security = System.getSecurityManager();
    if (security != null) {
        security.checkWrite(name);
    }
        if (name == null) {
            throw new NullPointerException();
        }
    fd = new FileDescriptor();
        fd.incrementAndGetUseCount();
        this.append = append;
    if (append) {
        openAppend(name);
    } else {
        open(name);
    }
    }

PipedOutputStream，须要与InputStream进行配合使用，不在赘述