定义

Java 中字符串，用来定义一串字符。说白了就是一串 char 的数组。

public final class String
    implements java.io.Serializable, Comparable, CharSequence {
    /** The value is used for character storage. */
    private final char value[];

    /** Cache the hash code for the string */
    private int hash; // Default to 0
    ...
}

实现三个接口：

Serializable：序列化。
Comparable：实现 compareTo() 方法进行比较，A>B 返回大于正整数、A
CharSequence：抽象 length()、subSequence()、charAt()。toString() 等方法。常用的实现此接口的类有 StringBuffer、StringBuilder。

字符串的拼接

"+"

Java 中使用 "+" 进行字符串的拼接，其实是编译器对这种方式进行了处理。其原理还是通过转换为 StringBuilder 或 StringBuffer，并通过其 append() 方法进行拼接。

String s = "a"; int i = 1; String result = s+i;

编译后

String s = "a"; int i = 1; String result = new StringBuilder().append(s).append(i).toString();

另外，如果拼接的双方为可知的字符串，编译器会优化处理。由于字符串的不可变性，编译器会自动拼接。

System.out.println("a" + "b");

编译后

System.out.println("ab");

"+" 的效率

由于编译器隐式创建，如果在遍历中大量使用拼接，会造成频繁创建对象。

String s = "a"; for (int i = 0; i < 1000; i++) { s += "b"; }

编译后

String s = "a"; for (int i = 0; i < 1000; i++) { s += new StringBuilder().append("b").toString(); } System.out.print(s);

这个过程会创建 1000 个 StringBuilder 对象，无疑会造成内存浪费。
优化后的做法是创建一个 StringBuilder 对象进行拼接。

StringBuilder stringBuilder = new StringBuilder("a"); for (int i = 0; i < 1000; i++) { stringBuilder.append("b"); } System.out.print(stringBuilder.toString());

内存区域

在Java的内存分配中，总共3种常量池，分别是Class常量池、运行时常量池、字符串常量池。

String 类型的数据就存放在字符串常量池中，是在 Class 文件经过验证和准备阶段之后存入。在 HotSpot VM 环境字符串存放在常量池的 StringTable 中，是一个 Hash 表。

JDK 1.7 之后把字符串存放在了堆中，可能是为了方便在更大的内存中分配和使用字符串。之前是存放在方法区的常量池中。

由于存放区域的改变，直接影响了 "==" 符号对比的结果。看下面代码：

// JDK 1.8 环境 String s1 = "String"; String s2 = "String"; String s3 = new String("String"); System.out.println(s1 == s2);//true System.out.println(s1 == s3);//false

在 JDK 1.8 的环境下，可以看到 s1 == s3 为 false，这是由 s1 和 s3 在堆内存中指向的地址不一致造成的。
而在 JDK 1.6 的环境下， s1 == s3 可能为 true，因为它们都指向常量区相同字符串的地址。

intern方法

直接声明的字符串会直接存放在常量池中，如果想在运行期向常量池存放字符串，可以使用 intern 方法。

这是一个 native 方法，调用后如果常量池存在该字符串则直接返回。如果不存在，在常量池创建并返回该字符串。

在批量创建字符串的时候，使用 intern 方法虽然比直接生成浪费点时间，但是会大大减少在堆内存中创建字符串的数量，节省内存空间。

==

对于字符串来说，equals 比较的是字符串的内容。而 == 比较的是两者的内存地址。

// JDK 1.8 环境 String s0 = new String("1") + new String("x"); String s2 = s0.intern(); String s1 = "1x"; System.out.println(s2 == s0);//true System.out.println(s2 == s1);//true System.out.println(s0 == s1);//true System.out.println("==="); String s3 = new String("1") + new String("x"); String s5 = s3.intern(); String s4 = "1x"; System.out.println(s5 == s3);//false s3 在堆内存有自己指向的地址，s5 指向的是字符串 1x 的地址 System.out.println(s5 == s4);//true System.out.println(s3 == s4);//flase 同理，不同地址不匹配 System.out.println(s5 == s0);//true System.out.println(s4 == s0);//true System.out.println("==="); System.out.println(s3 == s0);//false

为什么字符串不可变

从定义上说，String 是一个字符数组 final char[]，final 类型的不可变。

为什么会设计成不可变：为了效率和安全。

效率：既然是不可变的，那么在常量池中进行分配和搜索就会快很多。

安全：两种情况下的例子

作为参数传递，如果一个字符串传到了方法中，进行了拼接。由于不可变性，该字符串还是原来的信息。但是如果传入了 StringBuilder 对象，进行 append，方法结束后该字符串的信息就发生了变化。

引用指向的变化，可能会影响原信息。

public static void main(String[] args) { StringBuilder s1 = new StringBuilder("a"); StringBuilder s2 = s1; s2.append("b"); System.out.println(s1);//ab System.out.println(s2);//ab }

可以看到原来的 s1，被 s2 引用以后再拼接，原来的 s1 的数据也发生的变化。这是由于 s1 指向了一个字符串，s2 修改了引用的字符串，所以导致他们的数据都发生了变化。

StringBuilder、StringBuffer

在这里插入图片描述

区别：

String 不可变，StringBuilder、StringBuffer 可变；主要是因为后两者维护的是可变数组。

速度：StringBuilder > StringBuffer > String ；StringBuilder 线程不安全，StringBuffer 加了 synchronized 所以会慢一些。最慢的 String 创建、修改和拼接都是需要新建的。

StringBuilder 线程不安全，StringBuffer 线程安全。

append 原理 JDK1.8

StringBuilder 和 StringBuffer 使用 append() 方法时会调用父类的 append 方法。

AbstractStringBuilder # append

1. 父类 append 方法 public AbstractStringBuilder append(String str) { if (str == null) return appendNull(); int len = str.length(); // 2. 确认是否需要扩容 ensureCapacityInternal(count + len); // 3. 进行复制 str.getChars(0, len, value, count); count += len; return this; }

ensureCapacityInternal 方法判断如果缓存长度不够，进行扩容并将原数据复制进扩容后的数组。

AbstractStringBuilder # ensureCapacityInternal

private void ensureCapacityInternal(int minimumCapacity) { // 超出缓存容量 if (minimumCapacity - value.length > 0) { value = Arrays.copyOf(value, newCapacity(minimumCapacity)); } }

AbstractStringBuilder # newCapacity

private int newCapacity(int minCapacity) { // 直接扩容为原数组长度两倍+2 int newCapacity = (value.length << 1) + 2; // 如果扩容后长度不够，直接等于需要扩容的长度 if (newCapacity - minCapacity < 0) { newCapacity = minCapacity; } return (newCapacity <= 0 || MAX_ARRAY_SIZE - newCapacity < 0) ? hugeCapacity(minCapacity) : newCapacity; }

最后调用拼接数据相应类的 getChars 方法进行数据拷贝，完成拼接。

String # getChars

public void getChars(int srcBegin, int srcEnd, char dst[], int dstBegin) { if (srcBegin < 0) { throw new StringIndexOutOfBoundsException(srcBegin); } // 追加字符串长度 > 扩容后数组长度 if (srcEnd > value.length) { throw new StringIndexOutOfBoundsException(srcEnd); } // 0 > 追加字符串长度 if (srcBegin > srcEnd) { throw new StringIndexOutOfBoundsException(srcEnd - srcBegin); } // 将指定数组拷贝到新数组 System.arraycopy(value, srcBegin, dst, dstBegin, srcEnd - srcBegin); }

StringBuffer 方法与之类似，只不过加了 synchronized 关键字实现同步。

Java String：再学字符串

定义

字符串的拼接

"+"

"+" 的效率

内存区域

intern方法

==

为什么字符串不可变

StringBuilder、StringBuffer

append 原理 JDK1.8

你可能感兴趣的:(Java String：再学字符串)