Java 正则表达式基础

正则表达式是一种用于模式匹配或替换的规范。编程时经常遇到字符串查找,匹配,或者屏蔽特殊字符等等,如果采用正则表达式,则可以起到事半功倍的效果。

一、正则表达式基本语法。

1、.----句点符号

句点符号可以匹配任意单个字符,如a-z, 0-9,!,@,#,$等特殊字符,即通常意义上所说的通配符。

Pattern pattern = Pattern.compile( "leng" );
Matcher matcher = pattern.matcher( "len." );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


2、[]----方括号

.句点符号会匹配所有的字符,匹配范围太广。[]方括号可以限定匹配范围,即只能匹配[]中限定的字符。如[ade]、 [a-z]。

Pattern pattern = Pattern.compile( "l[abc]ng" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为false


3、|---或符号

|符号进行的是或运算,即匹配|符号连接的字符。如(a|bc|de)


Pattern pattern = Pattern.compile( "l(a|bc|de)ng" );
Matcher matcher = pattern.matcher( "ldeng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


4、匹配次数符号

*----表示匹配0次或多次

+----表示匹配1次或多次

?----表示匹配0次或1次

{n}----表示匹配n次(n为非负整数)

{m,n}----表示匹配最少m次,最多n次


Pattern pattern = Pattern.compile( "leng*" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


Pattern pattern = Pattern.compile( "leng+" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为false


Pattern pattern = Pattern.compile( "leng?" );
Matcher matcher = pattern.matcher( "len" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


Pattern pattern = Pattern.compile( "leng{1}" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


Pattern pattern = Pattern.compile( "leng{1,3}" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


5、^符号

1) 如果^符号用在[]方括号内部,则表示否定,即不匹配方括号内限定的字符,如[^abc]。

Pattern pattern = Pattern.compile( "[^b]*" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


2)如果^符号用在[]方括号外部,则表示起始,即匹配以^符号引导的开始字符,如^leng.

Pattern pattern = Pattern.compile( "^eng" );
Matcher matcher = pattern.matcher( "leng" );
boolean ret = matcher.matches( ); //输出为false


6、$符号

$符号用来匹配以$符号引导的结束字符,如eng$


Pattern pattern = Pattern.compile( ".*ang$" );
Matcher matcher = pattern.matcher( "lengxuegang" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


7、Java中常用的特殊符号

\d----表示数字:[0-9]

\D----表示非数字:[^0-9]

\w----表示字符:[a-zA-Z0-9]

\W----表示非字符:[^a-zA-Z0-9]

\s----表示空白字符:[\t\n\r\f]

\S----表示非空白字符:[^\t\n\r\f]


二、示例

1、字符串匹配。匹配以age结尾的字符串

Pattern pattern = Pattern.compile( ".*age$" );
Matcher matcher = pattern.matcher( "Java program language" );
boolean ret = matcher.matches( );
System.out.println(ret); //输出为true


2、字符串分割。将字符串以某个或某几个字符进行拆分

Pattern pattern = Pattern.compile( "[.!?]" );
String[] array = pattern.split( "Java program language. I like it!" );
for ( int i = 0; i < array.length; i++ ) {
    System.out.println( array[i].trim( ) );
}

输出为:

Java program language
I like it


3、字符串替换

1)首次替换

Pattern pattern = Pattern.compile( "Java" );
Matcher matcher = pattern.matcher( "Java program language. I like Java programming");
System.out.println(matcher.replaceFirst( "C++" ));

输出为:

C++ program language. I like Java programming

2)全部替换

Pattern pattern = Pattern.compile( "Java" );
Matcher matcher = pattern.matcher( "Java program language. I like Java programming" );
System.out.println(matcher.replaceAll( "C++" ));

输出为:

C++ program language. I like C++ programming

你可能感兴趣的:(java,编程,c,正则表达式,String)