java正则表达式构造子列表本文权当参考,不喜勿踩Summary of regular-expression constructs ConstructMatc
java正则表达式构造子列表
本文权当参考,不喜勿踩Summary of regular-expression constructs
Construct | Matches | | Characters | xThe character
x\\The backslash character
\0nThe character with octal value
0n (0
<= n <= 7)
\0nnThe character with octal value
0nn (0
<= n <= 7)
\0mnnThe character with octal value
0mnn (0
<= m <= 3, 0
<= n <= 7)
\xhhThe character with hexadecimal value
0xhh\uhhhhThe character with hexadecimal value
0xhhhh\tThe tab character (
'\u0009')
\nThe newline (line feed) character (
'\u000A')
\rThe carriage-return character (
'\u000D')
\fThe form-feed character (
'\u000C')
\aThe alert (bell) character (
'\u0007')
\eThe escape character (
'\u001B')
\cxThe control character corresponding to
x | Character classes | [abc]a,
b, or
c (simple class)
[^abc]Any character except
a,
b, or
c (negation)
[a-zA-Z]a through
z or
A through
Z, inclusive (range)
[a-d[m-p]]a through
d, or
m through
p:
[a-dm-p] (union)
[a-z&&[def]]d,
e, or
f (intersection)
[a-z&&[^bc]]a through
z, except for
b and
c:
[ad-z] (subtraction)
[a-z&&[^m-p]]a through
z, and not
m through
p:
[a-lq-z](subtraction)
| Predefined character classes | .Any character (may or may not match line terminators)
\dA digit:
[0-9]\DA non-digit:
[^0-9]\sA whitespace character:
[ \t\n\x0B\f\r]\SA non-whitespace character:
[^\s]\wA word character:
[a-zA-Z_0-9]\WA non-word character:
[^\w] | POSIX character classes (US-ASCII only) | \p{Lower}A lower-case alphabetic character:
[a-z]\p{Upper}An upper-case alphabetic character:
[A-Z]\p{ASCII}All ASCII:
[\x00-\x7F]\p{Alpha}An alphabetic character:
[\p{Lower}\p{Upper}]\p{Digit}A decimal digit:
[0-9]\p{Alnum}An alphanumeric character:
[\p{Alpha}\p{Digit}]\p{Punct}Punctuation: One of
!"#$%&'()*+,-./:;<=>?@[\]^_`{|}~java.lang.Character classes (simple java character type) | \p{javaLowerCase}Equivalent to java.lang.Character.isLowerCase()
\p{javaUpperCase}Equivalent to java.lang.Character.isUpperCase()
\p{javaWhitespace}Equivalent to java.lang.Character.isWhitespace()
\p{javaMirrored}Equivalent to java.lang.Character.isMirrored()
| Classes for Unicode blocks and categories | \p{InGreek}A character in the Greek block (simple block)
\p{Lu}An uppercase letter (simple category)
\p{Sc}A currency symbol
\P{InGreek}Any character except one in the Greek block (negation)
[\p{L}&&[^\p{Lu}]] Any letter except an uppercase letter (subtraction)
| Boundary matchers | ^The beginning of a line
$The end of a line
\bA word boundary
\BA non-word boundary
\AThe beginning of the input
\GThe end of the previous match
\ZThe end of the input but for the final terminator, if any
\zThe end of the input
| Greedy quantifiers | X?X, once or not at all
X*X, zero or more times
X+X, one or more times
X{n}X, exactly
n times
X{n,}X, at least
n times
X{n,m}X, at least
n but not more than
m times
| Reluctant quantifiers | X??X, once or not at all
X*?X, zero or more times
X+?X, one or more times
X{n}?X, exactly
n times
X{n,}?X, at least
n times
X{n,m}?X, at least
n but not more than
m times
| Possessive quantifiers | X?+X, once or not at all
X*+X, zero or more times
X++X, one or more times
X{n}+X, exactly
n times
X{n,}+X, at least
n times
X{n,m}+X, at least
n but not more than
m times
| Logical operators | XYX followed by
YX|YEither
X or
Y(X)X, as a capturing group
| Back references | \nWhatever the
nth capturing group matched
| Quotation | \Nothing, but quotes the following character
\QNothing, but quotes all characters until
\E\ENothing, but ends quoting started by
\Q