What is the use of Pattern.quote method?
I'm trying to understand Pattern.quote
using the following code:
String pattern = Pattern.quote("1252343% 8 567 hdfg gf^$545");
System.out.println("Pattern is : "+pattern);
produces the output:
Pattern is : \Q1252343% 8 567 hdfg gf^$545\E
What are \Q
and \E
here? The documentation description says :
Returns a literal pattern
String
for the specifiedString
.This method produces a
String
that can be used to create aPattern
that would match the strings
as if it were a literal pattern.Metacharacters or escape sequences in the input sequence will be given no special meaning.
But Pattern.quote
's return type is String
and not a compiled Pattern
object.
Why is this method required and what are some usage examples?
Solution 1:
\Q
means "start of literal text" (i.e. regex "open quote")\E
means "end of literal text" (i.e. regex "close quote")
Calling the Pattern.quote()
method wraps the string in \Q...\E
, which turns the text is into a regex literal. For example, Pattern.quote(".*")
would match a dot and then an asterisk:
System.out.println("foo".matches(".*")); // true
System.out.println("foo".matches(Pattern.quote(".*"))); // false
System.out.println(".*".matches(Pattern.quote(".*"))); // true
The method's purpose is to not require the programmer to have to remember the special terms \Q
and \E
and to add a bit of readability to the code - regex is hard enough to read already. Compare:
someString.matches(Pattern.quote(someLiteral));
someString.matches("\\Q" + someLiteral + "\\E"));
Referring to the javadoc:
Returns a literal pattern String for the specified String.
This method produces a String that can be used to create a Pattern that would match the string s as if it were a literal pattern.
Metacharacters or escape sequences in the input sequence will be given no special meaning.
Solution 2:
The Pattern.quote
method quotes part of a regex pattern to make regex interpret it as string literals.
Say you have some user input in your search program, and you want to regex for it. But this input may have unsafe characters so you can use
Pattern pattern = Pattern.compile(Pattern.quote(userInput));
This method does not quote a Pattern
but, as you point out, wraps a String
in regex quotes.
Solution 3:
\Q
and \E
, among all others, are thoroughly documented on the java.util.regex.Pattern Javadoc page. They mean "begin Quote", "End quote" and demark a region where all the chars have the literal meaning. The way to use the return of Pattern.quote
is to feed it to Pattern.compile
, or any other method that accepts a pattern string, such as String.split
.
Solution 4:
If you compile the String returned by Pattern.quote
, you'll get a Pattern
which matches the literal string that you quote
d.
\Q
and \E
mark the beginning and end of the quoted part of the string.
Solution 5:
Regex collides frequently with normal strings. Say I want a regex to search for a certain string that is only known at runtime. How can we be sure that the string doesn't have regex meaning eg(".*.*.*"
)? We quote it.