Regex: ?: notation (Question mark and colon notation) [duplicate]

I have the following Java regex, which I didn't write and I am trying to modify:

^class-map(?:(\\s+match-all)|(\\s+match-any))?(\\s+[\\x21-\\x7e]{1,40})$
           ^                                 ^

It's similar to this one.

Note the first question mark. Does it mean that the group is optional? There is already a question mark after the corresponding ). Does the colon have a special meaning in regex?

The regex compiles fine, and there are already JUnit tests that show how it works. It's just that I'm a bit confused about why the first question mark and colon are there.


Solution 1:

(?: starts a non-capturing group. It's no different to ( unless you're retrieving groups from the regex after use. See What is a non-capturing group? What does a question mark followed by a colon (?:) mean?.

Solution 2:

A little late to this thread - just to build on ryanp's answer.

Assuming you have the string aaabbbccc

Regular Expression

(a)+(b)+(c)+

This would give you the following 3 groups that matched:

['a', 'b', 'c']

Regular Expression with non-capturing parenthesis

Use the ?: in the first group

(?:a)+(b)+(c)+

and you would get the following groups that matched:

['b', 'c']

Hence why it is called "non-capturing parenthesis"

Example use case:

Sometime you use parenthesis for other things. For example to set the bounds of the | or operator:

"New (York|Jersey)"

In this case, you are only using the parenthesis for the or | switch, and you don't really want to capture this data. Use the non-capturing parenthesis to indicate that:

"New (?:York|Jersey)"