Regex: ?: notation (Question mark and colon notation)

(?: starts a non-capturing group. It's no different to ( unless you're retrieving groups from the regex after use. See What is a non-capturing group? What does a question mark followed by a colon (?:) mean?.


A little late to this thread - just to build on ryanp's answer.

Assuming you have the string aaabbbccc

Regular Expression

(a)+(b)+(c)+

This would give you the following 3 groups that matched:

['a', 'b', 'c']

Regular Expression with non-capturing parenthesis

Use the ?: in the first group

(?:a)+(b)+(c)+

and you would get the following groups that matched:

['b', 'c']

Hence why it is called "non-capturing parenthesis"

Example use case:

Sometime you use parenthesis for other things. For example to set the bounds of the | or operator:

"New (York|Jersey)"

In this case, you are only using the parenthesis for the or | switch, and you don't really want to capture this data. Use the non-capturing parenthesis to indicate that:

"New (?:York|Jersey)"

Tags:

Java

Regex