Groups and backreferences
Groups group multiple patterns as a whole, and capturing groups provide extra submatch information when using a regular expression pattern to match against a string. Backreferences refer to a previously captured group in the same regular expression.
Try it
Types
Characters | Meaning |
---|---|
(x) |
Capturing group:
Matches
A regular expression may have multiple capturing groups. In results,
matches to capturing groups typically in an array whose members are in
the same order as the left parentheses in the capturing group. This is
usually just the order of the capturing groups themselves. This
becomes important when capturing groups are nested. Matches are
accessed using the index of the result's elements ( Capturing groups have a performance penalty. If you don't need the matched substring to be recalled, prefer non-capturing parentheses (see below).
|
(?<Name>x) |
Named capturing group:
Matches "x" and stores it on
the groups property of the returned matches under the name specified
by
For example, to extract the United States area code from a phone
number, we could use |
(?:x) |
Non-capturing group:
Matches "x" but does not remember
the match. The matched substring cannot be recalled from the resulting
array's elements ( |
(?flags:x) , (?:flags-flags:x) |
Modifier:
Enables or disables the specified flags only to the enclosed pattern. Only the |
\n |
Backreference:
Where "n" is a positive integer. Matches the same substring matched by
the nth capturing group in the regular expression
(counting left parentheses). For example,
|
\k<Name> |
Named backreference:
A back reference to the last substring matching the
Named capture group specified by
For example,
Note: |
Examples
Using groups
In this example, we match two words in a structured format by using capturing groups to remember them. \w+
matches one or more word characters, and the parentheses ()
create a capturing group. The g
flag is used to match all occurrences.
const personList = `First_Name: John, Last_Name: Doe
First_Name: Jane, Last_Name: Smith`;
const regexpNames = /First_Name: (\w+), Last_Name: (\w+)/g;
for (const match of personList.matchAll(regexpNames)) {
console.log(`Hello ${match[1]} ${match[2]}`);
}
See more examples in the capturing group reference.
Using named groups
This example is the same as above, but we use named capturing groups to remember the matched words instead. This way, we can access the matched words by their meanings.
const personList = `First_Name: John, Last_Name: Doe
First_Name: Jane, Last_Name: Smith`;
const regexpNames =
/First_Name: (?<firstname>\w+), Last_Name: (?<lastname>\w+)/g;
for (const match of personList.matchAll(regexpNames)) {
console.log(`Hello ${match.groups.firstname} ${match.groups.lastname}`);
}
See more examples in the named capturing group reference.
Using groups and back references
In this example, we first match a single or double quote character with ['"]
, remember it, match an arbitrary number of characters with .*?
(*?
is a non-greedy quantifier), until we match the remembered quote character again with \1
. The \1
is a backreference to the first capturing group, which matches the same type of quote. The result will therefore be two strings: "'"
and '"'
.
const quote = `Single quote "'" and double quote '"'`;
const regexpQuotes = /(['"]).*?\1/g;
for (const match of quote.matchAll(regexpQuotes)) {
console.log(match[0]);
}
See more examples in the backreference reference.
Using groups and match indices
By providing the d
flag, the indices of each capturing group is returned. This is especially useful if you are correlating each matched group with the original text — for example, to provide compiler diagnostics.
const code = `function add(x, y) {
return x + y;
}`;
const functionRegexp =
/(function\s+)(?<name>[$_\p{ID_Start}][$\u200c\u200d\p{ID_Continue}]*)/du;
const match = functionRegexp.exec(code);
const lines = code.split("\n");
lines.splice(
1,
0,
" ".repeat(match.indices[1][1] - match.indices[1][0]) +
"^".repeat(match.indices.groups.name[1] - match.indices.groups.name[0]),
);
console.log(lines.join("\n"));
// function add(x, y) {
// ^^^
// return x + y;
// }