Intl.Segmenter() constructor
Baseline 2024
Newly available
Since April 2024, this feature works across the latest devices and browser versions. This feature might not work in older devices or browsers.
The Intl.Segmenter()
constructor creates Intl.Segmenter
objects.
Try it
Syntax
Parameters
locales
Optional-
A string with a BCP 47 language tag or an
Intl.Locale
instance, or an array of such locale identifiers. The runtime's default locale is used whenundefined
is passed or when none of the specified locale identifiers is supported. For the general form and interpretation of thelocales
argument, see the parameter description on theIntl
main page. options
Optional-
An object containing the following properties, in the order they are retrieved (all of them are optional):
localeMatcher
-
The locale matching algorithm to use. Possible values are
"lookup"
and"best fit"
; the default is"best fit"
. For information about this option, see Locale identification and negotiation. granularity
-
How granularly should the input be split. Possible values are:
"grapheme"
(default)-
Split the input into segments at grapheme cluster (user-perceived character) boundaries, as determined by the locale.
"word"
-
Split the input into segments at word boundaries, as determined by the locale.
"sentence"
-
Split the input into segments at sentence boundaries, as determined by the locale.
Return value
A new Intl.Segmenter
instance.
Exceptions
RangeError
-
Thrown if
locales
oroptions
contain invalid values.
Examples
Basic usage
The following example shows how to count words in a string using the Japanese language (where splitting the string using String
methods would have given an incorrect result).
const text = "吾輩は猫である。名前はたぬき。";
const japaneseSegmenter = new Intl.Segmenter("ja-JP", { granularity: "word" });
console.log(
[...japaneseSegmenter.segment(text)].filter((segment) => segment.isWordLike)
.length,
);
// 8, as the text is segmented as '吾輩'|'は'|'猫'|'で'|'ある'|'。'|'名前'|'は'|'たぬき'|'。'
Specifications
Specification |
---|
ECMAScript Internationalization API Specification # sec-intl-segmenter-constructor |
Browser compatibility
BCD tables only load in the browser