regex library

Constants

astralLetterAndMarks → const String
Unicode's astral planes consist of the supplementary code points that are outside of the Basic Multilingual Plane (BMP).
astralNumerals → const String
The astral planes in Unicode comprise supplementary code points that reside outside of the Basic Multilingual Plane (BMP).
atSigns → const String
bmpLetterAndMarks → const String
This pattern is designed to capture a wide range of letters, marks, and other characters from various scripts and symbol sets that reside within the BMP.
bmpNumerals → const String
This pattern is tailored to identify numerals from a variety of scripts and regions present within the BMP.
codePoint → const String
This pattern identifies two main groups of characters:
cyrillicLettersAndMarks → const String
A string representing the Unicode range for Cyrillic letters and marks.
directionalMarkersGroup → const String
A string containing Unicode directional markers.
emoji → const String
endHashtag → const String
hashSigns → const String
hashtagAlpha → const String
hashtagAlphaNumeric → const String
This pattern captures characters that are typically present in hashtags.
hashtagBoundary → const String
This pattern captures the beginning (^), end ($), and certain specific characters that can be considered as boundaries for hashtags. These boundaries help in accurately extracting hashtags from text.
hashtagConcatChars → const String
hashtagEmoji → const String
hashtagSpecialChars → const String
This pattern captures a variety of special characters that might appear within hashtags across multiple languages and scripts.
invalidCharsGroup → const String
A string containing Unicode representations of characters considered invalid in certain contexts.
invalidDomainChars → const String
invalidDomainChars is a constant RegExp pattern string, designed to match any characters that are considered invalid within a domain name context.
latinAccentChars → const String
A string representing the Unicode range for Latin letters with accents and diacritical marks.
nonBmpCodePairs → const String
In the UTF-16 encoding scheme, characters outside the Basic Multilingual Plane (BMP) are represented using a pair of 16-bit units called surrogate pairs. This pattern is designed to identify these pairs:
punct → const String
spacesGroup → const String
A string representing the Unicode range for various space or whitespace characters.
validAsciiDomain → const String
validCctld → const String
validCctld is a constant RegExp pattern string constructed to match a wide range of valid country code top-level domains (ccTLDs). The string encompasses an extensive list of ccTLDs representing various countries and territories, utilizing different scripts and alphabets.
validDomain → const String
validDomain is a constant RegExp pattern string used to validate domain names by combining several smaller pattern strings, each representing a different part or type of domain.
validDomainChars → const String
validDomainChars is a constant RegExp pattern string intended to match any character that is not included in the invalidDomainChars string, representing valid characters allowed in a domain name.
validDomainName → const String
validGeneralUrlPathChars → const String
validGtld → const String
validHashtag → const String
validMention → const String
validMentionPrecedingChars → const String
validPortNumber → const String
A string representing a regular expression pattern used to match valid port numbers in a URL.
validPunycode → const String
validSubdomain → const String
validUrl → const String
validUrlBalancedParens → const String
Allow URL paths to contain up to two nested levels of balanced parens
validUrlPath → const String
validUrlPathEndingChars → const String
Valid end-of-path characters (so /foo. does not gobble the period).
validUrlPrecedingChars → const String
validUrlQueryChars → const String
validUrlQueryEndingChars → const String

Functions

getFirstValidDomain(String source) String
isValidUrl(String input) bool