hashtagSpecialChars top-level constant
String
const hashtagSpecialChars
This pattern captures a variety of special characters that might appear within hashtags across multiple languages and scripts.
While underscores (_
) and hyphen (-
) are standard in many hashtag
implementations, the inclusion of characters from different Unicode
blocks ensures broader compatibility with non-Latin scripts and
specialized punctuation.
For instance, characters such as:
- U+200C (ZERO WIDTH NON-JOINER) and U+200D (ZERO WIDTH JOINER) are used in complex script ligature formation.
- U+05BE (HEBREW PUNCTUATION MAQAF) is a hyphen-like mark used in Hebrew.
- U+30FB (KATAKANA MIDDLE DOT) serves as a separator in Japanese Katakana. ... and several others cater to unique linguistic and typographic needs.
To understand the exact role of each character, one might need to refer to Unicode documentation using the provided hex values.
Implementation
const hashtagSpecialChars = '$hashtagConcatChars'
r'\u200c\u200d\ua67e\u05be\u05f3\u05f4\uff5e\u301c\u309b\u309c\u30a0\u30fb\u3003\u0f0b\u0f0c\xb7';