"If a worker wants to do his job well, he must first sharpen his tools." - Confucius, "The Analects of Confucius. Lu Linggong"
Front page > Programming > Why Does Python Regex "\d" Match Only Odd-Ordered Digits in "12345"?

Why Does Python Regex "\d" Match Only Odd-Ordered Digits in "12345"?

Posted on 2025-03-04
Browse:288

Why Does Python Regex

Regex Matches Digit: Uncovering the True Nature of "\d"

In Python's regex world, the enigmatic character class "\d" has raised questions about its precise matching behavior. Let's delve into its cryptic nature.

In the string "123", "\d" captures both "1" and "3" but curiously excludes "2". This peculiar behavior stems from a subtle nuance in the definition of a "digit."

Python's "\d" does not exclusively match digits from the Western numeric system (i.e., "0-9"). It also encompasses an extended range of Unicode character classes within the category of "Digit" or "Nd." These include, for instance, Eastern Arabic numerals such as "٠" and "١".

Thus, when matching digits in Python regex expressions, it's crucial to recognize that "\d" casts a wider net than [0-9]. This nuanced understanding unravels the mystery of why, in a sequence like "12345," only odd-ordered digits (i.e., "1", "3", and "5") align with "\d" matches, leaving even-ordered digits ("2" and "4") untouched.

Latest tutorial More>

Disclaimer: All resources provided are partly from the Internet. If there is any infringement of your copyright or other rights and interests, please explain the detailed reasons and provide proof of copyright or rights and interests and then send it to the email: [email protected] We will handle it for you as soon as possible.

Copyright© 2022 湘ICP备2022001581号-3