Before you keep reading...
Runestone Academy can only continue if we get support from individuals like you. As a student you are well aware of the high cost of textbooks. Our mission is to provide great books to you for free, but we ask that you consider a $10 donation, more if you can or less if $10 is a burden.
Before you keep reading...
Making great stuff takes time and $$. If you appreciate the book you are reading now and want to keep quality materials free for other students please consider a donation to Runestone Academy. We ask that you consider a $10 donation, but if you can give more thats great, if $10 is too much for your budget we would be happy with whatever you can afford as a show of support.
While this only scratched the surface of regular expressions, we have learned a bit about the language of regular expressions. They are search strings with special characters in them that communicate your wishes to the regular expression system as to what defines “matching” and what is extracted from the matched strings. Here are some of those special characters and character sequences:
^ Matches the beginning of the string.
$ Matches the end of the string.
. Matches any character (a wildcard).
\s Matches a whitespace character.
\S Matches a non-whitespace character (opposite of s).
* Applies to the immediately preceding character(s) and indicates to match
zero or more times.
*? Applies to the immediately preceding character(s) and indicates to
match zero or more times in “non-greedy mode”.
+ Applies to the immediately preceding character(s) and indicates to match
one or more times.
+? Applies to the immediately preceding character(s) and indicates to
match one or more times in “non-greedy mode”.
? Applies to the immediately preceding character(s) and indicates to match
zero or one time.
?? Applies to the immediately preceding character(s) and indicates to
match zero or one time in “non-greedy mode”.
[aeiou] Matches a single character as long as that character is in the
specified set. In this example, it would match “a”, “e”, “i”, “o”, or
“u”, but no other characters.
[a-z0-9] You can specify ranges of characters using the minus sign. This
example is a single character that must be a lowercase letter or a
[^A-Za-z] When the first character in the set notation is a caret, it
inverts the logic. This example matches a single character that is
anything other than an uppercase or lowercase letter.
( ) When parentheses are added to a regular expression, they are ignored
for the purpose of matching, but allow you to extract a particular
subset of the matched string rather than the whole string when using
\b Matches an empty string, but only at a word boundary. Must be used in a raw string (r”string”) so that it isn’t changed to a backspace.
\B Matches an empty string, but only when it is not at the beginning or end of a word.
\d Matches any decimal digit; equivalent to the set [0-9].
\D Matches any non-digit character; equivalent to the set [^0-9].