SlideShare a Scribd company logo
February 28, 2012           [MUST HAVE REGULAR EXPRESSION PATTERNS]

Regular expressions are strings of special characters that can be used to search and find character patterns. Within
the scope of BGP in Cisco IOS regular expressions can be used in show commands and AS-Path access-lists to
match BGP prefixes based on the information contained in their AS-Path.

In order to understand how to build regular expressions we first need to know what the character definitions
arefor the regex function of IOS. The below table illustrates the regex characters and their usage. This information is
contained in the Cisco IOS documentation under the Appendix of Cisco IOS Terminal Services Configuration Guide,
Release 12.2.

+------------------------------------------------------+
| CHAR | USAGE                                      |
+------------------------------------------------------|
| ^ | Start of string                               |
|------|-----------------------------------------------|
| $ | End of string                                   |
|------|-----------------------------------------------|
| [] | Range of characters                                |
|------|-----------------------------------------------|
| - | Used to specify range ( i.e. [0-9] )                   |
|------|-----------------------------------------------|
| ( )| Logical grouping                                 |
|------|-----------------------------------------------|
| .| Any single character                               |
|------|-----------------------------------------------|
| * | Zero or more instances                               |
|------|-----------------------------------------------|
| + | One or more instance                                 |
|------|-----------------------------------------------|
| ? | Zero or one instance                                |
|------|-----------------------------------------------|
| _ | Comma, open or close brace, open or close |
| | parentheses, start or end of string, or space |
+------------------------------------------------------+
Some commonly used regular expressions include:

+-------------+---------------------------+
| Expression | Meaning                      |
|-------------+---------------------------|
| .*        | Anything                  |
|-------------+---------------------------|
| ^$          | Locally originated routes |
|-------------+---------------------------|
February 28, 2012       [MUST HAVE REGULAR EXPRESSION PATTERNS]

| ^100_| Learned from AS100                 |
|-------------+---------------------------|
| _100$        | Originated in AS100 |
|-------------+---------------------------|
| _100_| Any instance of AS100 |
|-------------+---------------------------|
| ^[0-9]+$| Directly connected ASes|
+-------------+---------------------------+
Let’s break some of the above expressions down step-by-step. The first one “.*” says to match any single character
(“.”), and then find zero or more instances of that single character (“*”). This means zero or more instances or any
character, which effectively means anything.

The next string “^$” says to match the beginning of the string (“^”), and then immediately match the end of the string
(“$”). This means that the string is null. Within the scope of BGP the only time that the AS-Path is null is when you
are looking at a route within your own AS that you or one of your iBGP peers has originated. Hence this matches
locally originated routes.

The next string “^100_” says to match the beginning of the string (“^”), the literal characters 100, and then a comma,
an open or close brace, an open or close, a parentheses, the start or end of the string, or a space (“_”). This means
that the string must start with the number 100 followed by any non-alphanumeric character. In the scope of BGP this
means that routes which are learned from the AS 100 will be matched, as 100 will be the first AS in the path when AS
100 is sending us routes.

The next string “_100$” is the exact opposite of the previous one. This string says to start with any non-alphanumeric
character (“_”), followed by the literal characters 100, followed by the end of the string (“$”). This means that AS 100
is the last AS in the path, or in other words that the prefix in question was originated by AS 100.

The next string “_100_” is the combination of the two previous strings with some extra matches. This string means
that the literal characters 100 are set between any two non-alphanumeric characters. The first of these could be the
start of the string, which would match routes learned from AS 100, while the second of these could be the end of the
string, which would match routes originated in AS 100. Another case could be that the underscores represent
spaces, in which the string would match any other AS path information as long as “ 100 ” is included somewhere.
This would match any routes which transit AS 100, and therefore “_ASN_” is generally meant to match routes that
transit a particular AS as defined by the number “ASN”.

The final string “^[0-9]+$” is a little more complicated match. Immediately we can see that the string starts (“^”), and
we can see later that it ends (“$”). In the middle we see a range of numbers 0-9 in brackets, followed by the plus sign.
The numbers in brackets mean that any number from zero to nine can be matched, or in other words, any number.
Next we have the plus sign which means one or more instances. This string “[0-9]+” therefore means one or more
instance of any number, or in other words any number including numbers with multiple characters (i.e. 1, 12, 123,
February 28, 2012       [MUST HAVE REGULAR EXPRESSION PATTERNS]

1234, 12345678, etc.). When we combine these all together this string means routes originated in any directly
connected single AS, or in other words, the routes directly originated by the peers of your AS.

Now let’s look at a more complicated match, and using the above character patterns we will see how we can
construct the expression step by step. Suppose we have the following topology below, where we are looking at the
network from the perspective of AS 100.

+--------+ +--------+ +--------+ +--------+
| AS 200 |-| AS 201 |-| AS 202 |-| AS 203 |
+--------+ +--------+ +--------+ +--------+

       +--------+ +--------+ +--------+
       | AS 300 |-| AS 301 |-| AS 302 |
       +--------+ +--------+ +--------+ -+--------+
>--| AS 100 |
               +--------+ +--------+ / -+--------+
               | AS 400 |-| AS 401 | / /
               +--------+ +--------+/ /
                                /
                       +--------+ /
                       | AS 500 |/
                       +--------+
AS 100 peers with ASes 203, 302, 401, and 500, who each have peers as diagramed above. AS 100 wants to match
routes originated from its directly connected customers (ASes 203, 302, 401, and 500) in addition to routes originated
from their directly connected customers (ASes 202, 301, and 400). The easiest way to create this regular expression
would be to think about what we are first trying to match, and then write out all possibilities of these matches. In our
case these possibilities are:

203
203 202
302
302 301
401
401 400
500
Now we could simply create an expression with multiple lines (7 lines to be exact) that would match all of the possible
AS paths, but suppose that AS 100 wants to keep this match as flexible as possible so that it will apply to any other
ASes in the future. Now let’s try to generalize the above AS-Path information into a regex.

First off we know that each of the matches is going to start and going to end. This means that the first character we
will have is “^” and the last character is “$”. Next we know that between the “^” and “$” there will be either one AS or
February 28, 2012       [MUST HAVE REGULAR EXPRESSION PATTERNS]

two ASes. We don’t necessarily know what numbers these ASes will be, so for the time being let’s use the
placeholder “X”. Based on this our new possible matches are:

^X$
^X X$
Next let’s reason out what X can represent. Since X is only one single AS, there will be no spaces, commas,
parentheses, or any other special type characters. In other words, X must be a number. However, since we don’t
know what the exact path is, we must take into account that X may be a number with more than one character (i.e.
10, 123, or 10101). This essentially equates to one or more instance of any number zero through nine. In regular
expression syntax our two matches would therefore now read:

^[0-9]+$
^[0-9]+ [0-9]+$
This expressions reads that we either have a number consisting of one or more characters zero through nine, or a
number consisting of one or more characters zero through nine followed by a space and then another number
consisting of one or more characters zero through nine. This brings our expression down to two lines as opposed to
our original seven, but let’s see how we can combine the above two as well. To combine them, first let us compare
what is different between them.

^[0-9]+$
^[0-9]+ [0-9]+$
From looking at the expressions it is evident that the sequence “ [0-9]+” is the difference. In the first case “ [0-9]+”
does not exist in the expression. In the second case “ [0-9]+” does exist in the expression. In other words, “ [0-9]+” is
either true or false. True or false (0 or 1) is represented by the character “?” in regex syntax. Therefore we can
reduce our expression to:

^[0-9]+ [0-9]+?$
At this point we run into a problem with the order of operations of the regex. As denoted above the question mark will
apply only to the plus sign, and not to the range [0-9]. Instead, we want the question mark to apply to the string “ [0-
9]+” as a whole. Therefore this string needs to be grouped together using parentheses. Parentheses are used in
regular expressions as simply a logical grouping. Therefore our final expression reduces to:

^[0-9]+( [0-9]+)?$
Note that to match a question mark in IOS, the escape sequence CTRL-Vor ESC-Qmust be entered first, otherwise
the IOS parser will interpret the question mark as an attempt to invoke the context sensitive help.

More Related Content

PPTX
Alg2 lesson 7-8
PPTX
Alg2 lesson 7-8
DOCX
Surviving The Stump The Chump Interview Questions
PDF
Cisco ASA Firewall Interview Question "aka Stump-the-Chump" Question # 01
PPT
Cisco Exam # 642 611 Mpls Study Notes
PPTX
DBodle QoS Exam Study Notes
DOCX
Basic BGP Trouble Shooting Candidate Screening
Alg2 lesson 7-8
Alg2 lesson 7-8
Surviving The Stump The Chump Interview Questions
Cisco ASA Firewall Interview Question "aka Stump-the-Chump" Question # 01
Cisco Exam # 642 611 Mpls Study Notes
DBodle QoS Exam Study Notes
Basic BGP Trouble Shooting Candidate Screening

Similar to Regular Expression Patterns (20)

PPTX
String function in my sql
PPTX
Regular expression examples
PPT
Intro To TSQL - Unit 2
PPT
Intro to tsql unit 2
PPTX
Crossword puzzle Using Backtracking in Java
PPT
Regular Expressions 2007
PPT
Php String And Regular Expressions
PPT
CS540-2-lecture2 Lexical analyser of .ppt
PPT
Introduction to Perl
PDF
DEE 431 Introduction to MySql Slide 6
PPTX
String in programming language in c or c++
PPTX
Pythonlearn-11-Regex.pptx
PDF
Lecture 10.pdf
PPT
Regular Expression in Action
PPT
Form validation server side
PPT
Sql select
PPTX
Arrays & Strings.pptx
PPT
Regex Basics
PPTX
String function in my sql
Regular expression examples
Intro To TSQL - Unit 2
Intro to tsql unit 2
Crossword puzzle Using Backtracking in Java
Regular Expressions 2007
Php String And Regular Expressions
CS540-2-lecture2 Lexical analyser of .ppt
Introduction to Perl
DEE 431 Introduction to MySql Slide 6
String in programming language in c or c++
Pythonlearn-11-Regex.pptx
Lecture 10.pdf
Regular Expression in Action
Form validation server side
Sql select
Arrays & Strings.pptx
Regex Basics
Ad

More from Duane Bodle (7)

PPTX
OSPF Beyond Stump-the-Chump_Interview_Questions - Part 01 -
DOCX
SIP PRIMER
DOCX
Project Business Case and Capital Justification for Implementation of Applica...
PPT
OSPF LSA Types Explained
PDF
Troubleshooting BGP
DOC
Study Notes BGP Exam
DOCX
Cisco BGP Exam 642-661 Review Notes
OSPF Beyond Stump-the-Chump_Interview_Questions - Part 01 -
SIP PRIMER
Project Business Case and Capital Justification for Implementation of Applica...
OSPF LSA Types Explained
Troubleshooting BGP
Study Notes BGP Exam
Cisco BGP Exam 642-661 Review Notes
Ad

Regular Expression Patterns

  • 1. February 28, 2012 [MUST HAVE REGULAR EXPRESSION PATTERNS] Regular expressions are strings of special characters that can be used to search and find character patterns. Within the scope of BGP in Cisco IOS regular expressions can be used in show commands and AS-Path access-lists to match BGP prefixes based on the information contained in their AS-Path. In order to understand how to build regular expressions we first need to know what the character definitions arefor the regex function of IOS. The below table illustrates the regex characters and their usage. This information is contained in the Cisco IOS documentation under the Appendix of Cisco IOS Terminal Services Configuration Guide, Release 12.2. +------------------------------------------------------+ | CHAR | USAGE | +------------------------------------------------------| | ^ | Start of string | |------|-----------------------------------------------| | $ | End of string | |------|-----------------------------------------------| | [] | Range of characters | |------|-----------------------------------------------| | - | Used to specify range ( i.e. [0-9] ) | |------|-----------------------------------------------| | ( )| Logical grouping | |------|-----------------------------------------------| | .| Any single character | |------|-----------------------------------------------| | * | Zero or more instances | |------|-----------------------------------------------| | + | One or more instance | |------|-----------------------------------------------| | ? | Zero or one instance | |------|-----------------------------------------------| | _ | Comma, open or close brace, open or close | | | parentheses, start or end of string, or space | +------------------------------------------------------+ Some commonly used regular expressions include: +-------------+---------------------------+ | Expression | Meaning | |-------------+---------------------------| | .* | Anything | |-------------+---------------------------| | ^$ | Locally originated routes | |-------------+---------------------------|
  • 2. February 28, 2012 [MUST HAVE REGULAR EXPRESSION PATTERNS] | ^100_| Learned from AS100 | |-------------+---------------------------| | _100$ | Originated in AS100 | |-------------+---------------------------| | _100_| Any instance of AS100 | |-------------+---------------------------| | ^[0-9]+$| Directly connected ASes| +-------------+---------------------------+ Let’s break some of the above expressions down step-by-step. The first one “.*” says to match any single character (“.”), and then find zero or more instances of that single character (“*”). This means zero or more instances or any character, which effectively means anything. The next string “^$” says to match the beginning of the string (“^”), and then immediately match the end of the string (“$”). This means that the string is null. Within the scope of BGP the only time that the AS-Path is null is when you are looking at a route within your own AS that you or one of your iBGP peers has originated. Hence this matches locally originated routes. The next string “^100_” says to match the beginning of the string (“^”), the literal characters 100, and then a comma, an open or close brace, an open or close, a parentheses, the start or end of the string, or a space (“_”). This means that the string must start with the number 100 followed by any non-alphanumeric character. In the scope of BGP this means that routes which are learned from the AS 100 will be matched, as 100 will be the first AS in the path when AS 100 is sending us routes. The next string “_100$” is the exact opposite of the previous one. This string says to start with any non-alphanumeric character (“_”), followed by the literal characters 100, followed by the end of the string (“$”). This means that AS 100 is the last AS in the path, or in other words that the prefix in question was originated by AS 100. The next string “_100_” is the combination of the two previous strings with some extra matches. This string means that the literal characters 100 are set between any two non-alphanumeric characters. The first of these could be the start of the string, which would match routes learned from AS 100, while the second of these could be the end of the string, which would match routes originated in AS 100. Another case could be that the underscores represent spaces, in which the string would match any other AS path information as long as “ 100 ” is included somewhere. This would match any routes which transit AS 100, and therefore “_ASN_” is generally meant to match routes that transit a particular AS as defined by the number “ASN”. The final string “^[0-9]+$” is a little more complicated match. Immediately we can see that the string starts (“^”), and we can see later that it ends (“$”). In the middle we see a range of numbers 0-9 in brackets, followed by the plus sign. The numbers in brackets mean that any number from zero to nine can be matched, or in other words, any number. Next we have the plus sign which means one or more instances. This string “[0-9]+” therefore means one or more instance of any number, or in other words any number including numbers with multiple characters (i.e. 1, 12, 123,
  • 3. February 28, 2012 [MUST HAVE REGULAR EXPRESSION PATTERNS] 1234, 12345678, etc.). When we combine these all together this string means routes originated in any directly connected single AS, or in other words, the routes directly originated by the peers of your AS. Now let’s look at a more complicated match, and using the above character patterns we will see how we can construct the expression step by step. Suppose we have the following topology below, where we are looking at the network from the perspective of AS 100. +--------+ +--------+ +--------+ +--------+ | AS 200 |-| AS 201 |-| AS 202 |-| AS 203 | +--------+ +--------+ +--------+ +--------+ +--------+ +--------+ +--------+ | AS 300 |-| AS 301 |-| AS 302 | +--------+ +--------+ +--------+ -+--------+ >--| AS 100 | +--------+ +--------+ / -+--------+ | AS 400 |-| AS 401 | / / +--------+ +--------+/ / / +--------+ / | AS 500 |/ +--------+ AS 100 peers with ASes 203, 302, 401, and 500, who each have peers as diagramed above. AS 100 wants to match routes originated from its directly connected customers (ASes 203, 302, 401, and 500) in addition to routes originated from their directly connected customers (ASes 202, 301, and 400). The easiest way to create this regular expression would be to think about what we are first trying to match, and then write out all possibilities of these matches. In our case these possibilities are: 203 203 202 302 302 301 401 401 400 500 Now we could simply create an expression with multiple lines (7 lines to be exact) that would match all of the possible AS paths, but suppose that AS 100 wants to keep this match as flexible as possible so that it will apply to any other ASes in the future. Now let’s try to generalize the above AS-Path information into a regex. First off we know that each of the matches is going to start and going to end. This means that the first character we will have is “^” and the last character is “$”. Next we know that between the “^” and “$” there will be either one AS or
  • 4. February 28, 2012 [MUST HAVE REGULAR EXPRESSION PATTERNS] two ASes. We don’t necessarily know what numbers these ASes will be, so for the time being let’s use the placeholder “X”. Based on this our new possible matches are: ^X$ ^X X$ Next let’s reason out what X can represent. Since X is only one single AS, there will be no spaces, commas, parentheses, or any other special type characters. In other words, X must be a number. However, since we don’t know what the exact path is, we must take into account that X may be a number with more than one character (i.e. 10, 123, or 10101). This essentially equates to one or more instance of any number zero through nine. In regular expression syntax our two matches would therefore now read: ^[0-9]+$ ^[0-9]+ [0-9]+$ This expressions reads that we either have a number consisting of one or more characters zero through nine, or a number consisting of one or more characters zero through nine followed by a space and then another number consisting of one or more characters zero through nine. This brings our expression down to two lines as opposed to our original seven, but let’s see how we can combine the above two as well. To combine them, first let us compare what is different between them. ^[0-9]+$ ^[0-9]+ [0-9]+$ From looking at the expressions it is evident that the sequence “ [0-9]+” is the difference. In the first case “ [0-9]+” does not exist in the expression. In the second case “ [0-9]+” does exist in the expression. In other words, “ [0-9]+” is either true or false. True or false (0 or 1) is represented by the character “?” in regex syntax. Therefore we can reduce our expression to: ^[0-9]+ [0-9]+?$ At this point we run into a problem with the order of operations of the regex. As denoted above the question mark will apply only to the plus sign, and not to the range [0-9]. Instead, we want the question mark to apply to the string “ [0- 9]+” as a whole. Therefore this string needs to be grouped together using parentheses. Parentheses are used in regular expressions as simply a logical grouping. Therefore our final expression reduces to: ^[0-9]+( [0-9]+)?$ Note that to match a question mark in IOS, the escape sequence CTRL-Vor ESC-Qmust be entered first, otherwise the IOS parser will interpret the question mark as an attempt to invoke the context sensitive help.