[Cialug] Regular Expression for Pathnames

Ron Houk houk.ron at gmail.com
Thu Nov 27 11:04:25 CST 2014


Hi guys,
I'm just in the process of trying to learn more of the command line. Though
I'm not usually a fan of the O'Reilly line of books I'm finding "Classic
Shell Scripting" by Robbins & Beebe has a good section on regular
expressions. I just ordered for the library I work at and I'm thinking I
might have to get myself a copy.
On Nov 25, 2014 10:37 AM, "kristau" <kristau at gmail.com> wrote:

> Sounds like an excellent CIALUG meeting topic. . . if we can find a regex
> wizard to present, that is!
>
> On Tue, Nov 25, 2014 at 8:45 AM, jim kraai <jimgkraai at gmail.com> wrote:
>
> > there needs to be a regex intro/refresher class every few years
> >
> > On Mon, Nov 24, 2014 at 10:49 PM, Todd Walton <tdwalton at gmail.com>
> wrote:
> >
> > > Fabulous! I knew there was answer. Thanks, guys!
> > >
> > > --
> > > Todd
> > >
> > >
> > > On Mon, Nov 24, 2014 at 1:43 PM, Daniel A. Ramaley <
> > > daniel.ramaley at drake.edu
> > > > wrote:
> > >
> > > > I realized right after sending it that the way your problem is
> stated,
> > > > you might want to match different numbers of slashes. To match
> strings
> > > > that have between 1 and 3 (inclusive) slashes, change the part
> between
> > > > curly braces to be "1,3" instead of just "3":
> > > >
> > > >         egrep '^(/[^/]+){1,3}$'
> > > >
> > > > Explaining what this bit of line noise means:
> > > >
> > > > ^       Match beginning of line.
> > > > ()      Groups stuff and treats it as one object.
> > > > /       Match a literal "/".
> > > > []      Matches a single character provided inside the brackets.
> > > >         For example, "[abcde]" will match 5 lowercase letters.
> > > > ^       If used as the first character in square brackets, this
> > > >         inverts the characters matched.
> > > > /       Match "/". Note that it is inverted due to the preceding
> > > >         "^" and so therefore means match anything *except* "/".
> > > > +       Match the [] expression 1 or more times.
> > > > {}      Counter. How many times we should match.
> > > > 1,3     Match between 1 and 3 times, inclusive.
> > > > $       Match end of line.
> > > >
> > > > You can find more precise explanation in regex documentation; this is
> > > > just off the top of my head. The most important thing with reading
> > > > regexes is figuring out how they are grouped. The basic pattern here
> is
> > > > the (foo){bar} which just means "match foo, bar number of times".
> > > >
> > > > On 2014-11-24 at 10:58:24 Ron Houk wrote:
> > > > > Wow. Your solution is a lot more elegant. I'm still trying to learn
> > > > > this stuff. :)
> > > > >
> > > > > On Nov 24, 2014 10:35 AM, "Daniel A. Ramaley"
> > > > > <daniel.ramaley at drake.edu>
> > > > > wrote:
> > > > > > This should work for your purposes if all the data looks like the
> > > > > > samples. Set the number between curly braces to whatever you
> need.
> > > > > > (Note that your sample data didn't have any matches for just 2
> > > > > > slashes, but does for 3 slashes.)
> > > > > >
> > > > > >         egrep '^(/[^/]+){3}$'
> > > > > >
> > > > > > On 2014-11-24 at 10:23:07 Todd Walton wrote:
> > > > > > > If I have a text file full of pathnames, like:
> > > > > > >
> > > > > > > /var/log/folder1
> > > > > > > /var/log/folder2
> > > > > > > /home/todd/mydir
> > > > > > > /var/log/folder1/fileh
> > > > > > > /var/log/folder1/foldersub/fileh
> > > > > > >
> > > > > > > ...etc, what's the regular expression to find where a string
> has
> > > > > > > exactly two (or however many) forward slashes to the left of
> it?
> > > > > > > I
> > > > > > > have a 360,000 line list of path names, and I'd like to find
> > where
> > > > > > > a
> > > > > > > certain string falls early in the path.  I'm really only
> > > > > > > interested
> > > > > > > in paths where it's in the top three or four directories.
> > > > > > >
> > > > > > > --
> > > > > > > Todd
> > > > > > > _______________________________________________
> > > > > > > Cialug mailing list
> > > > > > > Cialug at cialug.org
> > > > > > > http://cialug.org/mailman/listinfo/cialug
> > > > > >
> > > > > > __
> > > > > > Daniel A. Ramaley  |  Network Engineer 2
> > > > > > Drake Technology Services (DTS) | Drake University
> > > > > >
> > > > > > T: +1 515 271-4540
> > > > > > F: +1 515 271-1938
> > > > > > E: daniel.ramaley at drake.edu
> > > > > >
> > > > > > _______________________________________________
> > > > > > Cialug mailing list
> > > > > > Cialug at cialug.org
> > > > > > http://cialug.org/mailman/listinfo/cialug
> > > > __
> > > > Daniel A. Ramaley  |  Network Engineer 2
> > > > Drake Technology Services (DTS) | Drake University
> > > >
> > > > T: +1 515 271-4540
> > > > F: +1 515 271-1938
> > > > E: daniel.ramaley at drake.edu
> > > >
> > > > _______________________________________________
> > > > Cialug mailing list
> > > > Cialug at cialug.org
> > > > http://cialug.org/mailman/listinfo/cialug
> > > >
> > > _______________________________________________
> > > Cialug mailing list
> > > Cialug at cialug.org
> > > http://cialug.org/mailman/listinfo/cialug
> > >
> > _______________________________________________
> > Cialug mailing list
> > Cialug at cialug.org
> > http://cialug.org/mailman/listinfo/cialug
> >
>
>
>
> --
> Tired programmer
> Coding late into the night
> The core dump follows
> _______________________________________________
> Cialug mailing list
> Cialug at cialug.org
> http://cialug.org/mailman/listinfo/cialug
>


More information about the Cialug mailing list