43BSDReno/share/doc/usd/13.edadv/ae2 - annotate

Return to ae2 CVS log
Up to [CSRG BSD Unix] / 43BSDReno / share / doc / usd / 13.edadv
Annotation of 43BSDReno/share/doc/usd/13.edadv/ae2, revision 1.1

1.1     ! root        1: .\"    @(#)ae2 6.1 (Berkeley) 5/22/86
        !             2: .\"
        !             3: .NH
        !             4: SPECIAL CHARACTERS
        !             5: .PP
        !             6: The editor
        !             7: .UL ed
        !             8: is the primary interface to the system
        !             9: for many people, so
        !            10: it is worthwhile to know
        !            11: how to get the most out of
        !            12: .UL ed
        !            13: for the least effort.
        !            14: .PP
        !            15: The next few sections will discuss
        !            16: shortcuts
        !            17: and labor-saving devices.
        !            18: Not all of these will be instantly useful
        !            19: to any one person, of course,
        !            20: but a few will be,
        !            21: and the others should give you ideas to store
        !            22: away for future use.
        !            23: And as always,
        !            24: until you try these things,
        !            25: they will remain theoretical knowledge,
        !            26: not something you have confidence in.
        !            27: .SH
        !            28: The List command `l'
        !            29: .PP
        !            30: .UL ed
        !            31: provides two commands for printing the contents of the lines
        !            32: you're editing.
        !            33: Most people are familiar with
        !            34: .UL p ,
        !            35: in combinations like
        !            36: .P1
        !            37: 1,$p
        !            38: .P2
        !            39: to print all the lines you're editing,
        !            40: or
        !            41: .P1
        !            42: s/abc/def/p
        !            43: .P2
        !            44: to change 
        !            45: `abc'
        !            46: to
        !            47: `def'
        !            48: on the current line.
        !            49: Less familiar is the
        !            50: .ul
        !            51: list
        !            52: command
        !            53: .UL l
        !            54: (the letter `\fIl\|\fR'),
        !            55: which gives slightly more information than
        !            56: .UL p .
        !            57: In particular,
        !            58: .UL l
        !            59: makes visible characters that are normally invisible,
        !            60: such as tabs and backspaces.
        !            61: If you list a line that contains some of these,
        !            62: .UL l
        !            63: will print each tab as
        !            64: .UL \z\(mi>
        !            65: and each backspace as
        !            66: .UL \z\(mi< .\(dg
        !            67: .FS
        !            68: \(dg These composite characters are created by overstriking a minus
        !            69: and a > or <, so they only appear as < or > on display terminals.
        !            70: .FE
        !            71: This makes it much easier to correct the sort of typing mistake
        !            72: that inserts extra spaces adjacent to tabs,
        !            73: or inserts a backspace followed by a space.
        !            74: .PP
        !            75: The
        !            76: .UL l
        !            77: command
        !            78: also `folds' long lines for printing _
        !            79: any line that exceeds 72 characters is printed on multiple lines;
        !            80: each printed line except the last is terminated by a backslash 
        !            81: .UL \*e ,
        !            82: so you can tell it was folded.
        !            83: This is useful for printing long lines on short terminals.
        !            84: .PP
        !            85: Occasionally the
        !            86: .UL l
        !            87: command will print in a line a string of numbers preceded by a backslash,
        !            88: such as \*e07 or \*e16.
        !            89: These combinations are used to make visible characters that normally don't print,
        !            90: like form feed or vertical tab or bell.
        !            91: Each such combination is a single character.
        !            92: When you see such characters, be wary _
        !            93: they may have surprising meanings when printed on some terminals.
        !            94: Often their presence means that your finger slipped while you were typing;
        !            95: you almost never want them.
        !            96: .SH
        !            97: The Substitute Command `s'
        !            98: .PP
        !            99: Most of the next few sections will be taken up with a discussion
        !           100: of the
        !           101: substitute
        !           102: command
        !           103: .UL s .
        !           104: Since this is the command for changing the contents of individual
        !           105: lines,
        !           106: it probably has the most complexity of any 
        !           107: .UL ed 
        !           108: command,
        !           109: and the most potential for effective use.
        !           110: .PP
        !           111: As the simplest place to begin,
        !           112: recall the meaning of a trailing
        !           113: .UL g
        !           114: after a substitute command.
        !           115: With
        !           116: .P1
        !           117: s/this/that/
        !           118: .P2
        !           119: and
        !           120: .P1
        !           121: s/this/that/g
        !           122: .P2
        !           123: the
        !           124: first
        !           125: one replaces the
        !           126: .ul
        !           127: first
        !           128: `this' on the line
        !           129: with `that'.
        !           130: If there is more than one `this' on the line,
        !           131: the second form
        !           132: with the trailing
        !           133: .UL g
        !           134: changes
        !           135: .ul
        !           136: all
        !           137: of them.
        !           138: .PP
        !           139: Either form of the
        !           140: .UL s
        !           141: command can be followed by
        !           142: .UL p
        !           143: or
        !           144: .UL l
        !           145: to `print' or `list' (as described in the previous section)
        !           146: the contents of the line:
        !           147: .P1
        !           148: s/this/that/p
        !           149: s/this/that/l
        !           150: s/this/that/gp
        !           151: s/this/that/gl
        !           152: .P2
        !           153: are all legal, and mean slightly different things.
        !           154: Make sure you know what the differences are.
        !           155: .PP
        !           156: Of course, any
        !           157: .UL s
        !           158: command can be preceded by one or two `line numbers'
        !           159: to specify that the substitution is to take place
        !           160: on a group of lines. 
        !           161: Thus
        !           162: .P1
        !           163: 1,$s/mispell/misspell/
        !           164: .P2
        !           165: changes the 
        !           166: .ul
        !           167: first
        !           168: occurrence of
        !           169: `mispell' to `misspell' on every line of the file.
        !           170: But
        !           171: .P1
        !           172: 1,$s/mispell/misspell/g
        !           173: .P2
        !           174: changes 
        !           175: .ul
        !           176: every
        !           177: occurrence in every line
        !           178: (and this is more likely to be what you wanted in this
        !           179: particular case).
        !           180: .PP
        !           181: You should also notice that if you add a
        !           182: .UL p
        !           183: or
        !           184: .UL l
        !           185: to the end of any of these substitute commands,
        !           186: only the last line that got changed will be printed,
        !           187: not all the lines.
        !           188: We will talk later about how to print all the lines
        !           189: that were modified.
        !           190: .SH
        !           191: The Undo Command `u'
        !           192: .PP
        !           193: Occasionally you will make a substitution in a line,
        !           194: only to realize too late that it was a ghastly mistake.
        !           195: The `undo' command
        !           196: .UL u
        !           197: lets you `undo' the last substitution:
        !           198: the last line that was substituted can be restored to
        !           199: its previous state by typing the command
        !           200: .P1
        !           201: u
        !           202: .P2
        !           203: .SH
        !           204: The Metacharacter `\*.'
        !           205: .PP
        !           206: As you have undoubtedly noticed
        !           207: when you use
        !           208: .UL ed ,
        !           209: certain characters have unexpected meanings
        !           210: when they occur in the left side of a substitute command,
        !           211: or in a search for a particular line.
        !           212: In the next several sections, we will talk about
        !           213: these special characters,
        !           214: which are often called `metacharacters'.
        !           215: .PP
        !           216: The first one is the period `\*.'.
        !           217: On the left side of a substitute command,
        !           218: or in a search with `/.../',
        !           219: `\*.' stands for
        !           220: .ul
        !           221: any
        !           222: single character.
        !           223: Thus the search
        !           224: .P1
        !           225: /x\*.y/
        !           226: .P2
        !           227: finds any line where `x' and `y' occur separated by
        !           228: a single character, as in
        !           229: .P1
        !           230: x+y
        !           231: x\-y
        !           232: x\*(BLy
        !           233: x\*.y
        !           234: .P2
        !           235: and so on.
        !           236: (We will use \*(BL to stand for a space whenever we need to
        !           237: make it visible.)
        !           238: .PP
        !           239: Since `\*.' matches a single character,
        !           240: that gives you a way to deal with funny characters
        !           241: printed by
        !           242: .UL l .
        !           243: Suppose you have a line that, when printed with the
        !           244: .UL l
        !           245: command, appears as
        !           246: .P1
        !           247:  ....   th\*e07is   ....
        !           248: .P2
        !           249: and you want to get rid of the 
        !           250: \*e07
        !           251: (which represents the bell character, by the way).
        !           252: .PP
        !           253: The most obvious solution is to try
        !           254: .P1
        !           255: s/\*e07//
        !           256: .P2
        !           257: but this will fail. (Try it.)
        !           258: The brute force solution, which most people would now take,
        !           259: is to re-type the entire line.
        !           260: This is guaranteed, and is actually quite a reasonable tactic
        !           261: if the line in question isn't too big,
        !           262: but for a very long line,
        !           263: re-typing is a bore.
        !           264: This is where the metacharacter `\*.' comes in handy.
        !           265: Since `\*e07' really represents a single character,
        !           266: if we say
        !           267: .P1
        !           268: s/th\*.is/this/
        !           269: .P2
        !           270: the job is done.
        !           271: The `\*.' matches the mysterious character between the `h' and the `i',
        !           272: .ul
        !           273: whatever it is.
        !           274: .PP
        !           275: Bear in mind that since `\*.' matches any single character,
        !           276: the command
        !           277: .P1
        !           278: s/\*./,/
        !           279: .P2
        !           280: converts the first character on a line into a `,',
        !           281: which very often is not what you intended.
        !           282: .PP
        !           283: As is true of many characters in
        !           284: .UL ed ,
        !           285: the `\*.' has several meanings, depending
        !           286: on its context.
        !           287: This line shows all three:
        !           288: .P1
        !           289: \&\*.s/\*./\*./
        !           290: .P2
        !           291: The first `\*.' is a line number,
        !           292: the number of
        !           293: the line we are editing,
        !           294: which is called `line dot'.
        !           295: (We will discuss line dot more in Section 3.)
        !           296: The second `\*.' is a metacharacter
        !           297: that matches any single character on that line.
        !           298: The third `\*.' is the only one that really is
        !           299: an honest literal period.
        !           300: On the
        !           301: .ul
        !           302: right
        !           303: side of a substitution, `\*.'
        !           304: is not special.
        !           305: If you apply this command to the line
        !           306: .P1
        !           307: Now is the time\*.
        !           308: .P2
        !           309: the result will
        !           310: be
        !           311: .P1
        !           312: \&\*.ow is the time\*.
        !           313: .P2
        !           314: which is probably not what you intended.
        !           315: .SH
        !           316: The Backslash `\*e'
        !           317: .PP
        !           318: Since a period means `any character',
        !           319: the question naturally arises of what to do
        !           320: when you really want a period.
        !           321: For example, how do you convert the line
        !           322: .P1
        !           323: Now is the time\*.
        !           324: .P2
        !           325: into
        !           326: .P1
        !           327: Now is the time?
        !           328: .P2
        !           329: The backslash `\*e' does the job.
        !           330: A backslash turns off any special meaning that the next character
        !           331: might have; in particular,
        !           332: `\*e\*.' converts the `\*.' from a `match anything'
        !           333: into a period, so
        !           334: you can use it to replace
        !           335: the period in
        !           336: .P1
        !           337: Now is the time\*.
        !           338: .P2
        !           339: like this:
        !           340: .P1
        !           341: s/\*e\*./?/
        !           342: .P2
        !           343: The pair of characters `\*e\*.' is considered by
        !           344: .UL ed
        !           345: to be a single real period.
        !           346: .PP
        !           347: The backslash can also be used when searching for lines
        !           348: that contain a special character.
        !           349: Suppose you are looking for a line that contains
        !           350: .P1
        !           351: \&\*.PP
        !           352: .P2
        !           353: The search
        !           354: .P1
        !           355: /\*.PP/
        !           356: .P2
        !           357: isn't adequate, for it will find
        !           358: a line like
        !           359: .P1
        !           360: THE APPLICATION OF ...
        !           361: .P2
        !           362: because the `\*.' matches the letter `A'.
        !           363: But if you say
        !           364: .P1
        !           365: /\*e\*.PP/
        !           366: .P2
        !           367: you will find only lines that contain `\*.PP'.
        !           368: .PP
        !           369: The backslash can also be used to turn off special meanings for
        !           370: characters other than `\*.'.
        !           371: For example, consider finding a line that contains a backslash.
        !           372: The search
        !           373: .P1
        !           374: /\*e/
        !           375: .P2
        !           376: won't work,
        !           377: because the `\*e' isn't a literal `\*e', but instead means that the second `/'
        !           378: no longer \%delimits the search.
        !           379: But by preceding a backslash with another one,
        !           380: you can search for a literal backslash.
        !           381: Thus
        !           382: .P1
        !           383: /\*e\*e/
        !           384: .P2
        !           385: does work.
        !           386: Similarly, you can search for a forward slash `/' with
        !           387: .P1
        !           388: /\*e//
        !           389: .P2
        !           390: The backslash turns off the meaning of the immediately following `/' so that
        !           391: it doesn't terminate the /.../ construction prematurely.
        !           392: .PP
        !           393: As an exercise, before reading further, find two substitute commands each of which will
        !           394: convert the line
        !           395: .P1
        !           396: \*ex\*e\*.\*ey
        !           397: .P2
        !           398: into the line
        !           399: .P1
        !           400: \*ex\*ey
        !           401: .P2
        !           402: .PP
        !           403: Here are several solutions;
        !           404: verify that each works as advertised.
        !           405: .P1
        !           406: s/\*e\*e\*e\*.//
        !           407: s/x\*.\*./x/
        !           408: s/\*.\*.y/y/
        !           409: .P2
        !           410: .PP
        !           411: A couple of miscellaneous notes about
        !           412: backslashes and special characters.
        !           413: First, you can use any character to delimit the pieces
        !           414: of an
        !           415: .UL s
        !           416: command: there is nothing sacred about slashes.
        !           417: (But you must use slashes for context searching.)
        !           418: For instance, in a line that contains a lot of slashes already, like
        !           419: .P1
        !           420: //exec //sys.fort.go // etc...
        !           421: .P2
        !           422: you could use a colon as the delimiter _
        !           423: to delete all the slashes, type
        !           424: .P1
        !           425: s:/::g
        !           426: .P2
        !           427: .PP
        !           428: Second, if # and @ are your character erase and line kill characters,
        !           429: you have to type \*e# and \*e@;
        !           430: this is true whether you're talking to
        !           431: .UL ed
        !           432: or any other program.
        !           433: .PP
        !           434: When you are adding text with
        !           435: .UL a
        !           436: or
        !           437: .UL i
        !           438: or
        !           439: .UL c ,
        !           440: backslash is not special, and you should only put in
        !           441: one backslash for each one you really want.
        !           442: .SH
        !           443: The Dollar Sign `$'
        !           444: .PP
        !           445: The next metacharacter, the `$', stands for `the end of the line'.
        !           446: As its most obvious use, suppose you have the line
        !           447: .P1
        !           448: Now is the
        !           449: .P2
        !           450: and you wish to add the word `time' to the end.
        !           451: Use the $ like this:
        !           452: .P1
        !           453: s/$/\*(BLtime/
        !           454: .P2
        !           455: to get
        !           456: .P1
        !           457: Now is the time
        !           458: .P2
        !           459: Notice that a space is needed before `time' in
        !           460: the substitute command,
        !           461: or you will get
        !           462: .P1
        !           463: Now is thetime
        !           464: .P2
        !           465: .PP
        !           466: As another example, replace the second comma in
        !           467: the following line with a period without altering the first:
        !           468: .P1
        !           469: Now is the time, for all good men,
        !           470: .P2
        !           471: The command needed is
        !           472: .P1
        !           473: s/,$/\*./
        !           474: .P2
        !           475: The $ sign here provides context to make specific which comma we mean.
        !           476: Without it, of course, the
        !           477: .UL s
        !           478: command would operate on the first comma to produce
        !           479: .P1
        !           480: Now is the time\*. for all good men,
        !           481: .P2
        !           482: .PP
        !           483: As another example, to convert
        !           484: .P1
        !           485: Now is the time\*.
        !           486: .P2
        !           487: into
        !           488: .P1
        !           489: Now is the time?
        !           490: .P2
        !           491: as we did earlier, we can use
        !           492: .P1
        !           493: s/\*.$/?/
        !           494: .P2
        !           495: .PP
        !           496: Like `\*.', the `$'
        !           497: has multiple meanings depending on context.
        !           498: In the line
        !           499: .P1
        !           500: $s/$/$/
        !           501: .P2
        !           502: the first `$' refers to the
        !           503: last line of the file,
        !           504: the second refers to the end of that line,
        !           505: and the third is a literal dollar sign,
        !           506: to be added to that line.
        !           507: .SH
        !           508: The Circumflex `^'
        !           509: .PP
        !           510: The circumflex (or hat or caret)
        !           511: `^' stands for the beginning of the line.
        !           512: For example, suppose you are looking for a line that begins
        !           513: with `the'.
        !           514: If you simply say 
        !           515: .P1
        !           516: /the/
        !           517: .P2
        !           518: you will in all likelihood find several lines that contain `the' in the middle before
        !           519: arriving at the one you want.
        !           520: But with
        !           521: .P1
        !           522: /^the/
        !           523: .P2
        !           524: you narrow the context, and thus arrive at the desired one
        !           525: more easily.
        !           526: .PP
        !           527: The other use of `^' is of course to enable you to insert
        !           528: something at the beginning of a line:
        !           529: .P1
        !           530: s/^/\*(BL/
        !           531: .P2
        !           532: places a space at the beginning of the current line.
        !           533: .PP
        !           534: Metacharacters can be combined. To search for a
        !           535: line that contains 
        !           536: .ul
        !           537: only 
        !           538: the characters
        !           539: .P1
        !           540: \&\*.PP
        !           541: .P2
        !           542: you can use the command
        !           543: .P1
        !           544: /^\*e\*.PP$/
        !           545: .P2
        !           546: .SH
        !           547: The Star `*'
        !           548: .PP
        !           549: Suppose you have a line that looks like this:
        !           550: .P1
        !           551: \fItext \fR x                y \fI text \fR
        !           552: .P2
        !           553: where 
        !           554: .ul
        !           555: text 
        !           556: stands
        !           557: for lots of text,
        !           558: and there are some indeterminate number of spaces between the
        !           559: .UL x
        !           560: and the
        !           561: .UL y .
        !           562: Suppose the job is to replace all the spaces between
        !           563: .UL x
        !           564: and
        !           565: .UL y
        !           566: by a single space.
        !           567: The line is too long to retype, and there are too many spaces
        !           568: to count.
        !           569: What now?
        !           570: .PP
        !           571: This is where the metacharacter `*'
        !           572: comes in handy.
        !           573: A character followed by a star
        !           574: stands for as many consecutive occurrences of that
        !           575: character as possible.
        !           576: To refer to all the spaces at once, say
        !           577: .P1
        !           578: s/x\*(BL*y/x\*(BLy/
        !           579: .P2
        !           580: The construction
        !           581: `\*(BL*'
        !           582: means
        !           583: `as many spaces as possible'.
        !           584: Thus `x\*(BL*y' means `an x, as many spaces as possible, then a y'.
        !           585: .PP
        !           586: The star can be used with any character, not just space.
        !           587: If the original example was instead
        !           588: .P1
        !           589: \fItext \fR x--------y \fI text \fR
        !           590: .P2
        !           591: then all `\-' signs can be replaced by a single space
        !           592: with the command
        !           593: .P1
        !           594: s/x-*y/x\*(BLy/
        !           595: .P2
        !           596: .PP
        !           597: Finally, suppose that the line was
        !           598: .P1
        !           599: \fItext \fR x\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.y \fI text \fR
        !           600: .P2
        !           601: Can you see what trap lies in wait for the unwary?
        !           602: If you blindly type
        !           603: .P1
        !           604: s/x\*.*y/x\*(BLy/
        !           605: .P2
        !           606: what will happen?
        !           607: The answer, naturally, is that it depends.
        !           608: If there are no other x's or y's on the line,
        !           609: then everything works, but it's blind luck, not good management.
        !           610: Remember that `\*.' matches
        !           611: .ul
        !           612: any
        !           613: single character?
        !           614: Then `\*.*' matches as many single characters as possible,
        !           615: and unless you're careful, it can eat up a lot more of the line
        !           616: than you expected.
        !           617: If the line was, for example, like this:
        !           618: .P1
        !           619: \fItext  \fRx\fI  text  \fR x\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.\*.y \fI  text  \fRy\fI  text  \fR
        !           620: .P2
        !           621: then saying
        !           622: .P1
        !           623: s/x\*.*y/x\*(BLy/
        !           624: .P2
        !           625: will take everything from the
        !           626: .ul
        !           627: first
        !           628: `x' to the
        !           629: .ul
        !           630: last
        !           631: `y',
        !           632: which, in this example, is undoubtedly more than you wanted.
        !           633: .PP
        !           634: The solution, of course, is to turn off the special meaning of
        !           635: `\*.' with
        !           636: `\*e\*.':
        !           637: .P1
        !           638: s/x\*e\*.*y/x\*(BLy/
        !           639: .P2
        !           640: Now everything works, for `\*e\*.*' means `as many
        !           641: .ul
        !           642: periods
        !           643: as possible'.
        !           644: .PP
        !           645: There are times when the pattern `\*.*' is exactly what you want.
        !           646: For example, to change
        !           647: .P1
        !           648: Now is the time for all good men ....
        !           649: .P2
        !           650: into
        !           651: .P1
        !           652: Now is the time\*.
        !           653: .P2
        !           654: use `\*.*' to eat up everything after the `for':
        !           655: .P1
        !           656: s/\*(BLfor\*.*/\*./
        !           657: .P2
        !           658: .PP
        !           659: There are a couple of additional pitfalls associated with `*' that you should be aware of.
        !           660: Most notable is the fact that `as many as possible' means
        !           661: .ul
        !           662: zero
        !           663: or more.
        !           664: The fact that zero is a legitimate possibility is
        !           665: sometimes rather surprising.
        !           666: For example, if our line contained
        !           667: .P1
        !           668: \fItext \fR xy \fI text \fR x             y \fI text \fR
        !           669: .P2
        !           670: and we said
        !           671: .P1
        !           672: s/x\*(BL*y/x\*(BLy/
        !           673: .P2
        !           674: the
        !           675: .ul
        !           676: first
        !           677: `xy' matches this pattern, for it consists of an `x',
        !           678: zero spaces, and a `y'.
        !           679: The result is that the substitute acts on the first `xy',
        !           680: and does not touch the later one that actually contains some intervening spaces.
        !           681: .PP
        !           682: The way around this, if it matters, is to specify a pattern like
        !           683: .P1
        !           684: /x\*(BL\*(BL*y/
        !           685: .P2
        !           686: which says `an x, a space, then as many more spaces as possible, then a y',
        !           687: in other words, one or more spaces.
        !           688: .PP
        !           689: The other startling behavior of `*' is again related to the fact
        !           690: that zero is a legitimate number of occurrences of something
        !           691: followed by a star. The command
        !           692: .P1
        !           693: s/x*/y/g
        !           694: .P2
        !           695: when applied to the line
        !           696: .P1
        !           697: abcdef
        !           698: .P2
        !           699: produces
        !           700: .P1
        !           701: yaybycydyeyfy
        !           702: .P2
        !           703: which is almost certainly not what was intended.
        !           704: The reason for this behavior is that zero is a legal number
        !           705: of matches,
        !           706: and there are no x's at the beginning of the line
        !           707: (so that gets converted into a `y'),
        !           708: nor between the `a' and the `b'
        !           709: (so that gets converted into a `y'), nor ...
        !           710: and so on.
        !           711: Make sure you really want zero matches;
        !           712: if not, in this case write
        !           713: .P1
        !           714: s/xx*/y/g
        !           715: .P2
        !           716: `xx*' is one or more x's.
        !           717: .SH
        !           718: The Brackets `[ ]'
        !           719: .PP
        !           720: Suppose that you want to delete any numbers
        !           721: that appear
        !           722: at the beginning of all lines of a file.
        !           723: You might first think of trying a series of commands like
        !           724: .P1
        !           725: 1,$s/^1*//
        !           726: 1,$s/^2*//
        !           727: 1,$s/^3*//
        !           728: .P2
        !           729: and so on,
        !           730: but this is clearly going to take forever if the numbers are at all long.
        !           731: Unless you want to repeat the commands over and over until
        !           732: finally all numbers are gone,
        !           733: you must get all the digits on one pass.
        !           734: This is the purpose of the brackets [ and ].
        !           735: .PP
        !           736: The construction
        !           737: .P1
        !           738: [0123456789]
        !           739: .P2
        !           740: matches any single digit _
        !           741: the whole thing is called a `character class'.
        !           742: With a character class, the job is easy.
        !           743: The pattern `[0123456789]*' matches zero or more digits (an entire number), so
        !           744: .P1
        !           745: 1,$s/^[0123456789]*//
        !           746: .P2
        !           747: deletes all digits from the beginning of all lines.
        !           748: .PP
        !           749: Any characters can appear within a character class,
        !           750: and just to confuse the issue there are essentially no special characters
        !           751: inside the brackets;
        !           752: even the backslash doesn't have a special meaning.
        !           753: To search for special characters, for example, you can say
        !           754: .P1
        !           755: /[\*.\*e$^[]/
        !           756: .P2
        !           757: Within [...], the `[' is not special.
        !           758: To get a `]' into a character class,
        !           759: make it the first character.
        !           760: .PP
        !           761: It's a nuisance to have to spell out the digits,
        !           762: so you can abbreviate them as
        !           763: [0\-9];
        !           764: similarly, [a\-z] stands for the lower case letters,
        !           765: and
        !           766: [A\-Z] for upper case.
        !           767: .PP
        !           768: As a final frill on character classes, you can specify a class
        !           769: that means `none of the following characters'.
        !           770: This is done by beginning the class with a `^':
        !           771: .P1
        !           772: [^0-9]
        !           773: .P2
        !           774: stands for `any character 
        !           775: .ul
        !           776: except
        !           777: a digit'.
        !           778: Thus you might find the first line that doesn't begin with a tab or space
        !           779: by a search like
        !           780: .P1
        !           781: /^[^(space)(tab)]/
        !           782: .P2
        !           783: .PP
        !           784: Within a character class,
        !           785: the circumflex has a special meaning 
        !           786: only if it occurs at the beginning.
        !           787: Just to convince yourself, verify that
        !           788: .P1
        !           789: /^[^^]/
        !           790: .P2
        !           791: finds a line that doesn't begin with a circumflex.
        !           792: .SH
        !           793: The Ampersand `&'
        !           794: .PP
        !           795: The ampersand `&' is used primarily to save typing.
        !           796: Suppose you have the line
        !           797: .P1
        !           798: Now is the time
        !           799: .P2
        !           800: and you want to make it
        !           801: .P1
        !           802: Now is the best time
        !           803: .P2
        !           804: Of course you can always say
        !           805: .P1
        !           806: s/the/the best/
        !           807: .P2
        !           808: but it seems silly to have to repeat the `the'.
        !           809: The `&' is used to eliminate the repetition.
        !           810: On the
        !           811: .ul
        !           812: right
        !           813: side of a substitute, the ampersand means `whatever
        !           814: was just matched', so you can say
        !           815: .P1
        !           816: s/the/& best/
        !           817: .P2
        !           818: and the `&' will stand for `the'.
        !           819: Of course this isn't much of a saving if the thing
        !           820: matched is just `the', but if it is something truly long or awful,
        !           821: or if it is something like `.*'
        !           822: which matches a lot of text,
        !           823: you can save some tedious typing.
        !           824: There is also much less chance of making a typing error
        !           825: in the replacement text.
        !           826: For example, to parenthesize a line,
        !           827: regardless of its length,
        !           828: .P1
        !           829: s/\*.*/(&)/
        !           830: .P2
        !           831: .PP
        !           832: The ampersand can occur more than once on the right side:
        !           833: .P1
        !           834: s/the/& best and & worst/
        !           835: .P2
        !           836: makes
        !           837: .P1
        !           838: Now is the best and the worst time
        !           839: .P2
        !           840: and
        !           841: .P1
        !           842: s/\*.*/&? &!!/
        !           843: .P2
        !           844: converts the original line into
        !           845: .P1
        !           846: Now is the time? Now is the time!!
        !           847: .P2
        !           848: .PP
        !           849: To get a literal ampersand, naturally the backslash is used to turn off the special meaning:
        !           850: .P1
        !           851: s/ampersand/\*e&/
        !           852: .P2
        !           853: converts the word into the symbol.
        !           854: Notice that `&' is not special on the left side
        !           855: of a substitute, only on the
        !           856: .ul
        !           857: right 
        !           858: side.
        !           859: .SH
        !           860: Substituting Newlines
        !           861: .PP
        !           862: .UL ed
        !           863: provides a facility for splitting a single line into two or more shorter lines by `substituting in a newline'.
        !           864: As the simplest example, suppose a line has gotten unmanageably long
        !           865: because of editing (or merely because it was unwisely typed).
        !           866: If it looks like
        !           867: .P1
        !           868: \fItext \fR   xy  \fI text \fR
        !           869: .P2
        !           870: you can break it between the `x' and the `y' like this:
        !           871: .P1
        !           872: s/xy/x\*e
        !           873: y/
        !           874: .P2
        !           875: This is actually a single command,
        !           876: although it is typed on two lines.
        !           877: Bearing in mind that `\*e' turns off special meanings,
        !           878: it seems relatively intuitive that a `\*e' at the end of
        !           879: a line would make the newline there
        !           880: no longer special.
        !           881: .PP
        !           882: You can in fact make a single line into several lines
        !           883: with this same mechanism.
        !           884: As a large example, consider underlining the word `very'
        !           885: in a long line
        !           886: by splitting `very' onto a separate line,
        !           887: and preceding it by the
        !           888: .UL roff
        !           889: or
        !           890: .UL nroff
        !           891: formatting command `.ul'.
        !           892: .P1
        !           893: \fItext \fR a very big \fI text \fR
        !           894: .P2
        !           895: The command
        !           896: .P1
        !           897: s/\*(BLvery\*(BL/\*e
        !           898: \&.ul\*e
        !           899: very\*e
        !           900: /
        !           901: .P2
        !           902: converts the line into four shorter lines,
        !           903: preceding the word `very' by the
        !           904: line
        !           905: `.ul',
        !           906: and eliminating the spaces around the `very',
        !           907: all at the same time.
        !           908: .PP
        !           909: When a newline is substituted
        !           910: in, dot is left pointing at the last line created.
        !           911: .PP
        !           912: .SH
        !           913: Joining Lines
        !           914: .PP
        !           915: Lines may also be joined together,
        !           916: but this is done with the
        !           917: .UL j
        !           918: command
        !           919: instead of
        !           920: .UL s .
        !           921: Given the lines
        !           922: .P1
        !           923: Now is
        !           924: \*(BLthe time
        !           925: .P2
        !           926: and supposing that dot is set to the first of them,
        !           927: then the command
        !           928: .P1
        !           929: j
        !           930: .P2
        !           931: joins them together.
        !           932: No blanks are added,
        !           933: which is why we carefully showed a blank 
        !           934: at the beginning of the second line.
        !           935: .PP
        !           936: All by itself,
        !           937: a
        !           938: .UL j
        !           939: command
        !           940: joins line dot to line dot+1,
        !           941: but any contiguous set of lines can be joined.
        !           942: Just specify the starting and ending line numbers.
        !           943: For example,
        !           944: .P1
        !           945: 1,$jp
        !           946: .P2
        !           947: joins all the lines into one big one
        !           948: and prints it.
        !           949: (More on line numbers in Section 3.)
        !           950: .SH
        !           951: Rearranging a Line with \*e( ... \*e)
        !           952: .PP
        !           953: (This section should be skipped on first reading.)
        !           954: Recall that `&' is a shorthand that stands for whatever
        !           955: was matched by the left side of an
        !           956: .UL s
        !           957: command.
        !           958: In much the same way you can capture separate pieces
        !           959: of what was matched;
        !           960: the only difference is that you have to specify
        !           961: on the left side just what pieces you're interested in.
        !           962: .PP
        !           963: Suppose, for instance, that 
        !           964: you have a file of lines that consist of names in the form
        !           965: .P1
        !           966: Smith, A. B.
        !           967: Jones, C.
        !           968: .P2
        !           969: and so on,
        !           970: and you want the initials to precede the name, as in
        !           971: .P1
        !           972: A. B. Smith
        !           973: C. Jones
        !           974: .P2
        !           975: It is possible to do this with a series of editing commands,
        !           976: but it is tedious and error-prone.
        !           977: (It is instructive to figure out how it is done, though.)
        !           978: .PP
        !           979: The alternative
        !           980: is to `tag' the pieces of the pattern (in this case,
        !           981: the last name, and the initials),
        !           982: and then rearrange the pieces.
        !           983: On the left side of a substitution,
        !           984: if part of the pattern is enclosed between
        !           985: \*e( and \*e),
        !           986: whatever matched that part is remembered,
        !           987: and available for use on the right side.
        !           988: On the right side,
        !           989: the symbol `\*e1' refers to whatever
        !           990: matched the first \*e(...\*e) pair,
        !           991: `\*e2' to the second \*e(...\*e),
        !           992: and so on.
        !           993: .PP
        !           994: The command
        !           995: .P1
        !           996: 1,$s/^\*e([^,]*\*e),\*(BL*\*e(\*.*\*e)/\*e2\*(BL\*e1/
        !           997: .P2
        !           998: although hard to read, does the job.
        !           999: The first \*e(...\*e) matches the last name,
        !          1000: which is any string up to the comma;
        !          1001: this is referred to on the right side with `\*e1'.
        !          1002: The second \*e(...\*e) is whatever follows
        !          1003: the comma and any spaces,
        !          1004: and is referred to as `\*e2'.
        !          1005: .PP
        !          1006: Of course, with any editing sequence this complicated,
        !          1007: it's foolhardy to simply run it and hope.
        !          1008: The global commands 
        !          1009: .UL g
        !          1010: and 
        !          1011: .UL v
        !          1012: discussed in section 4
        !          1013: provide a way for you to print exactly those
        !          1014: lines which were affected by the
        !          1015: substitute command,
        !          1016: and thus verify that it did what you wanted
        !          1017: in all cases.
unix.superglobalmegacorp.com
This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.