.net - Regex - Match Last Occurance -
i have text file full of names, want match them via regex.
each name ends following text: fsa fwb fcc, eg:
">dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc i want use following expression match names:
""">.+?""fsa fwb fcc" aka match text "> fsa fwb fcc, can parse excess matched myself.
however "> occurs throughout file, starts matching earlier. have wondered how match last occurance of something, in case, ">, end specified.
thanks all, stan.
description
it looks you're ending string literally fsa fwb fcc, , beginning of substring you're interested in starts directly after last "> before end string.
this expression will:
- find substring between last
">, nextfsa fwb fcc
">((?:(?!">).)*)fsa\sfwb\sfcc

sample text
">sometext">a dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc ">sometext">b dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc ">sometext">c dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc matches found:
[0][0] = ">a dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [0][1] = dave smith\u0012\/a>\u0012\/div>\u0012div class=\" [1][0] = ">b dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [1][1] = b dave smith\u0012\/a>\u0012\/div>\u0012div class=\" [2][0] = ">c dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [2][1] = c dave smith\u0012\/a>\u0012\/div>\u0012div class=\" or
if want go further , capture last "> through \u0012 before fsa fwb fcc ... i.e. actual name , not markup text, have @ expression
">((?:(?!">).)*?)\\u0012(?:(?!">).)*fsa\sfwb\sfcc

sample text
">sometext">a dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc ">sometext">b dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc ">sometext">c dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc matches found
[0][0] = ">a dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [0][1] = dave smith [1][0] = ">b dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [1][1] = b dave smith [2][0] = ">c dave smith\u0012\/a>\u0012\/div>\u0012div class=\"fsa fwb fcc [2][1] = c dave smith
Comments
Post a Comment