Tested in PowerGREP and works fine. It means this regexp is perl-compatible.
You just need to escape some special C++ sumbols (if there are any) with a
backslash to make it friendly with C++ compiler.
I find this regexp better than above-suggested because it catches not only
URLs ending with extensions listed bu any sequence of symbols that are not
single or double quotes - thanks to negated character class [^'\"] repeated
with plus ['\"]+.
... I must admit I haven't tested it, because I don't understand it. My understanding of regular expressions is very basic. I don't know what \w or \s is. It...
hi tilman, this is absolutely wonderful. i tested version 1.2g(beta), and have the following comments: 1. it crashed when i copied your regex with the \ and...
... I suspect that your newsreader broke the long line. I did notice that some errors do crash the regexp :-( That happens when you use "external" software......
hi tilman, got good results with: Javascript=javascript:.*\(['"](.*[^'](\?.*)?)['"](.*) it found all URLs and top domains. it did not catch: javascript:openWin...
... ('http://udp.intercea.co.uk/deutsch/bus_popup.htm','432','287'); ... By using my regexp which I have already posted to this group. If you have lost that my...
hi eugeny, using your regex from an earlier mail: javascript:\w+\s*\(\s*['\"]((?:ftp|https?)://[^'\"]+?)['\"](?:\s*,[^,] +?\s*)*\s*\); xenu crashes after a...
... ['\"] is equal to ['"] ... That is correct. But question mark followed by colon in a regex construct (?:something) has another meaning. (option1|option2)...
Hi eugeny, Tnx, no i am reading the powergrep stuff. Let me cut this discussion short: did you test your regex with xenu 1.2g beta? If I do, it keeps crashing,...
Hi, Is there anyone on this list who has used Xenu's Link Sleuth to check huge websites (300000 links) ? I'm asking because one user claims it always crashes...
... javascript:\w+\s*\(\s*['"]((?:ftp|https?)://[^'"]+?)['"](?:\s*,[^,]+?\s*)*\s *\); ... There are different regex libraries , each with its own pecularities...
Hi eugeny, Thanks a million, i will try it out today. I want to understand how the regex fits into Xenu code, doesn't Tilmans regex require three parts: 1. the...
... No, my regular expression is to be set in the INI file. Which you did when testing. I might update Xenu for other reasons, like more flexibility, and a...
guys, i think we are getting near the end of this javascript link saga. eugeny, i get perfect results with your simplified regex: javascript:[_a-z0-9]+ *\(...
this page http://www.quickbrowse.com/whatsnew.cgi contains this piece of HTML code <a href="javascript:popup('/freelimits.html',400,300)">Some features</a> So,...
Hi eugeny, I noticed this code isn't matched by your regex: javascript: function('URL') When there's a spacer between "javascript" and "function", so I have...
I realize I hate these approval delays. Is there a way that postings from people who have made on-topic posts get through immediately? Or people who have been...
... "New" are those who joined after you have made you group restricted to post to. They do not become "old" automatically after some time. You have to make...
... First, my regexp has this part : (?:ftp|https?):// It means it will catch any string that starts either from ftp:// http:// https:// So as long as this...
Frank , I strongly recommend you to download a trial version of PowerGREP from www.powergrep.com It contains an excellent manual in PDF format on building...
... "\w" stands for wORD symbol, i.e. any letter or any digit "\w+" will catch any alphanumeric *word*, like "dog", "cat" or "a123bc" So in our example \w+...