Is there a regex to match a string that contains A but does not contain B [duplicate]
My problem is that i want to check the browserstring with pure regex.
Mozilla/5.0 (Linux; U; Android 3.0; en-us; Xoom Build/HRI39) AppleWebKit/534.13 (KHTML, like Gecko) Version/4.0 Safari/534.13
-> should match
Mozilla/5.0 (Linux; U; Android 2.2.1; en-us; Nexus One Build/FRG83) AppleWebKit/533.1 (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1
should not match
my tried solution is: /?((?<=Android)(?:[^])*?(?=Mobile))/i
but it matches exactly wrong.
Solution 1:
You use look ahead assertions to check if a string contains a word or not.
If you want to assure that the string contains "Android" at some place you can do it like this:
^(?=.*Android).*
You can also combine them, to ensure that it contains "Android" at some place AND "Mobile" at some place:
^(?=.*Android)(?=.*Mobile).*
If you want to ensure that a certain word is NOT in the string, use the negative look ahead:
^(?=.*Android)(?!.*Mobile).*
This would require the word "Android to be in the string and the word "Mobile" is not allowed in the string. The .*
part matches then the complete string/row when the assertions at the beginning are true.
See it here on Regexr
Solution 2:
With some implementations of regular expressions, you can use a negative lookbehind assertion.
Per the docs, a negative lookbehind written as (?<!...)
matches only if the current position in the string is not preceded by a match for ...
Here's a Python interactive script showing how to use negative lookbehind with your sample strings:
>>> s = "Mozilla/5.0 (Linux; U; Android 3.0; en-us; Xoom Build/HRI39) AppleWebKit/534.13 (KHTML, like Gecko) Version/4.0 Safari/534.13"
>>> bool(re.search(r'Android.*(?<! Mobile) Safari', s))
True
>>> t = "Mozilla/5.0 (Linux; U; Android 2.2.1; en-us; Nexus One Build/FRG83) AppleWebKit/533.1 (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1"
>>> bool(re.search(r'Android.*(?<! Mobile) Safari', t))
False
Solution 3:
I'd just break it up
if ((m/Android/i) && (m/Safari/i) && !(m/Mobile Safari/i))
That said, depending on regex flaviour, you could combine that
if ((m/Android/i) && (m/(?<!Mobile )Safari/i))
or even
if (m/Android.*(?<!Mobile )Safari/i)
FYI see Lookahead/lookbehind
Update Tested these fine with Perl5 regex flavour (arguably the most popular flavour):
perl -ne 'print "$. yes\n" if m/Android.*(?<!Mobile )Safari/i'
Shows:
1 yes
for the given input in the OP