Page 1 of 1

text crawler doesn't do multi line searches

Posted: Fri Aug 21, 2009 9:54 am
by future systemz
Thanks for a great tool.

I have found that text crawler doesn't do multi line searches, eg the following expression <html.*ead> whould match all of the below text.

<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>




thanks for any advice

Posted: Fri Aug 21, 2009 10:32 am
by future systemz
I tried this:

<html(.|\r\n)*?head>

its rather ugly but I guess its better than nothing :)

what do you think DV?

Posted: Mon Aug 24, 2009 8:48 am
by DV
The dot doesn't match across newlines in this regular expression implementation. You could use
<html[\s\S]*ead>
which would match.
This behaviour will be improved in TC 2.0...