I've been playing around with RegExBuddy for over an hour trying to figure out what I thought would be a trivial RegEx. I am looking for a RegEx statement that will let me extract the HTML content from just between the body tags from a XHTML document.
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
Example paragraph content
Would this work ?
Of course, you need to add the necessary
\s in order to take into account
< body ...> (element with spaces), as in:
On second thought, I am not sure why I needed a negative look-ahead... This should also work (for a well-formed xhtml document):