Wednesday, March 24, 2010

Remove image tags from an html page

On my company's wiki site, there is an internal phone book . I wanted to be able to use it offline, so I saved it to my local computer. The web paged saved with a folder and attached images (normal behavior). But I wanted to remove the images.

I used Scite's replace function with regular expressions, I first removed all line breaks, then I used the simple regular expression
 <img[^>]*>
to match for all images and replace them with no text.

No comments: