gae网址抓取问题
问题是这样我想抓取cn.bing.com 的背景图片地址,来用作我一个网页的的背景。我已经获取到了整个网页的代码。用什么方法能简单的把背景图片的URL 提取出来呢?我要提取的代码:
...L{_background:none}#bgDiv{opacity:1;background-image:url(http://s.cn.bing.net/az/hprichbg/rb/Kitty_ZH-CN7073082266_1366x768.jpg);}#hp_ctrls{hei...
刚接触python, 还不是很了解。可以使用正则表达式吗?使用正则表达式怎么做?哪位高手帮解决以下 正则表达式 gae python url
[解决办法]
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Type "copyright", "credits" or "license()" for more information.
>>> import re
>>> t = '...L{_background:none}#bgDiv{opacity:1;background-image:url(http://s.cn.bing.net/az/hprichbg/rb/Kitty_ZH-CN7073082266_1366x768.jpg);}#hp_ctrls{hei...'
>>> m = re.search(r'background-image:url\(([^)]+)', t, re.I)
>>> m.groups(1)[0]
'http://s.cn.bing.net/az/hprichbg/rb/Kitty_ZH-CN7073082266_1366x768.jpg'
>>>