如何用python提取信息入库
Jul 24 18:03:11 192.168.0.13 [login] time:20120724 18:03:11 user:大坏蛋 ip:211.143.50.33 result:succeed channel:10086 ua:{"HTTP_USER_AGENT":"-","HTTP_X_FORWARDED_FOR":"211.143.50.33","clientip":"211.143.50.33","HTTP_X_REAL_IP":"-"} domain:service.xxx.com
Jul 24 18:03:11 192.168.0.13 [login] time:20120724 18:03:11 user:as1984 ip:218.205.245.21 result:succeed channel:10086 ua:{"HTTP_USER_AGENT":"-","HTTP_X_FORWARDED_FOR":"218.205.245.21","clientip":"218.205.245.21","HTTP_X_REAL_IP":"-"} domain:service.xxx.com
上面是2行信息
需要提取以下几个字段后面的信息入库:
time:
user:
ip:
result:
ua:
domain:
user会是中文
请教如何用python实现?
[解决办法]
import repat = re.compile(r'time:(.*?) user:(.*?) ip:(.*?) result:(.*?) ua:(.*?) domain:(.*)', re.I)with open('test.txt') as fd: for line in fd: mat = pat.search(line) if mat: print ' | '.join(mat.groups())