*******************
            *  异常处理与调式         *
            *******************



***常见错误:***

1) 名字没有定义,NameError

In [1]: print a
---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
in ()
----> 1 print a

NameError: name 'a' is not defined


2) 分母为零,ZeroDivisionError

In [2]: 10/0
---------------------------------------------------------------------------
ZeroDivisionError                         Traceback (most recent call last)
in ()
----> 1 10/0

ZeroDivisionError: integer division or modulo by zero


3) 文件不存在,IOError

In [3]: open("westos")
---------------------------------------------------------------------------
IOError                                   Traceback (most recent call last)
in ()
----> 1 open("westos")

IOError: [Errno 2] No such file or directory: 'westos'


4) 语法错误,SyntaxError

In [4]: for i in [1,2,3]
  File "", line 1
    for i in [1,2,3]
                    ^
SyntaxError: invalid syntax


5) 索引超出范围,IndexError

In [5]: a = [1,2,3]

In [6]: a[3]
---------------------------------------------------------------------------
IndexError                                Traceback (most recent call last)
in ()
----> 1 a[3]

IndexError: list index out of range

In [7]: t =(1,2,3)

In [8]: t[3]
---------------------------------------------------------------------------
IndexError                                Traceback (most recent call last)
in ()
----> 1 t[3]

IndexError: tuple index out of range

In [9]: t[1:9]            ###切片的时候,若超出范围,则默认为全部,不报错
Out[9]: (2, 3)


####python异常处理机制:try......except......finally######

例:
#!/usr/bin/env python
#coding:utf-8
try:                ###将可能发生错误的部分放在try下###
    print "staring......"
    li = [1,2,3]
    print a
    print li[3]
except IndexError:        ###捕获指定的异常###
    print 'index out of list length'
except NameError:        ###捕获指定的异常###
    print 'name is not define'
finally:            ###不管是否异常,一定会执行该代码块###
    print 'end......'



执行结果:

staring......            
name is not define
end......

###由结果可以看出,一旦捕获到异常就不会执行下面的语句,而是到了finally,如上例,捕获到NameError后,下一条语句就不再执行,因此,并没有去捕获IndexError,结果也只是输出NameError的打印内容和finally的打印内容#####


###当没有错误的时候可以加一条条件语句,显示没有错误####

#!/usr/bin/env python
#coding:utf-8

try:
    print "staring......"
    li = [1,2,3]
    a = 1
    print a
    print li[2]
except IndexError:
    print 'index out of list length'
except NameError:
    print 'name is not define'
else:                    ####如果没有异常,则执行该代码块###
    print "No Error"

finally:
    print 'end......'


执行结果:

staring......
1
3
No Error
end......




####可以将异常给变量,这样就可以自己打印错误####
#!/usr/bin/env python
#coding:utf-8

try:
    print "staring......"
    li = [1,2,3]
    print a
    print li[2]
except IndexError,e:        ###将错误赋给e###
    print e            ###会自动打印错误类型###
except NameError,e:
    print e
else:
    print "No Error"

finally:
    print 'end......'

执行结果:
staring......
name 'a' is not defined
end......



####在异常不知道的情况下,可以用BaseException,其实异常就是一个类,而BaseException是所有异常的父类####

#!/usr/bin/env python
#coding:utf-8

try:
    print "staring......"
    li = [1,2,3]
    print li[3]
    print a
except BaseException as e:
    print e
except BaseException as e:
    print e
else:
    print "No Error"

finally:
    print 'end......'


执行结果:
staring......
list index out of range
end......


####为了减少捕获异常的次数,可以将异常处理机制放在main函数下####

def func1(s):
    return func2(s)*2
def func2(s):
    return 10/s
def main():
    try:
        print func1("10")
    except TypeError,e:
        print e

main()



执行结果:
unsupported operand type(s) for /: 'int' and 'str'





####将错误信息写入文件###


import logging                ###导入logging模块###
logging.basicConfig(filename='err.log') ###使用basicConfig方法###
def func1(s):
    return func2(s)*2
def func2(s):
    return 10/s
def main():
    try:
        print func1("10")
    except Exception as e:
        logging.exception(e)        ###将异常信息写入err.log文件###

main()


err.log文件的内容:
ERROR:root:unsupported operand type(s) for /: 'int' and 'str'
Traceback (most recent call last):
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 48, in main
    print func1("10")
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 43, in func1
    return func2(s)*2
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 45, in func2
    return 10/s
TypeError: unsupported operand type(s) for /: 'int' and 'str'

####由此可见,以将错误的信息导入到了err.log文件###



####抛出异常和自定义异常###


a = 1                ###抛出异常###
if a == 1:
    raise NameError


执行结果:

Traceback (most recent call last):
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 57, in
    raise NameError
NameError



class MyError(BaseException):            ###自定义异常###
    pass

a = 1
if a == 1:
    raise MyError


执行结果:
charmProjects/pythonbasic/py5.1/error.py
Traceback (most recent call last):
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 60, in
    raise MyError
__main__.MyError


注意:一定不能够即捕获异常有抛出异常


#####调试-断言####

断言失败:
def foo(s):
    n = int(s)
    return 10 / n
def main():
    foo('0')
assert foo(5) == 1        ###断言失败,assert 语句本身抛出AssertionError###
print 'hello'


执行结果:

Traceback (most recent call last):
  File "/home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py", line 68, in
    assert foo(5) == 1
AssertionError

断言成功:

def foo(s):
    n = int(s)
    return 10 / n
def main():
    foo('0')
assert foo(5) == 2        ###断言成功,就执行下面的语句###
print 'hello'


执行结果:

hello

Python 解释器执行时可以用 -O 参数来关闭 assert,把所有的 assert 语句
当成 pass
例:
[kiosk@foundation38 py5.1]$ python -O error.py
hello



####将错误记录到文件里###

import logging
logging.basicConfig(filename='logging.log',level=logging.WARNING)    ###level=logging.WARNING定义日志级别为WARNING
def foo(s):
    n = int(s)
    logging.info('n=%d'% n)    ###日志级别为info的内容###
    logging.warning('n=%d...warn' % n)
    return 10 / n
def main():
    foo('0')
main()


执行后logging.log的内容为:
WARNING:root:n=0...warn        ###由此可见,并没有将日志级别为info的内容写入文件,因为,在最开始已经定义了级别为WARNING,除非级别比WARNING大的会记录到文件里###

例如:一开始定义日志级别为info

import logging
logging.basicConfig(filename='logging.log',level=logging.INFO)
def foo(s):
    n = int(s)
    logging.info('n=%d'% n)
    logging.warning('n=%d...warn' % n)
    return 10 / n
def main():
    foo('0')
main()

则执行后logging.log的内容为:
INFO:root:n=0
WARNING:root:n=0...warn

###日志级别:debug,info,warning,error####



###调试-pdb#####

pdb让程序以单步方式运行,随时查看运行状态。n 可以单步执行代码,p 变量名 来查看变量,q 结束调试,退出程序
[kiosk@foundation38 py5.1]$ python -m pdb error.py
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(3)()
-> _author_ = "xiao"
(Pdb) n
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(64)()
-> def foo(s):
(Pdb) n
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(69)()
-> def main():
(Pdb)




####pdb.set_trace()####
import pdb
n1 = 1
n = int(n1)
print n
pdb.set_trace()
print 'world'
pdb.set_trace()
print 'hello'
s = 2
print s


调试:
1
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(81)()
-> print 'world'
(Pdb) n
world
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(82)()
-> pdb.set_trace()
(Pdb) c            ###继续###
> /home/kiosk/PycharmProjects/pythonbasic/py5.1/error.py(83)()
-> print 'hello'
(Pdb) c
hello
2






            *****************
            *   正则表达式           *
            *****************


###re.findall(p,text)###

将能匹配上的全返回,会返回一个 list

In [13]: s = 'redhat linux hello world'

In [14]: r = 'linux'

In [15]: import re        ###导入re模块###

In [16]: re.findall(r,s)    ###如果r在s里,则返回###
Out[16]: ['linux']

In [17]: re.findall('red',s)    
Out[17]: ['red']

In [18]: re.findall('westos',s)    ###如果不在,则返回空###
Out[18]: []




In [1]: s = 'python linux hello wor\\ld'

In [2]: r = 'wor\\l'

In [3]: import re

In [4]: re.findall(r,s)        ###因为\\是特殊字符,会认为是一个\,所以会找不到###
Out[4]: []

In [5]: r1 = r'wor\\l'        ###在前面加上r,就不会认为是特殊字符,就可以找到####

In [6]: re.findall(r1,s)
Out[6]: ['wor\\l']

In [7]:


####基本模式####

1 字面模式: 就是字面长量,就代表其本身
2 . :匹配任何字符
3 \d:匹配任何十进制数
4 \D:匹配任何非数字字符
5 \s:匹配任何空白字符
6 \S:匹配任何非空间字符
7 \w:匹配任何字母数字字符
8 \W:匹配任何非字母数字自符
9 ^ 开头 $ 结尾
10 \ 转义字符





###次数的匹配###

次数的匹配 , 匹配其前面的字符出现的次数 :
* 0 次或多次
+ 一次或多次
? 零次或一次
{n} 出现 n 次
{m,n} 出现 m 到 n 次 :{0,}相当于*,{1,}相当于+,{0,1}匹配一次或零次,相当于?




练习:
1 判断一个字符串是否是合法的 Email 的方法(要求以.com结尾的为合法的)


In [16]: r = r'\w+@\w+\.com'

In [17]: import re

In [18]: re.findall(r,'[email protected]  [email protected]')
Out[18]: ['[email protected]']





2 判断满足029-1234567这样要求的电话号码的方法


In [19]: r = r'^\d{3}-\d{7}'

In [20]: re.findall(r,'029-1234567  1234-1234567')
Out[20]: ['029-1234567']



3 判断变量是否合法


In [30]: r = r'^[_a-zA-Z]\w*$'

In [31]: re.findall(r,'_a-1')
Out[31]: []

In [32]: re.findall(r,'a')
Out[32]: ['a']

In [33]: re.findall(r,'a1')
Out[33]: ['a1']

In [34]: re.findall(r,'_a1')
Out[34]: ['_a1']

In [35]: re.findall(r,'1a1')
Out[35]: []




####编译re.compile()###

当我们在 Python 中使用正则表达式时,re 模块内部会干两件事情:
1. 编译正则表达式,如果正则表达式的字符串本身不合法,会报错;
2. 用编译后的正则表达式去匹配字符串。
重复使用几千次,出于效率的考虑,我们可以预编译该正则表达式。


In [36]: r = r't.p'

In [37]: p = re.compile(r)

In [38]: print p
<_sre.SRE_Pattern object at 0x1f6a3c0>

In [39]: re.findall(p,'top tap tab')
Out[39]: ['top', 'tap']




###re.match(p,text)###
re.match(p,text) :p 为正则表达式模式, text 要查找的字符串,会返回一个match 对象
一定要用group()查看返回值
In [40]: re.match(p,'top tap tab')
Out[40]: <_sre.SRE_Match at 0x23d0850>

In [41]: a = re.match(p,'top tap tab')

In [42]: a.group()        ###显示匹配到的第一个字符,即在‘top tap tab’中查看第一个是否满足p的条件,满足则匹配成功返回,不满足则报错###
Out[42]: 'top'

In [43]: a.start()        ###显示匹配的开始###
Out[43]: 0

In [44]: a.end()        ###显示匹配的结束###
Out[44]: 3

In [46]: a.span()        ###显示匹配的全部长度###
Out[46]: (0, 3)



####当字符串中的第一项不满足匹配条件时,报错####
In [16]: a = re.match(p,'tab top tap')

In [17]: a.group()
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
in ()
----> 1 a.group()

AttributeError: 'NoneType' object has no attribute 'group'

###一般用条件判断语句来描述是否匹配到,如果字符串的第一个字符是所匹配的,则打印找到匹配,如果不是,则打印没有匹配###

In [54]: if a:
   ....:     print 'Match found:%s'% a.group()
   ....: else:
   ....:     print 'no match'
   ....:
no match



###re.search(p,text)######

只要在 text 中匹配到了 p 就返回,只返回第一个匹配到的,与re.match(p,text)的区别在与re.match(p,text)只在text的第一个查看是否匹配到p,而re.search(p,text)是在全部text中查找,只要text中有匹配到p的就返回

In [33]: p
Out[33]: re.compile(r'hello')

In [34]: re.search(p,'westos redhat hello linux')
Out[34]: <_sre.SRE_Match at 0x2033f38>

In [35]: a = re.search(p,'westos redhat hello linux')    ###hello并不在第一个,但是仍然匹配到,并且返回了####

In [36]: a.group()
Out[36]: 'hello'

In [37]:

小练习:要求以.com或者.cn结尾的为合法的邮件,判断字符串是否为合法的邮件
In [32]: email = r'\w+@\w+(\.com|\.cn)'

In [33]: a = re.search(email,'[email protected] [email protected]')

In [34]: a.group()
Out[34]: '[email protected]'



####re.finditer(p,text)####
找到re匹配的所有子串,并把它们作为一个迭代器返回

In [1]: import re

In [2]: a = re.fi
re.findall   re.finditer  

In [2]: a = re.finditer(r'hello', 'hello westos')

In [3]: a.next().group()
Out[3]: 'hello'



###re.sub(p,s,text)###

替换,将 p 匹配到的字符替换为 s

In [4]: s = 'hello westos'
In [6]: re.sub(r'wes..s','world','hello westos')   ###第一个字符指被替换的字符,第二个字符指要替换成的字符,第三个字符指被替换的字符串###
Out[6]: 'hello world'

In [7]: re.subn(r'wes..s','world','hello westos')    ###subn显示替换的次数###
Out[7]: ('hello world', 1)

In [8]: re.subn(r'wes..s','world','hello westos westos')
Out[8]: ('hello world world', 2)


####re.split(p,text)###
按照 p 匹配,并且以匹配到的字符为分隔符切割 text, 返回一个切割后的 list

In [10]: re.split(r'[\+\*]','123+34*18')    ###以+和*作为分割,将'123+34*18'切割###
Out[10]: ['123', '34', '18']




小练习:从http://172.25.254.252下提取图片

#!/usr/bin/env python
#coding:utf-8
import re
import urllib,urllib2

def getHtml(url):
    try:
        page = urllib.urlopen(url)    ###打开网址###
        html = page.read()        ###读出网址的内容###
    except urllib2.URLError,e:
        print 'Download error...%s'% e
    return html
def getImg(html):
    img_re = r'src="(.+\.jpg)"'
    img_recom = re.compile(img_re)
    imglist = re.findall(img_recom,html)
    x = 1
    for imgurl in imglist:
        urllib.urlretrieve(imgurl,"%s.png" %x)
        x += 1
html = getHtml('http://172.25.254.252/')
getImg(html)