Dave Python 练习十七 -- 正则表达式

一. 先看联机文档上的一段有关特殊字符的说明

A regular expression (or RE) specifies a set of strings that matches it; the functions in this module let you check if a particular string matches a given regular expression (or if a given regular expression matches a particular string, which comes down to the same thing).

Regular expressions can be concatenated to form new regular expressions; if A and B are both regular expressions, then AB is also a regular expression. In general, if a string p matches A and another string q matches B, the string pq will match AB. This holds unless A or B contain low precedence operations; boundary conditions between A and B; or have numbered group references. Thus, complex expressions can easily be constructed from simpler primitive expressions like the ones described here. For details of the theory and implementation of regular expressions, consult the Friedl book referenced above, or almost any textbook about compiler construction.

A brief explanation of the format of regular expressions follows. For further information and a gentler presentation, consult the Regular Expression HOWTO.

Regular expressions can contain both special and ordinary characters. Most ordinary characters, like 'A', 'a', or '0', are the simplest regular expressions; they simply match themselves. You can concatenate ordinary characters, so last matches the string 'last'. (In the rest of this section, we’ll write RE’s in this special style, usually without quotes, and strings to be matched 'in single quotes'.)

Some characters, like '|' or '(', are special. Special characters either stand for classes of ordinary characters, or affect how the regular expressions around them are interpreted. Regular expression pattern strings may not contain null bytes, but can specify the null byte using the \number notation, e.g., '\x00'.

The special characters are:

'.'

(Dot.) In the default mode, this matches any character except a newline. If the DOTALL flag has been specified, this matches any character including a newline.

'^'

(Caret.) Matches the start of the string, and in MULTILINE mode also matches immediately after each newline.

'$'

Matches the end of the string or just before the newline at the end of the string, and in MULTILINE mode also matches before a newline. foo matches both ‘foo’ and ‘foobar’, while the regular expression foo$ matches only ‘foo’. More interestingly, searching for foo.$ in 'foo1\nfoo2\n' matches ‘foo2’ normally, but ‘foo1’ in MULTILINE mode; searching for a single $ in 'foo\n' will find two (empty) matches: one just before the newline, and one at the end of the string.

'*'

Causes the resulting RE to match 0 or more repetitions of the preceding RE, as many repetitions as are possible. ab* will match ‘a’, ‘ab’, or ‘a’ followed by any number of ‘b’s.

'+'

Causes the resulting RE to match 1 or more repetitions of the preceding RE. ab+ will match ‘a’ followed by any non-zero number of ‘b’s; it will not match just ‘a’.

'?'

Causes the resulting RE to match 0 or 1 repetitions of the preceding RE. ab? will match either ‘a’ or ‘ab’.

*?, +?, ??

The '*', '+', and '?' qualifiers are all greedy; they match as much text as possible. Sometimes this behaviour isn’t desired; if the RE <.*> is matched against '<H1>title</H1>', it will match the entire string, and not just '<H1>'. Adding '?' after the qualifier makes it perform the match in non-greedy or minimal fashion; as few characters as possible will be matched. Using .*? in the previous expression will match only '<H1>'.

{m}

Specifies that exactly m copies of the previous RE should be matched; fewer matches cause the entire RE not to match. For example, a{6} will match exactly six 'a' characters, but not five.

{m,n}

Causes the resulting RE to match from m to n repetitions of the preceding RE, attempting to match as many repetitions as possible. For example, a{3,5} will match from 3 to 5 'a' characters. Omitting m specifies a lower bound of zero, and omitting n specifies an infinite upper bound. As an example, a{4,}b will match aaaab or a thousand 'a' characters followed by a b, but not aaab. The comma may not be omitted or the modifier would be confused with the previously described form.

{m,n}?

Causes the resulting RE to match from m to n repetitions of the preceding RE, attempting to match as few repetitions as possible. This is the non-greedy version of the previous qualifier. For example, on the 6-character string 'aaaaaa', a{3,5} will match 5 'a' characters, while a{3,5}? will only match 3 characters.

'\'

Either escapes special characters (permitting you to match characters like '*', '?', and so forth), or signals a special sequence; special sequences are discussed below.

If you’re not using a raw string to express the pattern, remember that Python also uses the backslash as an escape sequence in string literals; if the escape sequence isn’t recognized by Python’s parser, the backslash and subsequent character are included in the resulting string. However, if Python would recognize the resulting sequence, the backslash should be repeated twice. This is complicated and hard to understand, so it’s highly recommended that you use raw strings for all but the simplest expressions.

[]

Used to indicate a set of characters. Characters can be listed individually, or a range of characters can be indicated by giving two characters and separating them by a '-'. Special characters are not active inside sets. For example, [akm$] will match any of the characters 'a', 'k', 'm', or '$'; [a-z] will match any lowercase letter, and [a-zA-Z0-9] matches any letter or digit. Character classes such as \w or \S (defined below) are also acceptable inside a range, although the characters they match depends on whether ASCII or LOCALE mode is in force. If you want to include a ']' or a '-' inside a set, precede it with a backslash, or place it as the first character. The pattern []] will match ']', for example.

You can match the characters not within a range by complementing the set. This is indicated by including a '^' as the first character of the set; '^' elsewhere will simply match the '^' character. For example, [^5] will match any character except '5', and [^^] will match any character except '^'.

Note that inside [] the special forms and special characters lose their meanings and only the syntaxes described here are valid. For example, +, *, (, ), and so on are treated as literals inside [], and backreferences cannot be used inside [].

'|'

A|B, where A and B can be arbitrary REs, creates a regular expression that will match either A or B. An arbitrary number of REs can be separated by the '|' in this way. This can be used inside groups (see below) as well. As the target string is scanned, REs separated by '|' are tried from left to right. When one pattern completely matches, that branch is accepted. This means that once A matches, B will not be tested further, even if it would produce a longer overall match. In other words, the '|' operator is never greedy. To match a literal '|', use \|, or enclose it inside a character class, as in [|].

(...)

Matches whatever regular expression is inside the parentheses, and indicates the start and end of a group; the contents of a group can be retrieved after a match has been performed, and can be matched later in the string with the \number special sequence, described below. To match the literals '(' or ')', use $ or $, or enclose them inside a character class: [(] [)].

(?...)

This is an extension notation (a '?' following a '(' is not meaningful otherwise). The first character after the '?' determines what the meaning and further syntax of the construct is. Extensions usually do not create a new group; (?P<name>...) is the only exception to this rule. Following are the currently supported extensions.

(?aiLmsux)

(One or more letters from the set 'a', 'i', 'L', 'm', 's', 'u', 'x'.) The group matches the empty string; the letters set the corresponding flags: re.A (ASCII-only matching), re.I (ignore case), re.L (locale dependent), re.M (multi-line), re.S (dot matches all), and re.X (verbose), for the entire regular expression. (The flags are described in Module Contents.) This is useful if you wish to include the flags as part of the regular expression, instead of passing a flag argument to the re.compile() function.

Note that the (?x) flag changes how the expression is parsed. It should be used first in the expression string, or after one or more whitespace characters. If there are non-whitespace characters before the flag, the results are undefined.

(?:...)

A non-capturing version of regular parentheses. Matches whatever regular expression is inside the parentheses, but the substring matched by the group cannot be retrieved after performing a match or referenced later in the pattern.

(?P<name>...)

Similar to regular parentheses, but the substring matched by the group is accessible within the rest of the regular expression via the symbolic group name name. Group names must be valid Python identifiers, and each group name must be defined only once within a regular expression. A symbolic group is also a numbered group, just as if the group were not named. So the group named id in the example below can also be referenced as the numbered group 1.

For example, if the pattern is (?P<id>[a-zA-Z_]\w*), the group can be referenced by its name in arguments to methods of match objects, such as m.group('id') or m.end('id'), and also by name in the regular expression itself (using (?P=id)) and replacement text given to .sub() (using \g<id>).

(?P=name)

Matches whatever text was matched by the earlier group named name.

(?#...)

A comment; the contents of the parentheses are simply ignored.

(?=...)

Matches if ... matches next, but doesn’t consume any of the string. This is called a lookahead assertion. For example, Isaac (?=Asimov) will match 'Isaac ' only if it’s followed by 'Asimov'.

(?!...)

Matches if ... doesn’t match next. This is a negative lookahead assertion. For example, Isaac (?!Asimov) will match 'Isaac ' only if it’s not followed by 'Asimov'.

(?<=...)

Matches if the current position in the string is preceded by a match for ... that ends at the current position. This is called a positive lookbehind assertion. (?<=abc)def will find a match in abcdef, since the lookbehind will back up 3 characters and check if the contained pattern matches. The contained pattern must only match strings of some fixed length, meaning that abc or a|b are allowed, but a* and a{3,4} are not. Note that patterns which start with positive lookbehind assertions will never match at the beginning of the string being searched; you will most likely want to use the search() function rather than the match() function:

 
      >>> import re
>>> m = re.search('(?<=abc)def', 'abcdef')
>>> m.group(0)
'def'

This example looks for a word following a hyphen:

 
      >>> m = re.search('(?<=-)\w+', 'spam-egg')
>>> m.group(0)
'egg'

(?<!...)

Matches if the current position in the string is not preceded by a match for .... This is called a negative lookbehind assertion. Similar to positive lookbehind assertions, the contained pattern must only match strings of some fixed length. Patterns which start with negative lookbehind assertions may match at the beginning of the string being searched.

(?(id/name)yes-pattern|no-pattern)

Will try to match with yes-pattern if the group with given id or name exists, and with no-pattern if it doesn’t. no-pattern is optional and can be omitted. For example, (<)?(\w+@\w+(?:\.\w+)+)(?(1)>|$) is a poor email matching pattern, which will match with '<[email protected]>' as well as '[email protected]', but not with '<[email protected]' nor '[email protected]>' .

The special sequences consist of '\' and a character from the list below. If the ordinary character is not on the list, then the resulting RE will match the second character. For example, \$ matches the character '$'.

\number

Matches the contents of the group of the same number. Groups are numbered starting from 1. For example, (.+) \1 matches 'the the' or '55 55', but not 'the end' (note the space after the group). This special sequence can only be used to match one of the first 99 groups. If the first digit of number is 0, or number is 3 octal digits long, it will not be interpreted as a group match, but as the character with octal value number. Inside the '[' and ']' of a character class, all numeric escapes are treated as characters.

\A

Matches only at the start of the string.

\b

Matches the empty string, but only at the beginning or end of a word. A word is defined as a sequence of Unicode alphanumeric or underscore characters, so the end of a word is indicated by whitespace or a non-alphanumeric, non-underscore Unicode character. Note that formally, \b is defined as the boundary between a \w and a \W character (or vice versa). By default Unicode alphanumerics are the ones used, but this can be changed by using the ASCII flag. Inside a character range, \b represents the backspace character, for compatibility with Python’s string literals.

\B

Matches the empty string, but only when it is not at the beginning or end of a word. This is just the opposite of \b, so word characters are Unicode alphanumerics or the underscore, although this can be changed by using the ASCII flag.

\d

For Unicode (str) patterns:: Matches any Unicode decimal digit (that is, any character in Unicode character category [Nd]). This includes [0-9], and also many other digit characters. If the ASCII flag is used only [0-9] is matched (but the flag affects the entire regular expression, so in such cases using an explicit [0-9] may be a better choice).
For 8-bit (bytes) patterns:: Matches any decimal digit; this is equivalent to [0-9].

\D

Matches any character which is not a Unicode decimal digit. This is the opposite of \d. If the ASCII flag is used this becomes the equivalent of [^0-9] (but the flag affects the entire regular expression, so in such cases using an explicit [^0-9] may be a better choice).

\s

For Unicode (str) patterns:: Matches Unicode whitespace characters (which includes [ \t\n\r\f\v], and also many other characters, for example the non-breaking spaces mandated by typography rules in many languages). If the ASCII flag is used, only [ \t\n\r\f\v] is matched (but the flag affects the entire regular expression, so in such cases using an explicit [ \t\n\r\f\v] may be a better choice).
For 8-bit (bytes) patterns:: Matches characters considered whitespace in the ASCII character set; this is equivalent to [ \t\n\r\f\v].

\S

Matches any character which is not a Unicode whitespace character. This is the opposite of \s. If the ASCII flag is used this becomes the equivalent of [^ \t\n\r\f\v] (but the flag affects the entire regular expression, so in such cases using an explicit [^ \t\n\r\f\v] may be a better choice).

\w

For Unicode (str) patterns:: Matches Unicode word characters; this includes most characters that can be part of a word in any language, as well as numbers and the underscore. If the ASCII flag is used, only [a-zA-Z0-9_] is matched (but the flag affects the entire regular expression, so in such cases using an explicit [a-zA-Z0-9_] may be a better choice).
For 8-bit (bytes) patterns:: Matches characters considered alphanumeric in the ASCII character set; this is equivalent to [a-zA-Z0-9_].

\W

Matches any character which is not a Unicode word character. This is the opposite of \w. If the ASCII flag is used this becomes the equivalent of [^a-zA-Z0-9_] (but the flag affects the entire regular expression, so in such cases using an explicit [^a-zA-Z0-9_] may be a better choice).

\Z

Matches only at the end of the string.

Most of the standard escapes supported by Python string literals are also accepted by the regular expression parser:

 
    \a      \b      \f      \n
\r      \t      \v      \x
\\

Octal escapes are included in a limited form: If the first digit is a 0, or if there are three octal digits, it is considered an octal escape. Otherwise, it is a group reference. As for string literals, octal escapes are always at most three digits in length.

二. 测试

#encoding=utf-8 ############# 正则表达式 ###################### #正则表达式(RE)为高级文本模式匹配，以及搜索-替代等功能提供了基础。正则表达式(RE)是一 #些由字符和特殊符号组成的字符串，它们描述了这些字符和字符的某种重复方式，因此能按某种模 #式匹配一个有相似特征的字符串的集合，因此能按某模式匹配一系列有相似特征的字符串 #Python 通过标准库的re 模块支持正则表达式(RE)。 ##*************** Part 1: 正则表达式使用的特殊符号和字符 ************** ## 1.1 用管道符号( | )匹配多个正则表达式模式 #管道符号( | )，就是您键盘上的竖杠，表示一个或操作，它的意思是选择被管道符号分隔的多个不同的正则表达式中的一个。 #David|Dai -->David,Dai ## 1.2 匹配任意一个单个的字符( . ) #点字符或句号(.)符号匹配除换行符(NEWLINE)外的任意一个单个字符(Python 的正则表达式有 #一个编译标识 [S or DOTALL]，该标识能去掉这一限制，使 ( . ) 在匹配时包括换行符(NEWLINEs)。) #(这里括号缺一半) 无论是字母、数字、不包括“\n”的空白符、可打印的字符、还是非打印字符， #或是一个符号，“点”，( . )都可以匹配他们。 # #正表达式模式匹配的字符串 #f.o 在"f"和"o"中间的任何字符，如fao, f9o, f#o 等 #.. 任意两个字符 #.end 匹配在字符串end 前面的任意一个字符 ## 1.3 从字符串的开头或结尾或单词边界开始匹配( ^/$ /\b /\B ) #还有些符号和特殊字符是用来从字符串的开头或结尾开始搜索正则表达式模式的。如果想从字 #符串的开头开始匹配一个模式，你必须用脱字符号( ^ , 即，Caret)或特殊字符 \A (大写字母A 前 #面加上一个反斜线). 后者主要是为那些没有caret 符号的键盘使用的，比如说国际键盘。类似，美 #元符号 ( $ ) 或 \Z 是用来(零宽度)匹配字符串的结尾的。 #正则表达式模式匹配的字符串 #^From 匹配任何以From 开始的字符串 #/bin/tcsh$ 匹配任何以 /bin/tcsh 结束的字符串 #^Subject: hi$ 匹配仅由 Subject: hi 组成的字符串 #特殊字符 \b and \B 用来匹配单词边界。两者之间的区别是，\b 匹配的模式是一个单词边界， #就是说，与之对应的模式一定在一个单词的开头，不论这个单词的前面是有字符(该词在一个字符串 #的中间)，还是没有字符(该单词在一行的起始处)。同样地，\B 只匹配出现在一个单词中间的模式(即， #不在单词边界上的字符)。看下面几个例子： #RE Pattern Strings Matched #the 任何包含有"the"的字符串 #\bthe 任何以"the"开始的字符串 #\bthe\b 仅匹配单词 “the” #\Bthe 任意包含“the”但不以“the”开头的单词 ## 1.4 创建字符类( [ ] ) #使用方括号的正则表达式会匹配方括号里的任何一个字符。几个例子如下： #正则表达式模式匹配的字符串 #b[aeiu]t bat, bet, bit, but #[cr][23][dp][o2] 一个包含4 个字符的字符串: 第一个字符是 “r” 或 “c”，后面是 “2” # 或 “3”，再接下来是 “d” 或 “p”，最后是 “o” 或 “2“ ，例如：c2do, r3p2, r2d2, c3po, 等等。 ## 1.5 指定范围 ( - ) 和否定( ^ ) #方括号除匹配单个字符外，还可以支持所指定的字符范围。方括号里一对符号中间的连字符(-) #用来表示一个字符的范围，例如，A–Z, a–z, 或 0–9 分别代表大写字母、小写字母和十进制数 #字。这是一个按字母顺序排序的范围，所以它不限于只用在字母和十进制数字上。另外，如果在左 #方括号后第一个字符是上箭头符号(^)，就表示不匹配指定字符集里的任意字符。 #正则表达式模式匹配的字符 #z.[0-9] 字符"z"，后面跟任意一个字符，然后是一个十进制数字 #[r-u][env-y][us] “r” “s,” “t” 或 “u” 中的任意一个字符，后面跟的是 “e,”“n,” “v,” “w,” “x,” 或 “y”中的任意一个字符，再后面是字符“u” 或 “s”. #[^aeiou] 一个非元音字符 (练习：为什么我们说”非元音“, 而不说”辅音字母“?) #[^\t\n] 除TAB 制表符和换行符以外的任意一个字符 #["-a] 在使用ASCII 字符集的系统中，顺序值在‘"‘ 和 “a”之间的任意一个字符，即，顺序号在34 和97 之间的某一个字符。 ## 1.6 使用闭包操作符 ( *, +, ?, {} ) 实现多次出现/重复匹配 #特殊符号 “*”, “+”, 和 “?”, 它们可以用于匹配字符串模式出现一次、多次、或未出现的情况。 #星号或称星号操作符匹配它左边那个正则表达式出现零次或零次以上的情况。 #加号(+)操作符匹配它左边那个正则表达式模式至少出现一次的情况(它也被称为正闭包操作符)，而 #问号操作符( ? )匹配它左边那个正则表达式模式出现零次或一次的情况。 #还有花括号操作符({ })，花括号里可以是单个的值，也可以是由逗号分开的一对值。如果是 #一个值，如，{N}，则表示匹配N 次出现；如果是一对值，即，{M, N}，就表示匹配M 次到N 次出现。 #可以在这些符号前用反斜线进行转义，使它们失去特殊作用，即， “\*” 将匹配星号本身等。 ## 1.7 特殊字符表示字符集 #有一些特殊字符可以用来代表字符集合。例如，你可以不使用 “0–9”这个范围 #表示十进制数字，而改用简写“\d”表示。另一个特殊的字符 “\w” 可用来表示整个字符数字的 #字符集，即相当于“A-Za-z0-9_”的简写形式，特殊字符“\s” 代表空白字符。这些特殊字符的大 #写形式表示不匹配，比如，“\D” 表示非十进制数字的字符(等价于 “[^0-9]”)，等等。 ## 1.8 用圆括号(()) 组建组 #一对圆括号(()) 和正则表达式一起使用时可以实现以下任意一个(或两个)功能： # 对正则表达式进行分组 # 匹配子组 # # 有时你需要对正则表达式进行分组，其中一个很好的例子就是，你要用两个不同的正则表达式 #去比较一个字符串。另一个理由是为整个正则表达式添加一个重复操作符(即不是仅重复单个字符或 #单一字符集)。 # 使用圆括号的一个额外好处就是匹配的子串会被保存到一个子组，便于今后使用。这些子组可 #以在同一次匹配或搜索中被重复调用，或被提取出来做进一步处理. ##*************** Part 2: 正则表达式和Python 语言 ************** #Python 的默认正则表达式模块是 re 模块. ## 2.1 re 模块: 核心函数和方法 #常见的正则表达式函数与方法 #函数/方法描述 #####re 模块的函数 #compile(pattern,flags=0) 对正则表达式模式pattern 进行编译，flags 是可选标志符，并返回一个regex 对象re 模块的函数和regex 对象的方法 #match(pattern,string, flags=0) 尝试用正则表达式模式pattern 匹配字符串string，flags 是可选标志符，如果匹配成功，则返回一个匹配对象；否则返回None #search(pattern,string, flags=0) 在字符串string 中查找正则表达式模式pattern 的第一次出现，flags 是可选标志符，如果匹配成功，则返回一个匹配对象；否则返回None #findall(pattern,string[,flags]) 在字符串string 中查找正则表达式模式pattern 的所有(非重复)出现；返回一个匹配对象的列表 #finditer(pattern,string[, flags]) 和findall()相同，但返回的不是列表而是迭代器；对于每个匹配，该迭代器返回一个匹配对象 # # #####匹配对象的方法 #split(pattern,string, max=0) 根据正则表达式pattern 中的分隔符把字符string 分割为一个列表，返回成功匹配的列表，最多分割max 次(默认是分割所有匹配的地方)。 #sub(pattern, repl, string, max=0) 把字符串string 中所有匹配正则表达式pattern 的地方替换成字符串repl,如果max 的值没有给出，则对所有匹配的地方进行替换。 #group(num=0) 返回全部匹配对象(或指定编号是num 的子组) #groups() 返回一个包含全部匹配的子组的元组(如果没有成功匹配，就返回一个空元组) #核心笔记: RE 编译(何时应该使用compile 函数?) #Python 的代码最终会被编译为字节码,然后才被解释器执行。使用预编译代码对象要比使用字符串快， #因为解释器在执行字符串形式的代码前必须先把它编译成代码对象。 #这个概念也适用于正则表达式，在模式匹配之前，正则表达式模式必须先被编译成regex 对象。 #由于正则表达式在执行过程中被多次用于比较，我们强烈建议先对它做预编译，而且，既然正则表 #达式的编译是必须的，那使用么预先编译来提升执行性能无疑是明智之举。re.compile() 就是用来提供此功能的。 #其实模块函数会对已编译对象进行缓存，所以不是所有使用相同正则表达式模式的search()和 #match()都需要编译。即使这样，你仍然节省了查询缓存，和用相同的字符串反复调用函数的性能开销。 ## 2.2 使用compile()编译正则表达式 #大多数re 模块函数都可以作为regex 对象的方法。注意，尽管我们建议预编译，但它并不是必需的。如果你需要编译，就用方法，如果不需要，可以使用函数。 ## 2.3 匹配对象和 group(), groups() 方法 #在处理正则表达式时，除regex 对象外，还有另一种对象类型 - 匹配对象。这些对象是在match() #或search()被成功调用之后所返回的结果。匹配对象有两个主要方法：group() 和 groups(). # #group()方法或者返回所有匹配对象或是根据要求返回某个特定子组。groups()则很简单，它返 #回一个包含唯一或所有子组的元组。如果正则表达式中没有子组的话， groups() 将返回一个空元 #组，而group()仍会返回全部匹配对象。 ## 2.4 用match()匹配字符串 #match()函数尝试从字符串的开头开始对模式进行匹配。如果匹配成功，就返回一个匹配对象，而如果匹配失 #败了，就返回None。匹配对象的group() 方法可以用来显示那个成功的匹配。 #import re #m = re.match('foo', 'food on the table') # match succeeds # 匹配成功 #print(m.group()) #--> #foo #更简洁一点的写法： #print(re.match('foo', 'food on the table').group()) ## 2.5 search() 在一个字符串中查找一个模式 (搜索与匹配的比较) #你要搜索的模式出现在一个字符串中间的机率要比出现在字符串开头的机率更大一些。 #这正是search()派上用场的时候。search 和match 的工作方式一样，不同之处在于search 会检查 #参数字符串任意位置的地方给定正则表达式模式的匹配情况。如果搜索到成功的匹配，会返回一个匹配对象，否则返回None。 #import re #m = re.match('foo', 'seafood') # no match 匹配失败 #if m is not None: # print(m.group()) ##返回空，这个匹配是失败的。match()尝试从字符串起始处进行匹配模式，即，模式中的"f"试匹配到字符串中首字母"s"上, 这样匹配肯定是失败的。 # ##可以用search()函数。search() 查找字符串中模式首次出现的位置，而不是尝试(在起始处)匹配。严格地说，search() 是从左到右进行搜索。 #m = re.search('foo', 'seafood') # use search() instead 改用search() #if m is not None: # print(m.group()) #-->foo ## 2.6 匹配多个字符串( | ) #import re #bt = 'bat|bet|bit' # RE pattern: bat, bet, bit #正则表达式模式： bat, bet, bit #m = re.match(bt, 'bat') # 'bat' is a match #'bat' 是匹配的 #if m is not None: # print(m.group()) #-->bat ## 2.7 匹配任意单个字符( . ) #点号是不能匹配换行符或非字符(即,空字符串). #import re #anyend = '.end' #m = re.match(anyend, 'bend') # dot matches 'b' #点号匹配'b' #if m is not None: print(m.group()) # #m = re.match(anyend, 'end') # no char to match #没有字符匹配 #if m is not None: print(m.group()) # #m = re.match(anyend, '\nend') # any char except \n #匹配字符(\n 除外) #if m is not None: print(m.group()) # #m = re.search('.end', 'The end.')# matches ' ' in search . #匹配' ' #if m is not None: print(m.group()) #搜索一个真正点号(小数点)的正则表达式，在正则表达式中，用反斜线对它进行转义，使点号失去它的特殊意义： #import re #patt314 = '3.14' # RE dot #正则表达式点号 #pi_patt = '3\.14' # literal dot (dec. point) #浮点(小数点) #m = re.match(pi_patt, '3.14') # exact match #完全匹配 #if m is not None: print(m.group()) # #m = re.match(patt314, '3014') # dot matches '0' #点号匹配 '0' #if m is not None: print(m.group()) ## 2.8 创建字符集合( [ ] ) #import re #m = re.match('[cr][23][dp][o2]', 'c3po')# matches 'c3po' #匹配'c3po' #if m is not None: print(m.group()) # #m = re.match('r2d2|c3po', 'c2do')# does not match 'c2do' #不匹配'c2do' #if m is not None: print(m.group()) # #m = re.match('r2d2|c3po', 'r2d2')# matches 'r2d2' #匹配'r2d2' #if m is not None: print(m.group()) ## 2.9 重复、特殊字符和子组 #正则表达式中最常见的情况包括特殊字符的使用，正则表达式模式的重复出现，以及使用圆括号对匹配模式的各部分进行分组和提取操作。 #import re #patt = '\w+@(\w+\.)*\w+\.com' #print(re.match(patt, 'nobody@www.xxx.yyy.zzz.com').group()) #'nobody@www.xxx.yyy.zzz.com' #m = re.match('(\w\w\w)-(\d\d\d)', 'abc-123') #print(m.group()) # entire match 所有匹配部分 #'abc-123' #print(m.group(1)) # subgroup 1 匹配的子组1 #'abc' #print(m.group(2)) # subgroup 2 匹配的子组2 #'123' #print(m.groups()) # all subgroups 所有匹配子组 #('abc', '123') ## 2.10 从字符串的开头或结尾匹配及在单词边界上的匹配 #match()总是从字符串的开头进行匹配. #import re #m = re.search('^The', 'The end.') # match #匹配 #if m is not None: print(m.group()) # #m = re.search('^The', 'end. The') # not at beginning #不在开头 #if m is not None: print(m.group()) # #m = re.search(r'\bthe', 'bite the dog') # at a boundary #在词边界 #if m is not None: print(m.group()) # #m = re.search(r'\bthe', 'bitethe dog') # no boundary #无边界 #if m is not None: print(m.group()) # #m = re.search(r'\Bthe', 'bitethe dog') # no boundary #无边界 #if m is not None: print(m.group()) ## 2.11 用findall()找到每个出现的匹配部分 #findall()用于非重叠地查找某字符串中一个正则表达式模式出现的情况。findall()和search()相似之处在于二者都执行字符串搜索， #但findall()和match()与search()不同之处是，findall()总返回一个列表。如果findall()没有找到匹配的部分，会返回空 #列表；如果成功找到匹配部分，则返回所有匹配部分的列表(按从左到右出现的顺序排列)。 #import re #print(re.findall('car', 'car')) #['car'] #print(re.findall('car', 'scary')) #['car'] #print(re.findall('car', 'carry the barcardi to the car')) #['car', 'car', 'car'] ## 2.12 用sub()[和 subn()]进行搜索和替换 #有两种函数/方法用于完成搜索和代替的功能: sub()和subn(). 二者几乎是一样的，都是将某字符串中所有匹配正则表达式模式的部分进行替换。 #用来替换的部分通常是一个字符串，但也可能是一个函数，该函数返回一个用来替换的字符串。subn()和sub()一样，但它还返回一个表示替换次 #数的数字，替换后的字符串和表示替换次数的数字作为一个元组的元素返回。 #import re #print(re.sub('X', 'Dave', 'attn: X\n\nDear X,\n')) #--> #attn: Dave #Dear Dave, #print(re.subn('X', 'Dave', 'attn: X\n\nDear X,\n')) #--> #('attn: Dave\n\nDear Dave,\n', 2) #print(re.sub('[ae]', 'X', 'abcdef')) #--> #XbcdXf #print(re.subn('[ae]', 'X', 'abcdef')) #-->('XbcdXf', 2) ## 2.13 用split()分割(分隔模式) #re 模块和正则表达式对象的方法split()与字符串的split()方法相似，前者是根据正则表达式 #模式分隔字符串，后者是根据固定的字符串分割,因此与后者相比，显著提升了字符分割的能力。如 #果你不想在每个模式匹配的地方都分割字符串，你可以通过设定一个值参数(非零)来指定分割的最 #大次数。 #如果分隔符没有使用由特殊符号表示的正则表达式来匹配多个模式，那re.split()和string.split()的执行过程是一样的. #import re #print(re.split(':', 'str1:str2:str3')) #--> #['str1', 'str2', 'str3'] #核心笔记： Python 原始字符串(raw strings)的用法 #原始字符串的产生正是由于有正则表达式的存在。原因是ASCII 字符和正则表达式特殊字符间所产生的冲突。比如，特殊符号“\b”在 #ASCII 字符中代表退格键，但同时“\b”也是一个正则表达式的特殊符号，代表“匹配一个单词边界”。 #为了让RE 编译器把两个字符“\b”当成你想要表达的字符串，而不是一个退格键，你需要用另一个反斜线对它进行转义，即可以这样写：“\\b”。 ##*************** Part 3: 正则表达式示例 ************** ## 匹配一个字符串 import re data = 'Thu Feb 15 17:46:04 2012::uzifzf@dpyivihw.gov::1171590364-6-8' #我们要从上面的Data里匹配出星期 #方法一： #patt = '^(Mon|Tue|Wed|Thu|Fri|Sat|Sun)' #m = re.match(patt, data) #print(m.group()) # entire match #print(m.group(1)) # subgroup 1 #--> #Thu #Thu #方法二： #patt = '^(\w){3}' #m = re.match(patt, data) #if m is not None: # print(m.group()) # print(m.group(1)) #--> #Thu #u #访问子组1 的数据时，只看到“u”是因为子组1 中的数据被不断地替换成下一个字符。也就是 #说，m.group(1)开始的结果是“T”，然后是“h”,最后又被替换成“u”。它们是三个独立(而且重复) #的组，每个组是由字符或数字所组成的字符，而不是由连续的三个字符或数字组成的字符所形成的单个组。

-------------------------------------------------------------------------------------------------------
Blog： http://blog.csdn.net/tianlesoftware
Weibo: http://weibo.com/tianlesoftware
Email: [email protected]
DBA1 群：62697716(满); DBA2 群：62697977(满) DBA3 群：62697850(满)
DBA 超级群：63306533(满); DBA4 群： 83829929(满) DBA5群： 142216823(满)
DBA6 群：158654907(满) DBA7 群：69087192(满) DBA8 群：172855474
DBA 超级群2：151508914 DBA9群：102954821 聊天群：40132017(满)
--加群需要在备注说明Oracle表空间和数据文件的关系，否则拒绝申请

你可能感兴趣的:(python)

通达信实时行情API的功能有哪些？如何利用这些功能进行股票分析股票程序化交易接口量化交易股票API接口 Python股票量化交易通达信实时行情api 股票分析行情数据股票量化接口股票API接口
Python股票接口实现查询账户，提交订单，自动交易（1）Python股票程序交易接口查账，提交订单，自动交易（2）股票量化，Python炒股，CSDN交流社区>>>行情数据获取功能通达信实时行情API能够提供全面的行情数据。它可以获取股票的基本信息，如股票代码、名称等。能精确提供股票的实时价格，包括当前价、开盘价、收盘价等重要价格数据。这些数据是进行股票分析的基础。投资者可以根据当前价与开盘价的
Python项目之Pygame制作新年烟花！ WANGWUSAN66 pygame python 开发语言计算机经验分享源码
实现源码涉及到两个Python库：random和pygame。1.random库：randint(a,b)：返回一个在[a,b]范围内的随机整数。uniform(a,b)：返回一个在[a,b]范围内的随机浮点数。choice(sequence)：从给定的序列中随机选择一个元素。2.Pygame是一个用于制作游戏的Python模块，它包含了许多用于游戏开发和图形渲染的功能。以下是Pygame的一些主
python爬虫框架Scrapy简介码农~明哥 python python 爬虫 scrapy
当你写了很多个爬虫程序之后，你会发现每次写爬虫程序时，都需要将页面获取、页面解析、爬虫调度、异常处理、反爬应对这些代码从头至尾实现一遍，这里面有很多工作其实都是简单乏味的重复劳动。那么，有没有什么办法可以提升我们编写爬虫代码的效率呢？答案是肯定的，那就是利用爬虫框架，而在所有的爬虫框架中，Scrapy应该是最流行、最强大的框架。Scrapy概述Scrapy是基于Python的一个非常流行的网络爬虫
【算法】经典博弈论问题——斐波那契博弈 + Zeckendorf 定理 python 查理零世算法 python 数据结构
目录斐波那契博弈（FibonacciNim）齐肯多夫（Zeckendorf）定理示例分析实战演练斐波那契博弈（FibonacciNim）先说结论：当初始石子数目n是斐波那契数时，先手必败；否则，先手有策略获胜。证明概要:当n=2时，先手只能取1颗石子，后手直接取剩下的1颗石子获胜，因此先手必败。假设对于所有小于等于某个斐波那契数f[k]的情况，结论都成立。归纳：对于f[k+1]=f[k]+f[k-
用 Python 实现经典的 2048 游戏：一步步带你打造属于你的小游戏！一位小说男主 python python 游戏
用Python实现经典的2048游戏：一步步带你打造属于你的小游戏！（结尾附完整代码）简介2048是一个简单而又令人上瘾的数字拼图游戏。玩家通过滑动方块使相同数字的方块合并，目标是创造出数字2048！在这篇博客中，我们将用Python的Tkinter库从零开始实现这款游戏，涵盖从界面设计到逻辑实现的每一个细节，帮助你全面了解背后的开发思路。游戏特点经典玩法：滑动合并相同数字，尽可能达到2048。随
Python跨年烟花代码花洵琴
Python跨年烟花代码【下载地址】Python跨年烟花代码分享本资源文件提供了一个使用Python编写的跨年烟花代码，代码中使用了`pygame`、`random`和`math`库来实现烟花的模拟效果。代码中定义了三个类：`Firework`、`Particle`和`Trail`，分别代表烟花、烟花中的颗粒以及颗粒的轨迹点项目地址:https://gitcode.com/open-source-
Python web框架——Django xiabe python python django web开发
简介django是一个免费的开源的pythonweb框架。它遵循了model-view-template（MVT）的架构模式。由DjangoSoftwareFoundation维护，一个以501©(3)非营利组织形式成立的独立组织。django的主要目标是简单的去开发一个复杂的数据库驱动的网站。该框架强调组件的可重用性和“可插拔性”、代码更少、低耦合、快速开发以及“不要重复自己”的原则。Pytho
python 建立并使用 venv 波格斯特问题备忘 python 开发语言
python建立并使用venv[smf@5GC-10mmlShell]$python3-mvenv./.venv[smf@5GC-10mmlShell]$source./.venv/bin/activate(.venv)[smf@5GC-10mmlShell]$(.venv)[smf@5GC-10mmlShell]$(.venv)[smf@5GC-10mmlShell]$pip3installre
Tensorflow入门——训练结果的保存与加载 weixin_34087301 人工智能 python 数据库
2019独角兽企业重金招聘Python工程师标准>>>训练完成以后我们就可以直接使用训练好的模板进行预测了但是每次在预测之前都要进行训练，不是一个常规操作，毕竟有些复杂的模型需要训练好几天甚至更久所以将训练好的模型进行保存，当有需要的时候重新加载这个模型进行预测或者继续训练，这才是一个常规操作我们依然使用最简单的例子进行说明，这里沿用Tensorflow入门——实现最简单的线性回归模型的预测这个例
【Python学习】网络爬虫-获取京东商品评论并制作柱状图西攻城狮北 Python实用案例学习 python 爬虫京东评论柱状图
一、实现目标获取京东网站上商品的评论统计数据，并使用该数据制作了一个简单的柱状图。二、实现步骤2.1网页分析首先打开链接https://www.jd.com/。在搜索框中输入巧克力关键词后，点击第一件商品打开商品网页，找到商品评价，在商品评价模块能够看到用户选择的评论标签。由于该商品的全部用户评论有50万+，数据量较大。我们需要收集商品特点，所以我们选择对评价标签进行分析。打开https://it
python弹窗（tkinter库）：在弹窗中放置图片的两种方法独白不白 python 开发语言
我了解到的方法有两种，但无一例外，重点都是将图片转化成PhotoImage的形式，然后才能在弹窗中显示。相当于PhotoImage是tkinter库导出图片的专属格式。方法1基础写法：首先把gif格式的图片转化成PhotoImage形式，再利用Label导出。importtkinterastkroot=tk.Tk()a=tk.Frame(root)a.pack()b=tk.PhotoImage(f
爬虫实战--- （6）链家房源数据爬取与分析可视化 rain雨雨编程爬虫实战系列 python 爬虫数据分析
文章持续跟新，可以微信搜一搜公众号[rain雨雨编程]，第一时间阅读，涉及数据分析，机器学习，Java编程，爬虫，实战项目等。目录前言1.爬取目标2.所涉及知识点3.步骤分析（穿插代码讲解）步骤一：发送请求步骤二：获取数据步骤三：解析数据步骤四：保存数据4.爬取结果5.完整代码6数据可视化前言今天我将为大家分享一个非常实用的Python项目——链家房源数据的爬取与分析可视化。在这篇文章中，我们将分
Python 项目国际化：使用 Babel 实现多语言支持衫水 python进阶 python
文章目录如何使用Babel实现Python项目国际化1.安装Babel2.设置项目目录结构3.标记可翻译的文本4.提取可翻译的文本生成文件——生成pot文件4.1有配置文件方式（使用`babel.cfg`）4.1.1.创建`babel.cfg`文件4.1.2.提取翻译内容4.2无配置文件方式（直接指定文件路径）5.后续步骤（通用步骤）5.1.初始化翻译文件——生成po文件5.2.编辑po文件5.3
python创建虚拟环境 k47 python python linux 开发语言
python创建虚拟环境准备工作python3.8.8(python3.3以上自带venv模块)环境windows10步骤在D盘创建文件夹Env进入Env文件夹执行命令：python-mvenvtest_env(这里名称自己填)进入上一步创建的文件夹内，并进入scripts文件夹下执行命令进入虚拟环境：.\activatecmd命令窗口前面出现（你自己写的名称）就成功了退出虚拟环境deactiva
查找地理处理工具 pianmian1 arcgis
操作方法:1.在arcmap中打开目标地图2.单机Geoprocessing菜单,选择SearchForTools,打开Search窗口3.在搜索文本框中输入Clip,当开始输入这个单词时,搜索文本框会根据用户输入的字母自动匹配搜索结果4.单机Search按钮,即可生成一个匹配的工具列表.在搜索结果中,锤子图标表示工具,卷轴图标表示python脚本,含有彩色方格的表示模型5.选择Clip工具,将打
关于python语言程序设计课本的总结 pianmian1 python 开发语言
不知不觉就学完了整本书.今天来总结一下内容吧.目录第一章:程序设计基本方法;第二章:python语言基本语法元素第三章:基本数据类型第四章:程序的控制结构第五章:函数和代码复用第一章:程序设计基本方法;本章讲述了程序设计的基本语言概述与python语言特点.讲述了如何正确安装python程序.介绍了python语言的优点:语法简介,生态丰富,多语言集成,平台无关,强制可读,支持中文,模式多样等.并
ArcGis批量导出地图杨汶达@ ArcGis arcgis
ArcGIS软件从诞生之日起就引领着地理信息系统技术的潮流，极大地提高了制图的质量和效率，目前可以满足大多数用户的需求。但是在具有部分行业特色或存在大量重复工作的应用需求中，仅凭ArcGIS软件来完成制图工作不仅费时费力，而且工作量可能超过了可承受范围。因此，通过编程来实现自动化制图技术，则可以起到事半功倍的效果。以林地征占用项目使用林地现状图制图为例，介绍如何使用Python编写代码，利用Arc
python3+TensorFlow 2.x（四）反向传播刀客123 python学习 tensorflow 人工智能 python
目录反向传播算法反向传播算法基本步骤：反向中的参数变化总结反向传播算法反向传播算法（Backpropagation）是训练人工神经网络时使用的一个重要算法，它是通过计算梯度并优化神经网络的权重来最小化误差。反向传播算法的核心是基于链式法则的梯度下降优化方法，通过计算误差对每个权重的偏导数来更新网络中的参数。反向传播算法基本步骤：前向传播：将输入数据传递通过神经网络的各层，计算每一层的输出。计算损失
Python pdf转word 树上灵溪 python
最新在翻译英文文档，但都是pdf的，有点不方便，花了点时间做了一个小工具，分享一下，希望对大家有所帮助。这里录了一个视频传到B站了，比较详细可以看一下：传送门。最终结果是生成了一个可执行文件，可以批量转换文件夹中的pdf文件，包含图片和简单的格式转换（复杂的就不用考虑自己搞了QAQ）下面简单描述一下大概思路：1.引用pdf2docx库：frompdf2docximportConverter2.找到
Python SQLAlchemy库详解寒秋丶 Python python 开发语言数据库测试开发软件测试软件开发自动化测试
大家好，在Python生态系统中，SQLAlchemy库是一个强大的工具，为开发人员提供了便捷的方式来处理与数据库的交互。无论是开发一个小型的Web应用程序，还是构建一个大型的企业级系统，SQLAlchemy都能满足你的需求，并提供灵活性和性能上的优势。本文将带你深入探索SQLAlchemy库，从基础概念到高级用法，让你对其有一个全面的了解。一、介绍SQLAlchemy是Python中一个强大的开
三种国产大语言模型Python免费调用小软件大世界 python 人工智能
基础三大模型，需要先去官方注册获得key；后续可以使用下列代码调用1.腾讯安装：pip install -i https://mirrors.tencent.com/pypi/simple/ --upgrade tencentcloud-sdk-python实例：importjsonimporttypesfromtencentcloud.commonimportcredentialfromtenc
Flask教程5：flask数据库SQLAlchemy Cachel wood Flask入门教程数据库 flask oracle python 阿里云开发语言 LLM
文章目录SQLAlchemy为什么使用ORM初始化数据库配置表模型的定义与数据库映射数据的增、删、改、查操作数据的添加数据的查找数据的修改数据的删除init_app作用详解SQLAlchemySQLAlchemy是一个基于Python实现的ORM(ObjectRelationalMapping，对象关系映射）框架。该框架建立在DBAPI(数据库应用程序接口系统)之上，使用关系对象映射进行数据库操作
python对word文档与PDF的操作深海里的盐汽水 python
python操作word文档与PDF对word文档的操作在操作前需要安装第三方库pipinstallpython-docxpillow。用python创建一个word文档创建一个对象添加一个大标题添加段落保存文件fromdocximportDocumentfromdocx.sharedimportInchesfromdocx.documentimportDocumentasDoc#创建一个对象do
＜Python＞＜ffmpeg＞基于python使用PyQt5构建GUI实例：音频格式转换程序（MP3/aac/wma/flac）(优化版2) 机构师 python编程实例 python ffmpeg qt pyqt5 vscode
前言本文是基于python语言使用pyqt5来构建的GUI，功能是使用ffmpeg来对音频文件进行格式转换，如mp3、aac、wma、flac等音乐格式。UI示例：环境配置系统：windows平台：visualstudiocode语言：python库：pyqt5、ffmpeg概述本文是建立在之前的博文的基础上的优化版，前文链接：1、python使用ffmpeg来制作音频格式转换工具（优化版）2、利
第30篇：Python开发进阶：网络安全与测试猿享天开 python从入门到精通 python web安全开发语言
第30篇：网络安全与测试目录网络安全概述什么是网络安全常见的安全威胁Python中的网络安全工具常用安全库介绍安全编码实践密码学基础加密与解密哈希函数数字签名安全认证与授权用户认证访问控制OAuth与JWTWeb应用安全常见的Web安全漏洞防护措施安全测试网络安全测试渗透测试自动化测试工具安全漏洞扫描使用Python进行安全测试使用Scapy进行网络嗅探使用Requests进行安全测试使用Beau
PyQt4 的图片切割编辑器烛火萤辉 Python python pyqt
一、编辑器功能明确允许用户加载图片、选择切割模式、对切割后的图片片段进行操作（如移动、复制、粘贴、删除等），并支持撤销和重做操作。环境：Py2.7PyQt4.11二、导入模块介绍sys:用于访问与Python解释器强相关的变量和函数。os:提供操作系统相关功能，如文件路径操作。random:用于生成随机数，主要用于自动保存文件名。json:用于数据序列化和反序列化，方便保存和加载编辑状态。glob
知网爬虫，作者、摘要、题目、发表期刊等主要内容的获取大懒猫软件爬虫
爬取知网内容的详细过程爬取知网内容需要考虑多个因素，包括网站的结构、反爬虫机制等。以下是一个详细的步骤和代码实现，帮助你使用Python爬取知网上的论文信息，包括作者、摘要、题目、发表期刊等主要内容。1.数据准备首先，需要准备一些基础数据，如知网的URL、请求头等。2.模型构建使用requests库发送HTTP请求，使用BeautifulSoup库解析HTML内容。如果遇到动态加载的内容，可以使用
使用Python和Flask搭建导航网站需要注意的问题有哪些？大懒猫软件 python flask 开发语言
使用Python和Flask搭建导航网站时，需要注意以下几个关键问题，以确保网站的性能、安全性和可维护性。以下是一些常见问题和建议：1.性能优化静态文件缓存：确保静态文件（如CSS、JavaScript、图片）被浏览器缓存，减少重复请求。在Nginx中配置缓存头：nginx复制location~*\.(css|js|jpg|jpeg|png|gif)${expires1d;#设置缓存有效期为1天}
python【数据结构与算法】最长公共子串详解（附代码）理想不闪火算法
文章目录1定义1定义和最长公共子序列一样，使用动态规划的算法。下一步就要找到状态之间的转换方程。和LCS问题唯一不同的地方在于当A[i]!=B[j]时，res[i][j]就直接等于0了，因为子串必须连续，且res[i
Python之Spire.XLS进行Excel与CSV文件互转换一晌小贪欢 Python自动化办公 python excel python办公 python自动化
目录专栏导读背景安装Excel转CSV文件(推荐速度会快一点)代码CSV转Excel文件(小文件推荐)代码结尾专栏导读欢迎来到Python办公自动化专栏—Python处理办公问题，解放您的双手️‍博客主页：请点击——>一晌小贪欢的博客主页求关注该系列文章专栏：请点击——>Python办公自动化专栏求订阅文章作者技术和水平有限，如果文中出现错误，希望大家能指正❤️欢迎各位佬关注！❤️背景安装我们利用
java线程的无限循环和退出 3213213333332132 java
最近想写一个游戏，然后碰到有关线程的问题，网上查了好多资料都没满足。突然想起了前段时间看的有关线程的视频，于是信手拈来写了一个线程的代码片段。希望帮助刚学java线程的童鞋 package thread; import java.text.SimpleDateFormat; import java.util.Calendar; import java.util.Date
tomcat 容器 BlueSkator tomcat Web servlet
Tomcat的组成部分 1、server A Server element represents the entire Catalina servlet container. (Singleton) 2、service service包括多个connector以及一个engine，其职责为处理由connector获得的客户请求。 3、connector 一个connector
php递归,静态变量,匿名函数使用 dcj3sjt126com PHP 递归函数匿名函数静态变量引用传参
<!doctype html> <html lang="en"> <head> <meta charset="utf-8"> <title>Current To-Do List</title> </head> <body>
属性颜色字体变化周华华 JavaScript
function changSize(className){ var diva=byId("fot") diva.className=className; } </script> <style type="text/css"> .max{ background: #900; color:#039;
将properties内容放置到map中 g21121 properties
代码比较简单： private static Map<Object, Object> map; private static Properties p; static { //读取properties文件 InputStream is = XXX.class.getClassLoader().getResourceAsStream("xxx.properti
[简单]拼接字符串 53873039oycg 字符串
工作中遇到需要从Map里面取值拼接字符串的情况，自己写了个，不是很好，欢迎提出更优雅的写法，代码如下： import java.util.HashMap; import java.uti
Struts2学习云端月影
最近开始关注struts2的新特性，从这个版本开始，Struts开始使用convention-plugin代替codebehind-plugin来实现struts的零配置。配置文件精简了，的确是简便了开发过程，但是，我们熟悉的配置突然disappear了，真是一下很不适应。跟着潮流走吧，看看该怎样来搞定convention-plugin。使用Convention插件，你需要将其JAR文件放
Java新手入门的30个基本概念二 aijuans java 新手 java 入门
基本概念:　　1.OOP中唯一关系的是对象的接口是什么,就像计算机的销售商她不管电源内部结构是怎样的,他只关系能否给你提供电就行了,也就是只要知道can or not而不是how and why.所有的程序是由一定的属性和行为对象组成的,不同的对象的访问通过函数调用来完成,对象间所有的交流都是通过方法调用,通过对封装对象数据,很大限度上提高复用率。　　2.OOP中最重要的思想是类,类是模板是蓝图,
jedis 简单使用 antlove java redis cache command jedis
jedis.RedisOperationCollection.java package jedis; import org.apache.log4j.Logger; import redis.clients.jedis.Jedis; import java.util.List; import java.util.Map; import java.util.Set; pub
PL/SQL的函数和包体的基础百合不是茶 PL/SQL编程函数包体显示包的具体数据包
由于明天举要上课,所以刚刚将代码敲了一遍PL/SQL的函数和包体的实现(单例模式过几天好好的总结下再发出来);以便明天能更好的学习PL/SQL的循环,今天太累了,所以早点睡觉,明天继续PL/SQL总有一天我会将你永远的记载在心里,,, 函数; 函数:PL/SQL中的函数相当于java中的方法;函数有返回值定义函数的 --输入姓名找到该姓名的年薪 create or re
Mockito(二)--实例篇 bijian1013 持续集成 mockito 单元测试
学习了基本知识后，就可以实战了，Mockito的实际使用还是比较麻烦的。因为在实际使用中，最常遇到的就是需要模拟第三方类库的行为。比如现在有一个类FTPFileTransfer，实现了向FTP传输文件的功能。这个类中使用了a
精通Oracle10编程SQL(7)编写控制结构 bijian1013 oracle 数据库 plsql
/* *编写控制结构 */ --条件分支语句 --简单条件判断 DECLARE v_sal NUMBER(6,2); BEGIN select sal into v_sal from emp where lower(ename)=lower('&name'); if v_sal<2000 then update emp set
【Log4j二】Log4j属性文件配置详解 bit1129 log4j
如下是一个log4j.properties的配置 log4j.rootCategory=INFO, stdout , R log4j.appender.stdout=org.apache.log4j.ConsoleAppender log4j.appender.stdout.layout=org.apache.log4j.PatternLayout log4j.appe
java集合排序笔记白糖_ java
public class CollectionDemo implements Serializable,Comparable<CollectionDemo>{ private static final long serialVersionUID = -2958090810811192128L; private int id; private String nam
java导致linux负载过高的定位方法 ronin47
定位java进程ID 可以使用top或ps -ef |grep java ![图片描述][1] 根据进程ID找到最消耗资源的java pid 比如第一步找到的进程ID为5431 执行 top -p 5431 -H ![图片描述][2] 打印java栈信息 $ jstack -l 5431 > 5431.log 在栈信息中定位具体问题将消耗资源的Java PID转
给定能随机生成整数1到5的函数，写出能随机生成整数1到7的函数 bylijinnan 函数
import java.util.ArrayList; import java.util.List; import java.util.Random; public class RandNFromRand5 { /** 题目：给定能随机生成整数1到5的函数，写出能随机生成整数1到7的函数。解法1： f(k) = (x0-1)*5^0+(x1-
PL/SQL Developer保存布局 Kai_Ge
近日由于项目需要，数据库从DB2迁移到ORCAL，因此数据库连接客户端选择了PL/SQL Developer。由于软件运用不熟悉，造成了很多麻烦，最主要的就是进入后，左边列表有很多选项，自己删除了一些选项卡，布局很满意了，下次进入后又恢复了以前的布局，很是苦恼。在众多PL/SQL Developer使用技巧中找到如下这段： &n
[未来战士计划]超能查派[剧透,慎入] comsci 计划
非常好看,超能查派,这部电影......为我们这些热爱人工智能的工程技术人员提供一些参考意见和思想........ 虽然电影里面的人物形象不是非常的可爱....但是非常的贴近现实生活.... &nbs
Google Map API V2 dai_lm google map
以后如果要开发包含google map的程序就更麻烦咯 http://www.cnblogs.com/mengdd/archive/2013/01/01/2841390.html 找到篇不错的文章，大家可以参考一下 http://blog.sina.com.cn/s/blog_c2839d410101jahv.html 1. 创建Android工程由于v2的key需要G
java数据计算层的几种解决方法2 datamachine java sql 集算器
2、SQL SQL/SP/JDBC在这里属于一类，这是老牌的数据计算层，性能和灵活性是它的优势。但随着新情况的不断出现，单纯用SQL已经难以满足需求，比如： JAVA开发规模的扩大，数据量的剧增，复杂计算问题的涌现。虽然SQL得高分的指标不多，但都是权重最高的。成熟度：5星。最成熟的。
Linux下Telnet的安装与运行 dcj3sjt126com linux telnet
Linux下Telnet的安装与运行 linux默认是使用SSH服务的而不安装telnet服务如果要使用telnet 就必须先安装相应的软件包即使安装了软件包默认的设置telnet 服务也是不运行的需要手工进行设置如果是redhat9，则在第三张光盘中找到 telnet-server-0.17-25.i386.rpm
PHP中钩子函数的实现与认识 dcj3sjt126com PHP
假如有这么一段程序： function fun(){ fun1(); fun2(); } 首先程序执行完fun1()之后执行fun2()然后fun()结束。但是，假如我们想对函数做一些变化。比如说，fun是一个解析函数，我们希望后期可以提供丰富的解析函数，而究竟用哪个函数解析，我们希望在配置文件中配置。这个时候就可以发挥钩子的力量了。我们可以在fu
EOS中的WorkSpace密码修改蕃薯耀修改WorkSpace密码
EOS中BPS的WorkSpace密码修改 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 201
SpringMVC4零配置--SpringSecurity相关配置【SpringSecurityConfig】 hanqunfeng SpringSecurity
SpringSecurity的配置相对来说有些复杂，如果是完整的bean配置，则需要配置大量的bean，所以xml配置时使用了命名空间来简化配置，同样，spring为我们提供了一个抽象类WebSecurityConfigurerAdapter和一个注解@EnableWebMvcSecurity，达到同样减少bean配置的目的，如下： applicationContex
ie 9 kendo ui中ajax跨域的问题 jackyrong AJAX跨域
这两天遇到个问题，kendo ui的datagrid，根据json去读取数据，然后前端通过kendo ui的datagrid去渲染，但很奇怪的是，在ie 10,ie 11,chrome,firefox等浏览器中，同样的程序，浏览起来是没问题的，但把应用放到公网上的一台服务器，却发现如下情况： 1） ie 9下，不能出现任何数据，但用IE 9浏览器浏览本机的应用，却没任何问题
不要让别人笑你不能成为程序员 lampcy 编程程序员
在经历六个月的编程集训之后，我刚刚完成了我的第一次一对一的编码评估。但是事情并没有如我所想的那般顺利。说实话，我感觉我的脑细胞像被轰炸过一样。手慢慢地离开键盘，心里很压抑。不禁默默祈祷：一切都会进展顺利的，对吧？至少有些地方我的回答应该是没有遗漏的，是不是？难道我选择编程真的是一个巨大的错误吗——我真的永远也成不了程序员吗？我需要一点点安慰。在自我怀疑，不安全感和脆弱等等像龙卷风一
马皇后的贤德 nannan408
马皇后不怕朱元璋的坏脾气，并敢理直气壮地吹耳边风。众所周知，朱元璋不喜欢女人干政，他认为“后妃虽母仪天下，然不可使干政事”，因为“宠之太过，则骄恣犯分，上下失序”，因此还特地命人纂述《女诫》，以示警诫。但马皇后是个例外。　　有一次，马皇后问朱元璋道：“如今天下老百姓安居乐业了吗？”朱元璋不高兴地回答：“这不是你应该问的。”马皇后振振有词地回敬道：“陛下是天下之父，
选择某个属性值最大的那条记录（不仅仅包含指定属性，而是想要什么属性都可以） Rainbow702 sql group by 最大值 max 最大的那条记录
好久好久不写SQL了，技能退化严重啊！！！直入主题：比如我有一张表，file_info，它有两个属性（但实际不只，我这里只是作说明用）： file_code, file_version 同一个code可能对应多个version 现在，我想针对每一个code，取得它相关的记录中，version 值最大的那条记录， SQL如下： select *
VBScript脚本语言 tntxia VBScript
VBScript 是基于VB的脚本语言。主要用于Asp和Excel的编程。 VB家族语言简介 Visual Basic 6.0 源于BASIC语言。由微软公司开发的包含协助开发环境的事
java中枚举类型的使用 xiao1zhao2 java enum 枚举 1.5新特性
枚举类型是j2se在1.5引入的新的类型,通过关键字enum来定义,常用来存储一些常量. 1.定义一个简单的枚举类型 public enum Sex { MAN, WOMAN } 枚举类型本质是类,编译此段代码会生成.class文件.通过Sex.MAN来访问Sex中的成员,其返回值是Sex类型. 2.常用方法静态的values()方