史上最全的 java正则校验 家庭住址 手机号 姓名等

一下楼主整理了覆盖所有的正则表达式

这里是以xml的形式加载到内存使用的 –


<finders>
  <finder>
    <name>MobilePhonename>
    <pattern>^0?(13[0-9]|15[012356789]|17[013678]|18[0-9]|14[57])[0-9]{8}$
    pattern>
    <enabled>trueenabled>
  finder>
  <finder>
    <class>venus.dbmask.sensitive.pattern.CompositeCreditCardFinder
    class>
    <enabled>trueenabled>
  finder>  
    <finder>
        <name>Emailname>
        <pattern>[\w.]+@[\w.]+
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>CJK-Namename>
        <pattern>^(王|李|张|刘|陈|杨|黄|赵|吴|周|徐|孙|马|朱|胡|郭|何|高|林|罗|郑|梁|谢|宋|唐|许|韩|冯|邓|曹|彭|曾|肖|田|董|袁|潘|于|蒋|蔡|余|杜|叶|程|苏|魏|吕|丁|任|沈|姚|卢|姜|崔|钟|谭|陆|汪|范|金|石|廖|贾|夏|韦|傅|方|白|邹|孟|熊|秦|邱|江|尹|薛|闫|段|雷|侯|龙|史|黎|贺|顾|毛|郝|龚|邵|万|钱|覃|武|戴|孔|汤|庞|樊|兰|殷|施|陶|洪|翟|安|颜|倪|严|牛|温|芦|季|俞|章|鲁|葛|伍|申|尤|毕|聂|柴|焦|向|柳|邢|岳|齐|沿|梅|莫|庄|辛|管|祝|左|涂|谷|祁|时|舒|耿|牟|卜|路|詹|关|苗|凌|费|纪|靳|盛|童|欧|甄|项|曲|成|游|阳|裴|席|卫|查|屈|鲍|位|覃|霍|翁|隋|植|甘|景|薄|单|包|司|柏|宁|柯|阮|桂|闵|欧阳|解|强|丛|华|车|冉|房|边|辜|吉|饶|刁|瞿|戚|丘|古|米|池|滕|晋|苑|邬|臧|畅|宫|来|嵺|苟|全|褚|廉|简|娄|盖|符|奚|木|穆|党|燕|郎|邸|冀|谈|姬|屠|连|郜|晏|栾|郁|商|蒙|计|喻|揭|窦|迟|宇|敖|糜|鄢|冷|卓|花|艾|蓝|都|巩|稽|井|练|仲|乐|虞|卞|封|竺|冼|原|官|衣|楚|佟|栗|匡|宗|应|台|巫|鞠|僧|桑|荆|谌|银|扬|明|沙|薄|伏|岑|习|胥|保|和|蔺|水|云|昌|凤|酆|常|皮|康|元|平|萧|湛|禹|无|贝|茅|麻|危|骆|支|咎|经|裘|缪|干|宣|贲|杭|诸|钮|嵇|滑|荣|荀|羊|於|惠|家|芮|羿|储|汲|邴|松|富|乌|巴|弓|牧|隗|山|宓|蓬|郗|班|仰|秋|伊|仇|暴|钭|厉|戎|祖|束|幸|韶|蓟|印|宿|怀|蒲|鄂|索|咸|籍|赖|乔|阴|能|苍|双|闻|莘|贡|逢|扶|堵|宰|郦|雍|却|璩|濮|寿|通|扈|郏|浦|尚|农|别|阎|充|慕|茹|宦|鱼|容|易|慎|戈|庚|终|暨|居|衡|步|满|弘|国|文|寇|广|禄|阙|东|殴|殳|沃|利|蔚|越|夔|隆|师|厍|晃|勾|融|訾|阚|那|空|毋|乜|养|须|丰|巢|蒯|相|后|红|权逯|盖益|桓|公|万俟|司马|上官|夏侯|诸葛|闻人|东方|赫连|皇甫|尉迟|公羊|澹台|公冶|宗政|濮阳|淳于|单于|太叔|申屠|公孙|仲孙|轩辕|令狐|钟离|宇文|长孙|慕容|鲜于|闾丘|司徒|司空|亓官|司寇|仉|督|子车|颛孙|端木|巫马|公西|漆雕|乐正|壤驷|公良|拓跋|夹谷|宰父|谷粱|法|汝|钦|段干|百里|东郭|南门|呼延|归海|羊舌|微生|帅|缑|亢|况|郈|琴|梁丘|左丘|东门|西门|佘|佴|伯|赏|南宫|墨|哈|谯|笪|年|爱|仝|代)\s*[\u4e00-\u9fa5]{1,3}
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <class>io.dataapps.chlorine.pattern.CompositeCreditCardFinder
        class>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>USPhone-Formattedname>
        <pattern>[\()\.]*[0-9]{3}[\)\ \-\.]{1,2}[0-9]{3}[\-\ \.][0-9]{4}
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>IPV4name>
        <pattern>(?:[0-9]{1,3}\.){3}[0-9]{1,3}
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
    <name>CJK-Addressname>
    <pattern>^([\u4E00-\u9FA5A-Za-z0-9_]+(省|市|区|县|道|路|街|号)){2,}$
    pattern>
    <flags>10flags>
    <enabled>trueenabled>
  finder>
    <finder>
        <name>USStreetAddressname>
        <pattern>
            \d+(\s[A-Z0-9\.]+?){1,3}\s(RD|ROAD|DR|DRIVE|AVE|AVENUE|APT|APARTMENT|PL|PLACE|ST|STREET)
        pattern>
        <flags>10flags>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>SSN-spacesname>
        <pattern>[0-9]{3}\s[0-9]{2}\s[0-9]{4}pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>SSN-dashesname>
        <pattern>[0-9]{3}\-[0-9]{2}\-[0-9]{4}pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>IPV6name>
        <pattern>
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>URLname>
        <pattern>
        pattern>
        <enabled>falseenabled>
    finder>
    <finder>
        <name>Hostnamename>
        <pattern>\b(([A-Za-z]|[A-Za-z][A-Za-z0-9\-]*[A-Za-z0-9])\.)+([A-Za-z0-9]|[A-Za-z0-9][A-Za-z0-9\-]*[A-Za-z0-9])\b
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>ZipCodename>
        <pattern>\b[0-9]{5}(?:-[0-9]{4})?\bpattern>
        <enabled>falseenabled>
    finder>
    <finder>
        <name>UKPhoneNumbername>
        <pattern>^((\(?0\d{4}\)?\s?\d{3}\s?\d{3})|(\(?0\d{3}\)?\s?\d{3}\s?\d{4})|(\(?0\d{2}\)?\s?\d{4}\s?\d{4}))(\s?\#(\d{4}|\d{3}))?$
        pattern>
        <enabled>falseenabled>
    finder>
    <finder>
        <name>18_CERTIDname>
        <pattern>^[1-9]\d{5}(18|19|([23]\d))\d{2}((0[1-9])|(10|11|12))(([0-2][1-9])|10|20|30|31)\d{3}[0-9Xx]$
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>15_CERTIDname>
        <pattern>(1[1-5]|2[1-3]|3[1-7]|4[1-6]|5[0-4]|6[1-5])[0-9]{4}[1-9][0-9]((01|03|05|07|08|10|12)(0[1-9]|[12][0-9]|3[01])|(01|04|06|09|11)(0[1-9]|[12][0-9]|30)|(02)(0[1-9]|[12][0-9]))[0-9]{3}
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>PhoneNumbername>
        <pattern>(1[34578][0-9]{9}|0[0-9]{2}-[0-9]{8}|0[0-9]{3}-[0-9]{7})
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>BankCardname>
        <pattern>\d{16,19}
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>BusinessCodename>
        <pattern>^((1[1-5]|2[1-3]|3[1-7]|4[1-6]|5[0-4]|6[1-5])[0-9]{4}|100000)[0-5][1-5][0-2][0-9]{6}$
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>OrgCodename>
        <pattern>^[A-Z0-9]{8}-?[0-9X]$
        pattern>
        <enabled>trueenabled>
    finder>
    <finder>
        <name>UnifyUnifyCodename>
        <pattern>^([15][1239]|9[123]|Y1)(1[1-5]|2[1-3]|3[1-7]|4[1-6]|5[0-4]|6[1-5])[0-9]{4}[0-9A-Z]{8}[0-9X][0-9A-HJKLMNPQRTUWXY]$
        pattern>
        <enabled>trueenabled>
    finder>
finders>

你可能感兴趣的:(正则表达式抓取网页)