weixin_34194359

基于WordNet的英文同义词、近义词相似度评估及代码实现

源码地址：https://github.com/XBWer/WordSimilarity

1.确定要解决的问题及意义

在基于代码片段的分类过程中，由于程序员对数据变量名的选取可能具有一定的规范性，在某一特定业务处理逻辑代码中，可能多个变量名之间具有关联性或相似性（如“trade”（商品交易）类中，可能存在“business”，“transaction”，“deal”等同义词），在某些情况下，它们以不同的词语表达了相同的含义。因此，为了能够对代码片段做出更加科学的类别判断，更好地识别这些同义词，我们有必要寻找一种能够解决避免由于同义词的存在而导致误分类的方法。说白了，就是要去判断词语之间的相似度（即确定是否为近义词），并找出代码段中出现次数最多的一组语义。

2.要达到的效果

即在给定的代码段中，能够发现哪些词是属于同义词，并且能够实现分类。

Eg.public static void function(){

String trade=”money”;

Int deal=5;

Long long business=0xfffffff;

Boolen transaction=TRUE;

……

}

Output：同义词有：trade，deal，business，transaction

这段代码很可能与trade有关

3.初识WordNet

问题确定了之后，通过网上的搜索，发现了WordNet和word2vec这两个相关的词汇。（后知后觉，这本身就是一个找近义词的过程）

　　3.1 WordNet是什么

首先，来看WordNet。搜了一下相关介绍：

WordNet是一个由普林斯顿大学认识科学实验室在心理学教授乔治·A·米勒的指导下建立和维护的英语字典。开发工作从1985年开始，从此以后该项目接受了超过300万美元的资助（主要来源于对机器翻译有兴趣的政府机构）。

由于它包含了语义信息，所以有别于通常意义上的字典。WordNet根据词条的意义将它们分组，每一个具有相同意义的字条组称为一个synset（同义词集合）。WordNet为每一个synset提供了简短，概要的定义，并记录不同synset之间的语义关系。

WordNet的开发有两个目的：

它既是一个字典，又是一个辞典，它比单纯的辞典或词典都更加易于使用。

支持自动的文本分析以及人工智能应用。

WordNet内部结构

在WordNet中，名词，动词，形容词和副词各自被组织成一个同义词的网络，每个同义词集合都代表一个基本的语义概念，并且这些集合之间也由各种关系连接。（一个多义词将出现在它的每个意思的同义词集合中）。在WordNet的第一版中（标记为1.x），四种不同词性的网络之间并无连接。WordNet的名词网络是第一个发展起来的。

名词网络的主干是蕴涵关系的层次（上位／下位关系），它占据了关系中的将近80%。层次中的最顶层是11个抽象概念，称为基本类别始点（unique beginners），例如实体（entity，“有生命的或无生命的具体存在”），心理特征（psychological feature，“生命有机体的精神上的特征）。名词层次中最深的层次是16个节点。

（wikipedia）

通俗地来说，WordNet是一个结构化很好的知识库，它不但包括一般的词典功能，另外还有词的分类信息。目前，基于WordNet的方法相对来说比较成熟，比如路径方法 (lch)、基于信息论方法(res)等。（详见参考文献）

3.2 WordNet的安装与配置

有了WordNet ,也就等于是有了我们所要的单词库。所以，暂时先不考虑相似度的计算，把WordNet下载下来再说。

参考http://hi.baidu.com/buptyoyo/item/f13dfe463c061e3afb896028。顺利地下载，安装以及跑demo。

之后，一起来看一下WordNet的文件结构：

bin目录下，有可执行文件WordNet 2.1.exe：

可以看到，WordNet对所有的英文单词都进行的分类，并且形成了一棵语义树。在本例中，entity——>abstract entity——>abstraction——>attribute——>state——>feeling——> emotion——>love;

从叶子节点到根节点

WordNet名次分类中的25个基本类：

dict目录里面存放的就是资源库了，可以看到，它以形容词，副词，名词，动词来分类：

doc为WordNet的用户手册文件文件夹

lib为WordNet软件使用Windows资源的函数库

src为源码文件夹

4.解决问题的大致思路

我们首先以 WordNet 的词汇语义分类作为基础，抽取出其中的同义词，然后采用基于向量空间的方法计算出相似度。工作流程如下：

5.基于WordNet的相似度计算

以下摘自：《基于WordNet的英语词语相似度计算》

5.1 特征提取

5.2 意义相似度和词语相似度的计算

6.实现效果

与“trade”的相似度比较：

分析：

先看第一组：trade vs trade

自己和自己当然是相似度100%

再看第二组：trade#n#5 vs deal#n#1

相似度竟然和第一组是一样的！根据结果，trade作为名词时，它的第5种含义和deal作为名词时的第1种含义是完全相似的。让我们去库里看个究竟：

　　trade#n#5：

deal#n#1：

再来看一组不是很好理解的：

trade#n#7 vs deal#n#2

他们的相似度达到了0.14+，算是比较高的了，这是为什么呢？

　 trade#n#7：

sunshine#n#2:

相信聪明的你一定明白了为什么。

与“cat”的相似度比较：

7.代码分析

工程结构图：

test.java

 1 package JWordNetSim.test;
 2 
 3 import java.io.FileInputStream;
 4 import java.util.HashMap;
 5 import java.util.Map;
 6 
 7 import net.didion.jwnl.JWNL;
 8 import net.didion.jwnl.data.IndexWord;
 9 import net.didion.jwnl.data.POS;
10 import net.didion.jwnl.dictionary.Dictionary;
11 import shef.nlp.wordnet.similarity.SimilarityMeasure;
12 
13 /**
14  * A simple test of this WordNet similarity library.
15  * @author Mark A. Greenwood
16  */
17 public class Test
18 {
19     public static void main(String[] args) throws Exception
20     {    
21         //在运行代码前，必须在本机上安装wordnet2.0，只能装2.0，装了2.1会出错
22         JWNL.initialize(new FileInputStream("D:\\JAVAProjectWorkSpace\\jwnl\\JWordNetSim\\test\\wordnet.xml"));
23         
24         //建议一个映射去配置相关参数
25         Map params = new HashMap();
26         
27         //the simType parameter is the class name of the measure to use
28         params.put("simType","shef.nlp.wordnet.similarity.JCn");
29         
30         //this param should be the URL to an infocontent file (if required
31         //by the similarity measure being loaded)
32         params.put("infocontent","file:D:\\JAVAProjectWorkSpace\\jwnl\\JWordNetSim\\test\\ic-bnc-resnik-add1.dat");
33         
34         //this param should be the URL to a mapping file if the
35         //user needs to make synset mappings
36         params.put("mapping","file:D:\\JAVAProjectWorkSpace\\jwnl\\JWordNetSim\\test\\domain_independent.txt");
37         
38         //create the similarity measure
39         SimilarityMeasure sim = SimilarityMeasure.newInstance(params);
40         
41         //取词
42 //        Dictionary dict = Dictionary.getInstance();        
43 //        IndexWord word1 = dict.getIndexWord(POS.NOUN, "trade");            //这里把trade和dog完全定义为名词来进行处理
44 //        IndexWord word2 = dict.getIndexWord(POS.NOUN,"dog");                //
45 //        
46 //        //and get the similarity between the first senses of each word
47 //        System.out.println(word1.getLemma()+"#"+word1.getPOS().getKey()+"#1  " + word2.getLemma()+"#"+word2.getPOS().getKey()+"#1  " + sim.getSimilarity(word1.getSense(1), word2.getSense(1)));        
48         
49 //        //get similarity using the string methods (note this also makes use
50 //        //of the fake root node)
51 //        System.out.println(sim.getSimilarity("trade#n","deal#n"));
52         
53         //get a similarity that involves a mapping
54         System.out.println(sim.getSimilarity("trade", "trade"));
55         System.out.println(sim.getSimilarity("trade", "deal"));
56         System.out.println(sim.getSimilarity("trade", "commerce"));
57         System.out.println(sim.getSimilarity("trade", "transaction"));        
58         System.out.println(sim.getSimilarity("trade", "finance"));
59         System.out.println(sim.getSimilarity("trade", "financial"));
60         System.out.println(sim.getSimilarity("trade", "business"));
61         System.out.println(sim.getSimilarity("trade", "economy"));        
62         System.out.println(sim.getSimilarity("trade", "school"));
63         System.out.println(sim.getSimilarity("trade", "dog"));
64         System.out.println(sim.getSimilarity("trade", "cat"));
65         System.out.println(sim.getSimilarity("trade", "book"));
66         System.out.println(sim.getSimilarity("trade", "sunshine"));
67         System.out.println(sim.getSimilarity("trade", "smile"));
68         System.out.println(sim.getSimilarity("trade", "nice"));
69         System.out.println(sim.getSimilarity("trade", "hardly"));
70         System.out.println(sim.getSimilarity("trade", "beautiful"));
71     }
72 }

SimilarityMeasure.java

  1 package shef.nlp.wordnet.similarity;
  2 
  3 import java.io.BufferedReader;
  4 import java.io.InputStreamReader;
  5 import java.net.URL;
  6 import java.util.Arrays;
  7 import java.util.HashMap;
  8 import java.util.HashSet;
  9 import java.util.LinkedHashMap;
 10 import java.util.Map;
 11 import java.util.Set;
 12 
 13 import net.didion.jwnl.JWNLException;
 14 import net.didion.jwnl.data.IndexWord;
 15 import net.didion.jwnl.data.POS;
 16 import net.didion.jwnl.data.Synset;
 17 import net.didion.jwnl.dictionary.Dictionary;
 18 
 19 /**
 20  * An abstract notion of a similarity measure that all provided
 21  * implementations extend.
 22  * @author Mark A. Greenwood
 23  */
 24 public abstract class SimilarityMeasure
 25 {    
 26     /**
 27      * A mapping of terms to specific synsets. Usually used to map domain
 28      * terms to a restricted set of synsets but can also be used to map
 29      * named entity tags to appropriate synsets.
 30      */
 31     private Map> domainMappings = new HashMap>();
 32     
 33     /**
 34      * The maximum size the cache can grow to
 35      */
 36     private int cacheSize = 5000;
 37     
 38     /**
 39      * To speed up computation of the similarity between two synsets
 40      * we cache each similarity that is computed so we only have to
 41      * do each one once.
 42      */
 43     private Map cache = new LinkedHashMap(16,0.75f,true)
 44     {
 45         public boolean removeEldestEntry(Map.Entry eldest)
 46         {
 47             //if the size is less than zero then the user is asking us
 48             //not to limit the size of the cache so return false
 49             if (cacheSize < 0) return false;
 50             
 51             //if the cache has crown bigger than it's max size return true
 52             return size() > cacheSize;
 53         }
 54     }; 
 55     
 56     /**
 57      * Get a previously computed similarity between two synsets from the cache.
 58      * @param s1 the first synset between which we are looking for the similarity.
 59      * @param s2 the other synset between which we are looking for the similarity.
 60      * @return The similarity between the two sets or null
 61      *         if it is not in the cache.
 62      */
 63     protected final Double getFromCache(Synset s1, Synset s2)
 64     {
 65         return cache.get(s1.getKey()+"-"+s2.getKey());
 66     }
 67     
 68     /**
 69      * Add a computed similarity between two synsets to the cache so that
 70      * we don't have to compute it if it is needed in the future.
 71      * @param s1 one of the synsets between which we are storring a similarity.
 72      * @param s2 the other synset between which we are storring a similarity.
 73      * @param sim the similarity between the two supplied synsets.
 74      * @return the similarity score just added to the cache.
 75      */
 76     protected final double addToCache(Synset s1, Synset s2, double sim)
 77     {
 78         cache.put(s1.getKey()+"-"+s2.getKey(),sim);
 79         
 80         return sim;
 81     }
 82     
 83     /**
 84      * Configures the similarity measure using the supplied parameters.
 85      * @param params a set of key-value pairs that are used to configure
 86      *        the similarity measure. See concrete implementations for details
 87      *        of expected/possible parameters. 
 88      * @throws Exception if an error occurs while configuring the similarity measure.
 89      */
 90     protected abstract void config(Map params) throws Exception;
 91     
 92     /**
 93      * Create a new instance of a similarity measure.
 94      * @param confURL the URL of a configuration file. Parameters are specified
 95      *        one per line as key:value pairs.
 96      * @return a new instance of a similairy measure as defined by the
 97      *         supplied configuration URL.
 98      * @throws Exception if an error occurs while creating the similarity measure.
 99      */
100     public static SimilarityMeasure newInstance(URL confURL) throws Exception
101     {
102         //create map to hold the key-value pairs we are going to read from
103         //the configuration file
104         Map params = new HashMap();
105         
106         //create a reader for the config file
107         BufferedReader in = null;
108         
109         try
110         {
111             //open the config file
112             in = new BufferedReader(new InputStreamReader(confURL.openStream()));
113                     
114             String line = in.readLine();
115             while (line != null)
116             {
117                 line = line.trim();
118                 
119                 if (!line.equals(""))
120                 {
121                     //if the line contains something then
122                     
123                     //split the data so we get the key and value
124                     String[] data = line.split("\\s*:\\s*",2);
125                     
126                     if (data.length == 2)
127                     {
128                         //if the line is valid add the two parts to the map
129                         params.put(data[0], data[1]);
130                     }
131                     else
132                     {
133                         //if the line isn't valid tell the user but continue on
134                         //with the rest of the file
135                         System.out.println("Config Line is Malformed: " + line);
136                     }
137                 }
138                 
139                 //get the next line ready to process
140                 line = in.readLine();
141             }
142         }
143         finally
144         {
145             //close the config file if it got opened
146             if (in != null) in.close();
147         }
148         
149         //create and return a new instance of the similarity measure specified
150         //by the config file
151         return newInstance(params);
152     }
153     
154     /**
155      * Creates a new instance of a similarity measure using the supplied parameters.
156      * @param params a set of key-value pairs which define the similarity measure.
157      * @return the newly created similarity measure.
158      * @throws Exception if an error occurs  while creating the similarity measure.
159      */
160     public static SimilarityMeasure newInstance(Map params) throws Exception
161     {
162         //get the class name of the implementation we need to load
163         String name = params.remove("simType");
164         
165         //if the name hasn't been specified then throw an exception
166         if (name == null) throw new Exception("Must specifiy the similarity measure to use");
167         
168         //Get hold of the class we need to load
169         @SuppressWarnings("unchecked") Class c = (Class)Class.forName(name);
170         
171         //create a new instance of the similarity measure
172         SimilarityMeasure sim = c.newInstance();
173         
174         //get the cache parameter from the config params
175         String cSize = params.remove("cache");
176         
177         //if a cache size was specified then set it
178         if (cSize != null) sim.cacheSize = Integer.parseInt(cSize);
179         
180         //get the url of the domain mapping file
181         String mapURL = params.remove("mapping");
182         
183         if (mapURL != null)
184         {
185             //if a mapping file has been provided then 
186                         
187             //open a reader over the file
188             BufferedReader in = new BufferedReader(new InputStreamReader((new URL(mapURL)).openStream()));
189             
190             //get the first line ready for processing
191             String line = in.readLine();
192             
193             while (line != null)
194             {
195                 if (!line.startsWith("#"))
196                 {
197                     //if the line isn't a comment (i.e. it doesn't start with #) then...
198                     
199                     //split the line at the white space
200                     String[] data = line.trim().split("\\s+");
201                     
202                     //create a new set to hold the mapped synsets
203                     Set mappedTo = new HashSet();
204                     
205                     for (int i = 1 ; i < data.length ; ++i)
206                     {
207                         //for each synset mapped to get the actual Synsets
208                         //and store them in the set
209                         mappedTo.addAll(sim.getSynsets(data[i]));
210                     }
211                     
212                     //if we have found some actual synsets then
213                     //store them in the domain mappings
214                     if (mappedTo.size() > 0) sim.domainMappings.put(data[0], mappedTo);
215                 }
216                 
217                 //get the next line from the file
218                 line = in.readLine();
219             }
220             
221             //we have finished with the mappings file so close it
222             in.close();
223         }        
224         
225         //make sure it is configured properly
226         sim.config(params);
227         
228         //then return it
229         return sim;
230     }
231     
232     /**
233      * This is the method responsible for computing the similarity between two
234      * specific synsets. The method is implemented differently for each
235      * similarity measure so see the subclasses for detailed information.
236      * @param s1 one of the synsets between which we want to know the similarity.
237      * @param s2 the other synset between which we want to know the similarity.
238      * @return the similarity between the two synsets.
239      * @throws JWNLException if an error occurs accessing WordNet.
240      */
241     public abstract double getSimilarity(Synset s1, Synset s2) throws JWNLException;
242     
243     /**
244      * Get the similarity between two words. The words can be specified either
245      * as just the word or in an encoded form including the POS tag and possibly
246      * the sense number, i.e. cat#n#1 would specifiy the 1st sense of the noun cat.
247      * @param w1 one of the words to compute similarity between.
248      * @param w2 the other word to compute similarity between.
249      * @return a SimilarityInfo instance detailing the similarity between the
250      *         two words specified.
251      * @throws JWNLException if an error occurs accessing WordNet.
252      */
253     public final SimilarityInfo getSimilarity(String w1, String w2) throws JWNLException
254     {
255         //Get the (possibly) multiple synsets associated with each word
256         Set ss1 = getSynsets(w1);
257         Set ss2 = getSynsets(w2);
258                 
259         //assume the words are not at all similar
260         SimilarityInfo sim = null;
261         
262         for (Synset s1 : ss1)
263         {
264             for (Synset s2 : ss2)
265             {
266                 //for each pair of synsets get the similarity
267                 double score = getSimilarity(s1, s2);
268                                 
269                 if (sim == null || score > sim.getSimilarity())
270                 {
271                     //if the similarity is better than we have seen before
272                     //then create and store an info object describing the
273                     //similarity between the two synsets
274                     sim = new SimilarityInfo(w1, s1, w2, s2, score);
275                 }
276             }
277         }
278         
279         //return the maximum similarity we have found
280         return sim;    
281     }
282     
283     /**
284      * Finds all the synsets associated with a specific word.
285      * @param word the word we are interested. Note that this may be encoded
286      *        to include information on POS tag and sense index.
287      * @return a set of synsets that are associated with the supplied word
288      * @throws JWNLException if an error occurs accessing WordNet
289      */
290     private final Set getSynsets(String word) throws JWNLException
291     {        
292         //get a handle on the WordNet dictionary
293         Dictionary dict = Dictionary.getInstance();
294         
295         //create an emptuy set to hold any synsets we find
296         Set synsets = new HashSet();
297         
298         //split the word on the # characters so we can get at the
299         //upto three componets that could be present: word, POS tag, sense index
300         String[] data = word.split("#");
301         
302         //if the word is in the domainMappings then simply return the mappings
303         if (domainMappings.containsKey(data[0])) return domainMappings.get(data[0]);
304         
305         if (data.length == 1)
306         {
307             //if there is just the word
308                 
309             for (IndexWord iw : dict.lookupAllIndexWords(data[0]).getIndexWordArray())
310             {
311                 //for each matching word in WordNet add all it's senses to
312                 //the set we are building up
313                 synsets.addAll(Arrays.asList(iw.getSenses()));
314             }
315             
316             //we have finihsed so return the synsets we found
317             return synsets;
318         }
319     
320         //the calling method specified a POS tag as well so get that
321         POS pos = POS.getPOSForKey(data[1]);
322         
323         //if the POS tag isn't valid throw an exception
324         if (pos == null) throw new JWNLException("Invalid POS Tag: " + data[1]);
325         
326         //get the word with the specified POS tag from WordNet
327         IndexWord iw = dict.getIndexWord(pos, data[0]);
328         
329         if (data.length > 2)
330         {
331             //if the calling method specified a sense index then
332             //add just that sysnet to the set we are creating
333             synsets.add(iw.getSense(Integer.parseInt(data[2])));
334         }
335         else
336         {
337             //no sense index was specified so add all the senses of
338             //the word to the set we are creating
339             synsets.addAll(Arrays.asList(iw.getSenses()));
340         }
341         
342         //return the set of synsets we found for the specified word
343         return synsets;
344     }
345 }

每个函数都有详细注解，大家应该都看的明白。

262~277的循环过程如下：

JCN.java

  1 /************************************************************************
  2  *         Copyright (C) 2006-2007 The University of Sheffield          *
  3  *      Developed by Mark A. Greenwood      *
  4  *                                                                      *
  5  * This program is free software; you can redistribute it and/or modify *
  6  * it under the terms of the GNU General Public License as published by *
  7  * the Free Software Foundation; either version 2 of the License, or    *
  8  * (at your option) any later version.                                  *
  9  *                                                                      *
 10  * This program is distributed in the hope that it will be useful,      *
 11  * but WITHOUT ANY WARRANTY; without even the implied warranty of       *
 12  * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the        *
 13  * GNU General Public License for more details.                         *
 14  *                                                                      *
 15  * You should have received a copy of the GNU General Public License    *
 16  * along with this program; if not, write to the Free Software          *
 17  * Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA.            *
 18  ************************************************************************/
 19 
 20 package shef.nlp.wordnet.similarity;
 21 
 22 import net.didion.jwnl.JWNLException;
 23 import net.didion.jwnl.data.Synset;
 24 
 25 /**
 26  * An implementation of the WordNet similarity measure developed by Jiang and
 27  * Conrath. For full details of the measure see:
 28  * Jiang J. and Conrath D. 1997. Semantic similarity based on corpus
 29  * statistics and lexical taxonomy. In Proceedings of International
 30  * Conference on Research in Computational Linguistics, Taiwan.
 31  * @author Mark A. Greenwood
 32  */
 33 public class JCn extends ICMeasure
 34 {
 35     /**
 36      * Instances of this similarity measure should be generated using the
 37      * factory methods of {@link SimilarityMeasure}.
 38      */
 39     protected JCn()
 40     {
 41         //A protected constructor to force the use of the newInstance method
 42     }
 43     
 44     @Override public double getSimilarity(Synset s1, Synset s2) throws JWNLException
 45     {
 46         //if the POS tags are not the same then return 0 as this measure
 47         //only works with 2 nouns or 2 verbs.
 48         if (!s1.getPOS().equals(s2.getPOS())) return 0;
 49         
 50         //see if the similarity is already cached and...
 51         Double cached = getFromCache(s1, s2);
 52         
 53         //if it is then simply return it
 54         if (cached != null) return cached.doubleValue();
 55         
 56         //Get the Information Content (IC) values for the two supplied synsets
 57         double ic1 = getIC(s1);
 58         double ic2 = getIC(s2);
 59 
 60         //if either IC value is zero then cache and return a sim of 0
 61         if (ic1 == 0 || ic2 == 0) return addToCache(s1,s2,0);
 62         
 63         //Get the Lowest Common Subsumer (LCS) of the two synsets
 64         Synset lcs = getLCSbyIC(s1,s2);
 65         
 66         //if there isn't an LCS then cache and return a sim of 0
 67         if (lcs == null) return addToCache(s1,s2,0);
 68         
 69         //get the IC valueof the LCS
 70         double icLCS = getIC(lcs);
 71         
 72         //compute the distance between the two synsets
 73         //NOTE: This is the original JCN measure
 74         double distance = ic1 + ic2 - (2 * icLCS);
 75         
 76         //assume the similarity between the synsets is 0
 77         double sim = 0;
 78         
 79         if (distance == 0)
 80         {
 81             //if the distance is 0 (i.e. ic1 + ic2 = 2 * icLCS) then...
 82             
 83             //get the root frequency for this POS tag
 84             double rootFreq = getFrequency(s1.getPOS());
 85             
 86             if (rootFreq > 0.01)
 87             {
 88                 //if the root frequency has a value then use it to generate a
 89                 //very large sim value
 90                 sim = 1/-Math.log((rootFreq - 0.01) / rootFreq);
 91             }            
 92         }
 93         else
 94         {
 95             //this is the normal case so just convert the distance
 96             //to a similarity by taking the multiplicative inverse
 97             sim = 1/distance;
 98         }
 99         
100         //cache and return the calculated similarity
101         return addToCache(s1,s2,sim);
102     }
103 }

LIN.java

 1 package shef.nlp.wordnet.similarity;
 2 
 3 import net.didion.jwnl.JWNLException;
 4 import net.didion.jwnl.data.Synset;
 5 
 6 /**
 7  * An implementation of the WordNet similarity measure developed by Lin. For
 8  * full details of the measure see:
 9  * Lin D. 1998. An information-theoretic definition of similarity. In
10  * Proceedings of the 15th International Conference on Machine
11  * Learning, Madison, WI.
12  * @author Mark A. Greenwood
13  */
14 public class Lin extends ICMeasure
15 {
16     /**
17      * Instances of this similarity measure should be generated using the
18      * factory methods of {@link SimilarityMeasure}.
19      */
20     protected Lin()
21     {
22         //A protected constructor to force the use of the newInstance method
23     }
24     
25     @Override public double getSimilarity(Synset s1, Synset s2) throws JWNLException
26     {
27         //if the POS tags are not the same then return 0 as this measure
28         //only works with 2 nouns or 2 verbs.
29         if (!s1.getPOS().equals(s2.getPOS())) return 0;
30         
31         //see if the similarity is already cached and...
32         Double cached = getFromCache(s1, s2);
33         
34         //if it is then simply return it
35         if (cached != null) return cached.doubleValue();
36         
37         //Get the Information Content (IC) values for the two supplied synsets
38         double ic1 = getIC(s1);
39         double ic2 = getIC(s2);
40         
41         //if either IC value is zero then cache and return a sim of 0
42         if (ic1 == 0 || ic2 == 0) return addToCache(s1,s2,0);
43         
44         //Get the Lowest Common Subsumer (LCS) of the two synsets
45         Synset lcs = getLCSbyIC(s1,s2);
46         
47         //if there isn't an LCS then cache and return a sim of 0
48         if (lcs == null) return addToCache(s1,s2,0);
49         
50         //get the IC valueof the LCS
51         double icLCS = getIC(lcs);
52         
53         //caluclaue the similarity score
54         double sim = (2*icLCS)/(ic1+ic2);
55         
56         //cache and return the calculated similarity
57         return addToCache(s1,s2,sim);
58     }
59 }

参考文献：

《基于维基百科的语义相似度计算》盛志超，陶晓鹏(复旦大学计算机科学技术学院)；

《基于WordNet的英语词语相似度计算》颜伟，荀恩东（北京语言大学语言信息处理研究所）

WordNet中的名词：http://ccl.pku.edu.cn/doubtfire/semantics/wordnet/c-wordnet/nouns-in-wordnet.htm

MIT的JWI（Java WordNet Interface）和JWNL（Java WordNet Library）比较

http://jxr19830617.blog.163.com/blog/static/163573067201301985219857/

http://jxr19830617.blog.163.com/blog/static/1635730672013019105255295/

你可能感兴趣的:(基于WordNet的英文同义词、近义词相似度评估及代码实现)

Redis的ziplist与hashtable性能对比测试无级程序员 java 数据库大数据
测试一下ziplist长度为2048时性能。机器为4C,8G虚拟机另外，记录一下，200个节点的Redis集群要消耗大约200mbps带宽用于节点间通讯。
PGSql性能测试无级程序员数据库大数据
一个40亿的表，分成128个区，16384个slot，每个表分区大约3000W数据，每个slot大约25W数据，虚拟机8C16G，1T空间，测试导出一个slot数据性能，结果如下：select*fromtablewhereslot_id=0;以slot_id为索引：大约100多秒，以slot_id和slice_id为索引时大约2秒，很奇怪的结果。另外，数据增加到60亿，即每个表4500W数据时，一
美逛的邀请码怎么获得_美逛邀请码填什么? 日常购物小技巧
大家好，我是花桃APP推荐官小琪琪今天给大家说说：美逛的邀请码怎么获得_美逛邀请码填什么?一、美逛邀请码填什么填多少1、美逛邀请码填写：520999（这是花桃APP的），这样可以获得高佣金。相信大家的朋友圈最新都被一款叫“美逛”的APP软件刷屏了，那么，美逛是什么？简单说，美逛是一个全领域的、省钱还能赚钱的超级返利创业APP。淘宝只是美逛的商务合作方之一。美逛有京东，拼多多，淘宝，飞猪，还会陆续接
思维导图学生训练营2020寒假第五期圆满结束！曹华_全脑思维
2020年1月13日，盛和教育携手易思图，第五期寒假思维导图训练营圆满结束啦！今天更是混班制，最小的大班，最大的五年级。幼儿园我们本来是不招收的，年龄段是7~12岁，上一年级以后才能报名。这个大班的妹妹是哥哥来学习，妈妈就想妹妹一起，她也有事情的，带着妹妹不方便，我们机构正好有老师，所以想着一位老师可以陪着她一起学习，就让她留下来！没有想到的是，孩子一天的学习，思维完全跟的上，也没有安排专门的老师
C++11与MFC多线程控制：暂停与继续实践征途阿韦
本文还有配套的精品资源，点击获取简介：本项目深入探讨了在C++编程中，特别是在MFC框架下，如何管理和控制线程的暂停、继续和退出。涵盖了C++11标准库中std::thread的使用以及在MFC中CWinThread的继承和Run方法的重写。介绍了使用同步对象如条件变量、事件和信号量等实现线程暂停与继续的策略，并强调了线程退出的正确方式和多线程编程中的挑战，如同步、通信、避免死锁和竞态条件。1.C
打胎这种事~ 月亮上的妖姬
大部分人身边都有这种事男的被说为渣男不负责任女的则跟受了天大的委屈付出了所有得不到回报丢了清白丢了罪我以为……都是成年人对自己负责任一点他不想戴套你不让他睡他敢强奸你吗穿好自己的裤子别拿身体考验男人的心很多人对自己的选择都不负责任出了事都把狗血泼到男人头上如果他说戴套不舒服你为了他呢几秒钟的舒服就毅然决定顺服那就不要事后怨男人你那不是做爱是献身你后来的痛只是为了你曾经的爽
小李的快乐日记等不到晚安啦
总结：今天是6月7号星期一也是新的一周的开始；本周的目标是冲主管连续五天打钟很遗憾今天没能开单难度又增加啦许多剩下的这五天我准备全力以赴的剩下的交给上天啦如果我李华业有一天偷懒我天打雷劈；就算结果不是很好但至少已经全力以赴的去做这件事啦，不留遗憾这也是给自己一个挑战今天很遗憾把单子搞砸啦心态多少有点不好但这并不完全的影响到我；我在这五天之内我要全力以赴的去做业；加油李华业你是最棒最牛的
QCS8550 硬件性能全解析：参数、性能、优化，一篇讲透伊利丹~怒风 Qualcomm 算法 python 人工智能边缘计算无人机机器人
在物联网（IoT）设备向高性能、智能化演进的过程中，处理器作为核心算力单元扮演着关键角色。高通推出的Dragonwing™QCS8550处理器，凭借4nm工艺、异构计算架构、极致边缘AI处理能力及Wi-Fi7连接等特性，成为面向工业无人机、自主移动机器人、边缘AI盒子等高性能IoT场景的旗舰解决方案。本文将从核心参数、性能优势、优化亮点三个维度，全面解析这款处理器的技术实力。一、核心参数：4nm工
深度解析 QCS6490：硬件性能全揭秘
前言在科技飞速发展的当下，物联网（IoT）和边缘计算领域不断涌现出创新的硬件解决方案。高通的QCS6490处理器，即高通Dragonwing™QCS6490处理器，便是其中的佼佼者，它专为高性能边缘计算而设计，为追求高性能、高连接性以及强大AI处理能力的企业和商业IoT应用提供了卓越的支持。今天，就让我们深入剖析QCS6490的硬件性能，从参数、实际性能表现到优化策略，全方位了解这款芯片的魅力。Q
远古海洋种的蝎子，巨型羽翅鲎到底长什么样子？喵感数据
巨型羽翅鲎是生活在距今4.6-4.45亿年间的一种海蝎，又被称为广翅鲎。这群跟鲎具有血缘关系的海蝎们，它们大多数都是水生的节肢动物。这种动物生物力顽强，战斗力爆表，它们可以在任何环境下生存，无论是淡水环境还是陆地环境。奥陶纪海中霸主海蝎属于螯肢亚门的动物。这类动物包括蜘蛛和蝎子，如布龙度蝎子。螯肢亚门属于节肢动物古老的族群，它们最早出现在距今5亿年前时期的寒武纪时代。而巨型羽翅鲎则是海蝎种的一种。
高通手机跑AI系列之——人像与背景分割伊利丹~怒风 Qualcomm 人工智能智能手机 python arm AI编程
环境准备手机测试手机型号：RedmiK60Pro处理器：第二代骁龙8移动--8gen2运行内存：8.0GB，LPDDR5X-8400，67.0GB/s摄像头：前置16MP+后置50MP+8MP+2MPAI算力：NPU48TopsINT8&&GPU1536ALUx2x680MHz=2.089TFLOPS提示：任意手机均可以，性能越好的手机运行速度越快软件APP：AidLux2.0系统环境：Ubunt
正月14 burenjirigala
正月14日，天气格外暖和。一大早晨4点多我和爱人起床喂牛挤奶。今天比平常起的早些因为今我们要上我舅舅和啊姨他们家去拜年。大概8点左右从家里赶往县城，再从县城赶40多里地到我大姨家，大姨父今年73大寿就现到他们家拜年，大姨头发依然苍白，见我们过来拜年那高兴的，亲子下厨炒了几个菜给我们吃。吃完上我老舅家，老舅今天也出门，只有我妹妹在家就没逗留多长时间，赶往我三舅家。三舅前几年脑出血身体出些问题，但现在
《金文成〈中庸〉学习笔记399。2020-2-22》金吾生
《金文成〈中庸〉学习笔记399。2020-2-22》今天是庚子年戊寅月乙未日，正月廿九，2020年2月22日星期六。【唯天下至诚，为能尽其性；能尽其性，则能尽人之性；能尽人之性，则能尽物之性；能尽物之性，则可以赞天地之化育；可以赞天地之化育，则可以与天地参矣。】上一节，船山讲到诚与性的关系，诚是第二性的，性是第一性的，该怎么理解呢？船山说：“诚者性之撰也，性者诚之所丽也”，意思是说，不能简单地将诚
从《易经》解读《红楼梦》：贾家被抄家后一蹶不振，因没做好2点小说家郭大侠
《恺叔说红楼梦》第194期在《易经·豫卦》里写道：“介于石，不终日，贞吉”，意思是指“时刻保持中正的品德，就像石头一样，不为外界的事物所动，这样才能保持终日清醒，以此辨明是非，确保吉利”。另外，在《豫卦·像传》里写道：“雷出地奋，先王以作乐崇德，殷荐之上帝，以配祖考”，意思是指“雷从地下出来，雷声使万物振奋，古代的君王由此领悟到，要作乐崇德，隆重地向上帝和祖先祭祀，祈求获得吉利”。以上这两句出自《
互联网平台轻松赚钱？不存在的 Museaiceonly
80后的我女性受历史遗留的影响，在家庭中仍是主妇的角色，但也有小部分的女性受到高等教育熏陶在事业上取得巨大成功。到了90后的女性，高等教育普及大众，社会发展迅速，特别是第三产业和互联网产业受到大量资金支持和鼓励，衍生出许许多多依靠搭载各类App的移动终端的商业模式。直播、短视频、微商等互联网平台建设使得女性比男性更容易融入这样的平台来展示自己并汲取一定的报酬。这也就象征着女性打开了个性化事业的开端
一路书香快乐成长紫罗兰_c06d
一路书香快乐成长-----小蜜蜂班读书记事在这百花争艳、万紫千红的季节我们与新教育邂逅。由此认识并了解了新教育十大行动之-----营造书香校园。作为语文教师的我积极响应学校营造书香班级的号召，带领孩子们在书海中畅游。新教育主张，把最美好的童书给最美丽的童年。在新教育的光芒指引下，62位可爱的孩子在阳光的照耀下，像辛勤的小蜜蜂扑在书香中采花、酿蜜，收获颇多。今天记下与孩子们一起走过的每一个平凡而又幸
2019-05-07 fartlek跑 areece
早上起床，不看手机，同样的磨蹭法，居然能够早十来分钟出门，这就是手机的力量。10分钟的热身，fartle跑，1分钟on，1分钟off。计划30组，实际20组，而且中间还休息了好几次，倒不是累，依旧是懒罢了，后面觉得右边大腿有些吃力，提早结束，避免受伤。人生真是不经想。人活着是为了什么？这玩意简直是不能够想，更加不能够想的是，我自己活着是为了什么，随便想想都能够走火入魔生无可恋啊。
hive 分区表select全部数据_hive分区表 Xenophon Tony hive 分区表select全部数据
内部表和外部表内部表：createtable，copy数据到warehouse,删除表时数据也会删除外部表：createexternaltable，不copy数据到warehouse,删除表时数据不会删除表的分区分区的好处：如果不建立分区的话，则会全表扫描数据通过目录划分分区，分区字段是特殊字段目录结构：/pub/{dt}/{customer_id}/添加分区：ALTERTABLEfsADDPAT
AI驱动型论文搜索工具司南锤 AI 工具
✅一、AI驱动型论文搜索工具（强烈推荐）1.Consensus官网：consensus.app特点：输入自然语言问题（如“Doesgreenspacereduceurbanheatislandeffect?”），系统会自动返回论文中直接回答该问题的句子，标明支持/不支持的证据。适合人群：科研人员、政策制定者、想快速得到文献共识的人。2.ScispaceCopilot（原SemanticSchola
卫龙辣条抽查不合格，中毒多年你还有救吗？互联说
近日，湖北省食品药品监督管理局发布食品安全监督抽检信息公告显示，在所抽检的11类食品643批次中，不合格样品21批次，其中包括卫龙、谢博士、小鹏食品、香铛铛、钟芹辉等，多款“辣条”产品被检出不合格瞬间让众多网友瞬间崩溃，惊呼：没想到你是这样的卫龙！而卫龙官方也在第一时间发声明称，卫龙产品完全合法合规。而抽查不合格的原因是，卫龙一直执行现行有效的地方标准，而相关国家标准尚在征求意见阶段。虽然这句高深
树莓派vsftpd文件传输服务器的配置方法 czliutz 笔记 linux 服务器 linux ftp
在树莓派上安装和配置vsftpd（VerySecureFTPDaemon）服务器的步骤如下：1.安装vsftpd打开终端，执行以下命令安装vsftpd：sudoaptupdatesudoaptinstallvsftpd安装完成后，vsftpd会自动启动。可以通过以下命令验证服务状态：sudosystemctlstatusvsftpd2.备份原始配置文件在修改配置前，先备份原始配置文件：sudocp
python pywebview + vue3 做桌面端妃衣 python 开发语言
pythonpywebview+vue3做桌面端Api.py#传给前端的api对象,定义了一个可以通过js调用退出当前应用的函数classApi:def__init__(self)->None:self._window=None#java运行的线程self.process=Nonedefset_process(self,_process):self.process=_processdefset_w
我们退后，孩子才能生发自己解决问题的能力双胞胎妈妈_9a17
很多时候都想把我跟儿子之前的事写成书，用来记录我们得生活，也希望给更多妈妈启发。在教育孩子这条路上，你一直都不是孤独的，有成千上万个妈妈跟你一样。昨天婆婆来了，老大跟着婆婆长到三岁，所以老大找婆婆睡去了，我邀请老二跟我一起睡，老二不愿意。我说陪他睡，可是想想我还得写文章，于是告诉他自己睡吧！我们道了晚安，孩子睡了。对于孩子自己睡，我们总是有过多的担心，比如他会不会蹬被子，他能不能自己上厕所，他晚上
一招解决!第七届内部操盘群伍戈被骗不靠谱，low carbon-碳中和提不了现!可追回! 昌龙律法
当我们在投资理财的时候，骗子们的“罪恶”之手或许已经在慢慢伸向我们的“钱袋子”。因此，豆豆钱提醒广大消费者，需要高度警惕此类诈骗，谨防财产损失。如何在众多投资项目中辨真伪，识别并防范虚假网络投资理财诈骗，守护好自己的血汗钱，成为当务之急。我公司最近帮助到的一位该平台的受害者委托曝光此平台恶劣行径!望广大投资者引以为戒，谨慎投资！(重点提示；此类平台的所谓老师大多数都冒充知名牛散大咖或者企业知名人物
【每日健康小知识】20200522 冉听花开
今天来聊一聊高糖饮食的危害：①高血压美国心脏病杂志曾发文称，高糖饮食会引发高血压。主要是因为：高血糖水平作用于下丘脑的某个关键部位，会引起心率加快、血压升高。②癌症肿瘤专家认为，吃糖越多、越会帮助肿瘤加速生长，因此要尽量避免摄入太多精制糖。多吃甜食还会导致导入人体过早老化和皮肤受到损伤。③胆结石糖摄入过量，会加快胆固醇的积累，造成胆汁内胆固醇、胆汁酸、卵磷脂三者比例失调，而过多的胆固醇又会形成胆固
妈妈，谢谢您云桥妈妈
如果我是一株小草，您便是那滋润着我的露水，如果我是一片云彩，您便是那承载着我的天空，但是把您的爱只融进“谢谢”，当然是远远不够的。妈妈，谢谢您。从我缓缓的从您的身体中滑落，我便要感谢您给予我生命。当我在寂静的黑夜放声啼哭时，是您迅速的爬起来，安慰我。您在我出生前便有一颗望子成龙的心，等我可以走路了，您便在墙上贴满字，不厌其烦的教我，。记得四岁时得了一场重病，三十个日夜，不辞辛苦的照顾我，困了便睡椅
水草缸雨林缸造景：水草缸养殖小技巧养草的大灰狼
导读：千姿百态的水草缸，有许许多多的元素构成。玩水草缸在于创造的乐趣。不同的水草和鱼搭配，达到的效果也不尽相同，水草有自己的属性，鱼对环境的要求也不同。现在天气比较寒冷，大家都不太爱动，但生活里还是得见点绿的不啃水草是水草缸选鱼的基本原则。适合草缸的鱼类多种多样，挑选前需先简单了解。选水草最好选择高低有致，这样搭配起来层次感强烈。可能有些人会说水草缸造景是不是需要一些美术功底，不然做不好，其实可能
2019-06-29 房电孟
敬爱的李老师，智慧的马教授，亲爱的家人们：大家好，我是(侯维山)侯总的人，来自滨州鑫山力机械的房电孟。今天是2019年6月29日，我的日精进第297天,我们互相勉励，携手前行，每天进步一点点，距离成功便不远。图片发自App比学习：不要忘记奋斗，人生的路，无需苛求，只要你迈步，路就会在你脚下延伸；只要你扬帆，便会八面来风。启程了，人的生命才真正开始;启程了，人的智慧才得以发挥。生活时常和我们开着玩笑
山娃的蜕变（77）阊江水
序《山娃的蜕变》，讲述了1998年，在私营经济蓬勃发展的温州，18岁的文白，一位从没出过远门的山里娃，只是因为有一个梦，受到电视和书的影响，独自一人闯荡温州的一些坎坷经历，让一位懵懂无知的山里孩子最终成为了一名敢想敢做的现代青年。图片发自App第七十七章语言不通交流难德行善念暖心肠文白什么都没带，就在隔壁小店买了牙刷牙膏毛巾，他们正在吃饭，老板是个很魁梧的大哥，大姐热情的问文白哪里人，哪天开业。文
宝宝的第一口辅食必须是米粉嘛？ O泡不会飞
随着宝宝的长大，有一天突然发现他对大人吃饭有了兴趣。看爸爸妈妈吃饭的时候，眼睛直勾勾的，小嘴吧唧吧唧的吞口水。有的老人会说，四个月可以加蛋黄了，可以吃馒头芯沾菜汤了，可以喝大米油了......真的是这样吗？你了解宝宝什么时候可以添加辅食吗？宝宝会有这么几个信号告诉你。美国儿科学会建议：宝宝在4~6个月时就可以开始添加辅食了。具体什么时间添加辅食呢？那就要看宝宝的具体状况了。挺舌反应消失。小月龄宝宝
Spring中@Value注解，需要注意的地方无量 spring bean @Value xml
Spring 3以后,支持@Value注解的方式获取properties文件中的配置值，简化了读取配置文件的复杂操作 1、在applicationContext.xml文件(或引用文件中)中配置properties文件 <bean id="appProperty" class="org.springframework.beans.fac
mongoDB 分片开窍的石头 mongodb
mongoDB的分片。要mongos查询数据时候先查询configsvr看数据在那台shard上，configsvr上边放的是metar信息，指的是那条数据在那个片上。由此可以看出mongo在做分片的时候咱们至少要有一个configsvr,和两个以上的shard（片）信息。第一步启动两台以上的mongo服务 &nb
OVER(PARTITION BY)函数用法 0624chenhong oracle
这篇写得很好，引自 http://www.cnblogs.com/lanzi/archive/2010/10/26/1861338.html OVER(PARTITION BY)函数用法 2010年10月26日 OVER(PARTITION BY)函数介绍开窗函数 &nb
Android开发中，ADB server didn't ACK 解决方法一炮送你回车库 Android开发
首先通知：凡是安装360、豌豆荚、腾讯管家的全部卸载，然后再尝试。一直没搞明白这个问题咋出现的，但今天看到一个方法，搞定了！原来是豌豆荚占用了 5037 端口导致。参见原文章：一个豌豆荚引发的血案——关于ADB server didn't ACK的问题简单来讲，首先将Windows任务进程中的豌豆荚干掉，如果还是不行，再继续按下列步骤排查。 &nb
canvas中的像素绘制问题换个号韩国红果果 JavaScript canvas
pixl的绘制，1.如果绘制点正处于相邻像素交叉线，绘制x像素的线宽，则从交叉线分别向前向后绘制x/2个像素，如果x/2是整数，则刚好填满x个像素，如果是小数，则先把整数格填满，再去绘制剩下的小数部分，绘制时，是将小数部分的颜色用来除以一个像素的宽度，颜色会变淡。所以要用整数坐标来画的话（即绘制点正处于相邻像素交叉线时），线宽必须是2的整数倍。否则会出现不饱满的像素。 2.如果绘制点为一个像素的
编码乱码问题灵静志远 java jvm jsp 编码
1、JVM中单个字符占用的字节长度跟编码方式有关，而默认编码方式又跟平台是一一对应的或说平台决定了默认字符编码方式；2、对于单个字符：ISO-8859-1单字节编码，GBK双字节编码，UTF-8三字节编码；因此中文平台(中文平台默认字符集编码GBK)下一个中文字符占2个字节，而英文平台(英文平台默认字符集编码Cp1252(类似于ISO-8859-1))。 3、getBytes()、getByte
java 求几个月后的日期 darkranger calendar getinstance
Date plandate = planDate.toDate(); SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd"); Calendar cal = Calendar.getInstance(); cal.setTime(plandate); // 取得三个月后时间 cal.add(Calendar.M
数据库设计的三大范式（通俗易懂） aijuans 数据库复习
关系数据库中的关系必须满足一定的要求。满足不同程度要求的为不同范式。数据库的设计范式是数据库设计所需要满足的规范。只有理解数据库的设计范式，才能设计出高效率、优雅的数据库，否则可能会设计出错误的数据库. 目前，主要有六种范式：第一范式、第二范式、第三范式、BC范式、第四范式和第五范式。满足最低要求的叫第一范式，简称1NF。在第一范式基础上进一步满足一些要求的为第二范式，简称2NF。其余依此类推。
想学工作流怎么入手 atongyeye jbpm
工作流在工作中变得越来越重要，很多朋友想学工作流却不知如何入手。很多朋友习惯性的这看一点，那了解一点，既不系统，也容易半途而废。好比学武功，最好的办法是有一本武功秘籍。研究明白，则犹如打通任督二脉。系统学习工作流，很重要的一本书《JBPM工作流开发指南》。本人苦苦学习两个月，基本上可以解决大部分流程问题。整理一下学习思路，有兴趣的朋友可以参考下。 1 首先要
Context和SQLiteOpenHelper创建数据库百合不是茶 android Context创建数据库
一直以为安卓数据库的创建就是使用SQLiteOpenHelper创建,但是最近在android的一本书上看到了Context也可以创建数据库,下面我们一起分析这两种方式创建数据库的方式和区别,重点在SQLiteOpenHelper 一:SQLiteOpenHelper创建数据库: 1,SQLi
浅谈group by和distinct bijian1013 oracle 数据库 group by distinct
group by和distinct只了去重意义一样，但是group by应用范围更广泛些，如分组汇总或者从聚合函数里筛选数据等。譬如：统计每id数并且只显示数大于3 select id ,count(id) from ta
vi opertion 征客丶 mac opration vi
进入 command mode （命令行模式）按 esc 键再按 shift + 冒号注：以下命令中带 $ 【在命令行模式下进行】，不带 $ 【在非命令行模式下进行】一、文件操作 1.1、强制退出不保存 $ q! 1.2、保存 $ w 1.3、保存并退出 $ wq 1.4、刷新或重新加载已打开的文件 $ e 二、光标移动 2.1、跳到指定行数字
【Spark十四】深入Spark RDD第三部分RDD基本API bit1129 spark
对于K/V类型的RDD,如下操作是什么含义？ val rdd = sc.parallelize(List(("A",3),("C",6),("A",1),("B",5)) rdd.reduceByKey(_+_).collect reduceByKey在这里的操作，是把
java类加载机制 BlueSkator java 虚拟机
java类加载机制 1.java类加载器的树状结构引导类加载器 ^ | 扩展类加载器 ^ | 系统类加载器 java使用代理模式来完成类加载，java的类加载器也有类似于继承的关系，引导类是最顶层的加载器，它是所有类的根加载器，它负责加载java核心库。当一个类加载器接到装载类到虚拟机的请求时，通常会代理给父类加载器，若已经是根加载器了，就自己完成加载。虚拟机区分一个Cla
动态添加文本框 BreakingBad 文本框
<script> var num=1; function AddInput() { var str=""; str+="<input
读《研磨设计模式》-代码笔记-单例模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ public class Singleton { } /* * 懒汉模式。注意，getInstance如果在多线程环境中调用，需要加上synchronized，否则存在线程不安全问题 */ class LazySingleton
iOS应用打包发布常见问题 chenhbc ios iOS发布 iOS上传 iOS打包
这个月公司安排我一个人做iOS客户端开发，由于急着用，我先发布一个版本，由于第一次发布iOS应用，期间出了不少问题，记录于此。 1、使用Application Loader 发布时报错：Communication error.please use diagnostic mode to check connectivity.you need to have outbound acc
工作流复杂拓扑结构处理新思路 comsci 设计模式工作算法企业应用 OO
我们走的设计路线和国外的产品不太一样，不一样在哪里呢？国外的流程的设计思路是通过事先定义一整套规则(类似XPDL)来约束和控制流程图的复杂度(我对国外的产品了解不够多，仅仅是在有限的了解程度上面提出这样的看法)，从而避免在流程引擎中处理这些复杂的图的问题，而我们却没有通过事先定义这样的复杂的规则来约束和降低用户自定义流程图的灵活性，这样一来，在引擎和流程流转控制这一个层面就会遇到很
oracle 11g新特性Flashback data archive daizj oracle
1. 什么是flashback data archive Flashback data archive是oracle 11g中引入的一个新特性。Flashback archive是一个新的数据库对象，用于存储一个或多表的历史数据。Flashback archive是一个逻辑对象，概念上类似于表空间。实际上flashback archive可以看作是存储一个或多个表的所有事务变化的逻辑空间。
多叉树:2-3-4树 dieslrae 树
平衡树多叉树,每个节点最多有4个子节点和3个数据项,2,3,4的含义是指一个节点可能含有的子节点的个数,效率比红黑树稍差.一般不允许出现重复关键字值.2-3-4树有以下特征: 1、有一个数据项的节点总是有2个子节点(称为2-节点) 2、有两个数据项的节点总是有3个子节点(称为3-节
C语言学习七动态分配 malloc的使用 dcj3sjt126com c language malloc
/* 2013年3月15日15:16:24 malloc 就memory(内存) allocate(分配)的缩写本程序没有实际含义，只是理解使用 */ # include <stdio.h> # include <malloc.h> int main(void) { int i = 5; //分配了4个字节静态分配 int * p
Objective-C编码规范[译] dcj3sjt126com 代码规范
原文链接 : The official raywenderlich.com Objective-C style guide 原文作者 : raywenderlich.com Team 译文出自 : raywenderlich.com Objective-C编码规范译者 : Sam Lau
0.性能优化-目录 frank1234 性能优化
从今天开始笔者陆续发表一些性能测试相关的文章，主要是对自己前段时间学习的总结，由于水平有限，性能测试领域很深，本人理解的也比较浅，欢迎各位大咖批评指正。主要内容包括：一、性能测试指标吞吐量、TPS、响应时间、负载、可扩展性、PV、思考时间 http://frank1234.iteye.com/blog/2180305 二、性能测试策略生产环境相同基准测试预热等 htt
Java父类取得子类传递的泛型参数Class类型 happyqing java 泛型父类子类 Class
import java.lang.reflect.ParameterizedType; import java.lang.reflect.Type; import org.junit.Test; abstract class BaseDao<T> { public void getType() { //Class<E> clazz =
跟我学SpringMVC目录汇总贴、PDF下载、源码下载 jinnianshilongnian springMVC
----广告-------------------------------------------------------------- 网站核心商详页开发掌握Java技术，掌握并发/异步工具使用，熟悉spring、ibatis框架；掌握数据库技术，表设计和索引优化，分库分表/读写分离；了解缓存技术，熟练使用如Redis/Memcached等主流技术；了解Ngin
the HTTP rewrite module requires the PCRE library 流浪鱼 rewrite
./configure: error: the HTTP rewrite module requires the PCRE library. 模块依赖性Nginx需要依赖下面3个包 1. gzip 模块需要 zlib 库 ( 下载: http://www.zlib.net/ ) 2. rewrite 模块需要 pcre 库 ( 下载: http://www.pcre.org/ ) 3. s
第12章 Ajax（中） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Optimize query with Query Stripping in Web Intelligence blueoxygen BO
http://wiki.sdn.sap.com/wiki/display/BOBJ/Optimize+query+with+Query+Stripping+in+Web+Intelligence and a very straightfoward video http://www.sdn.sap.com/irj/scn/events?rid=/library/uuid/40ec3a0c-936
Java开发者写SQL时常犯的10个错误 tomcat_oracle java sql
1、不用PreparedStatements 　　有意思的是，在JDBC出现了许多年后的今天，这个错误依然出现在博客、论坛和邮件列表中，即便要记住和理解它是一件很简单的事。开发者不使用PreparedStatements的原因可能有如下几个：　　他们对PreparedStatements不了解　　他们认为使用PreparedStatements太慢了　　他们认为写Prepar
世纪互联与结盟有感阿尔萨斯
10月10日，世纪互联与（Foxcon）签约成立合资公司，有感。全球电子制造业巨头（全球500强企业）与世纪互联共同看好IDC、云计算等业务在中国的增长空间，双方迅速果断出手，在资本层面上达成合作，此举体现了全球电子制造业巨头对世纪互联IDC业务的欣赏与信任，另一方面反映出世纪互联目前良好的运营状况与广阔的发展前景。众所周知，精于电子产品制造（世界第一），对于世纪互联而言，能够与结盟