u013250861

知识图谱-KGE(Knowledge Graph Embedding)：kge库【包含：TransE、TransH、ConvE、DistMult、ComplEx、TuckER、SimplE...】

LibKGE is a PyTorch-based library for efficient training, evaluation, and hyperparameter optimization of knowledge graph embeddings (KGE). It is highly configurable, easy to use, and extensible. Other KGE frameworks are listed below.

The key goal of LibKGE is to foster reproducible research into (as well as meaningful comparisons between) KGE models and training methods. As we argue in our ICLR 2020 paper (see video), the choice of training strategy and hyperparameters are very influential on model performance, often more so than the model class itself. LibKGE aims to provide clean implementations of training, hyperparameter optimization, and evaluation strategies that can be used with any model. Every potential knob or heuristic implemented in the framework is exposed explicitly via well-documented configuration files (e.g., see here and here). LibKGE also provides the most common KGE models and new ones can be easily added (contributions welcome!).

For link prediction tasks, rule-based systems such as AnyBURL are a competitive alternative to KGE.

UPDATE: LibKGE now includes GraSH, an efficient multi-fidelity hyperparameter optimization algorithm for large-scale KGE models. See here for an example on how to use it.

Quick start

# retrieve and install project in development mode
git clone https://github.com/uma-pi1/kge.git
cd kge
pip install -e .

# download and preprocess datasets
cd data
sh download_all.sh
cd ..

# train an example model on toy dataset (you can omit '--job.device cpu' when you have a gpu)
kge start examples/toy-complex-train.yaml --job.device cpu

Features
Results and pretrained models
Using LibKGE
Currently supported KGE models
Extending LibKGE
FAQ
Known issues
Changelog
Other KGE frameworks
How to cite

Features

Training
- Training types: negative sampling, 1vsAll, KvsAll
- Losses: binary cross entropy (BCE), Kullback-Leibler divergence (KL),
  margin ranking (MR), squared error (SE)
- All optimizers and learning rate schedulers of PyTorch supported and can be
  chosen individually for different parameters (e.g., different for entity
  and for relation embeddings)
- Learning rate warmup
- Early stopping
- Checkpointing
- Stop (e.g., via Ctrl-C) and resume at any time
- Automatic memory management to support large batch sizes (see config key train.subbatch_auto_tune)
Hyperparameter tuning
- Grid search, manual search, quasi-random search (using
  Ax), Bayesian optimization (using Ax)
- Resource-efficient multi-fidelity search for large graphs (using GraSH)
- Highly parallelizable (multiple CPUs/GPUs on single machine)
- Stop and resume at any time
Evaluation
- Entity ranking metrics: Mean Reciprocal Rank (MRR), HITS@k with/without filtering
- Drill-down by: relation type, relation frequency, head or tail
Extensive logging and tracing
- Detailed progress information about training, hyper-parameter tuning, and evaluation
  is recorded in machine readable formats
- Quick export of all/selected parts of the traced data into CSV or YAML files to
  facilitate analysis
KGE models
- All models can be used with or without reciprocal relations
- RESCAL (code, config)
- TransE (code, config)
- TransH (code, config)
- DistMult (code, config)
- ComplEx (code, config)
- ConvE (code, config)
- RelationalTucker3/TuckER (code, config)
- CP (code, config)
- SimplE (code, config)
- RotatE (code, config)
- Transformer (“No context” model) (code, config)
Embedders
- Lookup embedder (code, config)
- Projection embedder (code, config)

Results and pretrained models

We list some example results (filtered MRR and HITS@k on test data) obtained with
LibKGE below. These results are obtained by running automatic hyperparameter
search as described here.

These results are not necessarily the best results that can be achieved using LibKGE,
but they are comparable in that a common experimental setup (and equal amount of work)
has been used for hyperparameter optimization for each model. Since we use filtered MRR
for model selection, our results may not be indicative of the achievable model performance
for other validation metrics (such as HITS@10, which has been used for model selection
elsewhere).

We report performance numbers on the entire test set, including the
triples that contain entities not seen during training. This is not done
consistently throughout existing KGE implementations: some frameworks remove
unseen entities from the test set, which leads to a perceived increase in
performance (e.g., roughly add +3pp to our WN18RR MRR numbers for this method of
evaluation).

We also provide pretrained models for these results. Each pretrained model is
given in the form of a LibKGE checkpoint, which contains the model as well as
additional information (such as the configuration being used). See the
documentation below on how to use checkpoints.

FB15K-237 (Freebase)

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.356	0.263	0.393	0.541	config.yaml	1vsAll-kl
TransE	0.313	0.221	0.347	0.497	config.yaml	NegSamp-kl
DistMult	0.343	0.250	0.378	0.531	config.yaml	NegSamp-kl
ComplEx	0.348	0.253	0.384	0.536	config.yaml	NegSamp-kl
ConvE	0.339	0.248	0.369	0.521	config.yaml	1vsAll-kl
RotatE	0.333	0.240	0.368	0.522	config.yaml	NegSamp-bce

WN18RR (Wordnet)

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.467	0.439	0.480	0.517	config.yaml	KvsAll-kl
TransE	0.228	0.053	0.368	0.520	config.yaml	NegSamp-kl
DistMult	0.452	0.413	0.466	0.530	config.yaml	KvsAll-kl
ComplEx	0.475	0.438	0.490	0.547	config.yaml	1vsAll-kl
ConvE	0.442	0.411	0.451	0.504	config.yaml	KvsAll-kl
RotatE	0.478	0.439	0.494	0.553	config.yaml	NegSamp-bce

FB15K (Freebase)

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.644	0.544	0.708	0.824	config.yaml	NegSamp-kl
TransE	0.676	0.542	0.787	0.875	config.yaml	NegSamp-bce
DistMult	0.841	0.806	0.863	0.903	config.yaml	1vsAll-kl
ComplEx	0.838	0.807	0.856	0.893	config.yaml	1vsAll-kl
ConvE	0.825	0.781	0.855	0.896	config.yaml	KvsAll-bce
RotatE	0.783	0.727	0.820	0.877	config.yaml	NegSamp-kl

WN18 (Wordnet)

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.948	0.943	0.951	0.956	config.yaml	1vsAll-kl
TransE	0.553	0.315	0.764	0.924	config.yaml	NegSamp-bce
DistMult	0.941	0.932	0.948	0.954	config.yaml	1vsAll-kl
ComplEx	0.951	0.947	0.953	0.958	config.yaml	KvsAll-kl
ConvE	0.947	0.943	0.949	0.953	config.yaml	1vsAll-kl
RotatE	0.946	0.943	0.948	0.953	config.yaml	NegSamp-kl

Yago3-10 (YAGO)

LibKGE supports large datasets such as Yago3-10 (123k entities) and Wikidata5M (4.8M entities).
The results given below were found by automatic hyperparameter search with a similar search
space as above, but with some values fixed (training with shared negative sampling,
embedding dimension: 128, batch size: 1024, optimizer: Adagrad,
regularization: weighted). The Yago3-10 result was obtained by training 30 pseudo-random configurations for
20 epochs, and then rerunning the configuration that performed best on validation
data for 400 epochs.

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
ComplEx	0.551	0.476	0.596	0.682	config.yaml	NegSamp-kl

Wikidata5M (Wikidata)

We report two results for Wikidata5m.
The first result was found by the same automatic hyperparameter search as described for
Yago3-10, but we limited the final training to 200 epochs. The second result was
obtained with significantly less resource consumption by using
the multi-fidelity GraSH search.

	Search + budget	Final training	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
ComplEx	Random, 600 epochs	200 epochs	0.301	0.245	0.331	0.397	config.yaml	NegSamp-kl
ComplEx	GraSH, 192 epochs	64 epochs	0.300	0.247	0.328	0.390	config.yaml	-

Freebase

GraSH was also applied to Freebase, one of the largest benchmarking datasets containing 86M entities.
The reported results were obtained by combining GraSH with distributed training implemented in
Dist-KGE.
The respective config files can be found in the GraSH repository as their execution is not yet supported in LibKGE.

	MRR	Hits@1	Hits@3	Hits@10
ComplEx	0.594	0.511	0.667	0.726
RotatE	0.613	0.578	0.637	0.669
TransE	0.553	0.520	0.571	0.614

CoDEx

CoDEx is a Wikidata-based KG completion
benchmark. The results here have been obtained using the automatic
hyperparameter search used for the Freebase and WordNet datasets, but with fewer
epochs and Ax trials for CoDEx-M and CoDEx-L. See the CoDEx
paper (EMNLP 2020) for details.

CoDEx-S

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.404	0.293	0.4494	0.623	config.yaml	1vsAll-kl
TransE	0.354	0.219	0.4218	0.634	config.yaml	NegSamp-kl
ComplEx	0.465	0.372	0.5038	0.646	config.yaml	1vsAll-kl
ConvE	0.444	0.343	0.4926	0.635	config.yaml	1vsAll-kl
TuckER	0.444	0.339	0.4975	0.638	config.yaml	KvsAll-kl

CoDEx-M

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.317	0.244	0.3477	0.456	config.yaml	1vsAll-kl
TransE	0.303	0.223	0.3363	0.454	config.yaml	NegSamp-kl
ComplEx	0.337	0.262	0.3701	0.476	config.yaml	KvsAll-kl
ConvE	0.318	0.239	0.3551	0.464	config.yaml	NegSamp-kl
TuckER	0.328	0.259	0.3599	0.458	config.yaml	KvsAll-kl

CoDEx-L

	MRR	Hits@1	Hits@3	Hits@10	Config file	Pretrained model
RESCAL	0.304	0.242	0.3313	0.419	config.yaml	1vsAll-kl
TransE	0.187	0.116	0.2188	0.317	config.yaml	NegSamp-kl
ComplEx	0.294	0.237	0.3179	0.400	config.yaml	1vsAll-kl
ConvE	0.303	0.240	0.3298	0.420	config.yaml	1vsAll-kl
TuckER	0.309	0.244	0.3395	0.430	config.yaml	KvsAll-kl

Using LibKGE

LibKGE supports training, evaluation, and hyperparameter tuning of KGE models.
The settings for each task can be specified with a configuration file in YAML
format or on the command line. The default values and usage for available
settings can be found in config-default.yaml as well
as the model- and embedder-specific configuration files (such as
lookup_embedder.yaml).

Train a model

First create a configuration file such as:

job.type: train
dataset.name: fb15k-237

train:
  optimizer: Adagrad
  optimizer_args:
    lr: 0.2

valid:
  every: 5
  metric: mean_reciprocal_rank_filtered

model: complex
lookup_embedder:
  dim: 100
  regularize_weight: 0.8e-7

To begin training, run one of the following:

# Store the file as `config.yaml` in a new folder of your choice. Then initiate or resume
# the training job using:
kge resume <folder>

# Alternatively, store the configuration anywhere and use the start command
# to create a new folder
#   /local/experiments/-
# with that config and start training there.
kge start <config-file>

# In both cases, configuration options can be modified on the command line, too: e.g.,
kge start <config-file> config.yaml --job.device cuda:0 --train.optimizer Adam

Various checkpoints (including model parameters and configuration options) will
be created during training. These checkpoints can be used to resume training (or any other job type such as hyperparameter search jobs).

Resume training

All of LibKGE’s jobs can be interrupted (e.g., via Ctrl-C) and resumed (from one of its checkpoints). To resume a job, use:

kge resume <folder>

# Change the device when resuming
kge resume <folder> --job.device cuda:1

By default, the last checkpoint file is used. The filename of the checkpoint can be overwritten using --checkpoint.

Evaluate a trained model

To evaluate trained model, run the following:

# Evaluate a model on the validation split
kge valid <folder>

# Evaluate a model on the test split
kge test <folder>

By default, the checkpoint file named checkpoint_best.pt (which stores the best validation result so far) is used. The filename of the checkpoint can be overwritten using --checkpoint.

Hyperparameter optimization

LibKGE supports various forms of hyperparameter optimization such as grid search,
random search, Bayesian optimization, or resource-efficient multi-fidelity search.
The search type and search space are specified in the configuration file.

For example, you may use Ax for SOBOL
(pseudo-random) and Bayesian optimization. The following config file defines a
search of 10 SOBOL trials (arms) followed by 20 Bayesian optimization trials:

job.type: search
search.type: ax

dataset.name: wnrr
model: complex
valid.metric: mean_reciprocal_rank_filtered

ax_search:
  num_trials: 30
  num_sobol_trials: 10  # remaining trials are Bayesian
  parameters:
    - name: train.batch_size
      type: choice
      values: [256, 512, 1024]
    - name: train.optimizer_args.lr
      type: range
      bounds: [0.0003, 1.0]
    - name: train.type
      type: fixed
      value: 1vsAll

For large graph datasets such as Wikidata5m, you may use
GraSH, which enables resource-efficient
hyperparameter optimization. A full documentation of the GraSH functionality,
useful search configs, and obtained results can
be found in the accompanying repository.
The following example config defines a
search of 64 randomly generated trials with a search budget equivalent
to only 3 full training runs on the whole dataset:

job.type: search
search.type: grash_search

dataset.name: wikidata5m
model: complex
valid.metric: mean_reciprocal_rank_filtered

grash_search:
  num_trials: 64 # initial number of randomly generated trials
  search_budget: 3 # in terms of full training runs on the whole dataset
  eta: 4 # reduction factor - only keep 1/eta best-performing trials per round
  variant: combined # low-fidelity approximation technique - combined = epoch + graph reduction
  parameters:
    - name: train.batch_size
      type: choice
      values: [256, 512, 1024]
    - name: train.optimizer_args.lr
      type: range
      bounds: [0.0003, 1.0]
    - name: train.type
      type: fixed
      value: 1vsAll

Trials can be run in parallel across several devices:

# Run 4 trials in parallel evenly distributed across two GPUs
kge resume <folder> --search.device_pool cuda:0,cuda:1 --search.num_workers 4

# Run 3 trials in parallel, with per GPUs capacity
kge resume <folder> --search.device_pool cuda:0,cuda:1,cuda:1 --search.num_workers 3

Export and analyze logs and checkpoints

Extensive logs are stored as YAML files (hyperparameter search, training,
validation). LibKGE provides a convenience methods to export the log data to
CSV.

kge dump trace <folder>

The command above yields CSV output such as this output for a training
job or this output for a search
job.
Additional configuration options or metrics can be added to the CSV files as
needed (using a keys
file).

Information about a checkpoint (such as the configuration that was used,
training loss, validation metrics, or explored hyperparameter configurations)
can also be exported from the command line (as YAML):

kge dump checkpoint <checkpoint>

Configuration files can also be dumped in various formats.

# dump just the configuration options that are different from the default values
kge dump config <config-or-folder-or-checkpoint>

# dump the configuration as is
kge dump config <config-or-folder-or-checkpoint> --raw

# dump the expanded config including all configuration keys
kge dump config <config-or-folder-or-checkpoint> --full

Help and other commands

# help on all commands
kge --help

# help on a specific command
kge dump --help

Use a pretrained model in an application

Using a trained model trained with LibKGE is straightforward. In the following
example, we load a checkpoint and predict the most suitable object for a two
subject-relations pairs: (‘Dominican Republic’, ‘has form of government’, ?) and
(‘Mighty Morphin Power Rangers’, ‘is tv show with actor’, ?).

import torch
from kge.model import KgeModel
from kge.util.io import load_checkpoint

# download link for this checkpoint given under results above
checkpoint = load_checkpoint('fb15k-237-rescal.pt')
model = KgeModel.create_from(checkpoint)

s = torch.Tensor([0, 2,]).long()             # subject indexes
p = torch.Tensor([0, 1,]).long()             # relation indexes
scores = model.score_sp(s, p)                # scores of all objects for (s,p,?)
o = torch.argmax(scores, dim=-1)             # index of highest-scoring objects

print(o)
print(model.dataset.entity_strings(s))       # convert indexes to mentions
print(model.dataset.relation_strings(p))
print(model.dataset.entity_strings(o))

# Output (slightly revised for readability):
#
# tensor([8399, 8855])
# ['Dominican Republic'        'Mighty Morphin Power Rangers']
# ['has form of government'    'is tv show with actor']
# ['Republic'                  'Johnny Yong Bosch']

For other scoring functions (score_sp, score_po, score_so, score_spo), see KgeModel.

Use your own dataset

To use your own dataset, create a subfolder mydataset (= dataset name) in the data folder. You can use your dataset later by specifying dataset.name: mydataset in your job’s configuration file.

Each dataset is described by a dataset.yaml file, which needs to be stored in the mydataset folder. After performing the quickstart instructions, have a look at the provided toy example under data/toy/dataset.yaml. The configuration keys and file formats are documented here.

Your data can be automatically preprocessed and converted into the format required by LibKGE. Here is the relevant part for the toy dataset, which see:

# download
curl -O http://web.informatik.uni-mannheim.de/pi1/kge-datasets/toy.tar.gz
tar xvf toy.tar.gz

# preprocess
python preprocess/preprocess_default.py toy

Currently supported KGE models

LibKGE currently implements the KGE models listed in features.

The examples folder contains some configuration files as examples of how to train these models.

We welcome contributions to expand the list of supported models! Please see CONTRIBUTING for details and feel free to initially open an issue.

Extending LibKGE

LibKGE can be extended with new training, evaluation, or search jobs as well as
new models and embedders.

KGE models implement the KgeModel class and generally consist of a
KgeEmbedder to associate each subject, relation and object to an embedding and
a KgeScorer to score triples given their embeddings. All these base classes
are defined in kge_model.py.

KGE jobs perform training, evaluation, and hyper-parameter search. The relevant base classes are Job, TrainingJob, EvaluationJob, and SearchJob.

To add a component, say mycomp (= a model, embedder, or job) with
implementation MyClass, you need to:

Create a configuration file mycomp.yaml. You may store this file directly
in the LibKGE module folders (e.g., /kge/model/) or in your own
module folder. If you plan to contribute your code to LibKGE, we suggest to
directly develop in the LibKGE module folders. If you just want to play
around or publish your code separately from LibKGE, use your own module.
Define all required options for your component, their default values, and
their types in mycomp.yaml. We suggest to follow LibKGE’s core philosophy
and define every option that can influence the outcome of an experiment in
this way. Please pay attention w.r.t. integer (0) vs. float (0.0) values;
e.g., float_option: 0 is incorrect because is interpreted as an integer.
Implement MyClass in a module of your choice. In mycomp.yaml, add key
mycomp.class_name with value MyClass. If you follow LibKGE’s directory
structure (mycomp.yaml for configuration and mycomp.py for
implementation), then ensure that MyClass is imported in __init__.py
(e.g., as done here).
To use your component in an experiment, register your module via the
modules key and its configuration via the import key in the experiment’s
configuration file. See config-default.yaml for a
description of those keys. For example, in myexp_config.yaml, add:
```
modules: [ kge.job, kge.model, kge.model.embedder, mymodule ]
import: [ mycomp ]
```

FAQ

Are the configuration options documented somewhere?

Yes, see config-default.yaml as well as the configuration files for each component listed above.

Are the command line options documented somewhere?

Yes, try kge --help. You may also obtain help for subcommands, e.g., try kge dump --help or kge dump trace --help.

LibKGE runs out of memory. What can I do?

For training, set train.subbatch_auto_tune to true (equivalent result, less memory but slower).
For evaluation, set entity_ranking.chunk_size to, say, 10000 (equivalent result, less memory but slightly slower, the more so the smaller the chunk size).
Change hyperparameters (non-equivalent result): e.g., decrease the batch size, use negative sampling, use less samples).

Known issues

Changelog

See here.

Other KGE frameworks

Other KGE frameworks:

Graphvite
AmpliGraph
OpenKE
PyKEEN
Pykg2vec
Dist-KGE, a parallel variant of LibKGE

KGE projects for publications that also implement a few models:

ConvE
KBC

PRs to this list are welcome.

How to cite

Please cite the following publication to refer to the experimental study about the impact of training methods on KGE performance:

@inproceedings{
  ruffinelli2020you,
  title={You {CAN} Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings},
  author={Daniel Ruffinelli and Samuel Broscheit and Rainer Gemulla},
  booktitle={International Conference on Learning Representations},
  year={2020},
  url={https://openreview.net/forum?id=BkxSmlBFvr}
}

If you use LibKGE, please cite the following publication:

@inproceedings{
  libkge,
  title="{L}ib{KGE} - {A} Knowledge Graph Embedding Library for Reproducible Research",
  author={Samuel Broscheit and Daniel Ruffinelli and Adrian Kochsiek and Patrick Betz and Rainer Gemulla},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
  year={2020},
  url={https://www.aclweb.org/anthology/2020.emnlp-demos.22},
  pages = "165--174",
}

你可能感兴趣的:(#,kge)

ConvE，知识图谱嵌入（KGE) autodl 服务器运行 zhous_ 服务器 python
参考：博客地址项目地址项目地址主要配置采用autodl服务器环境如下配置路径服务器配置实例创建完成后打开jupyterlab打开终端输入vim~/.bashrc然后enter进入然后英文输入法下按i进入编辑模式底部会变为INSERT然后在最后一行加上source/root/miniconda3/etc/profile.d/conda.sh然后按esc最后英文状态下输入:wq保存退出注意有个英文冒号
力扣剑指 Offer 22. 链表中倒数第k个节点三更鬼双指针链表 leetcode
题目来源：https://leetcode-cn.com/problems/lian-biao-zhong-dao-shu-di-kge-jie-dian-lcof/、大致题意：给一个链表，返回它的倒数第k个节点思路快慢指针初始时，快慢指针都指向头部先让一个快指针先走k步再让两个指针同步走，直至快指针走到尾部，此时慢指针的位置就是倒数第k个节点代码：publicListNodegetKthFrom
LeetCode 剑指 Offer 22. 链表中倒数第k个节点 Neepu_G.job 笔记学习日记链表指针算法 leetcode c语言
输入一个链表，输出该链表中倒数第k个节点。为了符合大多数人的习惯，本题从1开始计数，即链表的尾节点是倒数第1个节点。例如，一个链表有6个节点，从头节点开始，它们的值依次是1、2、3、4、5、6。这个链表的倒数第3个节点是值为4的节点。原题：链表中倒数第k个点或直接转到：https://leetcode-cn.com/problems/lian-biao-zhong-dao-shu-di-kge-j
单链表OJ题：LeetCode--剑指Offer 22.链表中的倒数第k个结点 stackY、链表数据结构 leetcode 算法
朋友们、伙计们，我们又见面了，今天给大家带来的是LeetCode中剑指Offer22.链表中的倒数第k个结点数据结构：数据结构专栏作者：stackY、C语言：C语言专栏LeetCode：LeetCode刷题训练营剑指Offer22.链表中的倒数第k个结点：https://leetcode.cn/problems/lian-biao-zhong-dao-shu-di-kge-jie-dian-lco
论文浅尝 | AutoETER: 用于知识图谱嵌入的自动实体类型表示开放知识图谱算法人工智能机器学习深度学习知识图谱
论文链接：https://arxiv.org/pdf/2009.12030.pdf动机传统的KGE使用附加的类型信息改善实体的表示，但是这些方法完全依赖于显式类型，或者忽略了特定于各种关系的不同类型表示，并且这些方法目前都不能同时推断出对称性、反演和组成的所有关系模式，以及1-N、N-1和N-N关系的复杂性质。所以为了探索任何知识图谱的类型信息，我们提出了通过将每个关系作为具有关系感知投影机制的两
论文浅尝 | Concept2Box：从双视图学习知识图谱的联合几何嵌入模型开放知识图谱学习知识图谱人工智能
笔记整理：张钊源，天津大学硕士，研究方向为知识图谱链接：https://virtual2023.aclweb.org/paper_P4210.html动机知识图嵌入（KGE）已被广泛研究，用于嵌入大规模关系数据以满足许多现实世界的应用。现有方法长期以来忽略了许多知识图谱包含两种根本不同视图的事实：高级本体视图概念和细粒度实例视图实体。它们通常将所有节点作为向量嵌入一个潜在空间。然而，单一的几何表示
燕玲的Scalers Talk第五轮新概念朗读持续力训练Day29 20191116 少女玲奈酱
练习材料：《新概念英语第二册》Lesson37TheOlympicGames任务配置：L0+L4知识笔记：原文：L0:全文音标——ˈlɛsn37ðiəʊˈlɪmpɪkgeɪmzðiəʊˈlɪmpɪkgeɪmzwɪlbiːhɛldɪnˈaʊəˈkʌntriɪnfɔːjɪəztaɪm.æzəgreɪtˈmɛniˈpiːplwɪlbiːˈvɪzɪtɪŋðəˈkʌntri,ðəˈgʌvnməntwɪlbi
K个数、K个点、K个元素，3K堆排序，类比三解题！清风Python
面试题17.14.最小K个数https://leetcode-cn.com/problems/smallest-k-lcci/solution/mian-shi-ti-1714zui-xiao-kge-shu-ji-chu-k9jd8/难度：中等题目：设计一个算法，找出数组中最小的k个数。以任意顺序返回这k个数均可。提示：0hp[0][0]:heapq.heappop(hp)heapq.heapp
评分函数和损失函数是什么（知识图谱嵌入KGE） Rebecca.Yan 自然语言处理+知识图谱知识图谱人工智能深度学习
一、知识图谱中的评分函数和损失函数评分函数：评分函数用于计算给定实体和关系之间的匹配度或相似度得分。它接收实体和关系的嵌入表示作为输入，并输出一个分数，该分数反映了实体和关系之间的相关性。评分函数的目标是衡量实体和关系之间的连接程度或关联强度。常见的评分函数包括内积、点积和基于神经网络的函数等。损失函数：损失函数是用于训练嵌入模型的目标函数，它用于衡量模型预测结果与真实标签之间的差异。损失函数接收
2021年10月28日淘宝x-sign学习笔记 bugtraq2021 flutter
淘宝，taobao，x-sign测试学习淘宝系的请求方式，均有x-sign参数，而这个参数的值，很多方法可以获得，但是生成方法，听说一直在修改，但一直未见过。{x-sign=azYBCM004xAAJc3kgE6mu9CtuRCvlc3lzro6Y9p7Asyc0c3rE9F+KZYa7cIB9PN7A3Jz59bQEpFWX9mhbdSJUWyhaHXt5c3lzdXN5c,x-mini-wua
剑指 Offer 40. 最小的k个数（C+实现） Kk_1025 我的剑指刷题系列算法数据结构 c++
剑指Offer40.最小的k个数https://leetcode.cn/problems/zui-xiao-de-kge-shu-lcof/法1：二叉堆通过最小堆，直接筛选出最小的k个数vectorgetLeastNumbers(vector&arr,intk){priority_queue,greater>minHeap;for(constintnum:arr){minHeap.push(num
知识图谱嵌入(KGE)：方法和应用的综述 zenRRan
来自：专知导读本文主要是参考《KnowledgeGraphEmbedding:ASurveyofApproachesandApplications》和刘知远的《知识表示学习的研究与进展》做的总结，主要介绍了最近关于知识图谱嵌入所涉及到的研究方法，主要从融合事实信息、融合附加信息和KGE下游任务应用三方面展开。由于篇幅较长，下图是本文的结构，可以按照自己的需要有选择性的浏览。介绍近年来，知识图谱(K
leetcode-每日一题2021.9.2 链表中倒数第k个节点还记得樱花正开~ leetcode 链表 leetcode 算法
题目https://leetcode-cn.com/problems/lian-biao-zhong-dao-shu-di-kge-jie-dian-lcof/我的思路先找到链表的总结点数，再从头推到cnt-k个结点。我的代码/***Definitionforsingly-linkedlist.*structListNode{*intval;*ListNode*next;*ListNode(int
论文阅读《2022WWW：Rethinking Graph Convolutional Networks in Knowledge Graph Completion》 Jiawen9 #知识图谱论文阅读知识图谱人工智能 python 算法机器学习
论文链接论文工作简介KCN在建模图结构方面很有效。基于GCN的KGC模型通常使用编码器-解码器框架，GCNs和KGE模型分别充当编码器和解码器。许多基于GCN的KGC模型虽然引入了额外的计算复杂度，但未能超越最先进的KGE模型？作者发现GCNs中的图结构并没有对KGC的性能有显著提升，相反实体表示的转换为性能带来提升。本文提出的LTE-KGE模型带来与KGE模型相似的性能提升同时避免了GCN聚合中
【算法实战】双指针 Sivan_Xin 算法实战专栏算法链表 java
文章目录[剑指Offer18.删除链表的节点-力扣（LeetCode）](https://leetcode.cn/problems/shan-chu-lian-biao-de-jie-dian-lcof/)[剑指Offer22.链表中倒数第k个节点-力扣（LeetCode）](https://leetcode.cn/problems/lian-biao-zhong-dao-shu-di-kge-j
[论文笔记]Rethinking Graph Convolutional Networks in Knowledge Graph Completion Yulki KGC 知识图谱人工智能机器学习论文阅读深度学习
摘要许多基于GCN的KGC模型增加了额外的计算复杂度，但效果并未提高。通过实验观察到，GCN中的图结构建模对KGC模型的性能没有显着影响，这与普遍看法相反。相反，实体表示的转换提供了性能改进。引言在KGC任务中，GCN和KGE模型分别充当encoder和decoder。值得注意的是，与仅聚合来自邻居节点信息的经典GCN不同，KGC中的GCN考虑了知识图中的边（关系）。这些结果表明，对于KGC任务来
最小的k个数 gzhao01
https://leetcode-cn.com/problems/zui-xiao-de-kge-shu-lcof/本题属于topk问题方法一：快速排序classSolution{public:vectorgetLeastNumbers(vector&arr,intk){sort(arr.begin(),arr.end());vectorresult={};for(inti=0;igetLeast
面试题40. 最小的k个数最尾一名
原题https://leetcode-cn.com/problems/zui-xiao-de-kge-shu-lcof/解题思路利用快排的思想，每经过一轮排序，pivot左边的一定是较小的，右边的一定是较大的。每轮比较之后，得到left:Array、pivot:number和right:Array如果left.length对于right进行递归，找最小的k-left.length-1个数如果lef
知识图谱嵌入评价指标之MRR，Hits@n Rebecca_yanhan 知识图谱人工智能
KnowledgeGraphEmbedding，KGE模型性能中最常用的几个指标：MRR,HITS@1,HITS@10。MRR和HITS@10是两个重要指标，不可缺少，MR不被看作是一个好的指标，所以不进行介绍。1、MRRMRR的全称是MeanReciprocalRanking，其中Reciprocal是指“倒数的”的意思。具体的计算方法如下：其中是三元组集合，是三元组集合个数，是指第个三元组的链
最小的K个数剑来___
地址https://leetcode-cn.com/problems/zui-xiao-de-kge-shu-lcof/JS解法：基于快速排序/***@param{number[]}arr*@param{number}k*@return{number[]}*/vargetLeastNumbers=function(arr,k){returnquikSort(arr).splice(0,k)};fu
链表中倒数第k个节点（顺序查找、快慢指针）学海无涯苦作舟呀 #链表 #双指针链表 java
题目链接：https://leetcode-cn.com/problems/lian-biao-zhong-dao-shu-di-kge-jie-dian-lcof/题目：输入一个链表，输出该链表中倒数第k个节点。为了符合大多数人的习惯，本题从1开始计数，即链表的尾节点是倒数第1个节点。例如，一个链表有6个节点，从头节点开始，它们的值依次是1、2、3、4、5、6。这个链表的倒数第3个节点是值为4的
【水文模型】评价指标 WW、forever #水文模型水文模型
水文模型模拟效果评价指标1皮尔逊相关系数（Pearson’scorrelationcoefficient,PCC）2百分比偏差（Percentbias,Pbias）3纳什效率系数（theNash-Sutcliffeefficiencycoefficient,NSE）4克林-古普塔效率系数（Kling-Guptaefficiencycoefficient,KGE）5决定系数R^2(Coefficie
知识图谱嵌入模型 (KGE) 的总结和比较
知识图谱嵌入(KGE)是一种利用监督学习来学习嵌入以及节点和边的向量表示的模型。它们将“知识”投射到一个连续的低维空间，这些低维空间向量一般只有几百个维度（用来表示知识存储的内存效率）。向量空间中，每个点代表一个概念，每个点在空间中的位置具有语义意义，类似于词嵌入。一个好的KGE应该具有足够的表现力来捕获KG属性，这些属性解决了表示关系的独特逻辑模式的能力。并且KG可以根据要求添加或删除一些特定属
KGE性能指标：MRR，MR，HITS@1，HITS@3，HITS@10 飞机火车巴雷特学习记录知识图谱
本文将介绍用于衡量知识图谱嵌入（KnowledgeGraphEmbedding，KGE）模型性能中最常用的几个指标：MRR，MR，HITS@1，HITS@3，HITS@10。一、MRRMRR的全称是MeanReciprocalRanking，其中Reciprocal是指“倒数的”的意思。具体的计算方法如下：其中是三元组集合，是三元组集合个数，是指第个三元组的链接预测排名。该指标越大越好。例如，对于
针对知识图谱嵌入（KGE）的投毒攻击【论文阅读】白白净净吃了没病知识图谱嵌入（KGE）知识图谱知识图谱嵌入（KGE）对抗攻击
目录前言一、知识图谱嵌入对抗攻击研究意义二、关于KGE的对抗攻击的研究难点和突破三、投毒攻击的设计①直接攻击②间接攻击四、实验设置和实验结果总结前言在著名的知识图谱数据集Freebase中，实体的数量超过3000万，而关系类型的数量只有1345。这导致了这样一个事实，即每种关系类型的固有特征远比实体的稳定，并且很难通过少量修改来操纵。因此，论文着重从实体的角度操纵目标事实的合理性。以此进行对抗攻击
知识图谱论文中模型指标MRR，MR，HITS@1，HITS@3，HITS@10的含义 Code_demon Paper Read 知识图谱
知识图谱论文中模型指标MRR，MR，HITS@1，HITS@3，HITS@10的含义本文将介绍用于衡量知识图谱嵌入（KnowledgeGraphEmbedding，KGE）模型性能中最常用的几个指标：MRR，MR，HITS@1，HITS@3，HITS@10。文章目录知识图谱论文中模型指标MRR，MR，HITS@1，HITS@3，HITS@10的含义一、MRR二、MR三、HITS@n四、从论文上发现
知识图谱嵌入的衡量指标：MRR，MR，HITS@n SU_ZCS big data
衡量知识图谱嵌入（KnowledgeGraphEmbedding，KGE）模型性能中最常用的几个指标：MRR，MR，HITS@n。在进行KG嵌入时，首先把实体以及关系随机初始化为一定维度的向量，然后进行训练，目的使（头实体+关系）向量与尾实体向量在空间中的表示尽可能相近。训练完成后，需要衡量嵌入质量。在评估时，对于一个三元组，将尾实体替换成任意一种其他的实体（共n-1个，只改变尾实体，其他不变），
EmbedKGQA论文简要解读 xhsun1997 KGQA 知识图谱自然语言处理 python
KGQA与KGE关于KGQA以及知识图谱嵌入的简单介绍可以看之前的两篇博客：KGQA概览知识图谱嵌入简单介绍这篇论文就是结合知识图谱嵌入（KGE）来进行多跳知识问答EmbedKGQA论文源代码MetaQA数据实验数据集是MetaQA数据集，该数据集是基于电影知识图谱的电影问答。下载链接提供的KG我们需要提供的KG，因为我们要预先训练KG中每一个实体的embedding。KG如下：比如，用Compl
Sequence-to-Sequence Knowledge Graph Completion and QuestionAnswering(2022 ACL) Toady 元气满满 KBQA论文笔记知识图谱人工智能机器学习
论文相关论文：https://aclanthology.org/2022.acl-long.201.pdf源码：GitHub-apoorvumang/kgt5:Sequence-to-SequenceKnowledgeGraphCompletionandQuestionAnswering(KGT5)摘要KGE(knowledgegraphembedding)模型用低维的嵌入向量表示知识图谱中的每个
【NeurIPS&&知识图谱】联邦环境下，基于元学习的图谱知识外推（阿里&浙大&含源码） AINLPer 论文阅读分享自然语言处理国际会议知识图谱学习人工智能
来源:AINLPer微信公众号（每日论文干货分享！！）编辑:ShuYini校稿:ShuYini时间:2022-09-27引言知识图谱（KGs）目前被广泛应用，但不论是传统的KGs和新建的KGs都会存在不完整的问题。虽然知识图谱嵌入(KGE)可以解决该类问题，但是新兴的KG往往伴随着新的关系和实体，在已有KG上训练的KGE模型，是不能应用于在新建KG上去获取这些看不到的实体和关系的。为此本文引入了元
统一思想认识永夜-极光思想
1.统一思想认识的基础,才能有的放矢原因: 总有一种描述事物的方式最贴近本质,最容易让人理解. 如何让教育更轻松,在于找到最适合学生的方式. 难点在于,如何模拟对方的思维基础选择合适的方式. &
Joda Time使用笔记 bylijinnan java joda time
Joda Time的介绍可以参考这篇文章： http://www.ibm.com/developerworks/cn/java/j-jodatime.html 工作中也常常用到Joda Time，为了避免每次使用都查API，记录一下常用的用法： /** * DateTime变化（增减） */ @Tes
FileUtils API eksliang FileUtils FileUtils API
转载请出自出处：http://eksliang.iteye.com/blog/2217374 一、概述这是一个Java操作文件的常用库，是Apache对java的IO包的封装，这里面有两个非常核心的类FilenameUtils跟FileUtils，其中FilenameUtils是对文件名操作的封装;FileUtils是文件封装，开发中对文件的操作，几乎都可以在这个框架里面找到。非常的好用。
各种新兴技术不懂事的小屁孩技术
1:gradle Gradle 是以 Groovy 语言为基础，面向Java应用为主。基于DSL（领域特定语言）语法的自动化构建工具。现在构建系统常用到maven工具，现在有更容易上手的gradle，搭建java环境: http://www.ibm.com/developerworks/cn/opensource/os-cn-gradle/ 搭建android环境： http://m
tomcat6的https双向认证酷的飞上天空 tomcat6
1.生成服务器端证书 keytool -genkey -keyalg RSA -dname "cn=localhost,ou=sango,o=none,l=china,st=beijing,c=cn" -alias server -keypass password -keystore server.jks -storepass password -validity 36
托管虚拟桌面市场势不可挡蓝儿唯美
用户还需要冗余的数据中心，dinCloud的高级副总裁兼首席营销官Ali Din指出。该公司转售一个MSP可以让用户登录并管理和提供服务的用于DaaS的云自动化控制台，提供服务或者MSP也可以自己来控制。在某些情况下，MSP会在dinCloud的云服务上进行服务分层，如监控和补丁管理。 MSP的利润空间将根据其参与的程度而有所不同，Din说。 “我们有一些合作伙伴负责将我们推荐给客户作为个
spring学习——xml文件的配置 a-john spring
在Spring的学习中，对于其xml文件的配置是必不可少的。在Spring的多种装配Bean的方式中，采用XML配置也是最常见的。以下是一个简单的XML配置文件： <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.or
HDU 4342 History repeat itself 模拟 aijuans 模拟
来源：http://acm.hdu.edu.cn/showproblem.php?pid=4342 题意：首先让求第几个非平方数，然后求从1到该数之间的每个sqrt(i)的下取整的和。思路：一个简单的模拟题目，但是由于数据范围大，需要用__int64。我们可以首先把平方数筛选出来，假如让求第n个非平方数的话，看n前面有多少个平方数，假设有x个，则第n个非平方数就是n+x。注意两种特殊情况，即
java中最常用jar包的用途 asia007 java
java中最常用jar包的用途 jar包用途axis.jarSOAP引擎包commons-discovery-0.2.jar用来发现、查找和实现可插入式接口，提供一些一般类实例化、单件的生命周期管理的常用方法.jaxrpc.jarAxis运行所需要的组件包saaj.jar创建到端点的点到点连接的方法、创建并处理SOAP消息和附件的方法，以及接收和处理SOAP错误的方法. w
ajax获取Struts框架中的json编码异常和Struts中的主控制器异常的解决办法百合不是茶 js json编码返回异常
一:ajax获取自定义Struts框架中的json编码出现以下问题: 1,强制flush输出 json编码打印在首页 2, 不强制flush js会解析json 打印出来的是错误的jsp页面却没有跳转到错误页面 3, ajax中的dataType的json 改为text 会
JUnit使用的设计模式 bijian1013 java 设计模式 JUnit
JUnit源代码涉及使用了大量设计模式 1、模板方法模式（Template Method）定义一个操作中的算法骨架，而将一些步骤延伸到子类中去，使得子类可以不改变一个算法的结构，即可重新定义该算法的某些特定步骤。这里需要复用的是算法的结构，也就是步骤，而步骤的实现可以在子类中完成。
Linux常用命令（摘录） sunjing crond chkconfig
chkconfig --list 查看linux所有服务 chkconfig --add servicename 添加linux服务 netstat -apn | grep 8080 查看端口占用 env 查看所有环境变量 echo $JAVA_HOME 查看JAVA_HOME环境变量安装编译器 yum install -y gcc
【Hadoop一】Hadoop伪集群环境搭建 bit1129 hadoop
结合网上多份文档，不断反复的修正hadoop启动和运行过程中出现的问题，终于把Hadoop2.5.2伪分布式安装起来，跑通了wordcount例子。Hadoop的安装复杂性的体现之一是，Hadoop的安装文档非常多，但是能一个文档走下来的少之又少，尤其是Hadoop不同版本的配置差异非常的大。Hadoop2.5.2于前两天发布，但是它的配置跟2.5.0，2.5.1没有分别。 &nb
Anychart图表系列五之事件监听白糖_ chart
创建图表事件监听非常简单：首先是通过addEventListener('监听类型',js监听方法)添加事件监听，然后在js监听方法中定义具体监听逻辑。以钻取操作为例，当用户点击图表某一个point的时候弹出point的name和value，代码如下： <script> //创建AnyChart var chart = new AnyChart(); //添加钻取操作&quo
Web前端相关段子 braveCS web前端
Web标准：结构、样式和行为分离使用语义化标签 0）标签的语义：使用有良好语义的标签，能够很好地实现自我解释，方便搜索引擎理解网页结构，抓取重要内容。去样式后也会根据浏览器的默认样式很好的组织网页内容，具有很好的可读性，从而实现对特殊终端的兼容。 1）div和span是没有语义的：只是分别用作块级元素和行内元素的区域分隔符。当页面内标签无法满足设计需求时，才会适当添加div
编程之美-24点游戏 bylijinnan 编程之美
import java.util.ArrayList; import java.util.Arrays; import java.util.HashSet; import java.util.List; import java.util.Random; import java.util.Set; public class PointGame { /**编程之美
主页面子页面传值总结 chengxuyuancsdn 总结
1、showModalDialog returnValue是javascript中html的window对象的属性,目的是返回窗口值,当用window.showModalDialog函数打开一个IE的模式窗口时,用于返回窗口的值主界面 var sonValue=window.showModalDialog("son.jsp"); 子界面 window.retu
[网络与经济]互联网+的含义 comsci 互联网+
互联网+后面是一个人的名字 = 网络控制系统互联网+你的名字 = 网络个人数据库每日提示:如果人觉得不舒服,千万不要外出到处走动,就呆在床上,玩玩手游,更不能够去开车,现在交通状况不
oracle 创建视图 with check option daizj 视图 view oralce
我们来看下面的例子： create or replace view testview as select empno,ename from emp where ename like ‘M%’ with check option; 这里我们创建了一个视图，并使用了with check option来限制了视图。然后我们来看一下视图包含的结果： select * from testv
ToastPlugin插件在cordova3.3下使用 dibov Cordova
自己开发的Todos应用，想实现“ 再按一次返回键退出程序 ”的功能，采用网上的ToastPlugins插件，发现代码或文章基本都是老版本，运行问题比较多。折腾了好久才弄好。下面吧基于cordova3.3下的ToastPlugins相关代码共享。 ToastPlugin.java package&nbs
C语言22个系统函数 dcj3sjt126com c function
C语言系统函数一、数学函数下列函数存放在math.h头文件中Double floor(double num) 求出不大于num的最大数。Double fmod(x, y) 求整数x/y的余数。Double frexp(num, exp); double num; int *exp; 将num分为数字部分（尾数）x和以2位的指数部分n，即num=x*2n，指数n存放在exp指向的变量中，返回x。D
开发一个类的流程 dcj3sjt126com 开发
本人近日根据自己的开发经验总结了一个类的开发流程。这个流程适用于单独开发的构件，并不适用于对一个项目中的系统对象开发。开发出的类可以存入私人类库，供以后复用。以下是开发流程： 1. 明确类的功能，抽象出类的大概结构 2. 初步设想类的接口 3. 类名设计（驼峰式命名） 4. 属性设置(权限设置) 判断某些变量是否有必要作为成员属
java 并发 shuizhaosi888 java 并发
能够写出高伸缩性的并发是一门艺术在JAVA SE5中新增了3个包 java.util.concurrent java.util.concurrent.atomic java.util.concurrent.locks 在java的内存模型中，类的实例字段、静态字段和构成数组的对象元素都会被多个线程所共享，局部变量与方法参数都是线程私有的，不会被共享。
Spring Security（11）——匿名认证 234390216 Spring Security ROLE_ANNOYMOUS 匿名
匿名认证目录 1.1 配置 1.2 AuthenticationTrustResolver 对于匿名访问的用户，Spring Security支持为其建立一个匿名的AnonymousAuthenticat
NODEJS项目实践0.2[ express,ajax通信...] 逐行分析JS源代码 Ajax nodejs express
一、前言通过上节学习，我们已经 ubuntu系统搭建了一个可以访问的nodejs系统，并做了nginx转发。本节原要做web端服务及 mongodb的存取，但写着写着，web端就
在Struts2 的Action中怎样获取表单提交上来的多个checkbox的值 lhbthanks java html struts checkbox
第一种方法：获取结果String类型在 Action 中获得的是一个 String 型数据，每一个被选中的 checkbox 的 value 被拼接在一起，每个值之间以逗号隔开(,)。所以在 Action 中定义一个跟 checkbox 的 name 同名的属性来接收这些被选中的 checkbox 的 value 即可。以下是实现的代码：前台 HTML 代码：
003.Kafka基本概念 nweiren hadoop kafka
Kafka基本概念：Topic、Partition、Message、Producer、Broker、Consumer。 Topic：消息源（Message）的分类。 Partition： Topic物理上的分组，一
Linux环境下安装JDK roadrunners jdk linux
1、准备工作创建JDK的安装目录： mkdir -p /usr/java/ 下载JDK，找到适合自己系统的JDK版本进行下载： http://www.oracle.com/technetwork/java/javase/downloads/index.html 把JDK安装包下载到/usr/java/目录，然后进行解压： tar -zxvf jre-7
Linux忘记root密码的解决思路 tomcat_oracle linux
1：使用同版本的linux启动系统，chroot到忘记密码的根分区passwd改密码　　2：grub启动菜单中加入init=/bin/bash进入系统，不过这时挂载的是只读分区。根据系统的分区情况进一步判断. 　　3: grub启动菜单中加入 single以单用户进入系统. 　　4:用以上方法mount到根分区把/etc/passwd中的root密码去除　　例如: 　　ro
跨浏览器 HTML5 postMessage 方法以及 message 事件模拟实现 xueyou jsonp jquery 框架 UI html5
postMessage 是 HTML5 新方法，它可以实现跨域窗口之间通讯。到目前为止，只有 IE8+, Firefox 3, Opera 9, Chrome 3和 Safari 4 支持，而本篇文章主要讲述 postMessage 方法与 message 事件跨浏览器实现。postMessage 方法 JSONP 技术不一样，前者是前端擅长跨域文档数据即时通讯，后者擅长针对跨域服务端数据通讯，p