lqfarmer

2020年NLP所有领域最新、经典、顶会、必读论文

本资源整理了近几年，自然语言处理领域各大AI相关的顶会中，一些经典、最新、必读的论文，涉及NLP领域相关的，Bert模型、Transformer模型、迁移学习、文本摘要、情感分析、问答、机器翻译、文本生成、质量评估、纠错(多任务、masking策略等。)、Probe、多语言、领域相关、多模态、模型压缩、谓词填充、Analysis、分词解析NER、代词指代消解、词义消歧、情感分析、关系抽取、知识库、文本分类等，几乎所有领域。

资源整理自网络，源地址：https://github.com/changwookjun/nlp-paper#probe

带链接版文档下载地址：

链接: https://pan.baidu.com/s/1gySZ2Yn3IIpMREB17fDKDg

提取码: nrp7

Bert模型

Transformer模型

迁移学习

文本摘要

情感分析

问答

机器翻译

下游任务

对话系统

谓词填充

Analysis

分词解析NER

代词指代消解

词义消歧

情感分析

关系抽取

知识库

文本分类

WSC·WNLI NLI

常识推理

摘要抽取

信息抽取

文本生成

质量评估

纠错(多任务、masking策略等。)

Probe

多语言

领域相关

多模态

模型压缩

论文列表

Bert相关

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding - NAACL 2019)

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding - arXiv 2019)

StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding - arXiv 2019)

RoBERTa: A Robustly Optimized BERT Pretraining Approach - arXiv 2019)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations - arXiv 2019)

Multi-Task Deep Neural Networks for Natural Language Understanding - arXiv 2019)

What does BERT learn about the structure of language? (ACL2019)

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned (ACL2019) [github]

Open Sesame: Getting Inside BERT's Linguistic Knowledge (ACL2019 WS)

Analyzing the Structure of Attention in a Transformer Language Model (ACL2019 WS)

What Does BERT Look At? An Analysis of BERT's Attention (ACL2019 WS)

Do Attention Heads in BERT Track Syntactic Dependencies?

Blackbox meets blackbox: Representational Similarity and Stability Analysis of Neural Language Models and Brains (ACL2019 WS)

Inducing Syntactic Trees from BERT Representations (ACL2019 WS)

A Multiscale Visualization of Attention in the Transformer Model (ACL2019 Demo)

Visualizing and Measuring the Geometry of BERT

How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings(EMNLP2019)

Are Sixteen Heads Really Better than One? (NeurIPS2019)

On the Validity of Self-Attention as Explanation in Transformer Models

Visualizing and Understanding the Effectiveness of BERT (EMNLP2019)

Attention Interpretability Across NLP Tasks

Revealing the Dark Secrets of BERT (EMNLP2019)

Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs (EMNLP2019)

The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives (EMNLP2019)

A Primer in BERTology: What we know about how BERT works

Do NLP Models Know Numbers? Probing Numeracy in Embeddings (EMNLP2019)

How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations (CIKM2019)

Whatcha lookin' at? DeepLIFTing BERT's Attention in Question Answering

What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

Calibration of Pre-trained Transformers

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models [github]

Transformer Series

Attention Is All You Need - arXiv 2017)

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context - arXiv 2019)

Universal Transformers - ICLR 2019)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer - arXiv 2019)

Reformer: The Efficient Transformer - ICLR 2020)

Adaptive Attention Span in Transformers (ACL2019)

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context (ACL2019) [github]

Generating Long Sequences with Sparse Transformers

Adaptively Sparse Transformers (EMNLP2019)

Compressive Transformers for Long-Range Sequence Modelling

The Evolved Transformer (ICML2019)

Reformer: The Efficient Transformer (ICLR2020) [github]

GRET: Global Representation Enhanced Transformer (AAAI2020)

Transformer on a Diet [github]

Efficient Content-Based Sparse Attention with Routing Transformers

BP-Transformer: Modelling Long-Range Context via Binary Partitioning

Recipes for building an open-domain chatbot

Longformer: The Long-Document Transformer

Transfer Learning

Deep contextualized word representations - NAACL 2018)

Universal Language Model Fine-tuning for Text Classification - ACL 2018)

Improving Language Understanding by Generative Pre-Training - Alec Radford)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding - NAACL 2019)

Cloze-driven Pretraining of Self-attention Networks - arXiv 2019)

Unified Language Model Pre-training for Natural Language Understanding and Generation - arXiv 2019)

MASS: Masked Sequence to Sequence Pre-training for Language Generation - ICML 2019)

MPNet: Masked and Permuted Pre-training for Language Understanding)[github]

Text Summarization

Positional Encoding to Control Output Sequence Length - Sho Takase(2019)

Fine-tune BERT for Extractive Summarization - Yang Liu(2019)

Language Models are Unsupervised Multitask Learners - Alec Radford(2019)

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss - Wan-Ting Hsu(2018)

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents - Arman Cohan(2018)

GENERATING WIKIPEDIA BY SUMMARIZING LONG SEQUENCES - Peter J. Liu(2018)

Get To The Point: Summarization with Pointer-Generator Networks - Abigail See(2017) * A Neural Attention Model for Sentence Summarization - Alexander M. Rush(2015)

Sentiment Analysis

Multi-Task Deep Neural Networks for Natural Language Understanding - Xiaodong Liu(2019)

Aspect-level Sentiment Analysis using AS-Capsules - Yequan Wang(2019)

On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis - Jose Camacho-Collados(2018)

Learned in Translation: Contextualized Word Vectors - Bryan McCann(2018)

Universal Language Model Fine-tuning for Text Classification - Jeremy Howard(2018)

Convolutional Neural Networks with Recurrent Neural Filters - Yi Yang(2018)

Information Aggregation via Dynamic Routing for Sequence Encoding - Jingjing Gong(2018)

Learning to Generate Reviews and Discovering Sentiment - Alec Radford(2017)

A Structured Self-attentive Sentence Embedding - Zhouhan Lin(2017)

Question Answering

Language Models are Unsupervised Multitask Learners - Alec Radford(2019)

Improving Language Understanding by Generative Pre-Training - Alec Radford(2018)

Bidirectional Attention Flow for Machine Comprehension - Minjoon Seo(2018)

Reinforced Mnemonic Reader for Machine Reading Comprehension - Minghao Hu(2017)

Neural Variational Inference for Text Processing - Yishu Miao(2015)

Machine Translation

The Evolved Transformer - David R. So(2019)

Surver paper

Evolution of transfer learning in natural language processing

Pre-trained Models for Natural Language Processing: A Survey

A Survey on Contextual Embeddings

Downstream task

QA MC Dialogue

A BERT Baseline for the Natural Questions

MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension (ACL2019)

Unsupervised Domain Adaptation on Reading Comprehension

BERTQA -- Attention on Steroids

A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning (EMNLP2019)

SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering

Multi-hop Question Answering via Reasoning Chains

Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents

Multi-step Entity-centric Information Retrieval for Multi-Hop Question Answering (EMNLP2019 WS)

End-to-End Open-Domain Question Answering with BERTserini (NAALC2019)

Latent Retrieval for Weakly Supervised Open Domain Question Answering (ACL2019)

Multi-passage BERT: A Globally Normalized BERT Model for Open-domain Question Answering (EMNLP2019)

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering (ICLR2020)

Learning to Ask Unanswerable Questions for Machine Reading Comprehension (ACL2019)

Unsupervised Question Answering by Cloze Translation (ACL2019)

Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation

A Recurrent BERT-based Model for Question Generation (EMNLP2019 WS)

Learning to Answer by Learning to Ask: Getting the Best of GPT-2 and BERT Worlds

Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension (ACL2019)

Incorporating Relation Knowledge into Commonsense Reading Comprehension with Multi-task Learning (CIKM2019)

SG-Net: Syntax-Guided Machine Reading Comprehension

MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension

Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning (EMNLP2019)

ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning (ICLR2020)

Robust Reading Comprehension with Linguistic Constraints via Posterior Regularization

BAS: An Answer Selection Method Using BERT Language Model

Beat the AI: Investigating Adversarial Human Annotations for Reading Comprehension

A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension(ACL2019 WS)

FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension (ACL2019 WS)

BERT with History Answer Embedding for Conversational Question Answering (SIGIR2019)

GraphFlow: Exploiting Conversation Flow with Graph Neural Networks for Conversational Machine Comprehension(ICML2019 WS)

Beyond English-only Reading Comprehension: Experiments in Zero-Shot Multilingual Transfer for Bulgarian (RANLP2019)

XQA: A Cross-lingual Open-domain Question Answering Dataset (ACL2019)

Cross-Lingual Machine Reading Comprehension (EMNLP2019)

Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model

Multilingual Question Answering from Formatted Text applied to Conversational Agents

BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on Novels (EMNLP2019)

MLQA: Evaluating Cross-lingual Extractive Question Answering

Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension (TACL)

SberQuAD - Russian Reading Comprehension Dataset: Description and Analysis

Giving BERT a Calculator: Finding Operations and Arguments with Reading Comprehension (EMNLP2019)

BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer(Interspeech2019)

Dialog State Tracking: A Neural Reading Comprehension Approach

A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems (ICASSP2020)

Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

Domain Adaptive Training BERT for Response Selection

BERT Goes to Law School: Quantifying the Competitive Advantage of Access to Large Legal Corpora in Contract Understanding

Slot filling

BERT for Joint Intent Classification and Slot Filling

Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model

A Comparison of Deep Learning Methods for Language Understanding (Interspeech2019)

Analysis

Fine-grained Information Status Classification Using Discourse Context-Aware Self-Attention

Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision (ACL2019)

BERT-based Lexical Substitution (ACL2019)

Assessing BERT’s Syntactic Abilities

Does BERT agree? Evaluating knowledge of structure dependence through agreement relations

Simple BERT Models for Relation Extraction and Semantic Role Labeling

LIMIT-BERT : Linguistic Informed Multi-Task BERT

A Simple BERT-Based Approach for Lexical Simplification

Multi-headed Architecture Based on BERT for Grammatical Errors Correction (ACL2019 WS)

Towards Minimal Supervision BERT-based Grammar Error Correction

BERT-Based Arabic Social Media Author Profiling

Sentence-Level BERT and Multi-Task Learning of Age and Gender in Social Media

Evaluating the Factual Consistency of Abstractive Text Summarization

NegBERT: A Transfer Learning Approach for Negation Detection and Scope Resolution

xSLUE: A Benchmark and Analysis Platform for Cross-Style Language Understanding and Evaluation

TabFact: A Large-scale Dataset for Table-based Fact Verification

Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings (ECIR2020) [github]

Keyphrase Extraction with Span-based Feature Representations

What do you mean, BERT? Assessing BERT as a Distributional Semantics Model

Word segmentation parsing NER

BERT Meets Chinese Word Segmentation

Toward Fast and Accurate Neural Chinese Word Segmentation with Multi-Criteria Learning

Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

Deep Contextualized Word Embeddings in Transition-Based and Graph-Based Dependency Parsing -- A Tale of Two Parsers Revisited (EMNLP2019)

Is POS Tagging Necessary or Even Helpful for Neural Dependency Parsing?

Parsing as Pretraining (AAAI2020)

Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing

Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement

Named Entity Recognition -- Is there a glass ceiling? (CoNLL2019)

A Unified MRC Framework for Named Entity Recognition

Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models

Robust Named Entity Recognition with Truecasing Pretraining (AAAI2020)

LTP: A New Active Learning Strategy for Bert-CRF Based Named Entity Recognition

MT-BioNER: Multi-task Learning for Biomedical Named Entity Recognition using Deep Bidirectional Transformers

Portuguese Named Entity Recognition using BERT-CRF

Towards Lingua Franca Named Entity Recognition with BERT

Pronoun coreference resolution

Resolving Gendered Ambiguous Pronouns with BERT (ACL2019 WS)

Anonymized BERT: An Augmentation Approach to the Gendered Pronoun Resolution Challenge (ACL2019 WS)

Gendered Pronoun Resolution using BERT and an extractive question answering formulation (ACL2019 WS)

MSnet: A BERT-based Network for Gendered Pronoun Resolution (ACL2019 WS)

Fill the GAP: Exploiting BERT for Pronoun Resolution (ACL2019 WS)

On GAP Coreference Resolution Shared Task: Insights from the 3rd Place Solution (ACL2019 WS)

Look Again at the Syntax: Relational Graph Convolutional Network for Gendered Ambiguous Pronoun Resolution (ACL2019 WS)

BERT Masked Language Modeling for Co-reference Resolution (ACL2019 WS)

Coreference Resolution with Entity Equalization (ACL2019)

BERT for Coreference Resolution: Baselines and Analysis (EMNLP2019) [github]

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution (EMNLP2019)

Ellipsis and Coreference Resolution as Question Answering

Coreference Resolution as Query-based Span Prediction

Multi-task Learning Based Neural Bridging Reference Resolution

Word sense disambiguation

GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP2019)

Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word Representations (EMNLP2019)

Using BERT for Word Sense Disambiguation

Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation (ACL2019)

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings (KONVENS2019)

Sentiment analysis

Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence (NAACL2019)

BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis (NAACL2019)

Exploiting BERT for End-to-End Aspect-based Sentiment Analysis (EMNLP2019 WS)

Adapt or Get Left Behind: Domain Adaptation through BERT Language Model Finetuning for Aspect-Target Sentiment Classification

An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese (ACL2019)

"Mask and Infill" : Applying Masked Language Model to Sentiment Transfer

Adversarial Training for Aspect-Based Sentiment Analysis with BERT

Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Relation extraction

Matching the Blanks: Distributional Similarity for Relation Learning (ACL2019)

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction (NLPCC2019)

Enriching Pre-trained Language Model with Entity Information for Relation Classification

Span-based Joint Entity and Relation Extraction with Transformer Pre-training

Fine-tune Bert for DocRED with Two-step Process

Entity, Relation, and Event Extraction with Contextualized Span Representations (EMNLP2019)

Knowledge base

KG-BERT: BERT for Knowledge Graph Completion

Language Models as Knowledge Bases? (EMNLP2019) [github]

BERT is Not a Knowledge Base (Yet): Factual Knowledge vs. Name-Based Reasoning in Unsupervised QA

Inducing Relational Knowledge from BERT (AAAI2020)

Latent Relation Language Models (AAAI2020)

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model (ICLR2020)

Zero-shot Entity Linking with Dense Entity Retrieval

Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking (CoNLL2019)

Improving Entity Linking by Modeling Latent Entity Type Information (AAAI2020)

PEL-BERT: A Joint Model for Protocol Entity Linking

How Can We Know What Language Models Know?

REALM: Retrieval-Augmented Language Model Pre-Training

Text classification

How to Fine-Tune BERT for Text Classification?

X-BERT: eXtreme Multi-label Text Classification with BERT

DocBERT: BERT for Document Classification

Enriching BERT with Knowledge Graph Embeddings for Document Classification

Classification and Clustering of Arguments with Contextualized Word Embeddings (ACL2019)

BERT for Evidence Retrieval and Claim Verification

Stacked DeBERT: All Attention in Incomplete Data for Text Classification

Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

WSC WNLI NLI

Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge

A Surprisingly Robust Trick for the Winograd Schema Challenge

WinoGrande: An Adversarial Winograd Schema Challenge at Scale (AAAI2020)

Improving Natural Language Inference with a Pretrained Parser

Adversarial NLI: A New Benchmark for Natural Language Understanding

Adversarial Analysis of Natural Language Inference Systems (ICSC2020)

HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference (LREC2020)

Evaluating BERT for natural language inference: A case study on the CommitmentBank (EMNLP2019)

Commonsense

CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge (NAACL2019)

HellaSwag: Can a Machine Really Finish Your Sentence? (ACL2019) [website]

Story Ending Prediction by Transferable BERT (IJCAI2019)

Explain Yourself! Leveraging Language Models for Commonsense Reasoning (ACL2019)

Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models

Informing Unsupervised Pretraining with External Linguistic Knowledge

Commonsense Knowledge + BERT for Level 2 Reading Comprehension Ability Test

BIG MOOD: Relating Transformers to Explicit Commonsense Knowledge

Commonsense Knowledge Mining from Pretrained Models (EMNLP2019)

KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP2019)

Cracking the Contextual Commonsense Code: Understanding Commonsense Reasoning Aptitude of Deep Contextual Representations (EMNLP2019 WS)

Do Massively Pretrained Language Models Make Better Storytellers? (CoNLL2019)

PIQA: Reasoning about Physical Commonsense in Natural Language (AAAI2020)

Evaluating Commonsense in Pre-trained Language Models (AAAI2020)

Why Do Masked Neural Language Models Still Need Common Sense Knowledge?

Do Neural Language Representations Learn Physical Commonsense? (CogSci2019)

Extractive summarization

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization (ACL2019)

Deleter: Leveraging BERT to Perform Unsupervised Successive Text Compression

Discourse-Aware Neural Extractive Model for Text Summarization

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization[github]

Passage Re-ranking with BERT

Investigating the Successes and Failures of BERT for Passage Re-Ranking

Understanding the Behaviors of BERT in Ranking

Document Expansion by Query Prediction

CEDR: Contextualized Embeddings for Document Ranking (SIGIR2019)

Deeper Text Understanding for IR with Contextual Neural Language Modeling (SIGIR2019)

FAQ Retrieval using Query-Question Similarity and BERT-Based Query-Answer Relevance (SIGIR2019)

Multi-Stage Document Ranking with BERT

REALM: Retrieval-Augmented Language Model Pre-Training

Generation

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model (NAACL2019 WS)

Pretraining-Based Natural Language Generation for Text Summarization

Text Summarization with Pretrained Encoders (EMNLP2019) [github (original)] [github (huggingface)]

Multi-stage Pretraining for Abstractive Summarization

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

MASS: Masked Sequence to Sequence Pre-training for Language Generation (ICML2019) [github], [github]

Unified Language Model Pre-training for Natural Language Understanding and Generation [github] (NeurIPS2019)

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training [github]

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

Towards Making the Most of BERT in Neural Machine Translation

Improving Neural Machine Translation with Pre-trained Representation

On the use of BERT for Neural Machine Translation (EMNLP2019 WS)

Incorporating BERT into Neural Machine Translation (ICLR2020)

Recycling a Pre-trained BERT Encoder for Neural Machine Translation

Leveraging Pre-trained Checkpoints for Sequence Generation Tasks

Mask-Predict: Parallel Decoding of Conditional Masked Language Models (EMNLP2019)

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

Cross-Lingual Natural Language Generation via Pre-Training (AAAI2020) [github]

Multilingual Denoising Pre-training for Neural Machine Translation

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

Unsupervised Pre-training for Natural Language Generation: A Literature Review

Quality evaluator

BERTScore: Evaluating Text Generation with BERT (ICLR2020)

Machine Translation Evaluation with BERT Regressor

SumQE: a BERT-based Summary Quality Estimation Model (EMNLP2019)

MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance (EMNLP2019) [github]

BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward

Modification (multi-task, masking strategy, etc.)

Multi-Task Deep Neural Networks for Natural Language Understanding (ACL2019)

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning (ICML2019)

Unifying Question Answering and Text Classification via Span Extraction

ERNIE: Enhanced Language Representation with Informative Entities (ACL2019)

ERNIE: Enhanced Representation through Knowledge Integration

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding (AAAI2020)

Pre-Training with Whole Word Masking for Chinese BERT

SpanBERT: Improving Pre-training by Representing and Predicting Spans [github]

Blank Language Models

Efficient Training of BERT by Progressively Stacking (ICML2019) [github]

RoBERTa: A Robustly Optimized BERT Pretraining Approach [github]

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations (ICLR2020)

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators (ICLR2020) [github] [blog]

FreeLB: Enhanced Adversarial Training for Language Understanding (ICLR2020)

KERMIT: Generative Insertion-Based Modeling for Sequences

DisSent: Sentence Representation Learning from Explicit Discourse Relations (ACL2019)

StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding (ICLR2020)

Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

SenseBERT: Driving Some Sense into BERT

Semantics-aware BERT for Language Understanding (AAAI2020)

K-BERT: Enabling Language Representation with Knowledge Graph

Knowledge Enhanced Contextual Word Representations (EMNLP2019)

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks (EMNLP2019)

SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models

Universal Text Representation from BERT: An Empirical Study

Symmetric Regularization based BERT for Pair-wise Semantic Reasoning

Transfer Fine-Tuning: A BERT Case Study (EMNLP2019)

Improving Pre-Trained Multilingual Models with Vocabulary Expansion (CoNLL2019)

SesameBERT: Attention for Anywhere

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer [github]

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

Probe

A Structural Probe for Finding Syntax in Word Representations (NAACL2019)

Linguistic Knowledge and Transferability of Contextual Representations (NAACL2019) [github]

Probing What Different NLP Tasks Teach Machines about Function Word Comprehension (*SEM2019)

BERT Rediscovers the Classical NLP Pipeline (ACL2019)

Probing Neural Network Comprehension of Natural Language Arguments (ACL2019)

Cracking the Contextual Commonsense Code: Understanding Commonsense Reasoning Aptitude of Deep Contextual Representations (EMNLP2019 WS)

What do you mean, BERT? Assessing BERT as a Distributional Semantics Model

Quantity doesn't buy quality syntax with neural language models (EMNLP2019)

Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction (ICLR2020)

oLMpics -- On what Language Model Pre-training Captures

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

Multi-lingual

Multilingual Constituency Parsing with Self-Attention and Pre-Training (ACL2019)

Language Model Pretraining (NeurIPS2019) [github]

75 Languages, 1 Model: Parsing Universal Dependencies Universally (EMNLP2019) [github]

Zero-shot Dependency Parsing with Pre-trained Multilingual Sentence Representations (EMNLP2019 WS)

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT (EMNLP2019)

How multilingual is Multilingual BERT? (ACL2019)

How Language-Neutral is Multilingual BERT?

Is Multilingual BERT Fluent in Language Generation?

Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks (EMNLP2019)

BERT is Not an Interlingua and the Bias of Tokenization (EMNLP2019 WS)

Cross-Lingual Ability of Multilingual BERT: An Empirical Study (ICLR2020)

Multilingual Alignment of Contextual Word Representations (ICLR2020)

On the Cross-lingual Transferability of Monolingual Representations

Unsupervised Cross-lingual Representation Learning at Scale

Emerging Cross-lingual Structure in Pretrained Language Models

Can Monolingual Pretrained Models Help Cross-Lingual Classification?

Fully Unsupervised Crosslingual Semantic Textual Similarity Metric Based on BERT for Identifying Parallel Data (CoNLL2019)

What the [MASK]? Making Sense of Language-Specific BERT Models

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization

Other than English models

CamemBERT: a Tasty French Language Model

FlauBERT: Unsupervised Language Model Pre-training for French

Multilingual is not enough: BERT for Finnish

BERTje: A Dutch BERT Model

RobBERT: a Dutch RoBERTa-based Language Model

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

AraBERT: Transformer-based Model for Arabic Language Understanding

PhoBERT: Pre-trained language models for Vietnamese

CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model

Domain specific

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets (ACL2019 WS)

BERT-based Ranking for Biomedical Entity Normalization

PubMedQA: A Dataset for Biomedical Research Question Answering (EMNLP2019)

Pre-trained Language Model for Biomedical Question Answering

How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering

ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission

Publicly Available Clinical BERT Embeddings (NAACL2019 WS)

Progress Notes Classification and Keyword Extraction using Attention-based Deep Learning Models with BERT

SciBERT: Pretrained Contextualized Embeddings for Scientific Text [github]

PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model

Multi-modal

VideoBERT: A Joint Model for Video and Language Representation Learning (ICCV2019)

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks (NeurIPS2019)

VisualBERT: A Simple and Performant Baseline for Vision and Language

Selfie: Self-supervised Pretraining for Image Embedding

ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data

Contrastive Bidirectional Transformer for Temporal Representation Learning

M-BERT: Injecting Multimodal Information in the BERT Structure

LXMERT: Learning Cross-Modality Encoder Representations from Transformers (EMNLP2019)

Fusion of Detected Objects in Text for Visual Question Answering (EMNLP2019)

BERT representations for Video Question Answering (WACV2020)

Unified Vision-Language Pre-Training for Image Captioning and VQA [github]

Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline

VL-BERT: Pre-training of Generic Visual-Linguistic Representations (ICLR2020)

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training

UNITER: Learning UNiversal Image-TExt Representations

Supervised Multimodal Bitransformers for Classifying Images and Text

Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks

BERT Can See Out of the Box: On the Cross-modal Transferability of Text Representations

BERT for Large-scale Video Segment Classification with Test-time Augmentation (ICCV2019WS)

SpeechBERT: Cross-Modal Pre-trained Language Model for End-to-end Spoken Question Answering

vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations

Effectiveness of self-supervised pre-training for speech recognition

Understanding Semantics from Speech Through Pre-training

Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models

Model compression

Distilling Task-Specific Knowledge from BERT into Simple Neural Networks

Patient Knowledge Distillation for BERT Model Compression (EMNLP2019)

Small and Practical BERT Models for Sequence Labeling (EMNLP2019)

Pruning a BERT-based Question Answering Model

TinyBERT: Distilling BERT for Natural Language Understanding [github]

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter (NeurIPS2019 WS) [github]

Knowledge Distillation from Internal Representations (AAAI2020)

PoWER-BERT: Accelerating BERT inference for Classification Tasks

WaLDORf: Wasteless Language-model Distillation On Reading-comprehension

Extreme Language Model Compression with Optimal Subwords and Shared Projections

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing

Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

Compressing Large-Scale Transformer-Based Models: A Case Study on BERT

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

MobileBERT: Task-Agnostic Compression of BERT by Progressive Knowledge Transfer

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT

Q8BERT: Quantized 8Bit BERT (NeurIPS2019 WS)

Misc

jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models [github]

Cloze-driven Pretraining of Self-attention Networks

Learning and Evaluating General Linguistic Intelligence

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks (ACL2019 WS)

Learning to Speak and Act in a Fantasy Text Adventure Game (EMNLP2019)

Conditional BERT Contextual Augmentation

Data Augmentation using Pre-trained Transformer Models

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes (ICLR2020)

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models (ICLR2020)

A Mutual Information Maximization Perspective of Language Representation Learning (ICLR2020)

Is BERT Really Robust? Natural Language Attack on Text Classification and Entailment (AAAI2020)

Thieves on Sesame Street! Model Extraction of BERT-based APIs (ICLR2020)

Graph-Bert: Only Attention is Needed for Learning Graph Representations

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping

Extending Machine Language Models toward Human-Level Language Understanding

Glyce: Glyph-vectors for Chinese Character Representations

Back to the Future -- Sequential Alignment of Text Representations

Improving Cuneiform Language Identification with BERT (NAACL2019 WS)

BERT has a Moral Compass: Improvements of ethical and moral values of machines

SMILES-BERT: Large Scale Unsupervised Pre-Training for Molecular Property Prediction (ACM-BCB2019)

On the comparability of Pre-trained Language Models

Transformers: State-of-the-art Natural Language Processing

Jukebox: A Generative Model for Music

WT5?! Training Text-to-Text Models to Explain their Predictions

你可能感兴趣的:(深度学习,深度学习优化策略汇总,深度学习视频教程及资料下载)

PyTorch & TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）阿牛的药铺算法移植部署 pytorch tensorflow fpga开发
PyTorch&TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）引言：为什么算法移植工程师必须掌握框架基础？针对光学类产品算法FPGA移植岗位需求（如可见光/红外图像处理），深度学习框架是算法落地的"桥梁"——既要用PyTorch/TensorFlow验证算法可行性，又要将训练好的模型（如CNN、目标检测）转换为FPGA可部署的格式（ONNX、TFLite）。本文采用"
深度学习模型表征提取全解析 ZhangJiQun&MXP 教学 2024大模型以及算力 2021 AI python 深度学习人工智能 python embedding 语言模型
模型内部进行表征提取的方法在自然语言处理（NLP）中，“表征（Representation）”指将文本（词、短语、句子、文档等）转化为计算机可理解的数值形式（如向量、矩阵），核心目标是捕捉语言的语义、语法、上下文依赖等信息。自然语言表征技术可按“静态/动态”“有无上下文”“是否融入知识”等维度划分一、传统静态表征（无上下文，词级为主）这类方法为每个词分配固定向量，不考虑其在具体语境中的含义（无法解
【Qualcomm】高通SNPE框架简介、下载与使用 Jackilina_Stone 人工智能 Qualcomm SNPE
目录一高通SNPE框架1SNPE简介2QNN与SNPE3Capabilities4工作流程二SNPE的安装与使用1下载2Setup3SNPE的使用概述一高通SNPE框架1SNPE简介SNPE（SnapdragonNeuralProcessingEngine），是高通公司推出的面向移动端和物联网设备的深度学习推理框架。SNPE提供了一套完整的深度学习推理框架，能够支持多种深度学习模型，包括Pytor
深度学习篇---昇腾NPU&CANN 工具包 Atticus-Orion 上位机知识篇图像处理篇深度学习篇深度学习人工智能 NPU 昇腾 CANN
介绍昇腾NPU是华为推出的神经网络处理器，具有强大的AI计算能力，而CANN工具包则是面向AI场景的异构计算架构，用于发挥昇腾NPU的性能优势。以下是详细介绍：昇腾NPU架构设计：采用达芬奇架构，是一个片上系统，主要由特制的计算单元、大容量的存储单元和相应的控制单元组成。集成了多个CPU核心，包括控制CPU和AICPU，前者用于控制处理器整体运行，后者承担非矩阵类复杂计算。此外，还拥有AICore
深度学习图像分类数据集—桃子识别分类 AI街潜水的八角深度学习图像数据集深度学习分类人工智能
该数据集为图像分类数据集，适用于ResNet、VGG等卷积神经网络，SENet、CBAM等注意力机制相关算法，VisionTransformer等Transformer相关算法。数据集信息介绍：桃子识别分类：['B1','M2','R0','S3']训练数据集总共有6637张图片，每个文件夹单独放一种数据各子文件夹图片统计:·B1:1601张图片·M2:1800张图片·R0:1601张图片·S3:
NumPy-@运算符详解 GG不是gg numpy numpy
NumPy-@运算符详解一、@运算符的起源与设计目标1.从数学到代码：符号的统一2.设计目标二、@运算符的核心语法与运算规则1.基础用法：二维矩阵乘法2.一维向量的矩阵语义3.高维数组：批次矩阵运算4.广播机制：灵活的形状匹配三、@运算符与其他乘法方式的核心区别1.对比`np.dot()`2.对比元素级乘法`*`3.对比`np.matrix`的`*`运算符四、典型应用场景：从基础到高阶1.深度学习
NLP_知识图谱_大模型——个人学习记录 macken9999 自然语言处理知识图谱大模型自然语言处理知识图谱学习
1.自然语言处理、知识图谱、对话系统三大技术研究与应用https://github.com/lihanghang/NLP-Knowledge-Graph深度学习-自然语言处理(NLP)-知识图谱：知识图谱构建流程【本体构建、知识抽取（实体抽取、关系抽取、属性抽取）、知识表示、知识融合、知识存储】-元気森林-博客园https://www.cnblogs.com/-402/p/16529422.htm
解决 Python 包安装失败问题：以 accelerate 为例
在使用Python开发项目时，我们经常会遇到依赖包安装失败的问题。今天，我们就以accelerate包为例，详细探讨一下可能的原因以及解决方法。通过这篇文章，你将了解到Python包安装失败的常见原因、如何切换镜像源、如何手动安装包，以及一些实用的注意事项。一、问题背景在开发一个深度学习项目时，我需要安装accelerate包来优化模型的训练过程。然而，当我运行以下命令时：bash复制pipins
从RNN循环神经网络到Transformer注意力机制：解析神经网络架构的华丽蜕变熊猫钓鱼>_> 神经网络 rnn transformer
1.引言在自然语言处理和序列建模领域，神经网络架构经历了显著的演变。从早期的循环神经网络（RNN）到现代的Transformer架构，这一演变代表了深度学习方法在处理序列数据方面的重大进步。本文将深入比较这两种架构，分析它们的工作原理、优缺点，并通过实验结果展示它们在实际应用中的性能差异。2.循环神经网络（RNN）2.1基本原理循环神经网络是专门为处理序列数据而设计的神经网络架构。RNN的核心思想
如何使用Python实现交通工具识别
如何使用Python实现交通工具识别文章目录技术架构功能流程识别逻辑用户界面增强特性依赖项主要类别内容展示该系统是一个基于深度学习的交通工具识别工具，具备以下核心功能与特点：技术架构使用预训练的ResNet50卷积神经网络模型（来自ImageNet数据集）集成图像增强预处理技术（随机裁剪、旋转、翻转等）采用多数投票机制提升预测稳定性基于置信度评分的结果筛选策略功能流程用户通过GUI界面选择待识别图
Python OpenCV教程从入门到精通的全面指南【文末送书】一键难忘 python opencv 开发语言
文章目录PythonOpenCV从入门到精通1.安装OpenCV2.基本操作2.1读取和显示图像2.2图像基本操作3.图像处理3.1图像转换3.2图像阈值处理3.3图像平滑4.边缘检测和轮廓4.1Canny边缘检测4.2轮廓检测5.高级操作5.1特征检测5.2目标跟踪5.3深度学习与OpenCVPythonOpenCV从入门到精通【文末送书】PythonOpenCV从入门到精通OpenCV(Ope
第八周 tensorflow实现猫狗识别降花绘 365天深度学习 tensorflow系列 tensorflow 深度学习人工智能
本文为365天深度学习训练营内部限免文章（版权归K同学啊所有）**参考文章地址：[TensorFlow入门实战｜365天深度学习训练营-第8周：猫狗识别（训练营内部成员可读）]**作者：K同学啊文章目录一、本周学习内容:1、自己搭建VGG16网络2、了解model.train_on_batch（）3、了解tqdm，并使用tqdm实现可视化进度条二、前言三、电脑环境四、前期准备1、导入相关依赖项2、
深度学习实战-使用TensorFlow与Keras构建智能模型程序员Gloria Python超入门 TensorFlow python
深度学习实战-使用TensorFlow与Keras构建智能模型深度学习已经成为现代人工智能的重要组成部分，而Python则是实现深度学习的主要编程语言之一。本文将探讨如何使用TensorFlow和Keras构建深度学习模型，包括必要的代码实例和详细的解析。1.深度学习简介深度学习是机器学习的一个分支，使用多层神经网络来学习和表示数据中的复杂模式。其广泛应用于图像识别、自然语言处理、推荐系统等领域。
AI在垂直领域的深度应用：医疗、金融与自动驾驶的革新之路
AI在垂直领域的深度应用：医疗、金融与自动驾驶的革新之路一、医疗领域：AI驱动的精准诊疗与效率提升1.医学影像诊断AI算法通过深度学习技术，已实现对X光、CT、MRI等影像的快速分析，辅助医生检测癌症、骨折等疾病。例如，GoogleDeepMind的AI系统在乳腺癌筛查中，误检率比人类专家低9.4%；中国的推想医疗AI系统可在20秒内完成肺部CT扫描分析，为急诊救治争取黄金时间。2.药物研发传统药
专题：2025云计算与AI技术研究趋势报告|附200+份报告PDF、原数据表汇总下载
原文链接：https://tecdat.cn/?p=42935关键词：2025,云计算，AI技术，市场趋势，深度学习，公有云，研究报告云计算和AI技术正以肉眼可见的速度重塑商业世界。过去十年，全球云服务收入激增8倍，中国云计算市场规模突破6000亿元，而深度学习算法的应用量更是暴涨400倍。这些数字背后，是企业从“自建机房”到“云原生开发”的转型，是AI从“实验室”走向“产业级应用”的跨越。本报告
【深度学习解惑】在实践中如何发现和修正RNN训练过程中的数值不稳定？云博士的AI课堂大模型技术开发与实践哈佛博后带你玩转机器学习深度学习深度学习 rnn 人工智能 tensorflow pytorch 神经网络机器学习
在实践中发现和修正RNN训练过程中的数值不稳定目录引言与背景介绍原理解释代码说明与实现应用场景与案例分析实验设计与结果分析性能分析与技术对比常见问题与解决方案创新性与差异性说明局限性与挑战未来建议和进一步研究扩展阅读与资源推荐图示与交互性内容语言风格与通俗化表达互动交流1.引言与背景介绍循环神经网络(RNN)在处理序列数据时表现出色，但训练过程中常面临梯度消失和梯度爆炸问题，导致数值不稳定。当网络
【深度学习实战】当前三个最佳图像分类模型的代码详解云博士的AI课堂大模型技术开发与实践哈佛博后带你玩转机器学习深度学习深度学习人工智能分类模型机器学习 Transformer EfficientNet ConvNeXt
下面给出三个在当前图像分类任务中精度表现突出的模型示例，分别基于SwinTransformer、EfficientNet与ConvNeXt。每个模型均包含：训练代码（使用PyTorch）从预训练权重开始微调（也可注释掉预训练选项，从头训练）数据集目录结构：└──dataset_root├──buy#第一类图像└──nobuy#第二类图像随机拆分：80%训练，20%验证每个Epoch输出一次loss
第35周—————糖尿病预测模型优化探索
目录目录前言1.检查GPU2.查看数据编辑3.划分数据集4.创建模型与编译训练5.编译及训练模型6.结果可视化7.总结前言本文为365天深度学习训练营中的学习记录博客原作者：K同学啊1.检查GPUimporttorch.nnasnnimporttorch.nn.functionalasFimporttorchvision,torch#设置硬件设备，如果有GPU则使用，没有则使用cpudevice=
深度学习预备知识 AmazingMQ 深度学习人工智能
1.Tensor张量定义：张量（tensor）表示一个由数值组成的数组，这个数组可能有多个维度（轴）。具有一个轴的张量对应数学上的向量，具有两个轴的张量对应数学上的矩阵，具有两个以上轴的张量目前没有特定的数学名称。importtorch#arange创建一个行向量x，这个行向量包含以0开始的前12个整数。x=torch.arange(12)print("x=",x)#x=tensor([0,1,2
根茎式装配体（RA）作为下一代协同智能范式的理论、架构与应用由数入道人工智能思维框架软件工程智能体
一、引言——范式危机与新大陆的召唤1.1表征主义的黄昏：当前AI协同范式的认知天花板自艾伦·图灵在《计算机器与智能》中播下思想的种子以来，人工智能的漫长征途始终被一个强大而内隐的哲学范式所笼罩——我们称之为“表征主义”（Representationism）。这一范式，无论其外在形态如何演变，从早期的符号逻辑、专家系统，到如今风靡全球的深度学习神经网络，其核心信念从未动摇：智能的核心，在于构建一个关
Manus AI与多语言手写识别
ManusAI与多语言手写识别背景与概述手写识别技术的发展现状与挑战ManusAI的核心技术与应用场景多语言手写识别的市场需求与难点ManusAI的技术架构深度学习在手写识别中的应用多语言支持的模型设计数据预处理与特征提取方法多语言手写识别的关键挑战不同语言字符的多样性处理上下文语义与书写风格适应性低资源语言的训练数据获取解决方案与优化策略迁移学习在多语言任务中的应用端到端模型的优化与轻量化用户反
基于LIDC-IDRI肺结节肺癌数据集的人工智能深度学习分类良性和恶性肺癌（Python 全代码）全流程解析（二）
基于LIDC-IDRI肺结节肺癌数据集的人工智能深度学习分类良性和恶性肺癌（Python全代码）全流程解析（二）1环境配置和数据集预处理1.1环境配置1.1数据集预处理2深度学习模型训练和评估2.1深度学习模型训练2.1深度学习模型评估笑话一则开心一下喽完整代码如下：模型文件如下深度学习模型讲解---待续第一部分内容的传送门第三部分传送门1环境配置和数据集预处理1.1环境配置环境配置建议使用ana
深度学习交互式图像分割技术演进与突破 wang1776866571 深度学习交互式分割深度学习人工智能交互式分割
说明本文为作者读研期间基于交互式图像分割领域公开文献的系统梳理与个人理解总结，所有内容均为原创撰写（ai辅助创作），未直接复制或抄袭他人成果。文中涉及的算法、模型及实验结论均参考自领域内公开发表的学术论文（具体文献见文末参考文献列表）。本文旨在为交互式图像分割领域的学习者提供一份结构化的综述参考，内容涵盖技术演进、核心方法、关键技术优化及应用前景，希望能为相关研究提供启发。摘要：本文系统综述了基于
前沿交叉：Fluent与深度学习驱动的流体力学计算体系 m0_75133639 流体力学深度学习人工智能航空航天 fluent 流体力学材料科学 CFD
基础模块流体力学方程求解1、不可压缩N-S方程数值解法（有限差分/有限元/伪谱法）·Fluent工业级应用：稳态/瞬态流、两相流仿真（圆柱绕流、入水问题）·Tecplot流场可视化与数据导出2、CFD数据的AI预处理·基于PCA/SVD的流场数据降维·特征值分解与时空特征提取深度学习核心3.物理机理嵌入的神经网络架构·物理信息神经网络（PINN）：将N-S方程嵌入损失函数（JAX框架实现）·神经常
如何使用目标检测深度学习框架yolov8训练钢管管道表面缺陷VOC+YOLO格式1159张3类别的检测数据集步骤和流程 FL1623863129 深度学习目标检测深度学习 YOLO
【数据集介绍】数据集中有很多增强图片，大约300张为原图剩余为增强图片数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：1159标注数量(xml文件个数)：1159标注数量(txt文件个数)：1159标注类别数：3所在仓库：firc-dataset标注类别名称(注意yo
2025年人工智能、虚拟现实与交互设计国际学术会议学术小八学术人工智能 vr 交互
重要信息官网：www.aivrid.com时间：2025年10月17-19日地点：中国-东莞部分介绍征稿主题包括但不限于：生物特征模式识别机器视觉专家系统深度学习智能搜索自动编程智能控制智能机器人系统组件虚拟现实平台用于VR/AR的AI平台数据和生成、操作、分析和验证浸入式环境和虚拟世界的生成优化和现实的渲染人工智能与用户体验个性化推荐系统情感计算与用户响应虚拟现实与沉浸式技术沉浸式环境设计交互设
机器学习深度学习驱动在光子学设计中的应用与未来【专题培训会议邀您共探科技前沿】软研科技信息与通信信号处理量子计算人工智能
一、背景介绍在智能科技飞速发展的今天，光子学设计与智能算法的结合正成为科研创新的热点。深度学习、机器学习等算法在光子器件的逆向设计、超构表面材料设计、光学神经网络构建等方面展现出巨大潜力。二、会议亮点由北京软研国际信息技术研究院主办的“智能算法驱动的光子学设计与应用”专题培训会议，将深入探讨以下核心内容：光子器件的逆向设计：利用深度学习优化多参数光子器件设计。超构表面与超材料设计：智能算法在新型光
【第三章:神经网络原理详解与Pytorch入门】02.深度学习框架PyTorch入门-(4)Pytorch实战 IT古董人工智能课程深度学习神经网络 pytorch
第三章:神经网络原理详解与Pytorch入门第二部分：深度学习框架PyTorch入门第四节：Pytorch模型构建内容：如何搭建复杂网络以及如何修改模型与保存一、构建复杂神经网络结构在PyTorch中，构建复杂模型通常通过继承nn.Module类，分模块组织层与前向传播逻辑。示例：自定义一个卷积神经网络（CNN）importtorch.nnasnnimporttorch.nn.functional
探秘AI大模型：一键获取深度学习精华-PPT全面解读曹筱习Dwayne
探秘AI大模型：一键获取深度学习精华-PPT全面解读【下载地址】AI大模型PPT资源下载本仓库提供了一个名为“ai大模型ppt”的资源文件下载。该资源文件详细介绍了AI大模型的相关内容，包括但不限于AI大模型的定义、应用场景、技术架构、发展趋势等。通过这份PPT，您可以深入了解AI大模型的核心概念和实际应用，为您的学习和研究提供有力支持项目地址:https://gitcode.com/open-s
人工智能基础知识PPT课件智慧化智能化数字化方案方案解读馆人工智能入门人工智能学习人工智能课件人工智能PPT
人工智能基础知识定义与概念：人工智能是研究、开发用于模拟、延伸和扩展人类智能行为的综合性科学，其目的是让计算机系统具备执行人类智能任务的能力。涉及计算机科学、数学等多学科，研究对象是让系统具备智能，智能包括认知、适应和自主能力等维度。学派与方法学派：有符号主义、联结主义、行为主义等学派，分别从不同角度研究人工智能。方法：包括基于知识、学习和仿生的方法，如专家系统、机器学习、深度学习等。分类与发展分
java责任链模式 3213213333332132 java 责任链模式村民告县长
责任链模式，通常就是一个请求从最低级开始往上层层的请求，当在某一层满足条件时，请求将被处理，当请求到最高层仍未满足时，则请求不会被处理。就是一个请求在这个链条的责任范围内，会被相应的处理，如果超出链条的责任范围外，请求不会被相应的处理。下面代码模拟这样的效果：创建一个政府抽象类,方便所有的具体政府部门继承它。 package 责任链模式; /** *
linux、mysql、nginx、tomcat 性能参数优化 ronin47
一、linux 系统内核参数 /etc/sysctl.conf文件常用参数 net.core.netdev_max_backlog = 32768 #允许送到队列的数据包的最大数目 net.core.rmem_max = 8388608 #SOCKET读缓存区大小 net.core.wmem_max = 8388608 #SOCKET写缓存区大
php命令行界面 dcj3sjt126com PHP cli
常用选项 php -v php -i PHP安装的有关信息 php -h 访问帮助文件 php -m 列出编译到当前PHP安装的所有模块执行一段代码 php -r 'echo "hello, world!";' php -r 'echo "Hello, World!\n";' php -r '$ts = filemtime("
Filter&Session 171815164 session
Filter HttpServletRequest requ = (HttpServletRequest) req; HttpSession session = requ.getSession(); if (session.getAttribute("admin") == null) { PrintWriter out = res.ge
连接池与Spring,Hibernate结合 g21121 Hibernate
前几篇关于Java连接池的介绍都是基于Java应用的，而我们常用的场景是与Spring和ORM框架结合，下面就利用实例学习一下这方面的配置。 1.下载相关内容： &nb
[简单]mybatis判断数字类型 53873039oycg mybatis
昨天同事反馈mybatis保存不了int类型的属性,一直报错，错误信息如下: Caused by: java.lang.NumberFormatException: For input string: "null" at sun.mis
项目启动时或者启动后ava.lang.OutOfMemoryError: PermGen space 程序员是怎么炼成的 eclipse jvm tomcat catalina.sh eclipse.ini
在启动比较大的项目时，因为存在大量的jsp页面，所以在编译的时候会生成很多的.class文件，.class文件是都会被加载到jvm的方法区中，如果要加载的class文件很多，就会出现方法区溢出异常 java.lang.OutOfMemoryError: PermGen space. 解决办法是点击eclipse里的tomcat，在
我的crm小结 aijuans crm
各种原因吧，crm今天才完了。主要是接触了几个新技术： Struts2、poi、ibatis这几个都是以前的项目中用过的。 Jsf、tapestry是这次新接触的，都是界面层的框架，用起来也不难。思路和struts不太一样，传说比较简单方便。不过个人感觉还是struts用着顺手啊，当然springmvc也很顺手，不知道是因为习惯还是什么。jsf和tapestry应用的时候需要知道他们的标签、主
spring里配置使用hibernate的二级缓存几步 antonyup_2006 java spring Hibernate xml cache
．在spring的配置文件中 applicationContent.xml，hibernate部分加入 xml 代码 <prop key="hibernate.cache.provider_class">org.hibernate.cache.EhCacheProvider</prop> <prop key="hi
JAVA基础面试题百合不是茶抽象实现接口 String类接口继承抽象类继承实体类自定义异常
/* * 栈（stack）：主要保存基本类型（或者叫内置类型）（char、byte、short、 *int、long、 float、double、boolean）和对象的引用，数据可以共享，速度仅次于 * 寄存器（register），快于堆。堆（heap）：用于存储对象。 */ &
让sqlmap文件 "继承" 起来 bijian1013 java ibatis sqlmap
多个项目中使用ibatis , 和数据库表对应的 sqlmap文件（增删改查等基本语句)，dao, pojo 都是由工具自动生成的, 现在将这些自动生成的文件放在一个单独的工程中，其它项目工程中通过jar包来引用，并通过"继承"为基础的sqlmap文件，dao,pojo 添加新的方法来满足项
精通Oracle10编程SQL(13)开发触发器 bijian1013 oracle 数据库 plsql
/* *开发触发器 */ --得到日期是周几 select to_char(sysdate+4,'DY','nls_date_language=AMERICAN') from dual; select to_char(sysdate,'DY','nls_date_language=AMERICAN') from dual; --建立BEFORE语句触发器 CREATE O
【EhCache三】EhCache查询 bit1129 ehcache
本文介绍EhCache查询缓存中数据，EhCache提供了类似Hibernate的查询API，可以按照给定的条件进行查询。要对EhCache进行查询，需要在ehcache.xml中设定要查询的属性数据准备 @Before public void setUp() { //加载EhCache配置文件 Inpu
CXF框架入门实例白糖_ spring Web 框架 webservice servlet
CXF是apache旗下的开源框架，由Celtix + XFire这两门经典的框架合成，是一套非常流行的web service框架。它提供了JAX-WS的全面支持，并且可以根据实际项目的需要，采用代码优先（Code First）或者 WSDL 优先（WSDL First）来轻松地实现 Web Services 的发布和使用，同时它能与spring进行完美结合。在apache cxf官网提供
angular.equals boyitech AngularJS AngularJS API AnguarJS 中文API angular.equals
angular.equals 描述: 比较两个值或者两个对象是不是相等。还支持值的类型，正则表达式和数组的比较。两个值或对象被认为是相等的前提条件是以下的情况至少能满足一项：两个值或者对象能通过=== （恒等）的比较两个值或者对象是同样类型，并且他们的属性都能通过angular
java-腾讯暑期实习生-输入一个数组A[1,2,...n]，求输入B，使得数组B中的第i个数字B[i]=A[0]*A[1]*...*A[i-1]*A[i+1] bylijinnan java
这道题的具体思路请参看何海涛的微博：http://weibo.com/zhedahht import java.math.BigInteger; import java.util.Arrays; public class CreateBFromATencent { /** * 题目：输入一个数组A[1,2,...n]，求输入B，使得数组B中的第i个数字B[i]=A
FastDFS 的安装和配置修订版 Chen.H linux fastDFS 分布式文件系统
FastDFS Home:http://code.google.com/p/fastdfs/ 1. 安装 http://code.google.com/p/fastdfs/wiki/Setup http://hi.baidu.com/leolance/blog/item/3c273327978ae55f93580703.html 安装libevent (对libevent的版本要求为1.4.
[强人工智能]拓扑扫描与自适应构造器 comsci 人工智能
当我们面对一个有限拓扑网络的时候,在对已知的拓扑结构进行分析之后,发现在连通点之后,还存在若干个子网络,且这些网络的结构是未知的,数据库中并未存在这些网络的拓扑结构数据....这个时候,我们该怎么办呢? 那么,现在我们必须设计新的模块和代码包来处理上面的问题
oracle merge into的用法 daizj oracle sql merget into
Oracle中merge into的使用 http://blog.csdn.net/yuzhic/article/details/1896878 http://blog.csdn.net/macle2010/article/details/5980965 该命令使用一条语句从一个或者多个数据源中完成对表的更新和插入数据. ORACLE 9i 中，使用此命令必须同时指定UPDATE 和INSE
不适合使用Hadoop的场景 datamachine hadoop
转自：http://dev.yesky.com/296/35381296.shtml。　　Hadoop通常被认定是能够帮助你解决所有问题的唯一方案。当人们提到“大数据”或是“数据分析”等相关问题的时候，会听到脱口而出的回答：Hadoop! 实际上Hadoop被设计和建造出来，是用来解决一系列特定问题的。对某些问题来说，Hadoop至多算是一个不好的选择，对另一些问题来说，选择Ha
YII findAll的用法 dcj3sjt126com yii
看文档比较糊涂，其实挺简单的： $predictions=Prediction::model()->findAll("uid=:uid",array(":uid"=>10)); 第一个参数是选择条件：”uid=10″。其中:uid是一个占位符，在后面的array(“:uid”=>10)对齐进行了赋值；更完善的查询需要
vim 常用 NERDTree 快捷键 dcj3sjt126com vim
下面给大家整理了一些vim NERDTree的常用快捷键了，这里几乎包括了所有的快捷键了，希望文章对各位会带来帮助。切换工作台和目录 ctrl + w + h 光标 focus 左侧树形目录ctrl + w + l 光标 focus 右侧文件显示窗口ctrl + w + w 光标自动在左右侧窗口切换ctrl + w + r 移动当前窗口的布局位置 o 在已有窗口中打开文件、目录或书签，并跳
Java把目录下的文件打印出来蕃薯耀列出目录下的文件文件夹下面的文件目录下的文件
Java把目录下的文件打印出来 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 2015年7月11日 11:02:
linux远程桌面----VNCServer与rdesktop hanqunfeng Desktop
windows远程桌面到linux，需要在linux上安装vncserver，并开启vnc服务，同时需要在windows下使用vnc-viewer访问Linux。vncserver同时支持linux远程桌面到linux。 linux远程桌面到windows，需要在linux上安装rdesktop，同时开启windows的远程桌面访问。下面分别介绍，以windo
guava中的join和split功能 jackyrong java
guava库中，包含了很好的join和split的功能，例子如下： 1）将LIST转换为使用字符串连接的字符串 List<String> names = Lists.newArrayList("John", "Jane", "Adam", "Tom");
Web开发技术十年发展历程 lampcy android Web 浏览器 html5
回顾web开发技术这十年发展历程： Ajax 03年的时候我上六年级，那时候网吧刚在小县城的角落萌生。传奇，大话西游第一代网游一时风靡。我抱着试一试的心态给了网吧老板两块钱想申请个号玩玩，然后接下来的一个小时我一直在，注，册，账，号。彼时网吧用的512k的带宽，注册的时候，填了一堆信息，提交，页面跳转，嘣，”您填写的信息有误，请重填”。然后跳转回注册页面，以此循环。我现在时常想，如果当时a
架构师之mima-----------------mina的非NIO控制IOBuffer(说得比较好) nannan408 buffer
1.前言。如题。 2.代码。 IoService IoService是一个接口，有两种实现：IoAcceptor和IoConnector；其中IoAcceptor是针对Server端的实现，IoConnector是针对Client端的实现；IoService的职责包括： 1、监听器管理 2、IoHandler 3、IoSession
ORA-00054:resource busy and acquire with NOWAIT specified Everyday都不同 oracle session Lock
[Oracle] 今天对一个数据量很大的表进行操作时，出现如题所示的异常。此时表明数据库的事务处于“忙”的状态，而且被lock了，所以必须先关闭占用的session。 step1，查看被lock的session： select t2.username, t2.sid, t2.serial#, t2.logon_time from v$locked_obj
javascript学习笔记 tntxia JavaScript
javascript里面有6种基本类型的值:number、string、boolean、object、function和undefined。number：就是数字值，包括整数、小数、NaN、正负无穷。string:字符串类型、单双引号引起来的内容。boolean:true、false object:表示所有的javascript对象，不用多说function:我们熟悉的方法，也就是
Java enum的用法详解 xieke90 enum 枚举
Java中枚举实现的分析：示例： public static enum SEVERITY{ INFO,WARN,ERROR } enum很像特殊的class，实际上enum声明定义的类型就是一个类。而这些类都是类库中Enum类的子类 (java.l