frankliu624

知识蒸馏paper分类整理(2014-2020)

Awesome Knowledge-Distillation

Awesome Knowledge-Distillation
- Different forms of knowledge
  - Knowledge from logits
  - Knowledge from intermediate layers
  - Graph-based
  - Mutual Information
  - Self-KD
  - Structured Knowledge
  - Privileged Information
- KD + GAN
- KD + Meta-learning
- Data-free KD
- KD + AutoML
- KD + RL
- Multi-teacher KD
  - Knowledge Amalgamation（KA) - zju-VIPA
- Cross-modal KD & DA
- Application of KD
  - for NLP
- Model Pruning or Quantization
- Beyond

Different forms of knowledge

Knowledge from logits

Distilling the knowledge in a neural network. Hinton et al. arXiv:1503.02531
Learning from Noisy Labels with Distillation. Li, Yuncheng et al. ICCV 2017
Training Deep Neural Networks in Generations:A More Tolerant Teacher Educates Better Students. arXiv:1805.05551
Knowledge distillation by on-the-fly native ensemble. Lan, Xu et al. NIPS 2018
Learning Metrics from Teachers: Compact Networks for Image Embedding. Yu, Lu et al. CVPR 2019
Relational Knowledge Distillation. Park, Wonpyo et al, CVPR 2019
Like What You Like: Knowledge Distill via Neuron Selectivity Transfer. Huang, Zehao and Wang, Naiyan. 2017
On Knowledge Distillation from Complex Networks for Response Prediction. Arora, Siddhartha et al. NAACL 2019
On the Efficacy of Knowledge Distillation. Cho, Jang Hyun and Hariharan, Bharath. arXiv:1910.01348. ICCV 2019
[noval]Revisit Knowledge Distillation: a Teacher-free Framework. Yuan, Li et al. arXiv:1909.11723
Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher. Mirzadeh et al. arXiv:1902.03393
Ensemble Distribution Distillation. ICLR 2020
Noisy Collaboration in Knowledge Distillation. ICLR 2020
On Compressing U-net Using Knowledge Distillation. arXiv:1812.00249
Distillation-Based Training for Multi-Exit Architectures. Phuong, Mary and Lampert, Christoph H. ICCV 2019
Self-training with Noisy Student improves ImageNet classification. Xie, Qizhe et al.(Google) CVPR 2020
Variational Student: Learning Compact and Sparser Networks in Knowledge Distillation Framework. arXiv:1910.12061
Preparing Lessons: Improve Knowledge Distillation with Better Supervision. arXiv:1911.07471
Adaptive Regularization of Labels. arXiv:1908.05474
Positive-Unlabeled Compression on the Cloud. Xu, Yixing(HUAWEI) et al. NIPS 2019
Snapshot Distillation: Teacher-Student Optimization in One Generation. Yang, Chenglin et al. CVPR 2019
QUEST: Quantized embedding space for transferring knowledge. Jain, Himalaya et al. CVPR 2020(pre)
Conditional teacher-student learning. Z. Meng et al. ICASSP 2019
Subclass Distillation. Müller, Rafael et al. arXiv:2002.03936
MarginDistillation: distillation for margin-based softmax. Svitov, David & Alyamkin, Sergey. arXiv:2003.02586
An Embarrassingly Simple Approach for Knowledge Distillation. Gao, Mengya et al. MLR 2018
Sequence-Level Knowledge Distillation. Kim, Yoon & Rush, Alexander M. arXiv:1606.07947
Boosting Self-Supervised Learning via Knowledge Transfer. Noroozi, Mehdi et al. CVPR 2018
Meta Pseudo Labels. Pham, Hieu et al. ICML 2020

Knowledge from intermediate layers

Fitnets: Hints for thin deep nets. Romero, Adriana et al. arXiv:1412.6550
Paying more attention to attention: Improving the performance of convolutional neural networks via attention transfer. Zagoruyko et al. ICLR 2017
Knowledge Projection for Effective Design of Thinner and Faster Deep Neural Networks. Zhang, Zhi et al. arXiv:1710.09505
A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning. Yim, Junho et al. CVPR 2017
Paraphrasing complex network: Network compression via factor transfer. Kim, Jangho et al. NIPS 2018
Knowledge transfer with jacobian matching. ICML 2018
Self-supervised knowledge distillation using singular value decomposition. Lee, Seung Hyun et al. ECCV 2018
Variational Information Distillation for Knowledge Transfer. Ahn, Sungsoo et al. CVPR 2019
9
Knowledge Distillation via Instance Relationship Graph. Liu, Yufan et al. CVPR 2019
Knowledge Distillation via Route Constrained Optimization. Jin, Xiao et al. ICCV 2019
Similarity-Preserving Knowledge Distillation. Tung, Frederick, and Mori Greg. ICCV 2019
MEAL: Multi-Model Ensemble via Adversarial Learning. Shen,Zhiqiang, He,Zhankui, and Xue Xiangyang. AAAI 2019
A Comprehensive Overhaul of Feature Distillation. Heo, Byeongho et al. ICCV 2019
Feature-map-level Online Adversarial Knowledge Distillation. ICLR 2020
Distilling Object Detectors with Fine-grained Feature Imitation. ICLR 2020
Knowledge Squeezed Adversarial Network Compression. Changyong, Shu et al. AAAI 2020
Stagewise Knowledge Distillation. Kulkarni, Akshay et al. arXiv: 1911.06786
Knowledge Distillation from Internal Representations. AAAI 2020
Knowledge Flow:Improve Upon Your Teachers. ICLR 2019
LIT: Learned Intermediate Representation Training for Model Compression. ICML 2019
Learning Deep Representations with Probabilistic Knowledge Transfer. Passalis et al. ECCV 2018
Improving the Adversarial Robustness of Transfer Learning via Noisy Feature Distillation. Chin, Ting-wu et al. arXiv:2002.02998
Knapsack Pruning with Inner Distillation. Aflalo, Yonathan et al. arXiv:2002.08258
Residual Knowledge Distillation. Gao, Mengya et al. arXiv:2002.09168
Knowledge distillation via adaptive instance normalization. Yang, Jing et al. arXiv:2003.04289
Bert-of-Theseus: Compressing bert by progressive module replacing. Xu, Canwen et al. arXiv:2002.02925 [code]

Graph-based

Graph-based Knowledge Distillation by Multi-head Attention Network. Lee, Seunghyun and Song, Byung. Cheol arXiv:1907.02226
Graph Representation Learning via Multi-task Knowledge Distillation. arXiv:1911.05700
Deep geometric knowledge distillation with graphs. arXiv:1911.03080
Better and faster: Knowledge transfer from multiple self-supervised learning tasks via graph distillation for video classification. IJCAI 2018
Distillating Knowledge from Graph Convolutional Networks. Yang, Yiding et al. arXiv:2003.10477

Mutual Information

Correlation Congruence for Knowledge Distillation. Peng, Baoyun et al. ICCV 2019
Similarity-Preserving Knowledge Distillation. Tung, Frederick, and Mori Greg. ICCV 2019
Variational Information Distillation for Knowledge Transfer. Ahn, Sungsoo et al. CVPR 2019
Contrastive Representation Distillation. Tian, Yonglong et al. ICLR 2020

Self-KD

Moonshine:Distilling with Cheap Convolutions. Crowley, Elliot J. et al. NIPS 2018
Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation. Zhang, Linfeng et al. ICCV 2019
Learning Lightweight Lane Detection CNNs by Self Attention Distillation. Hou, Yuenan et al. ICCV 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding. Clark, Kevin et al. ACL 2019,short
Self-Knowledge Distillation in Natural Language Processing. Hahn, Sangchul and Choi, Heeyoul. arXiv:1908.01851
Rethinking Data Augmentation: Self-Supervision and Self-Distillation. Lee, Hankook et al. ICLR 2020
Regularizing Predictions via Class wise Self knowledge Distillation. ICLR 2020
MSD: Multi-Self-Distillation Learning via Multi-classifiers within Deep Neural Networks. arXiv:1911.09418
Self-Distillation Amplifies Regularization in Hilbert Space. Mobahi, Hossein et al. arXiv:2002.05715
MINILM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers. Wang, Wenhui et al. arXiv:2002.10957

Structured Knowledge

Paraphrasing Complex Network:Network Compression via Factor Transfer. Kim, Jangho et al. NIPS 2018
Relational Knowledge Distillation. Park, Wonpyo et al. CVPR 2019
Knowledge Distillation via Instance Relationship Graph. Liu, Yufan et al. CVPR 2019
Contrastive Representation Distillation. Tian, Yonglong et al. arXiv: 1910.10699
Teaching To Teach By Structured Dark Knowledge. ICLR 2020

Privileged Information

Learning using privileged information: similarity control and knowledge transfer. Vapnik, Vladimir and Rauf, Izmailov. MLR 2015
Unifying distillation and privileged information. Lopez-Paz, David et al. ICLR 2016
Model compression via distillation and quantization. Polino, Antonio et al. ICLR 2018
KDGAN:Knowledge Distillation with Generative Adversarial Networks. Wang, Xiaojie. NIPS 2018
[noval]Efficient Video Classification Using Fewer Frames. Bhardwaj, Shweta et al. CVPR 2019
Retaining privileged information for multi-task learning. Tang, Fengyi et al. KDD 2019
A Generalized Meta-loss function for regression and classification using privileged information. Asif, Amina et al. arXiv:1811.06885

KD + GAN

Training Shallow and Thin Networks for Acceleration via Knowledge Distillation with Conditional Adversarial Networks. Xu, Zheng et al. arXiv:1709.00513
KTAN: Knowledge Transfer Adversarial Network. Liu, Peiye et al. arXiv:1810.08126
KDGAN:Knowledge Distillation with Generative Adversarial Networks. Wang, Xiaojie. NIPS 2018
Adversarial Learning of Portable Student Networks. Wang, Yunhe et al. AAAI 2018
Adversarial Network Compression. Belagiannis, Vasileios et al. ECCV 2018
Cross-Modality Distillation: A case for Conditional Generative Adversarial Networks. ICASSP 2018
Adversarial Distillation for Efficient Recommendation with External Knowledge. TOIS 2018
Training student networks for acceleration with conditional adversarial networks. Xu, Zheng et al. BMVC 2018
[noval]DAFL:Data-Free Learning of Student Networks. Chen, Hanting et al. ICCV 2019
MEAL: Multi-Model Ensemble via Adversarial Learning. Shen,Zhiqiang, He,Zhankui, and Xue Xiangyang. AAAI 2019
Knowledge Distillation with Adversarial Samples Supporting Decision Boundary. Heo, Byeongho et al. AAAI 2019
Exploiting the Ground-Truth: An Adversarial Imitation Based Knowledge Distillation Approach for Event Detection. Liu, Jian et al. AAAI 2019
Adversarially Robust Distillation. Goldblum, Micah et al. AAAI 2020
GAN-Knowledge Distillation for one-stage Object Detection. Hong, Wei et al. arXiv:1906.08467
Lifelong GAN: Continual Learning for Conditional Image Generation. Kundu et al. arXiv:1908.03884
Compressing GANs using Knowledge Distillation. Aguinaldo, Angeline et al. arXiv:1902.00159
Feature-map-level Online Adversarial Knowledge Distillation. ICLR 2020
MineGAN: effective knowledge transfer from GANs to target domains with few images. Wang, Yaxing et al. arXiv:1912.05270
Distilling portable Generative Adversarial Networks for Image Translation. Chen, Hanting et al. AAAI 2020
GAN Compression: Efficient Architectures for Interactive Conditional GANs. Junyan Zhu et al. CVPR 2020 [code]

KD + Meta-learning

Few Sample Knowledge Distillation for Efficient Network Compression. Li, Tianhong et al. ICLR 2020
Learning What and Where to Transfer. Jang, Yunhun et al, ICML 2019
Transferring Knowledge across Learning Processes. Moreno, Pablo G et al. ICLR 2019
Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval. Liu, Qing et al. ICCV 2019
Diversity with Cooperation: Ensemble Methods for Few-Shot Classification. Dvornik, Nikita et al. ICCV 2019
Knowledge Representing: Efficient, Sparse Representation of Prior Knowledge for Knowledge Distillation. arXiv:1911.05329v1
Progressive Knowledge Distillation For Generative Modeling. ICLR 2020
Few Shot Network Compression via Cross Distillation. AAAI 2020

Data-free KD

Data-Free Knowledge Distillation for Deep Neural Networks. NIPS 2017
Zero-Shot Knowledge Distillation in Deep Networks. ICML 2019
DAFL:Data-Free Learning of Student Networks. ICCV 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching. Micaelli, Paul and Storkey, Amos. NIPS 2019
Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya et al. ICML 2019
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion. Yin, Hongxu et al. CVPR 2020
Data-Free Adversarial Distillation. Fang, Gongfan et al. CVPR 2020
The Knowledge Within: Methods for Data-Free Model Compression. Haroush, Matan et al. arXiv:1912.01274
Knowledge Extraction with No Observable Data. Yoo, Jaemin et al. NIPS 2019 [code]
Data-Free Knowledge Amalgamation via Group-Stack Dual-GAN. CVPR 2020

other data-free model compression:

Data-free Parameter Pruning for Deep Neural Networks. Srinivas, Suraj et al. arXiv:1507.06149
Data-Free Quantization Through Weight Equalization and Bias Correction. Nagel, Markus et al. ICCV 2019
ZeroQ: A Novel Zero Shot Quantization Framework. Cai, Yaohui et al. arxiv:2001.00281

KD + AutoML

Improving Neural Architecture Search Image Classifiers via Ensemble Learning. Macko, Vladimir et al. 2019
Blockwisely Supervised Neural Architecture Search with Knowledge Distillation. Li, Changlin et al. arXiv:1911.13053v1
Towards Oracle Knowledge Distillation with Neural Architecture Search. Kang, Minsoo et al. AAAI 2020
Search for Better Students to Learn Distilled Knowledge. Gu, Jindong & Tresp, Volker arXiv:2001.11612
Circumventing Outliers of AutoAugment with Knowledge Distillation. Wei, Longhui et al. arXiv:2003.11342

KD + RL

N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning. Ashok, Anubhav et al. ICLR 2018
Knowledge Flow:Improve Upon Your Teachers. Liu, Iou-jen et al. ICLR 2019
Transferring Knowledge across Learning Processes. Moreno, Pablo G et al. ICLR 2019
Exploration by random network distillation. Burda, Yuri et al. ICLR 2019
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning. Hong, Zhang-Wei et al. arXiv:2002.00149
Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach. Xue, Zeyue et al. arXiv:2002.02202

Multi-teacher KD

Learning from Multiple Teacher Networks. You, Shan et al. KDD 2017
Semi-Supervised Knowledge Transfer for Deep Learning from Private Training Data. ICLR 2017
Knowledge Adaptation: Teaching to Adapt. Arxiv:1702.02052
Deep Model Compression: Distilling Knowledge from Noisy Teachers. Sau, Bharat Bhusan et al. arXiv:1610.09650v2
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. Tarvainen, Antti and Valpola, Harri. NIPS 2017
Born-Again Neural Networks. Furlanello, Tommaso et al. ICML 2018
Deep Mutual Learning. Zhang, Ying et al. CVPR 2018
Knowledge distillation by on-the-fly native ensemble. Lan, Xu et al. NIPS 2018
Collaborative learning for deep neural networks. Song, Guocong and Chai, Wei. NIPS 2018
Data Distillation: Towards Omni-Supervised Learning. Radosavovic, Ilija et al. CVPR 2018
Multilingual Neural Machine Translation with Knowledge Distillation. ICLR 2019
Unifying Heterogeneous Classifiers with Distillation. Vongkulbhisal et al. CVPR 2019
Distilled Person Re-Identification: Towards a More Scalable System. Wu, Ancong et al. CVPR 2019
Diversity with Cooperation: Ensemble Methods for Few-Shot Classification. Dvornik, Nikita et al. ICCV 2019
Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System. Yang, Ze et al. WSDM 2020
FEED: Feature-level Ensemble for Knowledge Distillation. Park, SeongUk and Kwak, Nojun. arXiv:1909.10754(AAAI20 pre)
Stochasticity and Skip Connection Improve Knowledge Transfer. Lee, Kwangjin et al. ICLR 2020
Online Knowledge Distillation with Diverse Peers. Chen, Defang et al. AAAI 2020
Hydra: Preserving Ensemble Diversity for Model Distillation. Tran, Linh et al. arXiv:2001.04694
Distilled Hierarchical Neural Ensembles with Adaptive Inference Cost. Ruiz, Adria et al. arXv:2003.01474

Knowledge Amalgamation（KA) - zju-VIPA

VIPA - KA

Amalgamating Knowledge towards Comprehensive Classification. Shen, Chengchao et al. AAAI 2019
Amalgamating Filtered Knowledge : Learning Task-customized Student from Multi-task Teachers. Ye, Jingwen et al. IJCAI 2019
Knowledge Amalgamation from Heterogeneous Networks by Common Feature Learning. Luo, Sihui et al. IJCAI 2019
Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More. Ye, Jingwen et al. CVPR 2019
Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation. ICCV 2019
Data-Free Knowledge Amalgamation via Group-Stack Dual-GAN. CVPR 2020

Cross-modal KD & DA

SoundNet: Learning Sound Representations from Unlabeled Video SoundNet Architecture. Aytar, Yusuf et al. ECCV 2016
Cross Modal Distillation for Supervision Transfer. Gupta, Saurabh et al. CVPR 2016
Emotion recognition in speech using cross-modal transfer in the wild. Albanie, Samuel et al. ACM MM 2018
Through-Wall Human Pose Estimation Using Radio Signals. Zhao, Mingmin et al. CVPR 2018
Compact Trilinear Interaction for Visual Question Answering. Do, Tuong et al. ICCV 2019
Cross-Modal Knowledge Distillation for Action Recognition. Thoker, Fida Mohammad and Gall, Juerge. ICIP 2019
Learning to Map Nearly Anything. Salem, Tawfiq et al. arXiv:1909.06928
Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval. Liu, Qing et al. ICCV 2019
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation. Kundu et al. ICCV 2019
CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency. Chen, Yun-Chun et al. CVPR 2019
XD:Cross lingual Knowledge Distillation for Polyglot Sentence Embeddings. ICLR 2020
Effective Domain Knowledge Transfer with Soft Fine-tuning. Zhao, Zhichen et al. arXiv:1909.02236
ASR is all you need: cross-modal distillation for lip reading. Afouras et al. arXiv:1911.12747v1
Knowledge distillation for semi-supervised domain adaptation. arXiv:1908.07355
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. Meng, Zhong et al. arXiv:2001.01798
Cluster Alignment with a Teacher for Unsupervised Domain Adaptation. ICCV 2019
Attention Bridging Network for Knowledge Transfer. Li, Kunpeng et al. ICCV 2019
Unpaired Multi-modal Segmentation via Knowledge Distillation. Dou, Qi et al. arXiv:2001.03111
Multi-source Distilling Domain Adaptation. Zhao, Sicheng et al. arXiv:1911.11554

Application of KD

Face model compression by distilling knowledge from neurons. Luo, Ping et al. AAAI 2016
Learning efficient object detection models with knowledge distillation. Chen, Guobin et al. NIPS 2017
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy. Mishra, Asit et al. NIPS 2018
Distilled Person Re-identification: Towars a More Scalable System. Wu, Ancong et al. CVPR 2019
[noval]Efficient Video Classification Using Fewer Frames. Bhardwaj, Shweta et al. CVPR 2019
Fast Human Pose Estimation. Zhang, Feng et al. CVPR 2019
Distilling knowledge from a deep pose regressor network. Saputra et al. arXiv:1908.00858 (2019)
Learning Lightweight Lane Detection CNNs by Self Attention Distillation. Hou, Yuenan et al. ICCV 2019
Structured Knowledge Distillation for Semantic Segmentation. Liu, Yifan et al. CVPR 2019
Relation Distillation Networks for Video Object Detection. Deng, Jiajun et al. ICCV 2019
Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection. Dong, Xuanyi and Yang, Yi. ICCV 2019
Progressive Teacher-student Learning for Early Action Prediction. Wang, Xionghui et al. CVPR2019
Lightweight Image Super-Resolution with Information Multi-distillation Network. Hui, Zheng et al. ICCVW 2019
AWSD:Adaptive Weighted Spatiotemporal Distillation for Video Representation. Tavakolian, Mohammad et al. ICCV 2019
Dynamic Kernel Distillation for Efficient Pose Estimation in Videos. Nie, Xuecheng et al. ICCV 2019
Teacher Guided Architecture Search. Bashivan, Pouya and Tensen, Mark. ICCV 2019
Online Model Distillation for Efficient Video Inference. Mullapudi et al. ICCV 2019
Distilling Object Detectors with Fine-grained Feature Imitation. Wang, Tao et al. CVPR2019
Relation Distillation Networks for Video Object Detection. Deng, Jiajun et al. ICCV 2019
Knowledge Distillation for Incremental Learning in Semantic Segmentation. arXiv:1911.03462
MOD: A Deep Mixture Model with Online Knowledge Distillation for Large Scale Video Temporal Concept Localization. arXiv:1910.12295
Teacher-Students Knowledge Distillation for Siamese Trackers. arXiv:1907.10586
LaTeS: Latent Space Distillation for Teacher-Student Driving Policy Learning. Zhao, Albert et al. CVPR 2020(pre)
Knowledge Distillation for Brain Tumor Segmentation. arXiv:2002.03688
ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes. Chen, Yuhua et al. CVPR 2018
Next Point-of-Interest Recommendation on Resource-Constrained Mobile Devices. WWW 2020
Multi-Representation Knowledge Distillation For Audio Classification. Gao, Liang et al. arXiv:2002.09607
Collaborative Distillation for Ultra-Resolution Universal Style Transfer. Wang, Huan et al. CVPR 2020
ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference. Chung, Jae-Won et al. arXiv:2003.10735
Object Relational Graph with Teacher-Recommended Learning for Video Captioning. Zhang, Ziqi et al. CVPR 2020

for NLP

Patient Knowledge Distillation for BERT Model Compression. Sun, Siqi et al. arXiv:1908.09355
TinyBERT: Distilling BERT for Natural Language Understanding. Jiao, Xiaoqi et al. arXiv:1909.10351
Learning to Specialize with Knowledge Distillation for Visual Question Answering. NIPS 2018
Knowledge Distillation for Bilingual Dictionary Induction. EMNLP 2017
A Teacher-Student Framework for Maintainable Dialog Manager. EMNLP 2018
Understanding Knowledge Distillation in Non-Autoregressive Machine Translation. arxiv 2019
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. Sanh, Victor et al. arXiv:1910.01108
Well-Read Students Learn Better: On the Importance of Pre-training Compact Models. Turc, Iulia et al. arXiv:1908.08962
On Knowledge distillation from complex networks for response prediction. Arora, Siddhartha et al. NAACL 2019
Distilling the Knowledge of BERT for Text Generation. arXiv:1911.03829v1
Understanding Knowledge Distillation in Non-autoregressive Machine Translation. arXiv:1911.02727
MobileBERT: Task-Agnostic Compression of BERT by Progressive Knowledge Transfer. ICLR 2020
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation. Weng, Rongxiang et al. AAAI 2020
TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval. Lu, Wenhao et al. KDD 2020
Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation. Xu, Yige et al. arXiv:2002.10345

Model Pruning or Quantization

Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression. ECCV 2016
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning. Ashok, Anubhav et al. ICLR 2018
Slimmable Neural Networks. Yu, Jiahui et al. ICLR 2018
Co-Evolutionary Compression for Unpaired Image Translation. Shu, Han et al. ICCV 2019
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. Liu, Zechun et al. ICCV 2019
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning. ICLR 2020
Pruning with hints: an efficient framework for model acceleration. ICLR 2020
Training convolutional neural networks with cheap convolutions and online distillation. arXiv:1909.13063
Cooperative Pruning in Cross-Domain Deep Neural Network Compression. Chen, Shangyu et al. IJCAI 2019
QKD: Quantization-aware Knowledge Distillation. Kim, Jangho et al. arXiv:1911.12491v1

Beyond

Do deep nets really need to be deep?. Ba,Jimmy, and Rich Caruana. NIPS 2014
When Does Label Smoothing Help? Müller, Rafael, Kornblith, and Hinton. NIPS 2019
Towards Understanding Knowledge Distillation. Phuong, Mary and Lampert, Christoph. AAAI 2019
Harnessing deep neural networks with logucal rules. ACL 2016
Adaptive Regularization of Labels. Ding, Qianggang et al. arXiv:1908.05474
Knowledge Isomorphism between Neural Networks. Liang, Ruofan et al. arXiv:1908.01581
Role-Wise Data Augmentation for Knowledge Distillation. ICLR 2020
Neural Network Distiller: A Python Package For DNN Compression Research. arXiv:1910.12232
(survey)Modeling Teacher-Student Techniques in Deep Neural Networks for Knowledge Distillation. arXiv:1912.13179
Understanding and Improving Knowledge Distillation. Tang, Jiaxi et al. arXiv:2002.03532
The State of Knowledge Distillation for Classification. Ruffy, Fabian and Chahal, Karanbir. arXiv:1912.10850 [code]
TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing. HIT and iFLYTEK. arXiv:2002.12620
Explaining Knowledge Distillation by Quantifying the Knowledge. Zhang, Quanshi et al. aiXiv:2003.03622
DeepVID: deep visual interpretation and diagnosis for image classifiers via knowledge distillation. IEEE Trans, 2019.

Note: All papers pdf can be found and downloaded on Bing or Google.

Source: https://github.com/FLHonker/Awesome-Knowledge-Distillation

Contact: Yuang Liu([email protected]), AIDA, ECNU.

《BERT基础教程：Transformer大模型实战》读书笔记 johnny233 读书笔记人工智能
概念BERT，BidirectionalEncoderRepresentationsfromTransformers，多Transformer的双向编码器表示法。RNN，recurrentneuralnetwork，循环神经网络。LSTM，longshort-termmemory，长短期记忆网络。NLI，Naturallanguageinference，自然语言推理。知识蒸馏(knowledged
英伟达如何通过剪枝和蒸馏技术让Llama 3.1模型“瘦身“? 蒜鸭人工智能算法机器学习
英伟达如何通过剪枝和蒸馏技术让Llama3.1模型"瘦身"?大家好，我是蒜鸭。今天我们来聊聊英伟达最近在大语言模型优化方面的一项有趣研究。随着Meta发布Llama3.1系列模型，如何在保持模型性能的同时缩小其体积成为了业界关注的焦点。英伟达研究团队通过结构化权重剪枝和知识蒸馏技术，成功将Llama3.18B模型压缩为4B参数的小型语言模型，并取得了不俗的效果。让我们一起来深入探讨这项技术的原理和
【机器学习】机器学习与大模型在人工智能领域的融合应用与性能优化新探索 E绵绵 Everything 人工智能机器学习大模型 python AIGC 应用科技
文章目录引言机器学习与大模型的基本概念机器学习概述监督学习无监督学习强化学习大模型概述GPT-3BERTResNetTransformer机器学习与大模型的融合应用自然语言处理文本生成文本分类机器翻译图像识别自动驾驶医学影像分析语音识别智能助手语音转文字大模型性能优化的新探索模型压缩权重剪枝量化知识蒸馏分布式训练数据并行模型并行异步训练高效推理模型裁剪缓存机制专用硬件未来展望跨领域应用智能化系统人
Transformer视频理解学习的笔记 LinlyZhai transformer 学习笔记
今天复习了Transformer,ViT,学了SwinTransformer,还有观看了B站视频理解沐神系列串讲视频上（24.2.26未看完,明天接着看）这里面更多论文见：https://github.com/mli/paper-reading/B站视频理解沐神系列串讲视频下（明天接着看）上面这张图中的知识蒸馏，可以回头看一下上面这个github网址论文：VideoTransformers:ASu
大模型量化技术原理-LLM.int8()、GPTQ 吃果冻不吐果冻皮动手学大模型人工智能
近年来，随着Transformer、MOE架构的提出，使得深度学习模型轻松突破上万亿规模参数，从而导致模型变得越来越大，因此，我们需要一些大模型压缩技术来降低模型部署的成本，并提升模型的推理性能。模型压缩主要分为如下几类：剪枝（Pruning）知识蒸馏（KnowledgeDistillation）量化之前也写过一些文章涉及大模型量化相关的内容。基于LLaMA-7B/Bloomz-7B1-mt复现开
知识蒸馏实战代码教学一（原理部分）业余小程序猿深度学习机器学习人工智能知识蒸馏
一、知识蒸馏的来源知识蒸馏（KnowledgeDistillation）源自于一篇由Hinton等人于2015年提出的论文《DistillingtheKnowledgeinaNeuralNetwork》。这个方法旨在将一个大型、复杂的模型的知识（通常称为教师模型）转移到一个小型、简化的模型（通常称为学生模型）中。通过这种方式，学生模型可以获得与教师模型相似的性能，同时具有更小的模型体积和计算资源需
知识蒸馏实战代码教学二（代码实战部分）业余小程序猿深度学习人工智能机器学习知识蒸馏
一、上章原理回顾具体过程：（1）首先我们要先训练出较大模型既teacher模型。（在图中没有出现）（2）再对teacher模型进行蒸馏，此时我们已经有一个训练好的teacher模型，所以我们能很容易知道teacher模型输入特征x之后，预测出来的结果teacher_preds标签。（3）此时，求到老师预测结果之后，我们需要求解学生在训练过程中的每一次结果student_preds标签。（4）先求h
超好用！——知识蒸馏中即插即用的对抗性调度器以及调整向量Vector 时光诺言机器学习人工智能深度学习 python
一.前言本设计思路来源于论文《DynamicData-FreeKnowledgeDistillationbyEasy-to-HardLearningStrategy》。1.1原理总体架构图如下。在常规的知识蒸馏中，一般不会考虑知识的难度先后，按照我们人类的思维，肯定是先学习容易的再学习难一点的知识（总不能小学就学高数吧哈哈）。一个模型的理想状态也应该如此。在本论文的设计图中，可以看到Generat
【论文解读】Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation Queen_sy 深度学习人工智能
目录1Introduction1Docre任务比句子级任务更具挑战性：2现有的Docre方法：3现有的Docre方法存在三个局限性2Methodology1使用轴向注意力模块作为特征提取器：2第二，提出适应性焦距损失3第三用知识蒸馏相关知识类别不平衡问题长尾类分布交叉熵损失和二元交叉熵损失二元交叉熵损失定义为知识蒸馏全文翻译https://baijiahao.baidu.com/s?id=1737
知识蒸馏之Knowledge Distillation: A Survey Diros1g 知识蒸馏
InternationalJournalofComputerVision2021JianpingGou1·BaoshengYu1·StephenJ.Maybank2·DachengTao11UBTECHSydneyAICentre,SchoolofComputerScience,FacultyofEngineering,TheUniversityofSydney,Darlington,NSW200
知识蒸馏综述---代码整理 qq_41920323 模型部署 python 知识蒸馏
本文尽可能简单解释蒸馏用到的策略，并提供了实现源码。1、KD:KnowledgeDistillation链接：https://arxiv.org/pdf/1503.02531.pd3f发表：NIPS14最经典的，也是明确提出知识蒸馏概念的工作，通过使用带温度的softmax函数来软化教师网络的逻辑层输出作为学生网络的监督信息，使用KLdivergence来衡量学生网络与教师网络的差异，具体流程如下
知识蒸馏（paper翻译）蓝羽飞鸟 DeepLearning 人工智能深度学习
paper：DistillingtheKnowledgeinaNeuralNetwork摘要：提高几乎所有机器学习算法性能的一个非常简单的方法是在相同的数据上训练许多不同的模型，然后对它们的预测进行平均[3]。不幸的是，使用整个模型集合进行预测非常麻烦，并且计算成本可能太高，无法部署到大量用户，尤其是在单个模型是大型神经网络的情况下。Caruana和他的合作者[1]已经证明，可以将集成中的知识压缩
第二十九周：文献阅读笔记（ResMLP）+ pytorch学习（Resnet代码实现） @默然笔记 pytorch 学习人工智能 python 深度学习机器学习
第二十九周：文献阅读笔记（ResMLP）摘要Abstract1.ResMLP1.1文献摘要1.2文献引言1.3ResMLP方法1.3.1整体流程1.3.2残差多感知机层1.4实验1.4.1数据集1.4.2超参数设置1.4.3主要结果1.4.4监督设置1.4.5自监督设置1.4.5知识蒸馏设置1.5ResMLP的创新点2.pytorch学习（ResNet代码实现）2.1数据集2.2文件结构2.3下载
vit细粒度图像分类（三）TRS-DeiT 学习笔记无妄无望学习笔记人工智能深度学习分类神经网络
1.摘要细粒度图像分类任务由于自身存在的细微的类间差别和巨大的类内差别使其极具挑战性，为了更好地学习细粒度图像的潜在特征，该算法将知识蒸馏引入到细粒度图像分类任务中，提出基于知识蒸馏与目标区域选取的细粒度图像分类方法（ＴＲＳ-ＤｅｉＴ），能使其兼具ＣＮＮ模型和Ｔｒａｎｓｆｏｒｍｅｒ模型的各自优点。此外，ＴＲＳ-ＤｅｉＴ的新型目标区域选取模块能够获取最具区分性的区域；为了区分任务中的易混淆类，引入对
【Image captioning】论文阅读七—Efficient Image Captioning for Edge Devices_AAAI2023 安静到无声手把手实现Image captioning 论文阅读
中文标题：面向边缘设备的高效图像描述（EfficientImageCaptioningforEdgeDevices）文章目录1.引言2.相关工作3.方法3.1ModelArchitecture（模型结构）3.2ModelTraining(模型训练)3.3KnowledgeDistillation（知识蒸馏）4.实验4.1数据集和评价指标4.2实施细节：4.3消融实验
和GPT讨论知识蒸馏的基本概念 Mighty_Crane gpt 人工智能
User帮我解释一下知识蒸馏、教师学生网络、逆向蒸馏等概念ChatGPT知识蒸馏（KnowledgeDistillation）是一种模型压缩技术。在这个过程中，一个大型、复杂的模型（通常称为“教师”模型）被用来训练一个更小、更简单的模型（称为“学生”模型）。学生模型不仅学习从数据到输出的映射，还学习模仿教师模型的行为，这通常能提高其性能。逆向蒸馏（ReverseDistillation）是其中的一
Knowledge Distilling，知识蒸馏 FeynmanMa
Distillingtheknowledgeinaneuralnetwork1.Motivationknowledge_distilling_title.jpg论文作者比较大名鼎鼎了。Motivation一部分来自模型压缩[2]，一部分源自作者认为大部分机器学习采用ensemble方法或者学习一个很大的模型来取得比较好的结果，但会给实际应用预测带来很大的压力，而且实际上模型之间也是有信息冗余的。希
AI芯片：神经网络研发加速器、神经网络压缩简化、通用芯片 CPU 加速、专用芯片 GPU 加速 Debroon #深度学习人工智能神经网络深度学习
AI芯片：神经网络研发加速器、神经网络压缩简化、通用芯片CPU加速、专用芯片GPU加速神经网络研发加速器神经网络编译器各自实现的神经网络编译器神经网络加速与压缩（算法层面）知识蒸馏低秩分解轻量化网络剪枝量化通用芯片CPU加速x86加速arm加速卷积优化神经网络加速库专用芯片GPU加速dsp加速faga加速npu加速K210人工智能微控制器神经网络加速库：Vulkan图形计算神经网络研发加速器神经网
《FITNETS: HINTS FOR THIN DEEP NETS》论文整理 LionelZhao 知识蒸馏论文阅读人工智能神经网络深度学习
目录零、前言一、Fitnet的目的及适用范围1、目的：2、适用范围：3、背景及创新点：二、Hint-BasedTraining思想1、hint层与guided层：2、核心思想：三、Fitnet训练过程及效果1、FItnet训练过程可以分为三个阶段：2、需要注意的问题：3、具体流程：4、损失函数：（1）预训练阶段：（2）知识蒸馏阶段：5、训练效果：四、Q&A1、小模型模仿大模型中间层的输出featu
YOLO蒸馏原理篇之---MGD、CWD蒸馏 qq_41920323 模型部署 MGD CWD特征蒸馏
一、MGD蒸馏论文地址：https://arxiv.org/abs/2205.01529论文翻译：https://mp.weixin.qq.com/s/FSvo3ns2maTpiTTWsE91kQ1.1摘要知识蒸馏已成功应用于各种任务。当前的蒸馏算法通常通过模仿教师的输出来提高学生的表现。本文表明，教师还可以通过指导学生的特征恢复来提高学生的表征能力。从这个角度来看，我们提出了掩蔽生成蒸馏(MGD
深度学习模型压缩方法：知识蒸馏方法总结 qq_41920323 模型部署深度学习人工智能
本文将介绍深度学习模型压缩方法中的知识蒸馏，内容从知识蒸馏简介、知识的种类、蒸馏机制、师生网络结构、蒸馏算法以及蒸馏方法等六部部分展开。一、知识蒸馏简介知识蒸馏是指用教师模型来指导学生模型训练，通过蒸馏的方式让学生模型学习到教师模型的知识。在模型压缩中，教师模型是一个提前训练好的复杂模型，而学生模型则是一个规模较小的模型。如下图所示，由训练好的教师模型，在相同的数据下，通过将教师网络对该样本的预测
使用知识蒸馏提升模型推理性能之乎者也· AI(人工智能)内容分享 NLP（自然语言处理）内容分享深度学习人工智能
目录知识蒸馏介绍LogitsTemperature理论介绍实验代码实验结果知识蒸馏介绍首先，我们先简单地了解下知识蒸馏概念[2]。通常，大模型可能是一个复杂的网络或多个网络的组合，表现出优越的效果和泛化能力。而小模型由于其较小的规模，其表达能力可能受到限制。为了提高小模型的效果，我们可以借助大模型所学习到的知识来指导小模型的训练。这样，小模型在参数数量明显减少的情况下，也能够达到与大模型相似的效果
深度学习中的知识蒸馏 Algorithm_Engineer_ 人工智能深度学习人工智能
一.概念知识蒸馏（KnowledgeDistillation）是一种深度学习中的模型压缩技术，旨在通过从一个教师模型（teachermodel）向一个学生模型（studentmodel）传递知识来减小模型的规模，同时保持性能。这个过程涉及到从教师模型的软标签（softlabels）或者特征中提取知识，然后用这些知识来训练一个更小的学生模型。简单了解一些知识蒸馏的一般步骤和关键概念：教师模型（Tea
【多模态】ALBEF 不牌不改【NLP &CV】人工智能计算机视觉深度学习机器学习 python 算法 transformer
ALBEF论文信息标题：AlignbeforeFuse:VisionandLanguageRepresentationLearningwithMomentumDistillation作者：JunnanLi（SalesforceResearch）期刊：NeurIPS2021发布时间与更新时间：2021.07.162021.10.07主题：多模态、预训练、图像、文本、对比学习、知识蒸馏、动量模型arX
【AI】一文读懂大模型套壳——神仙打架？软饭硬吃？ giszz 人工智能随笔人工智能
目录一、套壳的风波此起彼伏二、到底什么是大模型的壳2.1大模型的3部分，壳指的是哪里大模型的内核预训练（Pre-training）调优（Fine-tuning）2.2内核的发展历程和万流归宗2.3套壳不是借壳三、软饭硬吃，套壳真的不行吗四、神仙打架，百姓吃瓜4.1自研的佼佼者4.2模仿也不丢人4.3读书人偷书不算偷模仿学习（ImitationLearning）知识蒸馏（KnowledgeDisti
知识蒸馏 Knowledge Distillation（在tinybert的应用）不当菜鸡的程序媛学习记录人工智能
蒸馏（KnowledgeDistillation）是一种模型压缩技术，通常用于将大型模型的知识转移给小型模型，以便在保持性能的同时减小模型的体积和计算开销。这个过程涉及到使用一个大型、复杂的模型（通常称为教师模型）生成的软标签（概率分布），来训练一个小型模型（通常称为学生模型）。具体而言，对于分类问题，教师模型生成的概率分布可以看作是对每个类别的软标签，而学生模型通过学习这些软标签来进行训练。这种
yolov8知识蒸馏代码详解：支持logit和feature-based蒸馏 @BangBang 模型轻量化 yolov8 代码详解知识蒸馏
文章目录1.知识蒸馏理论2.yolov8蒸馏代码应用2.1环境配置2.2训练模型(1)训练教师模型(2)训练学生模型baseline(3)蒸馏训练3.知识蒸馏代码详解3.1蒸馏参数设置3.2蒸馏损失代码讲解3.2.1Featurebasedloss3.2.1Logitloss3.3获取蒸馏的featuremap及channels
AI的智慧精华：解锁知识蒸馏的秘密散一世繁华，颠半世琉璃人工智能
1.定义化学蒸馏是一种物质分离的方法，通过加热物质混合物，使其其中一种或多种成分的沸点低于其他成分的沸点，从而使其蒸发，然后通过冷凝使其凝结，最终得到纯净的成分。蒸馏通常用于分离液体混合物中的组分。在蒸馏过程中，混合物被加热，使其中沸点较低的成分先蒸发，然后通过冷凝器冷却并凝结为液体。凝结后的液体称为蒸馏液或馏出液。沸点较高的成分则留在容器中，称为残渣。而知识蒸馏就是把一个大的模型，称之为教师模型
Knowledge Distillation from A Stronger Teacher（NeurIPS 2022）论文解读 00000cj 知识蒸馏-分类深度学习人工智能知识蒸馏
paper：KnowledgeDistillationfromAStrongerTeacherofficialimplementation：https://github.com/hunto/dist_kd前言知识蒸馏通过将教师的知识传递给学生来增强学生模型的性能，我们自然会想到，是否教师的性能越强，蒸馏后学生的性能也会进一步提升？为了了解如何成为一个更强的教师模型以及它们对KD的影响，作者系统地研
yolov5知识蒸馏 cv-daily YOLO 深度学习人工智能
参考代码：https://github.com/Adlik/yolov5https://cloud.tencent.com/developer/article/2160509yolov5间的模型蒸馏，相同结构的。配置参数parser.add_argument('--t_weights',type=str,default='./weights/yolov5s.pt',help='initialtea
scala的option和some 矮蛋蛋编程 scala
原文地址： http://blog.sina.com.cn/s/blog_68af3f090100qkt8.html 对于学习 Scala 的 Java™ 开发人员来说，对象是一个比较自然、简单的入口点。在本系列前几期文章中，我介绍了 Scala 中一些面向对象的编程方法，这些方法实际上与 Java 编程的区别不是很大。我还向您展示了 Scala 如何重新应用传统的面向对象概念，找到其缺点
NullPointerException Cb123456 android BaseAdapter
java.lang.NullPointerException: Attempt to invoke virtual method 'int android.view.View.getImportantForAccessibility()' on a null object reference 出现以上异常.然后就在baidu上
PHP使用文件和目录天子之骄 php文件和目录读取和写入 php验证文件 php锁定文件
PHP使用文件和目录 1.使用include()包含文件 (1)：使用include()从一个被包含文档返回一个值 (2)：在控制结构中使用include() include_once()函数需要一个包含文件的路径，此外，第一次调用它的情况和include()一样，如果在脚本执行中再次对同一个文件调用，那么这个文件不会再次包含。在php.ini文件中设置
SQL SELECT DISTINCT 语句何必如此 sql
SELECT DISTINCT 语句用于返回唯一不同的值。 SQL SELECT DISTINCT 语句在表中，一个列可能会包含多个重复值，有时您也许希望仅仅列出不同（distinct）的值。 DISTINCT 关键词用于返回唯一不同的值。 SQL SELECT DISTINCT 语法 SELECT DISTINCT column_name,column_name F
java冒泡排序 3213213333332132 java 冒泡排序
package com.algorithm; /** * @Description 冒泡 * @author FuJianyong * 2015-1-22上午09:58:39 */ public class MaoPao { public static void main(String[] args) { int[] mao = {17,50,26,18,9,10
struts2.18 +json,struts2-json-plugin-2.1.8.1.jar配置及问题！ 7454103 DAO spring Ajax json qq
struts2.18 出来有段时间了！（貌似是稳定版）闲时研究下下！貌似 sruts2 搭配 json 做 ajax 很吃香！实践了下下！不当之处请绕过！呵呵网上一大堆 struts2+json 不过大多的json 插件都是 jsonplugin.34.jar strut
struts2 数据标签说明 darkranger jsp bean struts servlet Scheme
数据标签主要用于提供各种数据访问相关的功能，包括显示一个Action里的属性，以及生成国际化输出等功能数据标签主要包括： action ：该标签用于在JSP页面中直接调用一个Action，通过指定executeResult参数，还可将该Action的处理结果包含到本页面来。 bean ：该标签用于创建一个javabean实例。如果指定了id属性，则可以将创建的javabean实例放入Sta
链表.简单的链表节点构建 aijuans 编程技巧
/*编程环境WIN-TC*/ #include "stdio.h" #include "conio.h" #define NODE(name, key_word, help) \ Node name[1]={{NULL, NULL, NULL, key_word, help}} typedef struct node { &nbs
tomcat下jndi的三种配置方式 avords tomcat
jndi(Java Naming and Directory Interface，Java命名和目录接口)是一组在Java应用中访问命名和目录服务的API。命名服务将名称和对象联系起来，使得我们可以用名称访问对象。目录服务是一种命名服务，在这种服务里，对象不但有名称，还有属性。 tomcat配置
关于敏捷的一些想法 houxinyou 敏捷
从网上看到这样一句话：“敏捷开发的最重要目标就是：满足用户多变的需求，说白了就是最大程度的让客户满意。” 感觉表达的不太清楚。感觉容易被人误解的地方主要在“用户多变的需求”上。第一种多变，实际上就是没有从根本上了解了用户的需求。用户的需求实际是稳定的，只是比较多，也比较混乱，用户一般只能了解自己的那一小部分，所以没有用户能清楚的表达出整体需求。而由于各种条件的，用户表达自己那一部分时也有
富养还是穷养，决定孩子的一生 bijian1013 教育人生
是什么决定孩子未来物质能否丰盛？为什么说寒门很难出贵子，三代才能出贵族？真的是父母必须有钱，才能大概率保证孩子未来富有吗？-----作者：@李雪爱与自由事实并非由物质决定，而是由心灵决定。一朋友富有而且修养气质很好，兄弟姐妹也都如此。她的童年时代，物质上大家都很贫乏，但妈妈总是保持生活中的美感，时不时给孩子们带回一些美好小玩意，从来不对孩子传递生活艰辛、金钱来之不易、要懂得珍惜
oracle 日期时间格式转化征客丶 oracle
oracle 系统时间有 SYSDATE 与 SYSTIMESTAMP； SYSDATE：不支持毫秒，取的是系统时间； SYSTIMESTAMP：支持毫秒，日期，时间是给时区转换的，秒和毫秒是取的系统的。日期转字符窜：一、不取毫秒： TO_CHAR(SYSDATE, 'YYYY-MM-DD HH24:MI:SS') 简要说明， YYYY 年 MM 月
【Scala六】分析Spark源代码总结的Scala语法四 bit1129 scala
1. apply语法 FileShuffleBlockManager中定义的类ShuffleFileGroup，定义： private class ShuffleFileGroup(val shuffleId: Int, val fileId: Int, val files: Array[File]) { ... def apply(bucketId
Erlang中有意思的bug bookjovi erlang
代码中常有一些很搞笑的bug，如下面的一行代码被调用两次（Erlang beam） commit f667e4a47b07b07ed035073b94d699ff5fe0ba9b Author: Jovi Zhang <[email protected]> Date: Fri Dec 2 16:19:22 2011 +0100 erts:
移位打印10进制数转16进制-2008-08-18 ljy325 java 基础
/** * Description 移位打印10进制的16进制形式 * Creation Date 15-08-2008 9:00 * @author 卢俊宇 * @version 1.0 * */ public class PrintHex { // 备选字符 static final char di
读《研磨设计模式》-代码笔记-组合模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.List; abstract class Component { public abstract void printStruct(Str
利用cmd命令将.class文件打包成jar chenyu19891124 cmd jar
cmd命令打jar是如下实现：在运行里输入cmd，利用cmd命令进入到本地的工作盘符。(如我的是D盘下的文件有此路径 D:\workspace\prpall\WEB-INF\classes) 现在是想把D:\workspace\prpall\WEB-INF\classes路径下所有的文件打包成prpall.jar。然后继续如下操作： cd D: 回车 cd workspace/prpal
[原创]JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 comsci eclipse 设计模式算法工作 swing
JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 &nb
SecureCRT右键粘贴的设置 daizj secureCRT 右键粘贴
一般都习惯鼠标右键自动粘贴的功能，对于SecureCRT6.7.5 ，这个功能也已经是默认配置了。老版本的SecureCRT其实也有这个功能，只是不是默认设置，很多人不知道罢了。菜单： Options->Global Options ...->Terminal 右边有个Mouse的选项块。 Copy on Select Paste on Right/Middle
Linux 软链接和硬链接 dongwei_6688 linux
1.Linux链接概念Linux链接分两种，一种被称为硬链接（Hard Link），另一种被称为符号链接（Symbolic Link）。默认情况下，ln命令产生硬链接。【硬连接】硬连接指通过索引节点来进行连接。在Linux的文件系统中，保存在磁盘分区中的文件不管是什么类型都给它分配一个编号，称为索引节点号(Inode Index)。在Linux中，多个文件名指向同一索引节点是存在的。一般这种连
DIV底部自适应 dcj3sjt126com JavaScript
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml&q
Centos6.5使用yum安装mysql——快速上手必备 dcj3sjt126com mysql
第1步、yum安装mysql [root@stonex ~]# yum -y install mysql-server 安装结果： Installed: mysql-server.x86_64 0:5.1.73-3.el6_5 &nb
如何调试JDK源码 frank1234 jdk
相信各位小伙伴们跟我一样，想通过JDK源码来学习Java，比如collections包，java.util.concurrent包。可惜的是sun提供的jdk并不能查看运行中的局部变量，需要重新编译一下rt.jar。下面是编译jdk的具体步骤： 1.把C:\java\jdk1.6.0_26\sr
Maximal Rectangle hcx2013 max
Given a 2D binary matrix filled with 0's and 1's, find the largest rectangle containing all ones and return its area. public class Solution { public int maximalRectangle(char[][] matrix)
Spring MVC测试框架详解——服务端测试 jinnianshilongnian spring mvc test
随着RESTful Web Service的流行，测试对外的Service是否满足期望也变的必要的。从Spring 3.2开始Spring了Spring Web测试框架，如果版本低于3.2，请使用spring-test-mvc项目（合并到spring3.2中了）。 Spring MVC测试框架提供了对服务器端和客户端（基于RestTemplate的客户端）提供了支持。 &nbs
Linux64位操作系统（CentOS6.6）上如何编译hadoop2.4.0 liyong0802 hadoop
一、准备编译软件 1.在官网下载jdk1.7、maven3.2.1、ant1.9.4，解压设置好环境变量就可以用。环境变量设置如下：（1）执行vim /etc/profile （2）在文件尾部加入: export JAVA_HOME=/home/spark/jdk1.7 export MAVEN_HOME=/ho
StatusBar 字体白色 pangyulei status
[[UIApplication sharedApplication] setStatusBarStyle:UIStatusBarStyleLightContent]; /*you'll also need to set UIViewControllerBasedStatusBarAppearance to NO in the plist file if you use this method
如何分析Java虚拟机死锁 sesame java thread oracle 虚拟机 jdbc
英文资料： Thread Dump and Concurrency Locks Thread dumps are very useful for diagnosing synchronization related problems such as deadlocks on object monitors. Ctrl-\ on Solaris/Linux or Ctrl-B
位运算简介及实用技巧（一）：基础篇 tw_wangzhengquan 位运算
http://www.matrix67.com/blog/archives/263 去年年底写的关于位运算的日志是这个Blog里少数大受欢迎的文章之一，很多人都希望我能不断完善那篇文章。后来我看到了不少其它的资料，学习到了更多关于位运算的知识，有了重新整理位运算技巧的想法。从今天起我就开始写这一系列位运算讲解文章，与其说是原来那篇文章的follow-up，不如说是一个r
jsearch的索引文件结构 yangshangchuan 搜索引擎 jsearch 全文检索信息检索 word分词
jsearch是一个高性能的全文检索工具包，基于倒排索引，基于java8，类似于lucene，但更轻量级。 jsearch的索引文件结构定义如下： 1、一个词的索引由=分割的三部分组成：第一部分是词第二部分是这个词在多少