微软开源项目NeuronBlocks:像搭积木一样构建自然语言理解深度学习模型

NeuronBlocks - NLP DNN Toolkit https://github.com/Microsoft/NeuronBlocks

Overview

NeuronBlocks is a NLP deep learning modeling toolkit that helps engineers to build end-to-end pipelines for neural network model training for NLP tasks. The main goal of this toolkit is to minimize developing cost for NLP deep neural network model building, including both training and inference stages. For more details, please check our paper: NeuronBlocks – Building Your NLP DNN Models Like Playing Lego at https://arxiv.org/abs/1904.09535.

NeuronBlocks consists of two major components: Block Zoo and Model Zoo.

  • In Block Zoo, we provide commonly used neural network components as building blocks for model architecture design.

  • In Model Zoo, we provide a suite of NLP models for common NLP tasks, in the form of JSON configuration files.

Toolkit Usage

Users can either pick existing models (config files) in Model Zoo to start model training or create new models by leveraging neural network blocks in Block Zoo just like playing with Lego.

NLP Tasks Supported

  • Sentence Classification

  • Question Answering Matching

  • Textual Entailment

  • Slot tagging

  • Machine Reading Comprehension

  • More on-going

Who should consider using NeuronBlocks

Engineers or researchers who face the following challenges when using neural network models to address NLP problems:

  • Many frameworks to choose and high framework studying cost.

  • Heavy coding cost. A lot of details make it hard to debug.

  • Fast Model Architecture Evolution. It is difficult for engineers to understand the mathematical principles behind them.

  • Model Code optimization requires deep expertise.

  • Model Platform Compatibility Requirement. It requires extra coding work for the model to run on different platforms, such as Linux/Windows, GPU/CPU.

The advantages of leveraging NeuronBlocks for NLP neural network model training includes:

  • Model Building: for model building and parameter tuning, users only need to write simple JSON config files, which greatly minimize the effort of implementing new ideas.

  • Model Sharing It is super easy to share models just through JSON files, instead of nasty codes. For different models or tasks, our users only need to maintain one single centralized source code base.

  • Code Reusability: Common blocks can be easily shared across various models or tasks, reducing duplicate coding work.

  • Platform Flexibility: NeuronBlocks can run both on Linux and Windows machines, using both CPU and GPU. It also supports training on GPU platforms like Philly and PAI.

  • Model Visualization: A model visualizer is provided for visualization and configure correctness checking, which helps users to visualize the model architecture easily during debugging.

  • Extensibility: NeuronBlocks is extensible, allowing users to contribute new blocks or contributing novel models (JSON files).

你可能感兴趣的:(NLP)