Pytext支持分布式训练，Facebook AI基于PyTorch的NLP框架,简化部署流程

原创

datayx 2021-10-26 16:42:39 ©著作权

©著作权归作者所有：来自51CTO博客作者datayx的原创作品，请联系作者获取转载授权，否则将追究法律责任

Facebook开源了自家工程师们一直在用的NLP建模框架PyText。这个框架，每天要为Facebook旗下各种应用处理超过10亿次NLP任务，Facebook AI的工业级NLP开源框架。（简化部署流程，大规模应用也OK）

PyText基于PyTorch，能够加速从研究到应用的进度，从模型的研究到完整实施只需要几天时间。框架里还包含了一些预训练模型，可以直接拿来处理文本分类、序列标注等任务。

资料如下：

主页：https://facebook.ai/developers/tools/pytext

论文：https://research.fb.com/publications/pytext-a-seamless-path-from-nlp-research-to-production/

官方文档：https://pytext-pytext.readthedocs-hosted.com/en/latest/index.html#

官方博客：https://code.fb.com/ai-research/pytext-open-source-nlp-framework/

GitHub：https://github.com/facebookresearch/pytext

PyText解决了既要实现快速实验又要部署大规模服务模型的经常相互冲突。它主要通过以下两点来实现上面的需求：

并且，Facebook已经采用了使用PyText快速迭代新的建模思路，然后大规模无缝衔接地发布它们。

Pytext核心功能

Zhang et al. (2016): A Joint Model of Intent Determination and Slot Filling for Spoken Language Understanding
Lample et al. (2016): Neural Architectures for Named Entity Recognition
Yoon Kim (2014): Convolutional Neural Networks for Sentence Classification
Lin et al. (2017): A Structured Self-attentive Sentence Embedding
文本分类器
序列标记
联合意图槽模型
上下文意图 - intent-slot models

可扩展组件，可轻松创建新模型和任务
支持集成训练
支持分布式训练（在PyTorch 1.0中使用新的C10d后端）
参考实现和预训练模型论文：Gupta et al. (2018): Semantic Parsing for Task Oriented Dialog using Hierarchical Representations