fbpx
Accelerate your NLP pipelines using Hugging Face Transformers and ONNX Runtime Accelerate your NLP pipelines using Hugging Face Transformers and ONNX Runtime
This post was written by Morgan Funtowicz from Hugging Face and Tianlei Wu from Microsoft Transformer models have taken the world of natural language processing... Accelerate your NLP pipelines using Hugging Face Transformers and ONNX Runtime

Making NLP more Accessible

Higher performance NLP inference

Get started

python convert_graph_to_onnx.py --framework pt --model bert-base-cased bert-base-cased.onnx
pip install onnxruntime-tools 
python -m onnxruntime_tools.optimizer_cli --input bert-base-cased.onnx --output bert-base-cased.onnx --model_type bert

Resources


Performance Results

ODSC Community

The Open Data Science community is passionate and diverse, and we always welcome contributions from data science professionals! All of the articles under this profile are from our community, with individual authors mentioned in the text itself.

1