🤗 Newsletter Issue 9 - May 2021 May 28th 2021

News

🤗 Transformers 4.6 adds VISION!

Transformers v4.6 is our first release dedicated to computer vision!

1️⃣ CLIP from OpenAI, for Image-Text similarity or Zero-Shot Image classification
2️⃣ ViT from GoogleAI
3️⃣ DeiT from FacebookAI

Try SOTA image classification with ViT and DeiT on the Model Hub!

🤗 Datasets 1.6

🤗Datasets v1.6 brings you speed, features, and of course datasets:
- Now blazing fast: ~0.1ms per query for a 100 billion rows dataset 🚀🤯
- Even faster for small ones in memory by default 🏎
- Easy datasets concatenation: row ↔️, column ↕️, from memory 🧠 or disk 💽
- 800+ datasets available 📈, now with CUAD, OpenSLR, GEM1.1 and more

🚀 Expert Acceleration Program

This new program offers direct premium support from the Hugging Face team, to accelerate companies in their Transformers journey.

🔮 Which model to fine-tune, how?
🏎 How do I reduce latency by 10X?
⚙️ How do I optimize my production setup?
🧠 How do I leverage Transformers in SageMaker?
🔍 How do I mitigate bias in datasets and models?

💸 Inference API - now from $9/mo!

The Accelerated Inference API is now available through our $9/mo Supporter plan!

It’s the easiest way to integrate and serve any of the 13,000+ Hugging Face models - or your own private models - using our accelerated and scalable infrastructure, via simple API calls.

🤖 AutoNLP - now with Speech Recognition!

Create and deploy fine-tuned state of the art models automagically with AutoNLP!

New this month:
📃 Summarization models
🗣 Speech Recognition (ASR) models
📈 Regression models
🇺🇳 New languages: Hindi, Japanese, Chinese and Dutch

Let us know which task or language you'd like us to add next!

👋 Welcome JAX!

GoogleAI's JAX/Flax library can now be used as Transformers' backbone ML library.

JAX/Flax makes distributed training on TPU effortless and highly efficient!

Over 3,000 pretrained model checkpoints have been converted to JAX and can be fine-tuned on Natural Language Understanding downstream tasks.

👉 Google Colab
👉 Runtime evaluation

🤗 Accelerate

Want to run your PyTorch training loop on multi-GPUs or TPUs without using an abstract class you can't control or tweak easily? Try out our new open source library, 🤗 Accelerate!

With just five lines of code to add, your script will run locally (for debugging) as well as on any distributed setup!

Community

🌸 BigScience

Over 500 leading researchers from around the world are contributing to BigScience to create new understanding and shared scientific artifacts on how Large Language Models behave.

We are proud to start this collaboration, read all about it in this excellent Tech Review piece by Karen Hao!

🎙️Wav2Vec2 Fine-Tuning Week

During the Wav2Vec2 Fine-Tuning Week 382 community members came together to democratize state-of-the-art speech recognition technology for over 70 languages - thank you!

Participants fine-tuned a pretrained wav2vec2-large-xlsr-53 checkpoint on a language of their choice. Overall, over 240 fine-tuned model checkpoints were uploaded to the Model Hub, with many of them marking a new state-of-the-art performance in a low-resource language.

👨‍👩‍👧‍👦 Member spotlight

This month we tip our hat to Vasudev Gupta who did an incredible job contributing Google’s BigBird to Transformers. On behalf of the Hugging Face Community, thank you Vasu!

Vasu added both the auto-encoding model checkpoint, bigbird-roberta-base as well as the seq2seq model checkpoint, bigbird-pegasus.

Tutorials

🧠 Distributed Training on SageMaker

Learn how to use the new Hugging Face DLCs and Amazon SageMaker to train a distributed Seq2Seq-transformer model on the summarization task using the transformers and datasets libraries.

🛠 Scaling BERT Inference on CPU

We partnered with Intel to uncover all the knobs to speed up inference on modern CPUs.

The result: a new library to dial-in and measure your inference setup, with an in-depth blog post to dig into the details.

Events & Talks

📺 May 26th: PyTorch Community Voices

Transformers core maintainers Sylvain Gugger and Lysandre Debut boiled down the Hugging Face ecosystem and took live questions from the PyTorch community.

...
👋 Until next time!
🤗