Natural Language Processing (NLP) has significantly advanced in the last few years. It became hard to keep track with all the changes and technologies in the field. These selected articles will help you to keep-current with NLP.
Self-Supervised Learning keynote talk by Yann LeCunn; Perceiver - computation efficient transformer algorithm; Data Visualization also for summaries and more news in this monthly update.
To overcome Transformers' squared complexity (w.r.t input length), the Perceiver article here offers a novel method to learn the QKV matrices. Check it out!
The article suggests a method to lower the Transformer's complexity to a linear order and proves all the arguments also in a rigorous form. The article is not easy to read, but luckily, to understand the main idea, the first 5-6 pages are more than enough.