PAPER: Vector-vector-matrix Architecture: a Novel Hardware-aware Framework for Low-latency Inference in NLP Applications
Release Date:2021-04-07
Blog