Adaptive Compression of Word Embeddings
Autor: | Yeachan Kim, Kang-Min Kim, SangKeun Lee |
---|---|
Rok vydání: | 2020 |
Předmět: |
Theoretical computer science
Artificial neural network Computer science 02 engineering and technology 010501 environmental sciences 01 natural sciences Task (computing) Component (UML) 0202 electrical engineering electronic engineering information engineering Code (cryptography) Memory footprint 020201 artificial intelligence & image processing Word (computer architecture) 0105 earth and related environmental sciences |
Zdroj: | ACL |
DOI: | 10.18653/v1/2020.acl-main.364 |
Popis: | Distributed representations of words have been an indispensable component for natural language processing (NLP) tasks. However, the large memory footprint of word embeddings makes it challenging to deploy NLP models to memory-constrained devices (e.g., self-driving cars, mobile devices). In this paper, we propose a novel method to adaptively compress word embeddings. We fundamentally follow a code-book approach that represents words as discrete codes such as (8, 5, 2, 4). However, unlike prior works that assign the same length of codes to all words, we adaptively assign different lengths of codes to each word by learning downstream tasks. The proposed method works in two steps. First, each word directly learns to select its code length in an end-to-end manner by applying the Gumbel-softmax tricks. After selecting the code length, each word learns discrete codes through a neural network with a binary constraint. To showcase the general applicability of the proposed method, we evaluate the performance on four different downstream tasks. Comprehensive evaluation results clearly show that our method is effective and makes the highly compressed word embeddings without hurting the task accuracy. Moreover, we show that our model assigns word to each code-book by considering the significance of tasks. |
Databáze: | OpenAIRE |
Externí odkaz: |