BERT Lightweight: Optimal Parameter Subset Bort at 16% Size
Zheng Jiyang from Aofeisi QbitAI Report | WeChat Official Account QbitAI Recently, the Amazon Alexa team released a research achievement: researchers performed parameter selection on the BERT model, obtaining the optimal parameter subset of BERT—Bort. The research results indicate that Bort is only 16% the size of BERT-large, but its speed on CPU is 7.9 … Read more