Step-by-Step Distillation: New Method for Small Models to Rival Large Models
Machine Heart Reports Editor: Rome Large language models have astonishing capabilities, but they often incur huge costs during deployment due to their size. Researchers from the University of Washington, in collaboration with the Google Cloud AI Research Institute and Google Research, have proposed a solution to this problem by introducing the Distilling Step-by-Step paradigm to … Read more