Knowledge distillation Archives

0

New speech recognition experiments demonstrate how machine learning can scale

Customer interactions with Alexa are constantly growing more complex, and on the Alexa science team, we strive to stay ahead of the curve by continuously ...

rockstaryreviews August 4, 2024

READ MORE +

0

More-Efficient Machine Learning Models for On-Device Operation

Neural networks are responsible for most recent advances in artificial intelligence, including many of Alexa’s latest capabilities. But neural networks tend ...

rockstaryreviews August 3, 2024

READ MORE +

0

New sound detection approach improves on state of the art

Sound detection is a popular application of today’s smart speakers. Alexa customers who activate Alexa Guard when they leave the house, for instance, ...

rockstaryreviews July 23, 2024

READ MORE +

0

Ensuring that new language-processing models don’t backslide

The models behind machine learning (ML) services are continuously being updated, and the new models are usually more accurate than the old ones. But an ...

rockstaryreviews July 12, 2024

READ MORE +

0

Knowledge distillation for better convergence in multitask learning

Validation curves in a five-task multitask learning setup, where training minimizes the sum of the task losses. The tasks ...

rockstaryreviews June 26, 2024

READ MORE +

0

Domain data trumps teacher knowledge for distilling NLU models

Knowledge distillation is a popular technique for compressing large machine learning models into manageable sizes, to make them suitable for low-latency ...

rockstaryreviews June 17, 2024

READ MORE +

0

Teaching language models to reason consistently

Teaching large language models (LLMs) to reason is an active topic of research in natural-language processing, and a popular approach to that problem is the ...

rockstaryreviews June 13, 2024

READ MORE +

0

Using teacher knowledge at inference time to enhance student model

Knowledge distillation (KD) is one of the most effective ways to deploy large-scale language models in environments where low latency is essential. KD ...

rockstaryreviews June 5, 2024

READ MORE +

0

Building geospatial foundation models via continual pretraining

Geospatial technologies have rapidly ascended to a position of paramount importance across the globe. By providing a better understanding of Earth's ...

rockstaryreviews May 17, 2024

READ MORE +

0

Knowledge distillation method for better vision-language models

Large machine learning models based on the transformer architecture have recently demonstrated extraordinary results on a range of vision and language ...

rockstaryreviews May 11, 2024

READ MORE +

Shopping cart