Karsten Roth, a PhD student at the University of Tübingen, was awarded the EMVA Young Professional Award for work done during his science internship at Amazon on “Towards Total Recall in Industrial Anomaly Detection”. Roth received the award on May 13 during the 20th EMVA Business Conference in Brussels.
The EMVA Young Professional Award is an annual award “to honor the outstanding and innovative work of a student or a young professional in the field of machine vision or image processing.”
The paper Roth authored, along with applied scientist Latha Pemula, former senior applied scientist Joaquin Zepeda, vice president and distinguished scientist Bernhard Schölkopf, Amazon Scholar Thomas Brox, and senior applied scientist Peter Gehler, who was also Roth’s supervisor, addresses what is known as the cold start problem in anomaly detection.
“The project was for the service that became Amazon Lookout for Vision, which helps businesses, usually manufacturers, spot defects and anomalies,” Gehler explained. Lookout for Vision utilizes few-shot learning, meaning it is possible to assess products for defects based on relatively few examples of defective and normal products. However, a challenge arises when businesses only have nominal, or non-defective, product images available.
“The problem that we looked at is the cold start problem, where you don’t have any negative examples or defects,” Gehler said. The challenge extends to detecting defects that are hard to capture (eg thin scratches or missing parts) as well as spotting unforeseen and unanticipated defects.
“Our research was motivated by a real and practical challenge,” Roth explained. “The solution we developed had to be one of practical nature; driven to optimize performance and usability before caring about the ‘academic wow-factor’”.
The team identified a solution to the challenge by utilizing images of non-defective items. “There were many variants that we studied,” Gehler said. “The best option was based on building an image library of good examples and then testing nearness against this library.” Gehler noted that the question then becomes how ‘nearness’ is defined.
“Being near to an example depends on how you take the image and how you encode the image,” he said. He added that, drawing from principles of deep learning architecture, the solution is to “turn an image into a series of vectors, and then you compare those vectors — and there’s multiple ways of doing this at multiple stages.” By comparing products against non-defective examples and determining how closely or how far apart from that ideal the product is, defects can be flagged — even without actual examples of defects.
The team’s solution, PatchCore, achieved an image-level anomaly detection (AUROC) score of up to 99.6% on the widely used MVTec AD benchmark, “more than halving the error compared to the next best competitor.” Roth presented PatchCore this week at CVPR 2022.
Gehler said Roth’s role was vital. “Karsten produced so many results each week that we didn’t have time to cover all of them in our hour-long weekly meeting.”
“Receiving this year’s EMVA Young Professional Award provides further validation for the relevance and importance of this line of research, and I am delighted to have been awarded such a high-profile award,” Roth said.
He added that he is indebted to the team he interned under at Amazon.
“A somewhat unexpected, but very welcome consequence of conducting research at Amazon were significant improvements in the quality of my code and the overall documentation of my research progress,” he said, “with particular thanks to Peter who made the whole research internship a superb experience.”