TY - GEN
T1 - State-Aware Configuration Detection for Augmented Reality Step-by-Step Tutorials
AU - Stanescu, Ana
AU - Mohr, Peter
AU - Kozinski, Mateusz
AU - Mori, Shohei
AU - Schmalstieg, Dieter
AU - Kalkofen, Denis
PY - 2023/10
Y1 - 2023/10
N2 - Presenting tutorials in augmented reality is a compelling application area, but previous attempts have been limited to objects with only a small numbers of parts. Scaling augmented reality tutorials to complex assemblies of a large number of parts is difficult, because it requires automatically discriminating many similar-looking object configurations, which poses a challenge for current object detection techniques. In this paper, we seek to lift this limitation. Our approach is inspired by the observation that, even though the number of assembly steps may be large, their order is typically highly restricted: Some actions can only be performed after others. To leverage this observation, we enhance a state-of-the-art object detector to predict the current assembly state by conditioning on the previous one, and to learn the constraints on consecutive states. This learned 'consecutive state prior' helps the detector disambiguate configurations that are otherwise too similar in terms of visual appearance to be reliably discriminated. Via the state prior, the detector is also able to improve the estimated probabilities that a state detection is correct. We experimentally demonstrate that our technique enhances the detection accuracy for assembly sequences with a large number of steps and on a variety of use cases, including furniture, Lego and origami. Additionally, we demonstrate the use of our algorithm in an interactive augmented reality application.
AB - Presenting tutorials in augmented reality is a compelling application area, but previous attempts have been limited to objects with only a small numbers of parts. Scaling augmented reality tutorials to complex assemblies of a large number of parts is difficult, because it requires automatically discriminating many similar-looking object configurations, which poses a challenge for current object detection techniques. In this paper, we seek to lift this limitation. Our approach is inspired by the observation that, even though the number of assembly steps may be large, their order is typically highly restricted: Some actions can only be performed after others. To leverage this observation, we enhance a state-of-the-art object detector to predict the current assembly state by conditioning on the previous one, and to learn the constraints on consecutive states. This learned 'consecutive state prior' helps the detector disambiguate configurations that are otherwise too similar in terms of visual appearance to be reliably discriminated. Via the state prior, the detector is also able to improve the estimated probabilities that a state detection is correct. We experimentally demonstrate that our technique enhances the detection accuracy for assembly sequences with a large number of steps and on a variety of use cases, including furniture, Lego and origami. Additionally, we demonstrate the use of our algorithm in an interactive augmented reality application.
KW - Computing methodologies
KW - Human computer interaction (HCI)
KW - Human-centered computing
KW - Interaction paradigms
KW - Learning from demonstrations
KW - Learning settings
KW - Machine learning
KW - Mixed / augmented reality
UR - http://www.scopus.com/inward/record.url?scp=85180375618&partnerID=8YFLogxK
U2 - 10.1109/ISMAR59233.2023.00030
DO - 10.1109/ISMAR59233.2023.00030
M3 - Conference contribution
AN - SCOPUS:85180375618
SN - 979-8-3503-2839-4
T3 - Proceedings - IEEE International Symposium on Mixed and Augmented Reality
SP - 157
EP - 166
BT - Proceedings - 2023 IEEE International Symposium on Mixed and Augmented Reality
A2 - Bruder, Gerd
A2 - Olivier, Anne-Hélène
A2 - Cunningham, Andrew
A2 - Peng, Evan Yifan
A2 - Grubert, Jens
A2 - Williams, Ian
PB - Institute of Electrical and Electronics Engineers
CY - United States of America
T2 - 22nd IEEE International Symposium on Mixed and Augmented Reality
Y2 - 16 October 2023 through 20 October 2023
ER -