TR2026-084

Model Compression of Vision Transformer for Electric Motor Cogging Torque Prediction


    •  Chen, J., Koike-Akino, T., Wang, Y., Yamamoto, T., Sakamoto, Y., Wang, B., "Model Compression of Vision Transformer for Electric Motor Cogging Torque Prediction", IEEE World Congress on Computational Intelligence, June 2026.
      BibTeX TR2026-084 PDF
      • @inproceedings{Chen2026jun,
      • author = {Chen, Jian and Koike-Akino, Toshiaki and Wang, Ye and Yamamoto, Tatsuya and Sakamoto, Yusuke and Wang, Bingnan},
      • title = {{Model Compression of Vision Transformer for Electric Motor Cogging Torque Prediction}},
      • booktitle = {IEEE World Congress on Computational Intelligence},
      • year = 2026,
      • month = jun,
      • url = {https://www.merl.com/publications/TR2026-084}
      • }
  • MERL Contacts:
  • Research Areas:

    Electric Systems, Machine Learning, Multi-Physical Modeling

Abstract:

Vision Transformer (ViT) models have recently demonstrated strong performance in predicting electric motor cogging torque from visual representations of motor geometry. However, the high computational cost and memory footprint of ViT architectures hinder their deployment in resource- constrained environments such as embedded motor drives and real-time digital twins. This paper investigates post-training model compression techniques for a ViT-based cogging torque prediction model. Specifically, we evaluate some activation-aware pruning methods, which leverage activation statistics to identify redundant weights, as well as low-rank factorization applied to attention and feed-forward layers. Experimental results demonstrate that substantial reductions in parameter count and inference cost can be achieved with negligible degradation in prediction performance. These findings highlight the feasibility of deploying compressed ViT models for efficient and scalable electric motor analysis.