TR2025-149

Joint Training of Image Generator and Detector for Road Defect Detection


Abstract:

Road defect detection is important for road authorities to reduce the vehicle damage caused by road defects. Considering the practical scenarios where the defect detectors are typically deployed on edge devices with limited memory and computational resource, we aim at performing road defect detection without using ensemble-based methods or test-time augmentation (TTA). To this end, we propose to Jointly Train the image Generator and Detector for road defect detection (dubbed as JTGD). We design the dual discriminators for the generative model to en- force both the synthesized defect patches and overall images to look plausible. The synthesized image quality is improved by our proposed CLIP-based Frechet Inception Distance loss. The generative model in JTGD is trained jointly with the detector to encourage the generative model to synthesize harder examples for the detector. Since harder synthesized images of better quality caused by the aforesaid design are used in the data augmentation, JTGD outperforms the state-of-the-art method in the RDD2022 road defect detection benchmark across various countries under the condition of no ensemble and TTA. JTGD only uses less than 20% of the number of parameters compared with the competing baseline, which makes it more suitable for deployment on edge devices in practice.

 

  • Related News & Events

    •  NEWS    MERL Papers, Workshops, and Talks at ICCV 2025
      Date: October 19, 2025 - October 23, 2025
      Where: Honolulu, HI, USA
      MERL Contacts: Petros T. Boufounos; Anoop Cherian; Toshiaki Koike-Akino; Hassan Mansour; Tim K. Marks; Pedro Miraldo; Kuan-Chuan Peng; Pu (Perry) Wang
      Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Signal Processing
      Brief
      • MERL researchers presented 3 conference papers and 3 workshop papers, co-organized 2 workshops, and delivered 2 invited talks at the IEEE International Conference on Computer Vision (ICCV) 2025, which was held in Honolulu, HI, USA from October 19-23, 2025. ICCV is one of the most prestigious and competitive international conferences in the area of computer vision. Details of MERL contributions are provided below:


        Main Conference Papers:

        1. "SAC-GNC: SAmple Consensus for adaptive Graduated Non-Convexity" by V. Piedade, C. Sidhartha, J. Gaspar, V. M. Govindu, and P. Miraldo. (Highlight Paper)
        Paper: https://www.merl.com/publications/TR2025-146

        2. "Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts" by C.-A. Yang, K.-C. Peng, and R. A. Yeh.
        Paper: https://www.merl.com/publications/TR2025-124

        3. "Manual-PA: Learning 3D Part Assembly from Instruction Diagrams" by J. Zhang, A. Cherian, C. Rodriguez-Opazo, W. Deng, and S. Gould.
        Paper: https://www.merl.com/publications/TR2025-139


        MERL Co-Organized Workshops:

        1. "The Workshop on Anomaly Detection with Foundation Models (ADFM)" by K.-C. Peng, Y. Zhao, and A. Aich.
        Workshop link: https://adfmw.github.io/iccv25/

        2. "The 8th International Workshop on Computer Vision for Physiological Measurement (CVPM)" by D. McDuff, W. Wang, S. Stuijk, T. Marks, H. Mansour, V. R. Shenoy.
        Workshop link: https://sstuijk.estue.nl/cvpm/cvpm25/


        MERL Keynote Talks at Workshops:

        1. Tim K. Marks, Keynote Speaker at the Workshop on Computer Vision for Physiological Measurement (CVPM).
        Workshop website: https://vineetrshenoy.github.io/cvpmSeptember2025/

        2. Tim K. Marks, Keynote Speaker at the Workshop on Analysis and Modeling of Faces and Gestures (AMFG).
        Workshop website: https://fulab.sites.northeastern.edu/amfg2025/


        Workshop Papers:

        1. "Joint Training of Image Generator and Detector for Road Defect Detection" by K.-C. Peng.
        paper: https://www.merl.com/publications/TR2025-149

        2. "Radar-Conditioned 3D Bounding Box Diffusion for Indoor Human Perception" by R. Yataka, P. Wang, P.T. Boufounos, and R. Takahashi.
        paper: https://www.merl.com/publications/TR2025-154

        3. "L-GGSC: Learnable Graph-based Gaussian Splatting Compression" by S. Kato, T. Koike-Akino, and T. Fujihashi.
        paper: https://www.merl.com/publications/TR2025-148
    •  
  • Related Video

  • Related Publication

  •  Peng, K.-C., "Joint Training of Image Generator and Detector for Road Defect Detection", arXiv, September 2025.
    BibTeX arXiv
    • @article{Peng2025sep,
    • author = {Peng, Kuan-Chuan},
    • title = {{Joint Training of Image Generator and Detector for Road Defect Detection}},
    • journal = {arXiv},
    • year = 2025,
    • month = sep,
    • url = {https://arxiv.org/abs/2509.03465}
    • }