Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics

Shuai Yang^13,14,
Zhifei Chen¹³,
Pengguang Chen¹⁵,
Xi Fang¹⁵,
Yixun Liang¹³,
Shu Liu¹⁵ &
…
Yingcong Chen^13,14,16

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15065))

Included in the following conference series:

European Conference on Computer Vision

318 Accesses
2 Citations

Abstract

Defect inspection is paramount within the closed-loop manufacturing system. However, existing datasets for defect inspection often lack the precision and semantic granularity required for practical applications. In this paper, we introduce the Defect Spectrum, a comprehensive benchmark that offers precise, semantic-abundant, and large-scale annotations for a wide range of industrial defects. Building on four key industrial benchmarks, our dataset refines existing annotations and introduces rich semantic details, distinguishing multiple defect types within a single image. With our dataset, we were able to achieve an increase of 10.74% in the Recall rate, and a decrease of 33.10% in the False Positive Rate (FPR) from the industrial simulation experiment. Furthermore, we introduce Defect-Gen, a two-stage diffusion-based generator designed to create high-quality and diverse defective images, even when working with limited defective data. The synthetic images generated by Defect-Gen significantly enhance the performance of defect segmentation models, achieving an improvement in mIoU scores up to 9.85 on Defect-Spectrum subsets. Overall, The Defect Spectrum dataset demonstrates its potential in defect inspection research, offering a solid platform for testing and refining advanced models. Our project page is in https://envision-research.github.io/Defect_Spectrum/.

S. Yang and Z. Chen—These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Few-shot defect detection using feature enhancement and image generation for manufacturing quality inspection

Article 12 December 2023

Industrial product surface defect detection via the fast denoising diffusion implicit model

Article 11 July 2024

An Incremental Unified Framework for Small Defect Inspection

References

Bai, H., et al.: Vision datasets: a benchmark for vision-based industrial inspection. arXiv preprint arXiv:2306.07890 (2023)
Bergmann, P., Batzner, K., Fauser, M., Sattlegger, D., Steger, C.: The MVTec anomaly detection dataset: a comprehensive real-world dataset for unsupervised anomaly detection. Int. J. Comput. Vision 129(4), 1038–1059 (2021)
Article Google Scholar
Bergmann, P., Fauser, M., Sattlegger, D., Steger, C.: MVTec AD–a comprehensive real-world dataset for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9592–9600 (2019)
Google Scholar
Carvalho, P., Durupt, A., Grandvalet, Y.: A review of benchmarks for visual defect detection in the manufacturing industry. In: Gerbino, S., Lanzotti, A., Martorelli, M., Mirálbes Buil, R., Rizzi, C., Roucoules, L. (eds.) JCM 2022, pp. 1527–1538. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-15928-2_133
Chapter Google Scholar
Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018)
Google Scholar
Chen, X., Zhao, Z., Yu, F., Zhang, Y., Duan, M.: Conditional diffusion for interactive segmentation. In: ICCV (2021)
Google Scholar
Chen, X., Zhao, Z., Zhang, Y., Duan, M., Qi, D., Zhao, H.: Focalclick: towards practical interactive image segmentation (2022)
Google Scholar
Cheng, B., Misra, I., Schwing, A.G., Kirillov, A., Girdhar, R.: Masked-attention mask transformer for universal image segmentation (2022)
Google Scholar
Choi, J., Lee, J., Shin, C., Kim, S., Kim, H., Yoon, S.: Perception prioritized training of diffusion models. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)
Google Scholar
Defard, T., Setkov, A., Loesch, A., Audigier, R.: Padim: a patch distribution modeling framework for anomaly detection and localization (2020)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 248–255. IEEE (2009). https://ieeexplore.ieee.org/abstract/document/5206848/
Dhariwal, P., Nichol, A.: Diffusion models beat GANs on image synthesis. In: Advances in Neural Information Processing Systems, vol. 34, pp. 8780–8794 (2021)
Google Scholar
Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale (2021)
Google Scholar
Du, Z., Gao, L., Li, X.: A new contrastive GAN with data augmentation for surface defect recognition under limited data. IEEE Trans. Instrum. Meas. 72, 1–13 (2022)
Google Scholar
Faghih-Roohi, S., Hajizadeh, S., Núñez, A., Babuska, R., De Schutter, B.: Deep convolutional neural networks for detection of rail surface defects. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 2584–2589 (2016)
Google Scholar
Guo, J., Wang, Q., Li, Y.: Semi-supervised learning based on convolutional neural network and uncertainty filter for façade defects classification. In: Computer-Aided Civil and Infrastructure Engineering, pp. 302–317 (2021)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. In: Advances in Neural Information Processing Systems, vol. 33, pp. 6840–6851 (2020)
Google Scholar
Huang, Q., Wu, Y., Baruch, J., Jiang, P., Peng, Y.: A template model for defect simulation for evaluating nondestructive testing in X-radiography. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 39, 466–475 (2009)
Article Google Scholar
Cotton Incorporated: Standard fabric defect glossary (2023). https://www.cottoninc.com/quality-products/textile-resources/fabric-defect-glossary
Kirillov, A., et al.: Segment anything. arXiv:2304.02643 (2023)
Li, J., Li, D., Savarese, S., Hoi, S.: BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. In: ICML (2023)
Google Scholar
Liu, H., Li, C., Wu, Q., Lee, Y.J.: Visual instruction tuning (2023)
Google Scholar
Lu, F., Yao, X., Fu, C.W., Jia, J.: Removing anomalies as noises for industrial defect localization. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 16166–16175 (2023)
Google Scholar
Mery, D., Hahn, D., Hitschfeld, N.: Simulation of defects in aluminium castings using cad models of flaws and real X-ray images. Insight: Non-Destr. Test. Cond. Monit. 618–624 (2005)
Google Scholar
Mery, D., Filbert, D.: Automated flaw detection in aluminum castings based on the tracking of potential defects in a radioscopic image sequence. IEEE Trans. Robot. Autom. 18(6), 890–901 (2002)
Article Google Scholar
Mishra, P., Verk, R., Fornasier, D., Piciarelli, C., Foresti, G.L.: VT-ADL: a vision transformer network for image anomaly detection and localization. In: 30th IEEE/IES International Symposium on Industrial Electronics (ISIE) (2021)
Google Scholar
Mundt, M., Majumder, S., Murali, S., Panetsos, P., Ramesh, V.: Meta-learning convolutional neural architectures for multi-target concrete defect classification with the concrete defect bridge image dataset. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11196–11205 (2019)
Google Scholar
Ni, C., Yang, K., Xia, X., Lo, D., Chen, X., Yang, X.: Defect identification, categorization, and repair: better together (2022)
Google Scholar
Nichol, A.Q., Dhariwal, P.: Improved denoising diffusion probabilistic models. In: International Conference on Machine Learning, pp. 8162–8171. PMLR (2021)
Google Scholar
Niu, S., Li, B., Wang, X., Lin, H.: Defect image sample generation with GAN for improving defect recognition. IEEE Trans. Autom. Sci. Eng. 17(3), 1611–1622 (2020)
Google Scholar
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., Ommer, B.: High-resolution image synthesis with latent diffusion models (2021)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation (2015)
Google Scholar
Roth, K., Pemula, L., Zepeda, J., Schölkopf, B., Brox, T., Gehler, P.: Towards total recall in industrial anomaly detection (2022)
Google Scholar
Rott Shaham, T., Dekel, T., Michaeli, T.: Singan: learning a generative model from a single natural image. In: IEEE International Conference on Computer Vision (ICCV) (2019)
Google Scholar
Silvestre-Blanes, J., Albero-Albero, T., Miralles, I., Pérez-Llorens, R., Moreno, J.: A public fabric database for defect detection methods and results. Autex Res. J. 19(4), 363–374 (2019). https://doi.org/10.2478/aut-2019-0035
Song, W., Chen, T., Gu, Z., Gai, W., Huang, W., Wang, B.: Wood materials defects detection using image block percentile color histogram and eigenvector texture feature. In: Proceedings of the First International Conference on Information Sciences, Machinery, Materials and Energy. Atlantis Press (2015). https://doi.org/10.2991/icismme-15.2015.163
Strudel, R., Garcia, R., Laptev, I., Schmid, C.: Segmenter: transformer for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 7262–7272 (2021)
Google Scholar
Tabernik, D., Šela, S., Skvarč, J., Skočaj, D.: Segmentation-based deep-learning approach for surface-defect detection. J. Intell. Manuf. 31(3), 759–776 (2020)
Article Google Scholar
Tang, J., et al.: An incremental unified framework for small defect inspection. In: 18th European Conference on Computer Vision (ECCV) (2024). https://github.com/jqtangust/IUF
Vapnik, V.N., Chervonenkis, A.Y.: On the uniform convergence of relative frequencies of events to their probabilities. In: Measures of Complexity: Festschrift for Alexey Chervonenkis (2015)
Google Scholar
Wagner, S.: A literature survey of the quality economics of defect-detection techniques. CoRR abs/1612.04590 (2016). http://arxiv.org/abs/1612.04590
Wang, J., et al.: Deep high-resolution representation learning for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 43(10), 3349–3364 (2020)
Article Google Scholar
Wang, W., et al.: Sindiffusion: learning a diffusion model from a single natural image. arXiv preprint arXiv:2211.12445 (2022)
Wei, J., Zhang, Z., Shen, F., Lv, C.: Mask-guided generation method for industrial defect images with non-uniform structures. Machines 10(12), 1239 (2022)
Article Google Scholar
Wieler, M., Hahn, T.: Weakly supervised learning for industrial optical inspection. In: DAGM Symposium, vol. 6 (2007)
Google Scholar
Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., Luo, P.: Segformer: simple and efficient design for semantic segmentation with transformers (2021)
Google Scholar
Yao, X., Li, R., Zhang, J., Sun, J., Zhang, C.: Explicit boundary guided semi-push-pull contrastive learning for supervised anomaly detection (2023). https://arxiv.org/abs/2207.01463
Yu, C., Gao, C., Wang, J., Yu, G., Shen, C., Sang, N.: Bisenet v2: bilateral network with guided aggregation for real-time semantic segmentation. Int. J. Comput. Vision 129, 3051–3068 (2021)
Article Google Scholar
Zhang, G., Cui, K., Hung, T.Y., Lu, S.: Defect-GAN: high-fidelity defect synthesis for automated defect inspection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2524–2534 (2021)
Google Scholar
Zhang, Z., Zhao, Z., Zhang, X., Sun, C., Chen, X.: Industrial anomaly detection with domain shift: a real-world dataset and masked multi-scale reconstruction. arXiv preprint arXiv:2304.02216 (2023)
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)
Google Scholar
Zhou, B., Zhao, H., Puig, X., Fidler, S., Barriuso, A., Torralba, A.: Scene parsing through ADE20K dataset. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Google Scholar
Zhou, B., et al.: Semantic understanding of scenes through the ade20k dataset. Int. J. Comput. Vision 127(3), 302–321 (2019)
Article Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Google Scholar
Zou, Y., Jeong, J., Pemula, L., Zhang, D., Dabeer, O.: Spot-the-difference self-supervised pre-training for anomaly detection and segmentation (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

Hong Kong University of Science and Technology, Guangzhou, China
Shuai Yang, Zhifei Chen, Yixun Liang & Yingcong Chen
HKUST(GZ) - SmartMore Joint Lab, Guangzhou, China
Shuai Yang & Yingcong Chen
SmartMore. Corp, Shatin, Hong Kong
Pengguang Chen, Xi Fang & Shu Liu
Hong Kong University of Science and Technology, Kowloon, Hong Kong
Yingcong Chen

Authors

Shuai Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhifei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Pengguang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xi Fang
View author publications
You can also search for this author in PubMed Google Scholar
Yixun Liang
View author publications
You can also search for this author in PubMed Google Scholar
Shu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yingcong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shuai Yang .

Editor information

Editors and Affiliations

University of Birmingham, Birmingham, UK
Aleš Leonardis
University of Trento, Trento, Italy
Elisa Ricci
Technical University of Darmstadt, Darmstadt, Germany
Stefan Roth
Princeton University, Princeton, NJ, USA
Olga Russakovsky
Czech Technical University in Prague, Prague, Czech Republic
Torsten Sattler
École des Ponts ParisTech, Marne-la-Vallée, France
Gül Varol

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 18502 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, S. et al. (2025). Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics. In: Leonardis, A., Ricci, E., Roth, S., Russakovsky, O., Sattler, T., Varol, G. (eds) Computer Vision – ECCV 2024. ECCV 2024. Lecture Notes in Computer Science, vol 15065. Springer, Cham. https://doi.org/10.1007/978-3-031-72667-5_11

Download citation

DOI: https://doi.org/10.1007/978-3-031-72667-5_11
Published: 29 September 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72666-8
Online ISBN: 978-3-031-72667-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Few-shot defect detection using feature enhancement and image generation for manufacturing quality inspection

Industrial product surface defect detection via the fast denoising diffusion implicit model

An Incremental Unified Framework for Small Defect Inspection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 18502 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Few-shot defect detection using feature enhancement and image generation for manufacturing quality inspection

Industrial product surface defect detection via the fast denoising diffusion implicit model

An Incremental Unified Framework for Small Defect Inspection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 18502 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation