University of Lincoln
Browse

Channel randomisation: Self-supervised representation learning for reliable visual anomaly detection in speciality crops

journal contribution
posted on 2025-01-21, 16:30 authored by Taeyeong Choi, Owen Would, Adrian Salazar Gomez, Xin Liu, Grzegorz CielniakGrzegorz Cielniak

Modern, automated quality control systems for speciality crops utilise computer vision together with a machine learning paradigm exploiting large datasets for learning efficient crop assessment components. To model anomalous visuals, data augmentation methods are often developed as a simple yet powerful tool for manipulating readily available normal samples. State-of-the-art augmentation methods embed arbitrary “structural” peculiarities in normal images to build a classifier of these artefacts (i.e., pretext task), enabling self-supervised representation learning of visual signals for anomaly detection (i.e., downstream task). In this paper, however, we argue that learning such structure-sensitive representations may be suboptimal for agricultural anomalies (e.g., unhealthy crops) that could be better recognised by a different type of visual element like “colour”.


To be specific, we propose Channel Randomisation (CH-Rand)—a novel data augmentation method that forces deep neural networks to learn effective encoding of “colour irregularities” under self-supervision whilst performing a pretext task to discriminate channel-randomised images. Extensive experiments are performed across various types of speciality crops (apples, strawberries, oranges, and bananas) to validate the informativeness of learnt representations in detecting anomalous instances. Our results demonstrate that CH-Rand’s representations are significantly more reliable and robust, outperforming state-of-the-art methods (e.g., CutPaste) that learn structural representations by over 43% in Area Under the Precision–Recall Curve (AUC–PR), particularly for strawberries. Additional experiments suggest that adopting the L∗a∗b∗ colour space and “curriculum” learning in the pretext task — gradually disregarding channel combinations for unrealistic outcomes — further improves downstream-task performance by 16% in AUC–PR. In particular, our experiments employ Riseholme-2021, a novel speciality crop dataset consisting of 3.5K real strawberry images gathered in situ from the real farm, along with the Fresh & Stale public dataset. All our code and datasets are made publicly available online to ensure reproducibility and encourage further research in agricultural technologies.

Funding

Research England Expanding Excellence in England funding for Lincoln Agri-Robotics

USDA National Institute of Food and Agriculture, USA (USDA-020-67021-32855)

National Science Foundation, USA (OIA-2134901)

History

School affiliated with

  • Lincoln Institute for Agri-Food Technology (Research Outputs)

Publication Title

Computers and Electronics in Agriculture

Volume

226

Pages/Article Number

109416

Publisher

Elsevier

ISSN

0168-1699

eISSN

1872-7107

Date Accepted

2024-08-28

Date of First Publication

2024-09-27

Date of Final Publication

2024-11-01

Open Access Status

  • Not Open Access

Date Document First Uploaded

2024-12-20

Will your conference paper be published in proceedings?

  • N/A

Usage metrics

    University of Lincoln (Research Outputs)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC