Embracing limited and imperfect datasets in plant disease recognition using deep learning.
Currently maintained by Mingle Xu.
We are collecting public plant disease datasets in PPDRD project.
Recent advancements in deep learning have brought significant improvements to plant disease recognition. However, achieving satisfactory performance often requires high-quality training datasets, which are challenging and expensive to collect. Consequently, the practical application of current deep learning-based methods in real-world scenarios is hindered by the scarcity of high-quality datasets. Consequently, we argue that embracing non-high-quality datasets is more convincing and practical. Although this embrace brings opportunities, new challenges exist. A taxonomy of related challenges is, therefore, proposed to enrich our understandings. With this perspective, we do hope that deep learning can be implemented in real-world applications with satisfactory performance.
- Limited data: the training dataset is not in large-scale.
-
- Class-level: consider the variation of different classes within the training dataset.
-
- Dataset-level: consider the diversity between the training and test datasets.
- Imperfect data: the training dataset is annotated in an undesired way.