Fan Wu

Thanks for stopping by!

Research: My interest lies in practical machine learning methods and their theoretical analysis, spanning multiple areas such as Out-of-Distribution Generalization, Trustworthy Machine Learning, and Generative models. Specifically, I am interested in developing methods that enhance the capacity of learning predictive representations. Moreover, I strongly believe that the potential of large deep learning models is yet to be fully explored. My ultimate goal is to create a unified, robust, and generalizable machine learning method that is cost-efficient and accessible to everyone.

news

Feb 5, 2024	One paper is accepted by International Journal of Computer Vision (IJCV), IF=19.5.
Dec 9, 2023	One paper is accepted to AAAI 2024.
Dec 1, 2021	One paper is accepted to Computers in Biology and Medicine, IF=7.7.
Oct 1, 2021	One paper is accepted by Knowledge-Based Systems, IF=8.8.

selected publications

AAAI
G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Fan Wu, Jinling Gao, HONG Lanqing, and 3 more authors

In The 38th Annual AAAI Conference on Artificial Intelligence, 2023

Abs Bib PDF Code

In this paper, we focus on a realistic yet challenging task, Single Domain Generalization Object Detection (S-DGOD), where only one source domain’s data can be used for training object detectors, but have to generalize multiple distinct target domains. In S-DGOD, both high-capacity fitting and generalization abilities are needed due to the task’s complexity. Differentiable Neural Architecture Search (NAS) is known for its high capacity for complex data fitting and we propose to leverage Differentiable NAS to solve S-DGOD. However, it may confront severe over-fitting issues due to the feature imbalance phenomenon, where parameters optimized by gradient descent are biased to learn from the easy-to-learn features, which are usually non-causal and spuriously correlated to ground truth labels, such as the features of background in object detection data. Consequently, this leads to serious performance degradation, especially in generalizing to unseen target domains with huge domain gaps between the source domain and target domains. To address this issue, we propose the Generalizable loss (G-loss), which is an OoD-aware objective, preventing NAS from over-fitting by using gradient descent to optimize parameters not only on a subset of easy-to-learn features but also the remaining predictive features for generalization, and the overall framework is named G-NAS. Experimental results on the S-DGOD urban-scene datasets demonstrate that the proposed G-NAS achieves SOTA performance compared to baseline methods.
@inproceedings{wu2023gnas, title = {G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection}, author = {Wu, Fan and Gao, Jinling and Lanqing, HONG and Wang, Xinbing and Zhou, Chenghu and Ye, Nanyang}, booktitle = {The 38th Annual AAAI Conference on Artificial Intelligence}, year = {2023}, }
ICLR
Object Detection with OOD Generalizable Neural Architecture Search

Fan Wu, Kaican Li, Jinling Gao, and 5 more authors

In ICLR Domain Generalization Workshop, 2023

Abs Bib HTML PDF

To improve the Out-of-Distribution (OOD) Generalization on Object Detection, we present a Neural Architecture Search (NAS) framework guided by feature orthogonalization. We believe that the failure to generalize on OOD data is due to the spurious correlations of category-related features and context-related features. The category-related features describe the causal information for predicting the target objects, such as "a car with four wheels”, while the context-related features describe the non-causal information, such as "a car driving at night”. However, due to the distinct data distribution between training and testing sets, the context-related features are often mistaken for causal information. To address this, we aim to automatically discover an optimal architecture that can disentangle the category-related features and the context-related features with a novel weight-based detector head. Both theoretical and experimental results show that the proposed scheme can achieve disentanglement and better performance on both IID and OOD.
@inproceedings{wu2023object, title = {Object Detection with OOD Generalizable Neural Architecture Search}, author = {Wu, Fan and Li, Kaican and Gao, Jinling and Peng, Chensheng and Lanqing, HONG and Xie, Enze and Li, Zhenguo and Ye, Nanyang}, booktitle = {ICLR Domain Generalization Workshop}, year = {2023}, }
Journal
AGNet: Automatic generation network for skin imaging reports

Fan Wu, Haiqiong Yang, Linlin Peng, and 5 more authors

Computers in Biology and Medicine, 2022

Abs Bib HTML

Medical imaging has been increasingly adopted in the process of medical diagnosis, especially for skin diseases, where diagnoses based on skin pathology are extremely accurate. The diagnostic reports of skin pathology images has the distinguishing features of extreme repetitiveness and rigid formatting. However, reports written by inexperienced radiologists and pathologists can have a high error rate, and even experienced clinicians can find the reporting task both tedious and time-consuming. To address this challenge, this paper studies the automatic generation of diagnostic reports based on images of skin pathologies. A novel deep learning-based image caption framework named the automatic generation network (AGNet), which is an effective network for the automatic generation of skin imaging reports, is proposed. The proposed AGNet consists of four parts: (1) the image model that extracts features and classifies images; (2) the language model that codes data and generates words using comprehensible language; (3) the attention module that connects the “tail” of the image model and the “head” of the language model, and computes the relationship between images and captions; (4) the embedding and labeling module that processes the input caption data. In case study, The AGNet is verified on a skin pathological image dataset and compared with several state-of-the-art models. The results show that the AGNet achieves the highest scores of the evaluation metrics of image caption among all comparison models, demonstrating the promising performance of the proposed method.
@article{wu2022agnet, title = {AGNet: Automatic generation network for skin imaging reports}, author = {Wu, Fan and Yang, Haiqiong and Peng, Linlin and Lian, Zongkai and Li, Mingxin and Qu, Gang and Jiang, Shancheng and Han, Yu}, journal = {Computers in Biology and Medicine}, volume = {141}, pages = {105037}, year = {2022}, publisher = {Elsevier}, }