This week I am working on reimplementing experiments in the field of fine-grained visual classification.The data set used for this study is CUB-200-2011, a fine-grained bird classification dataset.
Summary
Method | Top-1 Accuracy - My Result | Top-1 Accuracy - Original Result | Code |
---|---|---|---|
FFVT | 91.62 | 91.6 | link |
Vit-NeT | 91.6 | 91.7 | link |
TransFG | NA | 91.7 | link |
IELT | 91.267 | 91.8 | link |
SAC | NA | 91.8 | link |
HERBS | 93.01 | 93.1 | link |
Details
- FFVT
![](https://blogs.gwu.edu/pless/files/2024/05/Screenshot-2024-05-25-at-3.19.32 PM-1024x87.png)
- IELT
![](https://blogs.gwu.edu/pless/files/2024/05/Screenshot-2024-05-29-at-12.32.30 PM-1024x162.png)
- Vit-Net
![](https://blogs.gwu.edu/pless/files/2024/05/Screenshot-2024-05-20-at-9.16.03 PM.png)
- HERBS
![](https://blogs.gwu.edu/pless/files/2024/05/image-1.png)
- TransFG: Due to GPU limitations, the training steps were not completed. However, I successfully migrated the workflow to Google Colab and optimized the data loading steps, reducing the time from 40 minutes to 2 minutes.
![](https://blogs.gwu.edu/pless/files/2024/05/Screenshot-2024-05-27-at-12.31.56 PM-1.png)
- SAC: Learned how to set up a virtual environment to incorporate TensorFlow 1.45 and Python 3.7, but encountered issues on a MacBook due to hardware limitations.
![](https://blogs.gwu.edu/pless/files/2024/05/Pasted-image-20240523222646-1-1024x231.png)