In this post, I will try to summarize what I have learned and applied during the past two weeks. First of all, our team has prepared the initial briefing of our project and presented it to the group. It was exceptionally useful since we received valuable feedback to work on. In the final briefing, we have decided to add more visualizations, possibly a gif of our data augmentation process, and an additional metric to measure our success.
We have prepared initial data augmentation, which you can find in Narmin's blog. I will not repeat it here đ I was responsible for choosing and running one of the existing solutions from Kaggle. My final choice was the TPU notebook, which was the fastest among all. It had an encoder/decoder architecture with attention units. The score was moderate, although it is very interesting what will happen once we add our augmentations.
I have also read a lot about Vision Transformers and find them really exciting! Probably we will try to implement it for our project soon. Looking forward to transforming the theoretical knowledge I got to practice!