There are some questions that I believe can have a positive impact on the successful implementation of the project in the name of time, accuracy, and scope.
- As it is always discussed how much data is important. So, in our case, there are some publicly available data already but to have the model tested in the real environment, the creation of the custom dataset is one of the goals. I would like to hear about the successes and failures you have had in your own projects and what was the reasons, whether it is a common pattern or depending on the scope and content of the project.
- The second question can be about the quality of the data. How to successfully detect the format of the data required to gather such as if video whether it should be mp4, Flv, MOV or totally another one. So, some formats are taking less storage but also carry less information. Is there any pre-defined method to find detect the one without trial/fail period?
- How to find/decide on the correct metrics to evaluate the success rate of the project? Let's say it is up to us and no input has been given by the stakeholders. What strategy do you follow?
- Let's assume the model is built and working fine about some average and there is nothing that seems to be wrong. How to debug the system to see whether there is any mathematical mistake in the computation which just ended up not giving so bad but in the long run it would fail? What methods do you follow to be sure that the model is correct on the mathematical base?