"Monitoring Bird Biodiversity" is the study of utilizing Artificial Intelligence to teach the computer to recognize Bird Calls from sound files. Artificial Intelligence (AI) is an incredibly new and expanding field that has been growing more and more as even more research has been done on the subject. As it pertains to Bird Biodiversity, the goal of our AI is to be able to loop through thousands upon thousands of hours of sound recordings, picking out bird calls and categorizing them by what type of bird they are. This is a very critical component of many important scientific studies that need to know how the populations of birds exist in the wild. Our project completes the first part in which we can accurately recognize most bird calls from a given input sound file.
Now, let's take a step by step approach into our algorithm:
Fig 1: Padded Region-of-Interest Example.
Examples of Successful Bird Calls being recognized: (These are images where we have confirmed there to be at least 1 Bird Call)
Examples of Incorrect Bird Calls being recognized: (These are images where we have confirmed there to be NO Bird Calls in them)
Fig 2: Confusion Matrix
Fig 3: ROC/RPC Curves
We believe our algorithm to be fairly successful, and, if we would be able to take it further, we'd like to integrate some of the further ideas. Such as recognizing what bird is what, training it on a much larger dataset, and possibly even incorporating some form of real-time element. While difficult to understand at first, we rapidly began to comprehend our task and it was a fairly smooth experience. We did need to redo some of our ROI-Padding code, but that was a quick and easy fix.
For more information about YOLO, click here. Of course, special thanks to both Matthew Clark and Shree Baligar for both guiding us and giving us such a great project to work on.