Decoding Insect Song: A Multitask Semisupervised Orthoptera Bioacoustic Classifier
Olga Isupova, Danil Kuzin, Ella Browning, Tom Mills, Steven Reece
- Published
- Jun 11, 2026 — 11:51 UTC
Problem
Existing automated tools for passive acoustic monitoring in ecology are often narrowly trained and lack transferability across species. This paper addresses these limitations by proposing a semi-supervised, multi-task framework specifically for Orthoptera bioacoustics. The authors highlight the need for improved classification methods that can leverage unlabelled data and generalize better across different species, which is particularly relevant given the ecological importance of Orthoptera.
Method
The proposed framework, PULSE, integrates several advanced techniques: weakly-supervised species classification, self-supervised learning on unlabelled field audio, and knowledge distillation from a general-purpose bioacoustic model. The architecture employs a multi-task learning approach, allowing the model to learn from both labelled and unlabelled data effectively. The training process utilizes a combination of labelled datasets for species classification and unlabelled audio for self-supervised learning, enhancing the model’s ability to generalize. The authors do not disclose specific training compute details, but they emphasize the use of active learning to iteratively improve model performance.
Results
PULSE demonstrates significant performance improvements over a state-of-the-art general model across multiple metrics. The macro F1 score achieved by PULSE is 0.21 compared to 0.07 for the baseline, while the Area Under the Curve (AUC) is 0.74 versus 0.45, and Average Precision (AP) is 0.32 compared to 0.19. Furthermore, the implementation of active learning boosts the macro F1 score to 0.34 and AUC to 0.84, indicating a substantial enhancement in classification accuracy. These results suggest that PULSE not only outperforms existing models but also effectively utilizes unlabelled data to improve learning outcomes.
Limitations
The authors acknowledge that while PULSE shows promising results, it is still a semi-supervised approach, which may limit its performance in scenarios with very few labelled examples. Additionally, the reliance on a general-purpose bioacoustic model for knowledge distillation may introduce biases inherent to that model. The paper does not address potential challenges related to the scalability of the framework or its performance across diverse ecological contexts beyond Orthoptera.
Why it matters
The development of PULSE has significant implications for ecological monitoring and biodiversity assessment, as it enables more robust classification of bioacoustic signals, which can lead to better-informed conservation strategies. The learned embeddings not only facilitate species classification but also encode ecologically meaningful structures, which can be explored through an interactive visualization tool. This work contributes to the growing field of bioacoustic monitoring and highlights the potential of semi-supervised learning in ecological applications, as published in arXiv cs.LG.
By Turing Wire editorial staff · Jun 11, 2026 · Editorial standards →
Source: arXiv cs.LG