Abstract:Sex identification of pigeons is a crucial task in the breeding and pairing process. To enable intelligent sex identification of pigeons, a method based on pigeon vocalizations was proposed and the endpoint detection model EGVAN_VAD and the sex classification model EGVAN, were developed both based on the EGLKA module. The method transformed one-dimensional audio signals into two-dimensional Mel spectrograms as input to the EGVAN_VAD model for detecting pigeon vocal segments, followed by noise reduction by using a thresholding method and a Butterworth low-pass filter to eliminate both steady-state and transient environmental noise. The denoised Mel spectrogram was then used as input for the EGVAN model, which was compared with LSTM, GRU, TDNN, and VAN models for recognizing pigeon vocalizations across different age groups (1~3 months, 3~6 months, and more than six months). Experimental results indicated that the EGVAN_VAD model achieved an accuracy of 93.4% and the recall of 94.3%, with processing time of 10ms per 4-second audio segment, outperforming other endpoint detection models in overall performance. The EGVAN model achieved the highest recognition precision for female and male calls, reaching 90.7% and 90.3%, respectively, with processing time of 14.1ms per 3-second audio segment. For pigeons aged 3~6 months, the identification success rate reached 99.5%, while the average success rate across all age groups was 89.0% following systematic testing. These findings demonstrated that the EGVAN model exhibited excellent performance in recognizing both male and female pigeon calls. The research result can provide a technical reference for applying audio-based methods to intelligent and accurate sex identification in other monomorphic bird species.