Looking for reliable species distribution models for low-density cetacean species: Compared effectiveness of SDMs for G. griseus, G. melas, Z. cavirostris in the Mediterranean Sea based on long-term fixed-transect data

Citation
Arcangeli A, Azzolin M, Babey L, et al (2024) Looking for reliable species distribution models for low-density cetacean species: Compared effectiveness of SDMs for G. griseus, G. melas, Z. cavirostris in the Mediterranean Sea based on long-term fixed-transect data. Aquatic Conservation: Marine and Freshwater Ecosystems 34:e4115. https://doi.org/10.1002/aqc.4115
Abstract

The EU regulative framework for the protection of marine biodiversity and habitats requires the assessment of species' conservation status and the identification of core habitats to design adequate conservation and management plans. However, the identification of distribution range and habitat-use of pelagic large-range, migratory species, such as cetaceans, is challenging. Species distribution models (SDMs) are increasingly used in conservation planning to identify species priority areas. However, the quality of SDMs varies widely depending on the representativeness of data and the appropriateness of the modelling techniques. Since 2007, the Fixed Line Transect Mediterranean Monitoring Network (FLT Med Net) has been continuously monitoring cetaceans throughout the year in the Mediterranean basin using passenger ferries as observation platforms that perform repetitive surveys along fixed trans-border transects. With the aim of defining a standard analytical approach, the data collected by the FLT Med Net on rarer Mediterranean cetacean species (i.e., Grampus griseus, Globicephala melas, Ziphius cavirostris) are used here to assess the performance of commonly used SDMs, including GLM, GAM, GAM-Negative Binomial, GAM-tweedy, MaxEnt and Random Forest. Models were built and evaluated using a total of 296 FLT Med Net sighting data and cross-validated using 145 independent data points. Under testing conditions, almost all methods exhibited good performance, with Random Forest being the best model in several cases. However, when evaluated with the independent dataset, many models yielded inconsistent results or notably low performance. Only MaxEnt demonstrated consistent efficiency and reliability in both cases, showing results less affected by unequal sampling or small sample size. Results confirm the importance of robust standard SDM approaches based on representative data such as the FLT Med Net data to identify reliable priority areas for the species and to assess the long-term coherence and effectiveness of spatial protection measures.