Abstract
The global network of protected areas has rapidly expanded in the past decade and is expected to cover at least 30% of land and sea by 2030 to halt biodiversity erosion. Yet, the distribution of protected areas is highly heterogeneous on Earth and the social-environmental preconditions enabling or hindering protected area establishment remain poorly understood. Here, using fourteen socioeconomic and environmental factors, we characterize the multidimensional niche of terrestrial and marine protected areas, which we use to accurately establish, at the global scale, whether a particular location has preconditions favourable for paestablishment. We reveal that protected areas, particularly the most restrictive ones, over-aggregate where human development and the number of non-governmental organizations are high. Based on the spatial distribution of vertebrates and the likelihood to convert non-protected areas into strictly protected areas, we identify ‘potential’ versus ‘unrealistic’ conservation gains on land and sea, which we define as areas of high vertebrate diversity that are, respectively, favourable and unfavourable to protected area establishment. Where protected areas are unrealistic, alternative strategies such as other effective area-based conservation measures or privately protected areas, could deliver conservation outcomes.
Similar content being viewed by others
Introduction
Protected areas (PAs) are a core management response to the increasingly pervasive and multiple impacts of global change on species1,2 and human well-being3,4, although weak regulations4, financial limitations5 and conflicts6 can hinder their conservation benefits. To face the rapid expansion of human activities, even towards the last wilderness areas7, and the unprecedented rate of biodiversity loss8,9, the global coverage of PAs has markedly increased over the last decades, with more than 17% of the world’s terrestrial surface and 8% of the ocean formally designated as protected10. Recently, the 15th Conference of Parties to the UN Convention on Biological Diversity (COP 15) adopted the “Kunming-Montreal Global Biodiversity Framework” (GBF) committed to protect at least 30% of both land and sea before 2030. Yet, the current protection coverage is highly heterogeneous on Earth10, and some advocate that more ambitious and relevant targets are urgently needed to safeguard biodiversity and nature’s contribution to people1,2.
This race to protect more land and sea does not necessarily benefit biodiversity and people equally everywhere11,12, nor are the costs arising from PAs shared equitably around the world13,14 with identified barriers and levers to increase protection coverage15,16. Global assessments of PA coverage10 may thus overshadow socioeconomic, habitat and environmental gaps, which can be detrimental to poverty alleviation17, climate mitigation18 and species conservation19. Indeed, PA designations tend to be biased toward higher-income countries20 and places that are remote or of low value to agriculture21 and fisheries22. Yet, while most research has focused on evaluating the potential benefits of existing and future PA networks for nature and people2,23, less attention has been given to a better understanding of the combination of socioeconomic and environmental enabling conditions to PA establishment. Indeed, whilst the influence of some factors like geopolitical conflicts24, human development20, land use25, cultural context26 and elevation27 on PA establishment have been studied, other potential barriers and levers to increase protection coverage have been overlooked, like the level of dependency on natural resources and the number of environmental Non-Governmental Organisations (NGOs). In other words, identifying the main socioeconomic and environmental preconditions of PA establishment worldwide should provide critical insights into the potential viability of efforts for new PA designations to achieve the ambitious target of at least 30% coverage in the short term.
Here, we quantitatively assess the ‘global niche’ of terrestrial and marine PAs by representing the distribution of all IUCN-recognised PAs in a multidimensional social-environmental space encompassing local and national conditions15. By analogy to species niches that define their suitable habitats, so determine their extinction and colonisation patterns28, PA niches are expected to predict their new establishment patterns but also downgrading, downsizing, and degazettement events29. We then build machine learning models predicting the relative likelihood of PA occurrence and identifying PA-enabling conditions on both land and sea. Finally, model predictions are used to pinpoint non-protected areas that are ‘potential gains’ and ‘unrealistic gains’ for biodiversity conservation globally, i.e., areas that would benefit most to vertebrates and that are respectively favourable and unfavourable to PA establishment. Ultimately, a better understanding of local opportunities, levers and barriers would guide strategic interventions towards realistic and operational long-term positive conservation outcomes.
In this work, we show that PA distribution is highly heterogeneous on Earth, even within a single country, but highly predictable from a set of social-environmental factors. We highlight a few clustered areas meeting favourable PA enabling conditions while being critical for biodiversity conservation, i.e. ‘low-hanging fruit’ areas, but widespread unrealistic high conservation gains, so ‘high-hanging fruit’ areas.
Results and discussion
Underrepresented socioeconomic and environmental conditions
To conduct our assessment, we first defined a global 10 × 10 km grid on both land and sea. We then selected every grid cell that overlapped, even partially, with a PA of the IUCN category from I to VI (see ‘Methods’). We also randomly selected the same number of grid cells outside PAs to obtain a representative set of non-protected areas globally and a balanced design in our statistical analyses (see ‘Methods’). All marine cells were constrained to be included within countries’ exclusive economic zones (EEZs), where 98.8% of marine PAs are located according to the Marine Protection Atlas, and to account for socioeconomic factors at the local or country level. We thus ended up with a total of 517,450 cells on land and 421,746 cells at sea, which are representative of all biomes on Earth.
For each cell, we extracted the value of 14 factors for both land and sea (see ‘Methods’ and Supplementary Data 1) to provide a spatially explicit analytical framework of local social–environmental conditions15. These factors were carefully selected in relation to their potential association with the presence of protected areas (see rationales and hypotheses in Tables 1 and 2) and their relative independence (Pearson correlation between −0.7 and 0.7). Seven factors were related to the socioeconomic context, which can be local (Gross Domestic Product or GDP, Accessibility in terms of travel time to the nearest city and Human footprint) or national (human development index or HDI, number of conflicts, number of Non-Governmental Organisations or NGOs and Dependency on natural resources). Seven other factors were related to the local environmental context (island, atmospheric or sea surface temperature, primary productivity, elevation or depth, distance to the coast, along with distance to seamounts and salinity for marine cells and precipitation and freshwater coverage for terrestrial cells). In the rare cases where values for these factors were not available (i.e., 0.7% of cells on land and 1.2% at sea), we inferred the missing values using a machine learning algorithm (see ‘Methods’).
To visualise the global grid cell distribution according to the level of protection in a socioeconomic and environmental multidimensional space, we first performed an ecological niche factor analysis (ENFA), a linear ordination technique30. Considering all IUCN PA categories, we show that while the social-environmental space of unprotected areas broadly overlaps with that of protected areas for both land and sea, there are consistent gaps (Fig. 1). In other words, the global network of PAs fails to cover all socioeconomic and environmental conditions on Earth. Moreover, the density of PAs along the first ENFA axis is markedly different from that of unprotected areas on both land (Kolmogorov–Smirnov distance = 0.32 vs. 0.07 along the second axis) and sea (Kolmogorov–Smirnov distance = 0.52 vs. 0.24 along the second axis). Terrestrial and marine PAs tend to aggregate on positive values along the first ENFA axis, so where the level of HDI and the number of NGOs are high but where the number of conflicts, human footprint and dependency on natural resources are low (Fig. 1) like the Cap Corse and Agriate marine protected area (France, IUCN category V) located in Corsica, a Mediterranean island (Fig. 2b). Yet, more restrictive protected areas are located in under-represented social-environmental contexts like the Viñales valley (Cuba) which is of IUCN Category II (Fig. 2a).
IUCN category I PAs (i.e., the most restrictive, or most strictly protected) only cover a reduced fraction of global social-environmental conditions, particularly at sea (Fig. 1). These restrictive PAs tend to disproportionately occur on positive coordinates along the first ENFA axis compared to non-protected areas (Kolmogorov–Smirnov distance = 0.50 on land and 0.45 at sea vs. 0.22 on land and 0.29 at sea along the second axis). This clustering of PAs and gaps of unprotected conditions are even more pronounced in the socioeconomic multidimensional space (Supplementary Fig. 1) than in the environmental space (Supplementary Fig. 2).
Our findings echo previous studies suggesting that PA distribution is highly heterogeneous on Earth10, even within a single country31. Indeed, PA design has historically rarely been systematic or strategic, but rather opportunistic or political, leaving many habitats, environments, species and evolutionary lineages underrepresented19,32. For instance, a positive bias towards high elevation areas27, unfertile lands21 and islands33 have been shown for terrestrial PAs. The lack of national or international coordination has also induced a contrast in protection coverage between areas with different economic levels or population densities33,34. Here, we reveal that the global heterogeneous distribution in PAs is even more pronounced across social and economic contexts than across the range of environmental conditions, with many gaps for the most restrictive PAs (IUCN category I). These gaps are highly detrimental for both nature, since most biodiversity hotspots occur in the tropics35,36, and people, given the well-being3,4 and socioeconomic17,37 outcomes provided by PAs which are currently underrepresented in most African countries10 where such management tools and potential benefits are the most needed38.
Prediction of areas under protection
Since we randomly chose unprotected areas, i.e., cells outside the global network of IUCN-recorded PAs, and some PAs are not reported39, we only have presence-background data40. Therefore, we do not predict probabilities of PA occurrences but instead, relative likelihoods of PA occurrences in each grid cell using the same set of 14 social-environmental factors and a machine learning algorithm (random forest or RF) accounting for covariation across factors, potential nonlinearities, and spatial hierarchy of factors (e.g., cells within countries). To avoid underestimating the model predictive error related to the spatial autocorrelation in data41, particularly due to the inherent contiguity of protected cells in large PAs, we performed a 10-fold spatial cross-validation procedure by creating spatially independent training and testing folds based on a minimum distance of 1000 km42 (see ‘Methods’). We also performed sensitivity analyses where we removed the largest PAs but also those belonging to the lowest and highest HDI countries to test the robustness of our findings (see ‘Methods’). We did not detect any significant spatial autocorrelation in the residuals of the models.
The relative likelihood of PA occurrence, irrespective of the IUCN category, is accurately predicted on land (spatially cross-validated accuracy = 0.74 ± SD = 0.02 and true skill statistic TSS = 0.47 ± SD = 0.04), with socioeconomic factors having on average more relative standardised importance (0.63) than environmental factors (0.59) (Fig. 3a, Supplementary Fig. 3). The relative likelihood of marine PA occurrence is even more accurately predicted (spatially cross-validated accuracy = 0.88 ± SD = 0.05 and TSS = 0.76 ± SD = 0.09) with socioeconomic factors being on average even more important (0.66) than environmental factors (0.43) (Fig. 3b, Supplementary Fig. 4). For both land and sea, HDI, accessibility, the number of NGOs, the number of conflicts, dependency on natural resources, primary productivity and annual temperature are the most important factors correlated to relative PA occurrences. On average, the relative standardised importance of national socioeconomic factors is higher than that of local socioeconomic factors on both land (0.81 vs. 0.47) and sea (0.69 vs. 0.56), explaining the high heterogeneity of current protection coverage among countries10.
When applying the same analysis to the most restrictive PAs (IUCN category I), RF models reach an even higher predictive performance both on land (spatially cross-validated accuracy = 0.91 ± SD = 0.02 and TSS = 0.46 ± SD = 0.06) and at sea (spatially cross-validated accuracy = 0.94 ± SD = 0.03 and TSS = 0.64 ± SD = 0.21), with again a dominance of socioeconomic factors over environmental ones and of national factors over local ones (Fig. 3c, d, Supplementary Figs. 3 and 4). These models show that, even in the complete absence of biodiversity or land use information, the presence of highly restrictive protected areas both on land and at sea is highly predictable from a small set of well-selected social-environmental factors. Among these factors, the predominant ones (importance > 0.8) for land are HDI and temperature and at sea are accessibility in terms of travel time and temperature. The sensitivity analyses show that the models are very robust to the removal of the largest PAs and those belonging to the lowest and highest HDI countries (Supplementary Fig. 5), suggesting that the inherent omission of PAs in IUCN-reported lists and the overrepresentation of protected grid cells belonging to the largest PAs have little impact on our results and conclusions.
Partial response plots reveal remarkable consistencies in the factors associated with the relative likelihood of PA occurrence both on land and at sea (Fig. 4, Supplementary Fig. 6). Indeed, the relative likelihood of having protection coverage (whatever the IUCN category) in a given area is positively linked to the level of HDI and the number of NGOs, but negatively to the number of conflicts, the distance to the coast, the dependency on natural resources, human footprint and annual temperature. The positive link with HDI was already demonstrated in coastal areas20 but also holds for terrestrial areas. Yet, this relationship is non-linear, with a positive contribution of HDI on the relative PA occurrence only apparent above a minimum threshold of 0.7, corresponding to the developed countries43. It suggests that, given the small increase in HDI for the poorest countries in recent decades44, development alone cannot offer a short-term solution to increase PA coverage. In contrast, for medium-HDI countries, we can expect a rapid rise in protection coverage even for a small increase in HDI, especially at sea. This projection is only valid under a space-for-time assumption since our models do not include any temporal trend but only spatial information. Still, these results echo the call to engage developing countries in strategies skipping the intermediate stage of overexploitation during the growing phase to rapidly reach the developed state (HDI > 0.7) when the Living Planet Index, reflecting the state of vertebrate species, begins to recover with development43. Below this threshold of 0.7, any increase in HDI may be detrimental to biodiversity since it may increase the overexploitation of natural resources43.
We show that beyond countries’ HDI, the socioeconomic context is a key precondition of PA establishment, with the presence of conflicts and high human footprint both acting as potential barriers to achieving conservation targets. This socioeconomic context is not only relevant at the country level to explain PA establishment, as shown by other studies33, but also at the local scale (10 km resolution) with accessibility being the most important factor in models predicting the relative likelihood of marine PA occurrence (Fig. 3b, d). By contrast, the link with the number of NGOs is positive for both land and sea, suggesting a potential lever to PA establishment, owing to enhanced financial and management capacity15, with a marked threshold of 50 NGOs at the country scale for boosting marine PA establishment. However, NGOs may also be attracted to areas with a high-density of PAs to facilitate fundraising and help reach positive outcomes, so we rather highlight correlation instead of causation here.
There are some marked differences between land and sea when looking at the secondary factors explaining the relative likelihood of PA occurrences. Elevation was positively related to terrestrial PA relative occurrence (e.g., lots of PAs in Europe are located in the mountains)27, while water depth was negatively correlated to the relative occurrence of marine PAs, highlighting a global positive bias of protection coverage on the coast and a pelagic deficit within countries’ EEZs2. Yet, remote areas, characterised by their low accessibility, are more likely to be protected than accessible coastal areas, a known bias towards residual marine PAs45, while accessibility has no visible effect on land.
When focusing on the most restrictive IUCN category I PAs, partial responses show similar relationships (Fig. 4, Supplementary Fig. 6). The number of conflicts, the number of NGOs, and dependence on natural resources are the strongest correlates of PA relative occurrence, albeit to a lower extent than when all IUCN categories are considered. However, HDI is no longer important on sea, but is still a strong factor on land, suggesting an overrepresentation of restrictive terrestrial PAs in developed countries. At sea, accessibility and dependency on natural resources have the strongest negative relation with relative IUCN PA occurrence, illustrating strong barriers to PA establishment along coastlines where human density is high and marine resources are essential for livelihood or food security. In contrast, accessibility and human footprint have no negative relationship with IUCN I PA occurrence on land, suggesting fewer or weaker barriers due to human pressure in restrictive terrestrial PA establishment. As an alternative explanation, old restrictive terrestrial PAs, like US national parks, were initially created in remote areas that have experienced substantial increases in housing density since 194046. Therefore, even in populated areas (e.g., agricultural plains), IUCN category I PAs appear to be viable management options if the socioeconomic context is favourable (high number of NGOs and HDI but also low dependency on natural resources and number of conflicts), suggesting that highly restrictive terrestrial PAs are not fatally constrained to ‘high and far’ areas21. Yet, this expansion may be compromised by projected land use and parochialism25, since large-scale conservation actions can be locally constrained by political and stakeholder units.
Potential and unrealistic conservation gains
Taking advantage of our accurate predictive models, we can assess the relative likelihood that unprotected areas worldwide could become protected based on their social-environmental context. In doing so, we aim to document whether and where the enabling social-environmental conditions pre-exist for PA implementation outside of the existing global PA network. Since national socioeconomic factors explain the relative likelihood that unprotected areas are turned into IUCN I PAs (e.g. HDI), we estimated and mapped this averaged likelihood per country on both land and sea (Fig. 5). We show that only some European countries and New Zealand reach a mean probability of protecting current terrestrial unprotected areas higher than 50% while this value is lower than 10% for most countries except in North America and some other countries like Australia, Botswana, Japan or Mongolia. For marine unprotected areas, this mean probability of establishing IUCN I protection is very low (<10%) in most countries except Chile, Mexico and some others, suggesting that reaching more than 10% coverage by the most restrictive marine protected areas, beyond the current effort, is unrealistic for most countries so the high sea may represent the next big challenge to protect more than 30% of marine areas before 2030 including 10% under full protection2.
To determine whether areas meeting favourable enabling conditions, i.e., ‘low-hanging fruit’ areas, match the most critical areas for biodiversity conservation, we used the global distributions of terrestrial mammals and birds but also marine fishes (see ‘Methods’) to rank the non-protected areas according to their conservation benefits based on the coverage of species range size, if turned into protected areas. We collected, for instance, the global distribution of the Alpine ibex (Capra ibex) and the Galapagos shark (Carcharhinus galapagensis) (Fig. 2c, d). We considered here only PAs of IUCN category I since these terrestrial and marine vertebrate taxa tend to benefit only from highly restrictive protection measures47,48,49. More precisely, we implemented a complementarity-, balanced-, and priority-based ranking of areas with the Zonation software for spatial conservation planning (see Methods)2,50. It selects areas that are complementary and jointly achieve a well-balanced representation across all species. The goal is not to propose a global prioritisation of new PAs but rather to identify where conservation benefits may be ‘easier’ to reach given the social-environmental context (hereafter called ‘potential gains’), when compared to areas where such benefits are likely to be ‘harder’ to be achieved as enabling conditions are not met (hereafter called ‘unrealistic gains’). More precisely, we define as ‘potential gains’ the unprotected areas with both a high relative likelihood of being protected (>90%) and a high conservation prioritisation ranking (top 10%) corresponding to ‘low-hanging fruits’ in conservation51. On the opposite, ‘unrealistic gains’ are the unprotected areas with high conservation priority (top 10%) but with a low relative likelihood of being protected (<10%) given their social-environmental context (Fig. 6a, b), so ‘high-hanging fruits’. Here, the social-environmental factors are considered more like proxies of a wider set of conditions enabling or preventing PA establishment than causal determinants, independent of the biodiversity attributes.
We observe that potential high conservation gains on land are highly clustered in space (Fig. 6c) and are, for different reasons, almost absent in continental Africa, Europe and South Asia. In European countries, most unprotected areas are not top-ranked in terms of bird and mammal species protection, even if the social-environmental context is favourable to the establishment of PAs. These areas are thus potential low gains for the conservation of terrestrial vertebrates. Conversely, many areas of Africa and South Asia can provide high conservation gains if protected, but the socio-environmental context tends to restrict new PAs establishment, so these areas correspond to unrealistic high conservation gains for terrestrial vertebrate biodiversity. Potential high conservation gains are mainly located along the North Pacific coast of the American continent, in Amazonia, in Southern Australia and in Northern Asia due to relatively fewer local social-environmental constraints but important conservation gains for terrestrial vertebrates. For some countries (US and Australia) the enabling conditions mainly stem from socioeconomic factors like HDI while, for others (some parts of Columbia and Philippines), enabling conditions mostly belong to factors like high elevation and low human pressure (Fig. 4). However, despite an overall favourable national context in the US and Australia, we also observe high within-country heterogeneity (Fig. 6c, d) suggesting the importance of local factors, potentially acting as barriers to PA establishment such as accessibility or distance to the coast.
Unrealistic but high conservation gains on land are mainly located in South Asia, Africa and South America. In these areas, despite the presence of many vertebrate species, the socioeconomic context tends to be unfavourable for the establishment of PAs (Fig. 5). In such areas, other effective area-based conservation measures (OECM) than PAs should be favoured since they can still deliver positive ecological outcomes while being more inclusive with locals52. Yet, care should be taken to avoid the risks associated with OECMs53. Alternatively, privately protected areas (PPAs) could also complement or overtake state-government PAs to achieve unrealistic but high conservation gains39. Indeed, our approach identifies areas where long-term benefits for conservation will certainly require more inclusive alternatives to classical highly restrictive IUCN PAs or regional cooperation to help countries with unfavourable socioeconomic conditions to protect their land54 and contribute to reaching the target of 30% protection coverage.
At sea, we identified very few potential high conservation gains, which are mostly located in the Eastern Pacific and North-Eastern Australia, but many unrealistic high gains (Fig. 6d). Unprotected marine areas with a favourable social-environmental context for the establishment of new PAs would not provide important conservation gains for fish biodiversity in return. Conversely, unprotected marine areas where biodiversity gains are expected to be the highest have a low relative likelihood to become protected given the local socioeconomic context (Fig. 6b). These unrealistic high conservation gains are widespread along the coast of many countries (mainly in Africa, Asia and America), except Australia and, to a lesser extent, Northern Europe where local conditions are more favourable to PAs establishment (Fig. 5). Therefore, the short-term protection of marine fishes seems more challenging (few ‘low-hanging fruits’) than the protection of terrestrial vertebrates given the inherent difficulty to establish strictly restrictive PAs (IUCN category I) in marine areas of high conservation gains. Alternative, easier-to-implement solutions are urgently needed in these EEZs to halt biodiversity decline, such as other effective area-based conservation tools and non-spatial sustainable management of marine resources52,55; yet their protection effectiveness remains to be demonstrated53.
More generally, the global conservation target of 30% coverage by protected areas before 2030, including 10% of fully protected areas, seems hardly achievable without strong efforts to overcome socioeconomic barriers or propose alternative and more inclusive solutions (OECMs and PPAs), especially for marine waters.
Methods
Global grid cells of protected and unprotected areas
The global dataset was integrated into a global 10 × 10 km resolution grid covering both land and sea. We intersected this initial grid with the GADM (global administrative areas-GADM version 3.6) for the terrestrial part and the exclusive economic zone boundaries (Marin Regions ‘maritime boundaries’ version 11) for the marine cells. As a result, a cell that intersects with either of these two previous databases is either marine or terrestrial. For the coastal environment we assigned the ecosystem (terrestrial or marine) that overlapped more than 50% of the cell surface.
These land and sea cells were intersected with the Word Database on Protected Areas (April 2020). A cell was considered as having some protection if a PA partly overlapped the cell. This resulted in 258,725 cells on land and 210,873 cells at sea that have some protection coverage. For each protected cell, we reported the IUCN category between I (highly restrictive or no-take) and VI (least restrictive IUCN category). If, within a protected cell, several PAs with different IUCN categories were present, we associated the more restrictive category with the cell. We randomly selected the same number of cells among the remaining unprotected cells on both land and sea to obtain a balanced design.
Socioeconomic and environmental factors
We associated to each grid cell a set of 14 socioeconomic and environmental factors known for their potential to explain or to correlate with the presence of PAs on both land and sea (Tables 1 and 2). We chose these factors to make our analyses spatially explicit with probabilistic models of conservation action that integrate theoretically supported social-environmental correlates of PA establishment15. These factors are detailed in Supplementary Data 1.
Modelling the relative likelihood of protected area occurrence
Using our presence-background data, we modelled relative likelihoods of PA occurrences on both land and sea40. Prior to model fitting, we evaluated the collinearity between factors by computing Pearson correlation coefficients. All pairs of factors were weakly or moderately correlated on both land and sea (−0.7 > correlation > +0.7). Prior to fitting, we also filled all the gaps (0.7% of cells on land and 1.2% at sea) in the factor data using a Random Forest model designed specifically to impute missing values. For this, we used the missForest function implemented in the missForest R package56. Among the many statistical alternatives to impute values that are missing in large datasets using the other factors as predictors, we choose this machine learning technique since it performs well and makes very few assumptions about structural aspects of the data57.
Then, we built Random Forest (RF) models to relate the presence–absence of PA in a given cell (all IUCN categories and IUCN I only) on land and sea to the 7 socioeconomic and 7 environmental factors using the ranger function from the ranger R package, which provides a fast implementation of RF, particularly suited for high dimensional data58. So, RF was used as a classifier with two classes (presence and absence of PA) for each grid cell. We used 500 trees in each RF that make an ensemble of independent decision trees to improve model accuracy59.
We then calibrated RF predictions to obtain a continuous distribution of relative PA occurrences ranging homogeneously between 0 and 1, thus allowing its interpretation in a probabilistic way60. We applied a procedure based on flexible generalised additive binomial models (GAM) to observed data (here, presence-absence of PA) as a function of response-scale model fits60 using the gam function from the gam R package. This procedure avoids overfitting, which potentially leads to the perfect separation of classes in predictions from algorithms like RF. We ultimately obtained a calibrated relative likelihood of being protected for all unprotected grid cells according to 14 social-environmental factors.
Models’ performance and factors’ importance
Model performances were assessed using a 10-fold spatial cross-validation procedure (using 90% of grid cells for the training and 10% for the testing) and two classical metrics: accuracy, which is the proportion of correct predictions, and the true skill statistic (TSS), which is a simple and intuitive measure considering both sensitivity (i.e., the proportion of PA presences accurately predicted) and specificity (i.e., the proportion of PA absences accurately predicted). When accuracy ranges between 0 and 1, TSS ranges between −1 and 1, with 1 indicating perfect agreement and values lower than zero indicating a performance worse than random61. These metrics were calculated from the confusion matrix using the confusionMatrix function from the caret R package.
Since our grid cells, split between our train and test datasets, are not independent with presences tending to aggregate over space due to large PAs, there is the risk of obtaining invalid overoptimistic results, particularly with machine learning algorithms41,42. We thus created spatially independent training and testing blocks on land and sea based on a pre-specified and conservative distance of 1000 km to perform a spatial cross-validation procedure. For this, we used the spatialBlocks function from the blockCV package v2.1.5. Finally, we tested for the presence of spatial autocorrelation in model residuals as a function of distance between grid cells with Moran’s I statistic.
We also calculated factor importances for each cross-validation procedure, using the varImpRanger function from the varImp R package, based on the area under the curve (AUC) loss when a factor is randomly permuted, using the multiclass.AU1P measure dedicated to binary classifiers while considering the a priori distribution of the classes62. We also estimated impurity values as alternative measures of factor importance. We displayed partial response plots to show the effect of each factor while controlling for the others, so we kept constant at their means, using the partial and plotPartial functions from the dpd R package.
Sensitivity analyses
We performed a first sensitivity analysis to test the robustness of our results to the overweight, in terms of grid cell number, of very large PAs. For this, we removed 27 PAs on land and 2 at sea since they cover more than 10% of the total protected area.
Since we certainly missed PAs from low-HDI countries due to a lack of IUCN declaration, we performed a second sensitivity analysis where we removed PAs from countries with HDI in the bottom 10% values, so 393 PAs on land and 108 at sea. On the other hand, high-HDI countries may be overrepresented in IUCN I PAs, so we performed a third sensitivity analysis where we removed PAs from countries with HDI in the top 10% values, totalling 10,254 PAs on land and 1094 at sea.
We conducted the same sensitivity analyses for the model focusing on restrictive PAs (IUCN I). These sensitivity analyses resulted in the removal of 5155 PAs on land and 791 at sea for high HDI values, 10 PAs on land and 2 at sea for lower HDI values and 9 PAs on land and 2 at sea for large PAs.
All analyses can be reproduced through our GitHub repository.
Global biodiversity maps of vertebrates
Terrestrial mammals and birds
We used the IUCN range maps for 5529 terrestrial mammal species and the BirdLife range maps for 10,709 bird species (breeding ranges only)63. Extinct (EX) and Extinct in the Wild (EW) species were removed. For birds, polygons of the Spheniscidae family, which are mostly marine birds, were also removed. Geographic ranges were converted to a 10 km resolution Mollweide equal-area grid projection (i.e., 100 km2 cells). This resolution was previously validated as the finest justifiable for these data globally without incurring false presences which would impose a geographic and ecological bias on spatial prioritisation solutions64.
Marine fishes
We collated fish species data from the Ocean Biogeographic Information System (OBIS). We inventoried 16,238,200 occurrence records from 34,883 entries. We cleaned the data by identifying the synonyms, misspellings and rare species (only one occurrence) and by restraining the dataset to species present in the marine environment according to FishBase. This resulted in a set of 11,503,257 occurrences for 11,345 marine fish species around the world. We reconstructed distribution maps for each species, defined as the convex polygon surrounding the area where each species was observed. We divided the resulting polygons into four parts across the world to integrate possible range discontinuity between the two hemispheres and the Atlantic and Pacific Oceans (e.g., species with antitropical distribution). We refined each species distribution map by removing areas where maximal depths fell outside the minimum or maximum known depth range of the species. As the OBIS database did not well represent the tropical fishes, we merged the database with 6316 fish species censused on reefs65. We finally obtained a world database containing 14,050 fish species that we aggregated on a 1° resolution grid covering all oceans. We transformed individual species shapefiles into equal-area raster grids at a resolution of 1°.
Spatial conservation prioritisation
We used the Zonation software (version 4) for spatial conservation prioritisation based on vertebrate species distributions on land (mammals and birds) and at sea (fishes). This allowed us to rank unprotected cells in their ability to complete the existing PA network in order to provide an optimised global representative coverage for vertebrate conservation. Zonation uses a reverse stepwise heuristic algorithm that removes cells that contribute to the smallest marginal losses in the representation of biodiversity features50. More precisely, we identified the optimal expansion of the existing PA network using the core-area zonation algorithm, which is based on the ranking of the most important occurrences of a feature in a given cell. Thus, Zonation gives the highest priority to locations with high occurrences of rare species, maximising both the representation of all species and their proportional coverage. Zonation resulted in a prioritisation hierarchy; these priority values were then used to identify locations that contribute most to biodiversity representation, so the unprotected cells of high conservation gain.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Data sources used in the study are listed in Supplementary Data 1, while the compiled and processed datasets can be found on GitHub and at https://doi.org/10.5281/zenodo.11183846. All these data are freely accessible, and we followed the conditions to access them.
Data underlying Figures are provided in this paper and can be found at https://doi.org/10.5281/zenodo.11183846.
Code availability
Codes to draw the Figures are provided in GitHub.
References
-
Zeng, Y., Koh, L. P. & Wilcove, D. S. Gains in biodiversity conservation and ecosystem services from the expansion of the planet’s protected areas. Sci. Adv. 8, eabl9885 (2022).
-
Sala, E. et al. Protecting the global ocean for biodiversity, food and climate. Nature 592, 397–402 (2021).
-
Naidoo, R. et al. Evaluating the impacts of protected areas on human well-being across the developing world. Sci. Adv. 5, eaav3006 (2019).
-
Ban, N. C. et al. Well-being outcomes of marine protected areas. Nat. Sustain. 2, 524–532 (2019).
-
Gill, D. A. et al. Capacity shortfalls hinder the performance of marine protected areas globally. Nature 543, 665–667 (2017).
-
Daskin, J. H. & Pringle, R. M. Warfare and wildlife declines in Africa’s protected areas. Nature 553, 328 (2018).
-
Beyer, H. L., Venter, O., Grantham, H. S. & Watson, J. E. M. Substantial losses in ecoregion intactness highlight urgency of globally coordinated action. Conserv. Lett. 13, e12692 (2020).
-
Yan, H. F. et al. Overfishing and habitat loss drive range contraction of iconic marine fishes to near extinction. Sci. Adv. 7, eabb6026 (2021).
-
Cantera, I. et al. Low level of anthropization linked to harsh vertebrate biodiversity declines in Amazonia. Nat. Commun. 13, 3290 (2022).
-
Maxwell, S. L. et al. Area-based conservation in the twenty-first century. Nature 586, 217–227 (2020).
-
Cinner, J. E. et al. Meeting fisheries, ecosystem function, and biodiversity goals in a human-dominated world. Science 368, 307–311 (2020).
-
Gatiso, T. T. et al. Sustainable protected areas: synergies between biodiversity conservation and socioeconomic development. People Nat. 4, 893–903 (2022).
-
Schleicher J. et al. Protecting half of the planet could directly affect over one billion people. Nat. Sustain. 2, 1094–1096 (2019).
-
Appleton, M. R. et al. Protected area personnel and ranger numbers are insufficient to deliver global expectations. Nat. Sustain. 5, 1100–1110 (2022).
-
Williamson, M. A., Schwartz, M. W. & Lubell, M. N. Spatially explicit analytical models for social–ecological systems. Bioscience 68, 885–895 (2018).
-
Schultz, M. et al. A framework to identify barriers and levers to increase the levels of protection of marine protected areas. One Earth 5, 987–999 (2022).
-
Kandel, P., Pandit, R., White, B. & Polyakov, M. Do protected areas increase household income? Evidence from a Meta-Analysis. World Dev. 159, 106024 (2022).
-
Jankowska, E., Pelc, R., Alvarez, J., Mehra, M. & Frischmann, C. J. Climate benefits from establishing marine protected areas targeted at blue carbon solutions. Proc. Natl Acad. Sci. USA 119, e2121705119 (2022).
-
Hanson, J. O. et al. Global conservation of species’ niches. Nature 580, 232–234 (2020).
-
Fox, H. E. et al. Explaining global patterns and trends in marine protected area (MPA) development. Mar. Policy 36, 1131–1138 (2012).
-
Joppa, L. N. & Pfaff, A. High and far: biases in the location of protected areas. PLoS ONE 4, e8273 (2009).
-
Devillers, R. et al. Reinventing residual reserves in the sea: are we favouring ease of establishment over need for protection? Aquat. Conserv. 25, 480–504 (2015).
-
O’Connor, L. M. J. et al. Balancing conservation priorities for nature and for people in Europe. Science 372, 856 (2021).
-
Hammill, E., Tulloch, A. I. T., Possingham, H. P., Strange, N. & Wilson, K. A. Factoring attitudes towards armed conflict risk into selection of protected areas for conservation. Nat. Commun. 7, 11042 (2016).
-
Montesino Pouzols, F. et al. Global protected area expansion is compromised by projected land-use and parochialism. Nature 516, 383–386 (2014).
-
Masud, M. M., Othman, A., Akhtar, R. & Rana, M. S. The underlying drivers of sustainable management of natural resources: the case of marine protected areas (MPAs). Ocean Coast. Manag. 199, 105405 (2021).
-
Elsen, P. R., Monahan, W. B. & Merenlender, A. M. Global patterns of protection of elevational gradients in mountain ranges. Proc. Natl Acad. Sci. USA 115, 6004 (2018).
-
Broennimann, O. et al. Distance to native climatic niche margins explains establishment success of alien mammals. Nat. Commun. 12, 2353 (2021).
-
Albrecht, R. et al. Protected area downgrading, downsizing, and degazettement (PADDD) in marine protected areas. Mar. Policy 129, 104437 (2021).
-
Hirzel, A. H., Hausser, J., Chessel, D. & Perrin, N. Ecological-niche factor analysis: how to compute habitat-suitability maps without absence data? Ecology 83, 2027–2036 (2002).
-
Claudet, J., Loiseau, C. & Pebayle, A. Critical gaps in the protection of the second largest exclusive economic zone in the world. Mar. Policy 124, 104379 (2021).
-
Mouillot, D. et al. Global marine protected areas do not secure the evolutionary history of tropical corals and fishes. Nat. Commun. 7, 10359 (2016).
-
Mouillot, D. et al. Global correlates of terrestrial and marine coverage by protected areas on islands. Nat. Commun. 11, 4438 (2020).
-
Claudet, J., Loiseau, C., Sostres, M. & Zupan, M. Underprotected marine protected areas in a global biodiversity hotspot. One Earth 2, 380–384 (2020).
-
Robuchon, M. et al. Revisiting species and areas of interest for conserving global mammalian phylogenetic diversity. Nat. Commun. 12, 3694 (2021).
-
Tittensor, D. P. et al. Global patterns and predictors of marine biodiversity across taxa. Nature 466, 1098–U1107 (2010).
-
Marcos, C. et al. Reviewing the ecosystem services, societal goods, and benefits of marine protected areas. Front. Mar. Sci. 8, 504 (2021).
-
Andrello, M. et al. Global mismatch between fishing dependency and larval supply from marine reserves. Nat. Commun. 8, 16039 (2017).
-
Palfrey, R., Oldekop, J. A. & Holmes, G. Privately protected areas increase global protected area coverage and connectivity. Nat. Ecol. Evolut. 6, 730–737 (2022).
-
Guillera-Arroita, G. et al. Is my species distribution model fit for purpose? Matching data and models to applications. Glob. Ecol. Biogeogr. 24, 276–292 (2015).
-
Ploton, P. et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat. Commun. 11, 4540 (2020).
-
Roberts, D. R. et al. Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40, 913–929 (2017).
-
Cazzolla Gatti, R. Trends in human development and environmental protection. Int. J. Environ. Stud. 73, 268–276 (2016).
-
Kummu, M., Taka, M. & Guillaume, J. H. A. Gridded global datasets for gross domestic product and human development index over 1990–2015. Sci. Data 5, 180004 (2018).
-
Devillers, R. et al. Residual marine protected areas five years on: are we still favouring ease of establishment over need for protection? Aquat. Conserv. 30, 1758–1764 (2020).
-
Hansen, A. J. et al. Exposure of U.S. National Parks to land use and climate change 1900–2100. Ecol. Appl. 24, 484–502 (2014).
-
Pacifici, M., Di Marco, M. & Watson, J. E. M. Protected areas are now the last strongholds for many imperiled mammal species. Conserv. Lett. 13, e12748 (2020).
-
Boakes, E. H., Fuller, R. A. & McGowan, P. J. K. The extirpation of species outside protected areas. Conserv. Lett. 12, e12608 (2019).
-
Turnbull, J. W., Johnston, E. L. & Clark, G. F. Evaluating the social and ecological effectiveness of partially protected marine areas. Conserv. Biol. 35, 921–932 (2021).
-
Lehtomäki, J. & Moilanen, A. Methods and workflow for spatial conservation prioritization using Zonation. Environ. Model. Softw. 47, 128–137 (2013).
-
Coll, M., Steenbeek, J., Lasram, F. B. R., Mouillot, D. & Cury, P. ‘Low-hanging fruit’ for conservation of marine vertebrate species at risk in the Mediterranean Sea. Glob. Ecol. Biogeogr. 24, 226–239 (2015).
-
Gurney, G. G. et al. Biodiversity needs every tool in the box: use OECMs Comment. Nature 595, 646–649 (2021).
-
Claudet, J. et al. Avoiding the misuse of other effective area-based conservation measures in the wake of the blue economy. One Earth 5, 969–974 (2022).
-
Raymond, C. M. et al. Inclusive conservation and the Post-2020 Global Biodiversity Framework: tensions and prospects. One Earth 5, 252–264 (2022).
-
Reimer, J. M., Devillers, R. & Claudet, J. Benefits and gaps in area-based management tools for the ocean Sustainable Development Goal. Nat. Sustain. 4, 349–357 (2021).
-
Stekhoven, D. J. & Bühlmann, P. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118 (2012).
-
Penone, C. et al. Imputation of missing data in life-history trait datasets: which approach performs the best? Methods Ecol. Evolut. 5, 961–970 (2014).
-
Wright M. N., Ziegler A. ranger: a Fast Implementation of Random Forests for High Dimensional Data in C++ and R. J. Stat. Softw. 77, 1–17 (2017).
-
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
-
Dormann, C. F. Calibration of probability predictions from machine-learning and statistical models. Glob. Ecol. Biogeogr. 29, 760–765 (2020).
-
Allouche, O., Tsoar, A. & Kadmon, R. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). J. Appl. Ecol. 43, 1223–1232 (2006).
-
Ferri, C., Hernández-Orallo, J. & Modroiu, R. An experimental comparison of performance measures for classification. Pattern Recognit. Lett. 30, 27–38 (2009).
-
Loiseau, N. et al. Global distribution and conservation status of ecologically rare mammal and bird species. Nat. Commun. 11, 5071 (2020).
-
Pollock, L. J., Thuiller, W. & Jetz, W. Large conservation gains possible for global biodiversity facets. Nature 546, 141 (2017).
-
Parravicini, V. et al. Global mismatch between species richness and vulnerability of reef fish assemblages. Ecol. Lett. 17, 1101–1110 (2014).
-
Kinlock Nicole, L. et al. Explaining global variation in the latitudinal diversity gradient: Meta‐analysis confirms known patterns and uncovers new ones. Glob. Ecol. Biogeogr. 27, 125–141 (2017).
-
Barlow, J. et al. The future of hyperdiverse tropical ecosystems. Nature 559, 517–526 (2018).
-
Williams, B. A. et al. Global rarity of intact coastal regions. Conserv. Biol. 36, e13874 (2022).
-
Pontarp, M. et al. The latitudinal diversity gradient: novel understanding through mechanistic eco-evolutionary models. Trends Ecol. Evolut. 34, 211–223 (2019).
-
Pina T. E. N., Carvalho W. D., Rosalino L. M. C., Hilario R. R. Drivers of mammal richness, diversity and occurrence in heterogeneous landscapes composed by plantation forests and natural environments. Forest Ecol. Manag. 449, 117467 (2019).
-
Powers, R. P. & Jetz, W. Global habitat loss and extinction risk of terrestrial vertebrates under future land-use-change scenarios. Nat. Clim. Change 9, 323 (2019).
-
Bohm, M. et al. The conservation status of the world’s reptiles. Biol. Conserv. 157, 372–385 (2013).
-
Costello, M. J. & Chaudhary, C. Marine biodiversity, biogeography, deep-sea gradients, and conservation. Curr. Biol. 27, R511–R527 (2017).
-
Roberts, T. E. et al. Testing biodiversity theory using species richness of reef-building corals across a depth gradient. Biol. Lett. 15, 20190493 (2019).
-
Letessier, T. B. et al. Remote reefs and seamounts are the last refuges for marine predators across the Indo-Pacific. PLOS Biol. 17, e3000366 (2019).
-
Koehler, B., Erlandsson, M., Karlsson, M. & Bergström, L. Species richness and functional attributes of fish assemblages across a large-scale salinity gradient in shallow coastal areas. Biogeosciences 19, 2295–2312 (2022).
-
Sosa-Lopez, A., Mouillot, D., Ramos-Miranda, J., Flores-Hernandez, D. & Chi, T. D. Fish species richness decreases with salinity in tropical coastal lagoons. J. Biogeogr. 34, 52–61 (2007).
-
Auricht, H., Mosley, L., Lewis, M. & Clarke, K. Mapping the long-term influence of river discharge on coastal ocean chlorophyll-a. Remote Sens. Ecol. Conserv. 8, 629–643 (2022).
-
Marinesque, S., Kaplan, D. M. & Rodwell, L. D. Global implementation of marine protected areas: Is the developing world being left behind? Mar. Policy 36, 727–737 (2012).
-
Bernauer, T. & Betzold, C. Civil society in global environmental governance. J. Environ. Dev. 21, 62–66 (2012).
-
Nicolle, S. & Leroy, M. Advocacy coalitions and protected areas creation process: case study in the Amazon. J. Environ. Manag. 198, 99–109 (2017).
-
Loucks, C., Ricketts, T. H., Naidoo, R., Lamoreux, J. & Hoekstra, J. Explaining the global pattern of protected area coverage: relative importance of vertebrate biodiversity, human activities and agricultural suitability. J. Biogeogr. 35, 1337–1348 (2008).
-
Sanderson, E. W. et al. The human footprint and the last of the wild: the human footprint is a global map of human influence on the land surface, which suggests that human beings are stewards of nature, whether we like it or not. Bioscience 52, 891–904 (2002).
Acknowledgements
This study benefited from the PELAGIC group funded by the Centre for the Synthesis and Analysis of Biodiversity (CESAB) of the Foundation for Research on Biodiversity (FRB) and by the Bertarelli Foundation. It was also supported by the MPA-POVERTY project (ANR-19-CE03-0005) funded by the Agence Nationale de la Recherche (ANR). This project was conducted as part of the Belmont Forum PARSEC project, funded under the Collaborative Research Action (CRA) on Science-Driven e-Infrastructures Innovation (SEI), with funding from FAPESP, the ANR, JST and the NSF, with collaborators from Australia, and support from the synthesis centre CESAB of the French Foundation for Research on Biodiversity.
Author information
Authors and Affiliations
Contributions
D.M., L.V., J.C., R.D., T.L., N.L., S.M., L.M., J.M., N.M., A.N., V.P., M.T. & W.T. conceived the study. D.M., L.V., C.A., N.C., V.D., N.L., A.N., R.S., L.O. & J.R. carried out data collection and analyses. L.V., N.C., N.L., L.M. & N.M. draw the figures. All authors wrote and corrected the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Mouillot, D., Velez, L., Albouy, C. et al. The socioeconomic and environmental niche of protected areas reveals global conservation gaps and opportunities.
Nat Commun 15, 9007 (2024). https://doi.org/10.1038/s41467-024-53241-1
-
Received: 07 September 2023
-
Accepted: 11 July 2024
-
Published: 18 October 2024
-
DOI: https://doi.org/10.1038/s41467-024-53241-1