Maxent model for Notropis_amoenus


This page contains some analysis of the Maxent model for Notropis_amoenus, created Sun Mar 14 12:55:50 EDT 2021 using Maxent version 3.4.1. If you would like to do further analyses, the raw data used here is linked to at the end of this page.


Analysis of omission/commission

The following picture shows the omission rate and predicted area as a function of the cumulative threshold. The omission rate is is calculated both on the training presence records, and (if test data are used) on the test records. The omission rate should be close to the predicted omission, because of the definition of the cumulative threshold.


The next picture is the receiver operating characteristic (ROC) curve for the same data. Note that the specificity is defined using predicted area, rather than true commission (see the paper by Phillips, Anderson and Schapire cited on the help page for discussion of what this means). This implies that the maximum achievable AUC is less than 1. If test data is drawn from the Maxent distribution itself, then the maximum possible test AUC would be 0.821 rather than 1; in practice the test AUC may exceed this bound.



Some common thresholds and corresponding omission rates are as follows. If test data are available, binomial probabilities are calculated exactly if the number of test samples is at most 25, otherwise using a normal approximation to the binomial. These are 1-sided p-values for the null hypothesis that test points are predicted no better than by a random prediction with the same fractional predicted area. The "Balance" threshold minimizes 6 * training omission rate + .04 * cumulative threshold + 1.6 * fractional predicted area.

Cumulative thresholdCloglog thresholdDescriptionFractional predicted areaTraining omission rate
1.0000.049Fixed cumulative value 10.7830.011
5.0000.135Fixed cumulative value 50.5870.033
10.0000.211Fixed cumulative value 100.4640.082
0.5350.034Minimum training presence0.8340.000
11.4270.23010 percentile training presence0.4380.098
29.2670.441Equal training sensitivity and specificity0.2320.230
23.0560.373Maximum training sensitivity plus specificity0.2870.164
2.4970.088Balance training omission, predicted area and threshold value0.6850.016
10.1830.213Equate entropy of thresholded and original distributions0.4610.082


Click here to interactively explore this prediction using the Explain tool. If clicking from your browser does not succeed in starting the tool, try running the script in C:\Users\Dan McGarvey\Documents\LargeWorkingFiles\VA_ScenicRivers_Project_2020\VA_ScenicRivers_ModelingData_Loop100\Notropis_amoenus\ResultsFinal\Notropis_amoenus_explain.bat directly. This tool requires the environmental grids to be small enough that they all fit in memory.



Response curves


These curves show how each environmental variable affects the Maxent prediction. The (raw) Maxent model has the form exp(...)/constant, and the curves show how the exponent changes as each environmental variable is varied, keeping all other environmental variables at their average sample value. Click on a response curve to see a larger version. Note that the curves can be hard to interpret if you have strongly correlated variables, as the model may depend on the correlations in ways that are not evident in the curves. In other words, the curves show the marginal effect of changing exactly one variable, whereas the model may take advantage of sets of variables changing together.



In contrast to the above marginal response curves, each of the following curves represents a different model, namely, a Maxent model created using only the corresponding variable. These plots reflect the dependence of predicted suitability both on the selected variable and on dependencies induced by correlations between the selected variable and other variables. They may be easier to interpret if there are strong correlations between variables.




Analysis of variable contributions


The following table gives estimates of relative contributions of the environmental variables to the Maxent model. To determine the first estimate, in each iteration of the training algorithm, the increase in regularized gain is added to the contribution of the corresponding variable, or subtracted from it if the change to the absolute value of lambda is negative. For the second estimate, for each environmental variable in turn, the values of that variable on training presence and background data are randomly permuted. The model is reevaluated on the permuted data, and the resulting drop in training AUC is shown in the table, normalized to percentages. As with the variable jackknife, variable contributions should be interpreted with caution when the predictor variables are correlated.

VariablePercent contributionPermutation importance
ArbolateSum40.730
PctImp2001Cat13.815.3
MSST_200810.48.5
PctDecid2001Cat10.112.2
PctCrop2001Cat8.69.4
BFICat5.910.6
WtDepCat4.25.7
RunoffCat3.43
PctWdWet2001Cat2.13.3
HydrlCondCat0.82




Raw data outputs and control parameters


The data used in the above analysis is contained in the next links. Please see the Help button for more information on these.
The model applied to the training environmental layers
The model applied to the environmental layers in Background
The coefficients of the model
The omission and predicted area for varying cumulative and raw thresholds
The prediction strength at the training and (optionally) test presence sites
Results for all species modeled in the same Maxent run, with summary statistics and (optionally) jackknife results


Regularized training gain is 0.775, training AUC is 0.851, unregularized training gain is 0.959.
Algorithm converged after 660 iterations (20 seconds).

The follow settings were used during the run:
183 presence records used for training.
12735 points used to determine the Maxent distribution (background points and presence points).
Environmental layers used (all continuous): ArbolateSum BFICat HydrlCondCat MSST_2008 PctCrop2001Cat PctDecid2001Cat PctImp2001Cat PctWdWet2001Cat RunoffCat WtDepCat
Regularization values: linear/quadratic/product: 0.050, categorical: 0.250, threshold: 1.000, hinge: 0.500
Feature types used: hinge
responsecurves: true
outputformat: raw
outputdirectory: ResultsFinal
projectionlayers: Background
samplesfile: FocalFishData_Thin
environmentallayers: BackgroundSample
warnings: false
askoverwrite: false
maximumbackground: 150000
writebackgroundpredictions: true
responsecurvesexponent: true
linear: false
quadratic: false
product: false
autorun: true
writeplotdata: true
autofeature: false
maximumiterations: 5000
threads: 4
allowpartialdata: true
Command line used: samplesfile=FocalFishData_Thin environmentallayers=BackgroundSample projectionlayers=Background togglelayertype=StreamOrder outputdirectory=ResultsFinal linear=false quadratic=false product=false threshold=false hinge=true autofeature=false responsecurves=true jackknife=false outputformat=raw plots=true warnings=false maximumbackground=150000 addsamplestobackground=true writeplotdata=true writebackgroundpredictions=true extrapolate=true maximumiterations=5000 responsecurvesexponent=true allowpartialdata=true nodata=-9999 threads=4 askoverwrite=false autorun

Command line to repeat this species model: java density.MaxEnt nowarnings noprefixes -E "" -E Notropis_amoenus responsecurves outputformat=raw outputdirectory=ResultsFinal projectionlayers=Background samplesfile=FocalFishData_Thin environmentallayers=BackgroundSample nowarnings noaskoverwrite maximumbackground=150000 writebackgroundpredictions responsecurvesexponent nolinear noquadratic noproduct autorun writeplotdata noautofeature maximumiterations=5000 threads=4 allowpartialdata