7  Split

Split data into training (to fit) and test (to evaluate prediction)

Data is often split so that ~20% of the observations (presence and absence) are set aside from the model fitting to be used for model evaluation.

The k-fold function is often used to split the data into k groups, and then the model is fit k times, each time using a different group as the test data and the remaining groups as the training data.