Neural Kriging
==============

.. image:: ../../images/neural_kriging/image_intro.png
    :align: center
    :width: 80%


**Neural Kriging** is an interpolation method that estimates values at unknown locations based on surrounding data points. It is inspired by traditional geostatistical methods like inverse distance weighting (IDW) and kriging. It incorporates a trainable neural network to enhance predictions.

A key feature of Neural Kriging is that users do not need to define interpolation parameters. The model automatically **learns** the optimal distance weighting, rotation, and influence of secondary variables based on the dataset - reducing bias and improving accuracy. This makes it an adaptable tool for geologists looking to model spatial data without extensive parameter tuning. This automated learning approach is particularly useful in geosciences, where geological and geochemical patterns are often nonlinear and influenced by multiple factors.

Interface
---------

The application only requires the data to interpolate and the object where the interpolation will be saved. The interface is presented in the :ref:`figure below <nk_ui_general>`.

.. image:: ../../images/neural_kriging/ui_general.png
    :name: nk_ui_general
    :align: center
    :width: 60%

* **Source**: The object containing the data to use for the interpolation.

* **Pretrained model file** (*optional*): If users have a pretrained model, they can load it here. It must be associated to the source object. The data the model was trained on must be present in the source object.

* **Data**: The primary data to interpolate.

* **Client**: The object on which the data will be interpolated.

Secondary Data (*optional*)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
If activated, users can include secondary data for the interpolation. The secondary data **must be named the same** in both the source and client objects. The following options are available:

* **Source Data**: The secondary data from the source object.
* **Client Data**: The secondary data from the client object.

Distance Threshold (*optional*)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

* **Distance Threshold**: If activated, only points within the specified distance to source points will be interpolated.

Output Names
^^^^^^^^^^^^

* **Output name** (*optional*): The name of the output variable. If not specified, the default name is the *source data name* + "_NK_interpolated" .

* **Save model** (*optional*): If activated, users have to define the name of the modelto save in the source object.

Optional Parameters
-------------------

The application provides several optional parameters to customize the interpolation process. The :ref:`figure below <nk_ui_optional>` shows the available options. The default values should work well for most cases.

.. image:: ../../images/neural_kriging/ui_optional.png
    :name: nk_ui_optional
    :align: center
    :width: 60%

* **Number of Nearest Neighbors**: Specifies the number of closest data points used for interpolation. A higher number of neighbors smooths predictions but increases computation time, while a lower number preserves local variations but may introduce noise.

* **Learning Rate**: Controls how quickly the model updates during training. A higher learning rate speeds up convergence but risks instability, whereas a lower learning rate ensures more gradual, stable learning at the cost of longer training times. If the interpolation converge into a non-optimal solution, users can try to reduce the learning rate.

* **Batch Size**: Defines the number of data points processed at once during training. Larger batch sizes improve computational efficiency but may generalize poorly, while smaller batch sizes capture finer details but require more iterations.

* **Max Training Epochs**: The maximum number of times the model processes the dataset. More epochs allow for better learning but increase runtime. Early stopping mechanisms prevent unnecessary computations when the model stops improving.

* **Percentage of Points per Epoch**: Specifies the fraction of available data used in each training epoch. A higher percentage stabilizes training but increases computation time, while a lower percentage introduces more randomness, potentially enhancing generalization.

* **Early Stopping Patience**: Defines how many epochs the model will continue training without improvement before stopping. A higher patience prevents premature termination but may lead to overfitting.

* **Scheduler Reduction Factor**: Determines the step size for reducing the learning rate when training stagnates. A lower factor results in smaller adjustments, while a higher factor decreases learning rate more aggressively.

* **Scheduler Patience**: Specifies the number of epochs without improvement before reducing the learning rate. A higher patience delays learning rate adjustments, while a lower patience makes adjustments more frequent.

* **Minimum Learning Rate**: Defines the lowest value the learning rate can reach during training, ensuring that training does not completely stop even after multiple reductions.

* **L1 Regularization Weight**: Encourages sparsity in the model by penalizing large weights. A higher L1 weight reduces complexity but may remove useful patterns, while a lower weight allows for more flexibility.

* **L2 Regularization Weight**: Prevents large weight values, improving generalization. A higher L2 weight enhances stability but may underfit data, while a lower weight provides more adaptability.

* **Enable Multiprocessing**: Allows parallel processing to speed up training. Enabling this feature reduces computation time but requires more system resources. The multiprocess will be run on half the cpu cores available.


Methodology
-----------

As presented in the :ref:`figure below <weights_interpolation>`, this application aims to find the optimal weights of surrounding points to predict the value at a target location. This section explains how those weights are calculated.

.. image:: ../../images/neural_kriging/weights.png
    :align: center
    :name: weights_interpolation
    :width: 50%

Neural Kriging follows a step-by-step process to calculate interpolated values:

1. Compute the **distance to neighboring points** in 3D space \((X, Y, Z)\).

    .. math::

        \delta_{\text{distance}} = (X_{\text{point}} - X_{\text{neighbor}}, Y_{\text{point}} - Y_{\text{neighbor}}, Z_{\text{point}} - Z_{\text{neighbor}})


2. Apply a **trainable rotation matrix** to adjust for anisotropy.

    .. math::
        \delta_{\text{rotated}} = \mathbf{R} \cdot \delta_{\text{distance}}

3. Compute interpolation **weights** using the transformed distances and trainable **range** and **nugget** parameters for each X, Y, Z axis:

   .. math::

      \text{weights} = \frac{1}{\textit{nugget} +  \|\delta_{\text{rotated}}\|^{\textit{range}}}

4. Modify the weights using a **trainable neural network** (one layer only) that incorporates secondary variables:

   .. math::

      \text{weights} = \text{weights} \times \textit{NeuralNetwork}(\delta_{\text{secondary variable}})

5. Compute the final predicted value using a weighted sum:

   .. math::

      \hat{V} = \frac{\sum (\text{weights} \times \text{values})}{\sum (\text{weights})}

Pre-processing
^^^^^^^^^^^^^^

The following pre-processing steps are performed by the function before applying the interpolation:

* **Filtering Points**:
   * Only points containing valid values are kept for interpolation.
   * In the case of secondary variables, all points containing "*no-data*" values will be excluded.

* **Secondary Variable Normalization**:
   * Secondary variables are normalized by subtracting their mean and dividing by their standard deviation.

* **Normalization of the Primary Data**:
   * The data to be interpolated is normalized by subtracting its mean and dividing by its standard deviation.
   * After interpolation, the results are transformed back to the original scale by multiplying by the variance and adding the mean.

* **Distance-Based Filtering** (optional):
   * If activated, data further than a specified distance from the points to interpolate will not be interpolated.


Tutorial
--------

The :ref:`following video <nk_tutorial>` presents a tutorial on how to use the Neural Kriging Interpolation application.

* Select the object containing the data to interpolate.
* Select the data to be interpolated.
* Select the object to project the interpolated data onto.
* (Optional) Define the name of the output object.
* (Optional) Select the secondary data for the source object.
* (Optional) Select the secondary data for the client object.
* (Optional) Define a distance threshold to limit the interpolation range.
* Run the application.
* Inspect the results.

.. figure:: ../../images/neural_kriging/neural_kriging.gif
    :name: nk_tutorial
    :align: center