This repository contains a collection of Python scripts and Jupyter notebooks to implement spatial multivariate imputation using Machine learning and Lambda distribution.
It is recommended to use Anaconda Python distribution and create an environment using the provided environment.yml file.
The notebooks use the Python scripts that are located in Tools folder.
original.datis the input data file.01-DataInventory.ipynbthe first notebook that takes care of data processing and exporting the data for imputation.02-LambdaDistributionMlModel.ipynbthis notebook is used to fit the lambda distribution and its results will be used by downstream notebooks.03-MlForConditionalDistributionTemplate.ipynba template notebook to implement training of the neural networks. It is not required to run this notebook as it is used by the next notebook to execute training.04-DataImputationUsingMLP.ipynbthe main notebook that implements all the steps of imputation and cross validation. This notebook usespapermillpython package to orchestrate training of neural networks.