In this task, you should do the main types of analysis of nonparametric data on different data sets, including:
- Cluster analysis (different models, including MCMC)
- Regression analysis (from simple paired variants to GAM anв Neighbourhood)
- LogLinear Models
Needed code and interpretation of results.
For this task needed two datasets (priority to Kaggle datasets) with following requirements:
1. At least one dataset must contain not less than 10 variables, any scale type. Better if at least one 1 or 2 variables are numeric/continuous.
2. At least one dataset must contain binary variables better suitable for the BayesBinMix MCMC clluster algorithm.
3. For Loglinear modelling at least 4 variables should be regarded as binary/nominal_categorical for better analysis in structures.