
Feature Clustering: A Simple Solution to Many Machine Learning Problems
- Vincent Granville
- March 12, 2023
Feature clustering is an unsupervised machine learning technique to separate the features of a dataset into homogeneous groups. In short, it is a clustering procedure, but performed on the features rather than on the observations. Such techniques often rely on a similarity metric, measuring how close two features are to each other. In this article, […]
Read More
Data Synthetization: enhanced GANs vs Copulas
- Vincent Granville
- March 8, 2023
Using case studies, I compare generative adversarial networks (GANs) with copulas to synthesize tabular data. I discuss back-end and front-end improvements to help GANs better replicate the correlation structure present in the real data. Likewise, I discuss methods to further improve copulas, including transforms, the use of separate copulas for each population segment, and parametric […]
Read More
New Book on Synthetic Data: Version 3.0 Just Released
- Vincent Granville
- February 3, 2023
Update on March 3, 2023: Version 4.0 has been released and now replaces version 3.0 on the e-Store. It contains a new full chapter on enhanced generative adversarial networks (GANs) with comparison to copula-based methods for data synthetization, with illustrations on real-life datasets. The book has considerably grown since version 1.0. It started with synthetic […]
Read More
New Interpolation Methods for Data Synthetization and Prediction
- Vincent Granville
- January 14, 2023
Entitled “New Interpolation Methods for Synthetization and Prediction”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. This article is an extract from my book “Synthetic Data and Generative AI”, available here. I describe little-known original interpolation methods with applications to real-life datasets. These simple techniques are easy to implement and can […]
Read More
Synthetizing the Insurance Dataset Using Copulas: Towards Better Synthetization
- Vincent Granville
- December 17, 2022
This article is an extract from my book “Synthetic Data and Generative AI”, available here. In the context of synthetic data generation, I’ve been asked a few times to provide a case study focusing on real-life tabular data used in the finance or health industry. Here we go: this article fills this gap. The purpose is […]
Read More
Military-grade Fast Random Number Generator Based on Quadratic Irrationals
- Vincent Granville
- December 13, 2022
This article is an extract from my book “Synthetic Data and Generative AI”, available here. There are very few serious articles in the literature dealing with digits of irrational numbers to build a pseudo-random number generator (PRNG). It seems that this idea was abandoned long ago due to the computational complexity and the misconception that such […]
Read More
Spectacular Videos: Synthetic Universes, with Star Collision Graph
- Vincent Granville
- November 19, 2022
Entitled “Spectacular Videos: Synthetic Universes, with Star Collision Graph”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. Also discussed in details with Python code in my book “Synthetic Data”, available here. This project started as an attempt to generate simulations for the three-body problem in astronomy: studying the orbits of three […]
Read More
New Book: Intuitive Machine Learning and Explainable AI
- Vincent Granville
- September 23, 2022
Intuitive Machine Learning with focus on explainable AI, human-friendly intelligence, powerful visualizations and applications. By Vincent Granville Ph.D, published in September 2022. PDF format, 156 pages. Version 1.0 with Python code. The book is available here. For my upcoming course based on this book, see here. This book covers the foundations of machine learning, with modern […]
Read More
Advanced Machine Learning with Basic Excel: Simple Alternative to XGBoost
- Vincent Granville
- September 11, 2022
Entitled “Advanced Machine Learning with Basic Excel”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. Also discussed in details with Python code in chapter 2 in my book “Intuitive Machine Learning and Explainable AI”, available here. I discuss ensemble methods combining many mini decision trees, blended with regression, explained in […]
Read More
The Sound that Data Makes
- Vincent Granville
- August 29, 2022
Featured in chapter 11 in my book “Intuitive Machine Learning and Explainable AI”, available here. It is common these days to read stories about the sound of black holes, deep space or the abyss. But what if you could turn your data into music? There are a few reasons one might want to do this. First, […]
Read More
You must be logged in to post a comment.