Data Sets

Data Sets Featured Posts Machine Learning Synthetic Data Time Series Visualization

New Interpolation Methods for Data Synthetization and Prediction

Entitled “New Interpolation Methods for Synthetization and Prediction”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. This article is an extract from my book “Synthetic Data and Generative AI”, available here. I describe little-known original interpolation methods with applications to real-life datasets. These simple techniques are easy to implement and can […]

Read More
Data Sets Featured Posts Machine Learning Synthetic Data

Synthetizing the Insurance Dataset Using Copulas: Towards Better Synthetization

This article is an extract from my book “Synthetic Data and Generative AI”, available here. In the context of synthetic data generation, I’ve been asked a few times to provide a case study focusing on real-life tabular data used in the finance or health industry. Here we go: this article fills this gap. The purpose is […]

Read More
Data Sets Featured Posts Machine Learning Statistical Science Synthetic Data

Military-grade Fast Random Number Generator Based on Quadratic Irrationals

This article is an extract from my book “Synthetic Data and Generative AI”, available here. There are very few serious articles in the literature dealing with digits of irrational numbers to build a pseudo-random number generator (PRNG). It seems that this idea was abandoned long ago due to the computational complexity and the misconception that such […]

Read More
Computer Vision Data Sets Explainable AI Featured Posts Machine Learning Synthetic Data Visualization

Spectacular Videos: Synthetic Universes, with Star Collision Graph

Entitled “Spectacular Videos: Synthetic Universes, with Star Collision Graph”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. Also discussed in details with Python code in my book “Synthetic Data”, available here. This project started as an attempt to generate simulations for the three-body problem in astronomy: studying the orbits of three […]

Read More
Books Data Sets Deep Learning Explainable AI Featured Posts Machine Learning Synthetic Data Visualization

New Book: Intuitive Machine Learning and Explainable AI

Intuitive Machine Learning with focus on explainable AI, human-friendly intelligence, powerful visualizations and applications. By Vincent Granville Ph.D, published in September 2022. PDF format, 156 pages. Version 1.0 with Python code. The book is available here. For my upcoming course based on this book, see here. This book covers the foundations of machine learning, with modern […]

Read More
Data Sets Explainable AI Featured Posts ML with Excel Natural Language Processing

Advanced Machine Learning with Basic Excel: Simple Alternative to XGBoost

Entitled “Advanced Machine Learning with Basic Excel”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. Also discussed in details with Python code in chapter 2 in my book “Intuitive Machine Learning and Explainable AI”, available here. I discuss ensemble methods combining many mini decision trees, blended with regression, explained in […]

Read More
Data Sets Featured Posts Machine Learning Time Series Visualization

The Sound that Data Makes

Featured in chapter 11 in my book “Intuitive Machine Learning and Explainable AI”, available here. It is common these days to read stories about the sound of black holes, deep space or the abyss. But what if you could turn your data into music? There are a few reasons one might want to do this. First, […]

Read More
Data Sets Featured Posts Machine Learning Podcasts Statistical Science Synthetic Data

Synthetic Data in Machine Learning: What, Why, How?

In this episode, Nicolai Baldin (CEO) and Simon Swan (Machine Learning Lead) of Synthesized are welcoming the founder of Data Science Central and MLTechniques.com Vincent Granville to discuss synthetic data generation, share secrets about Machine Learning on synthetic data, key challenges with synthetic data, and using generative models to solve issues related to fairness and […]

Read More
Data Sets Featured Posts Machine Learning Visualization

The Art of Visualizing High Dimensional Data

Entitled “The Art of Visualizing High Dimensional Data”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. Also discussed in details with Python code in chapter 4 in my book “Intuitive Machine Learning and Explainable AI”, available here. This article discusses enriched visualizations, with a focus on animated gifs and videos […]

Read More
Data Sets Explainable AI Featured Posts Machine Learning ML with Excel Statistical Science Synthetic Data

Little Known Secrets about Interpretable Machine Learning on Synthetic Data

Entitled “Little Known Secrets about Interpretable Machine Learning on Synthetic Data”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. This first article in a new series on synthetic data and explainable AI, focuses on making linear regression more meaningful and controllable. Includes synthetic data, advanced machine learning with Excel, […]

Read More