
A Synthetic Stock Exchange Played with Real Money
- Vincent Granville
- May 15, 2023
Not only that, but you can predict — more precisely compute with absolute certainty — what the value of any stock will be tomorrow. Transaction fees are well below 0.05% and the market, at least in the version presented here, is fair: in other words, a zero-sum game if you play by luck. If instead […]
Read More
Autonomous Driving: Boosting Optical Flow with Synthetic Data
- Michael Galarnyk
- April 27, 2023
- computer vision
- optical flow
- synthetic data
Optical flow is defined as the task of estimating per-pixel motion between video frames. Optical flow models take two sequential frames as input and return as output a flow vector that predicts where each pixel in the first frame will be in the second frame. Optical flow is an important task for autonomous driving, but […]
Read More
Generating and Videolizing Agglomerative Processes
- Vincent Granville
- April 17, 2023
This short article explains how to efficiently simulate the evolution of agglomerative processes, and visualize their behavior with data animations. I use a generic, simple model for illustration purposes: atoms, initially consisting of one electron, collide and merge over time, with a pre-specified maximum number of electrons per atom: the maximum limit. Given enough time, […]
Read More
Smart Grid Search for Faster Hyperparameter Tuning
- Vincent Granville
- March 30, 2023
The objective is two-fold. First, I introduce a 2-parameter generalization of the discrete geometric and zeta distributions. Indeed, a combination of both. It allows you to simultaneously match the variance and mean in observed data, thanks to the two parameters p and α. To the contrary, each distribution taken separately only has one parameter, and […]
Read More
Data Synthetization: enhanced GANs vs Copulas
- Vincent Granville
- March 8, 2023
Using case studies, I compare generative adversarial networks (GANs) with copulas to synthesize tabular data. I discuss back-end and front-end improvements to help GANs better replicate the correlation structure present in the real data. Likewise, I discuss methods to further improve copulas, including transforms, the use of separate copulas for each population segment, and parametric […]
Read More
Data Synthetization Explained in One Picture
- Vincent Granville
- February 23, 2023
The diagram is organized as follows. Dashed blue lines are associated to GANs (generative adversarial networks), where the goal is to produce a sequence of synthetic datasets that get better and better at mimicking the structure present in the real data, over successive iterations. The diagram features 5 such iterations, with the synthetized datasets denoted […]
Read More
New Book on Synthetic Data: Version 3.0 Just Released
- Vincent Granville
- February 3, 2023
Update on March 3, 2023: Version 4.0 has been released and now replaces version 3.0 on the e-Store. It contains a new full chapter on enhanced generative adversarial networks (GANs) with comparison to copula-based methods for data synthetization, with illustrations on real-life datasets. The book has considerably grown since version 1.0. It started with synthetic […]
Read More
New Interpolation Methods for Data Synthetization and Prediction
- Vincent Granville
- January 14, 2023
Entitled “New Interpolation Methods for Synthetization and Prediction”, the full version in PDF format is accessible in the “Free Books and Articles” section, here. This article is an extract from my book “Synthetic Data and Generative AI”, available here. I describe little-known original interpolation methods with applications to real-life datasets. These simple techniques are easy to implement and can […]
Read More
Synthetizing the Insurance Dataset Using Copulas: Towards Better Synthetization
- Vincent Granville
- December 17, 2022
This article is an extract from my book “Synthetic Data and Generative AI”, available here. In the context of synthetic data generation, I’ve been asked a few times to provide a case study focusing on real-life tabular data used in the finance or health industry. Here we go: this article fills this gap. The purpose is […]
Read More
Military-grade Fast Random Number Generator Based on Quadratic Irrationals
- Vincent Granville
- December 13, 2022
This article is an extract from my book “Synthetic Data and Generative AI”, available here. There are very few serious articles in the literature dealing with digits of irrational numbers to build a pseudo-random number generator (PRNG). It seems that this idea was abandoned long ago due to the computational complexity and the misconception that such […]
Read More
You must be logged in to post a comment.