AI + Network Papers15 min read11 citations

Diffusion-Based Generative Models for Synthetic Network Traffic Generation

Dr. Guillaume Chevalier, Prof. Jiayu Zhou

Michigan State University

Jan 19, 2026View on arXiv

Abstract

We develop a denoising diffusion probabilistic model (DDPM) for generating realistic synthetic network traffic data. The model captures complex temporal correlations, long-range dependencies, and multi-variate relationships in network KPIs. Synthetic data generated by our model passes 95% of statistical fidelity tests and, when used for training, improves downstream ML model performance by 18% in data-scarce scenarios. This enables operators to develop and test AI solutions without exposing proprietary network data.

AI Summary

AI-Generated Summary
  • Diffusion model generating realistic synthetic network traffic data.
  • Passes 95% of statistical fidelity tests for realism.
  • 18% improvement in downstream ML when used for training augmentation.
  • Enables AI development without exposing proprietary network data.

Key Findings

  • 1Diffusion models capture multi-variate network KPI relationships better than GANs.
  • 2Conditional generation enables creating data for specific network conditions.
  • 3Privacy analysis confirms synthetic data does not memorize individual records.

Industry Implications

Solves the data scarcity problem for telecom AI development.

Enables secure AI model benchmarking and sharing between organizations.

Could create a marketplace for synthetic network data.

Diffusion ModelSynthetic DataNetwork TrafficPrivacy

Read the Original Paper

Access the full paper on arXiv for complete methodology, results, and references.

Open on arXiv

Related Papers