Skip to main content

ML Dataset Generation

info

This guide is coming soon. Below is an outline of what it will cover.

Planned Content

  • Motivation — why ML-ready power system datasets are needed
  • Dataset Formats — graph-structured (PyG/DGL) and tabular formats
  • Cascade Datasets — generating failure propagation datasets
  • Power Flow Datasets — generating steady-state datasets
  • Data Augmentation — topology perturbation, load scaling
  • Integration with PyG/DGL — loading datasets into PyTorch Geometric and DGL
  • Benchmarking — standard splits and evaluation protocols