DataFlood Suite Documentation

Welcome to the comprehensive documentation for the DataFlood Suite - a powerful platform for generating realistic synthetic data based on DataFlood's human-editable models.

Quick Start

New to DataFlood? Start here:

  1. Core Concepts - Understand the fundamentals
  2. DataFlood CLI Getting Started - Command-line quick start
  3. FloodGate API Quick Start - API quick start

Components

DataFlood (CLI)

Command-line tool for schema generation and document creation.

DataFloodEditor (GUI)

Visual editor for creating schemas and designing sequences.

FloodGate (API)

RESTful API service for programmatic access.

Common Topics

Guides & Tutorials

Documentation by Task

I want to...

Generate synthetic data from existing samples
  1. Use DataFlood CLI to analyze your data
  2. Or import into DataFloodEditor
Create a schema from scratch
  1. Use DataFloodEditor Model Editor
  2. Define properties and constraints
  3. Configure statistical behavior for each data element
  4. Test generation
Design time-based sequences
  1. Open Tides Editor
  2. Add document generation steps
  3. Configure parent-child relationships
  4. Set timing patterns
Merge multiple schemas
  1. Use Model Merge Tool
  2. Drag and drop properties
  3. Resolve conflicts
  4. Save merged DataFlood model
Integrate with my application
  1. Start FloodGate API
  2. Use REST endpoints
Generate large datasets
  1. Use DataFlood CLI for batch generation
  2. Or FloodGate API serving mode
  3. Follow performance tips
  4. Use streaming options

Import data from CSV or JSON

  1. Use DataFloodEditor Import
  2. Configure import settings
  3. Review generated schema
  4. Enhance by editing the DataFlood model
Troubleshoot issues
  1. Check Troubleshooting Guide
  2. Review Best Practices
  3. Validate DataFlood models
  4. Test with small batches

Architecture Overview

┌─────────────────────────────────────────────────────┐
│                    User Interfaces                  │
├──────────────┬──────────────┬──────────────────────┤
│  DataFlood   │DataFloodEditor│    FloodGate API    │
│     CLI      │     GUI       │    REST Service     │
├──────────────┴──────────────┴──────────────────────┤
│              DataFlood Core Library                 │
│  • Schema Generation  • Document Generation         │
│  • Statistical Models • Sequence Execution          │
└─────────────────────────────────────────────────────┘

Component Relationships

  • DataFlood CLI: Command-line interface for DataFlood model generation and document creation
  • DataFloodEditor: Desktop application for visual editing and project management
  • FloodGate API: HTTP service for programmatic access and integration

Key Features

Statistical Modeling

  • String Models: Pattern recognition, n-grams, vocabularies
  • Histograms: Numeric distributions
  • Format Detection: Automatic recognition of emails, URLs, dates
  • Entropy Control: Fine-tune randomness

Data Generation

  • Realistic Data: Maintains statistical properties
  • Multiple Formats: JSON, CSV, JSONL output
  • Reproducible: Seed-based generation

Tides Sequence Design

  • Time-Based: Generate documents over time
  • Relationships: Parent-child document links
  • Transactions: Triggered generation
  • Orchestration: Multiple models working together

Integration

  • REST API: Full HTTP interface
  • Swagger: Interactive documentation
  • Docker: Container deployment
  • Batch Operations: Large-scale generation

Sample Use Cases

E-commerce Testing

Generate realistic product catalogs, customer profiles, orders, and transactions for testing e-commerce systems. See complete example.

IoT Simulation

Create sensor data streams with realistic patterns, anomalies, and time-based variations. See IoT example.

Financial Data

Generate banking transactions, account records, and settlement data with proper relationships. See banking example.

Healthcare Records

Create synthetic patient records, appointments, and medical data for system testing. See healthcare example.

Log Generation

Produce application logs, security events, and audit trails for testing log analysis systems. See logging example.

Getting Help

Documentation

Interactive Help

  • FloodGate Swagger UI at http://localhost:5000/swagger
  • DataFloodEditor tooltips and help menu
  • DataFlood CLI --help flag

Troubleshooting

Version Information

  • Current Version: 1.0.0
  • Documentation Updated: August 2025
  • .NET Version: 9.0
  • License: Commercial (contact for licensing)

Quick Links

Getting Started

References

Key Guides

v1.0, all documentation copyright SmallMinds 2025