| title | Cloud Data Persistence | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| description | Central data management foundation of the Edge AI Accelerator architecture, bridging edge computing with cloud-based analytics capabilities through Azure Storage and Microsoft Fabric integration | ||||||||||
| author | Edge AI Team | ||||||||||
| ms.date | 2025-06-07 | ||||||||||
| ms.topic | reference | ||||||||||
| keywords |
|
||||||||||
| estimated_reading_time | 6 |
The Cloud Data Persistence component serves as the central data management foundation of the Edge AI Accelerator architecture, bridging edge computing with cloud-based analytics capabilities. This component implements a modern data architecture that collects, stores, and processes data generated from edge devices to enable advanced analytics and AI model training.
Within the accelerator, this component:
- Provides centralized data storage for edge device telemetry, AI inference results, and operational data
- Creates a data pipeline from edge to cloud by consuming events from the messaging system
- Enables advanced analytics through Microsoft Fabric's analytics platform integration
- Supports the AI lifecycle by storing data that can be used for model training and validation
The Cloud Data Persistence component is a critical integration hub:
- Edge devices: Data generated at the edge flows through messaging to this component
- 050-Messaging component: Consumes events from Event Grid topics for real-time data processing
- AI model training: Provides datasets for retraining models deployed to edge devices
- Analytics and reporting: Enables insights from combined edge and cloud data
- Edge devices generate data and AI inference results
- Data is published to Event Grid topics by the messaging component
- Fabric EventStream ingests events from Event Grid in real-time
- Data is stored in:
- Azure Storage (operational data, configurations, and raw telemetry)
- Fabric Lakehouse (processed data ready for analytics)
- Stored data is available for applications, analytics, and AI model retraining
This component creates:
- Azure Storage Accounts: For general-purpose persistence of application data
- Azure Storage Containers: For organizing and storing blob data
- Azure Storage File Shares: For file-based sharing and access
- Data Lake Gen2 Storage: For hierarchical namespaces and big data workloads
- Fabric Workspace: A logical container for Fabric analytics assets
- Fabric Lakehouse: A modern data lake with data warehouse capabilities
- Fabric EventStream: A real-time data ingestion service connecting Event Grid to the Lakehouse
- Azure CLI installed and logged in
- Terraform installed
- jq installed (for script processing)
The component provides two options for Microsoft Fabric workspace integration:
Set should_create_fabric_workspace = true in your Terraform configuration to create a new Fabric workspace.
If you want to use a premium capacity with your new workspace, run the provided script:
cd scripts
chmod +x select-fabric-capacity.sh
./select-fabric-capacity.shTo use an existing Microsoft Fabric workspace:
- Set
should_create_fabric_workspace = falsein your Terraform configuration - Provide the existing workspace ID via
existing_fabric_workspace_idvariable
In both cases, the component will create:
- A Lakehouse in the specified workspace
- An EventStream connecting Event Grid to the Lakehouse (if an Event Grid endpoint is provided)
If you want to use a premium capacity with Microsoft Fabric, run the provided script:
cd scripts
chmod +x select-fabric-capacity.sh
./select-fabric-capacity.shThe script will:
- List all available Fabric capacities for your account
- Prompt you to select one
- Save the selection for use by Terraform
If you don't have any capacities or choose not to use one, the deployment will use the Fabric free tier.
The terraform code can be found in the terraform folder.
- storage-account: Creates Azure Storage Account with appropriate configuration
- data-lake: Provisions Data Lake Gen2 storage for big data workloads
- fabric_workspace: Creates a Microsoft Fabric workspace environment
- fabric_lakehouse: Creates a Fabric Lakehouse linked to the workspace
- fabric_eventstream: Creates a Fabric EventStream that connects Event Grid to the Lakehouse
🤖 Crafted with precision by ✨Copilot following brilliant human instruction, then carefully refined by our team of discerning human reviewers.