๐Management of datasets stored under the new system
How the new Data Execution System handles stored datasets
Overview
Changes Introduced by the New Data Execution System
Legacy Data Execution System
How we handle refresh data jobs with our new Multi-Tenant Execution System

Memory Allocation for Refresh Jobs
Factors affecting memory consumption
Data Processing Workflow
1. Planning Extraction
2. Fetching Data
3. In-memory execution
Performance and Efficiency Results
Scenario
Legacy System (Laputa)
HADES
Performance Gain
Data Storage Architecture
Data File Efficiency
QA: refresh data under the new data execution system
What is the Refresh Data feature in Toucan?
What has changed with the new Data Execution System (HADES)?
Architecture and System Behavior
How does the HADES system manage data refresh jobs?
What are the main differences between HADES and the legacy system?
Aspect
Legacy System
HADES
How does HADES prevent resource conflicts between tenants?
Memory Management and Performance
How is memory allocated for refresh jobs?
What factors influence memory consumption?
Does Toucan use Polars for data processing?
What happens if a job exceeds its memory limit?
Data Processing Workflow
What are the main stages of a refresh job?
What are the different pipeline types?
Data Storage and File Management
How is customer data stored?
Can customers use their own S3 buckets?
How are files stored and downloaded?
Last updated
Was this helpful?