-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Labels
Description
Summary
Add support for major cloud data warehouse dialects to expand market reach.
Current State
5 dialects supported: PostgreSQL, MySQL, SQL Server, Oracle, SQLite
Target Dialects (Phased)
Phase 1: Snowflake (Q2 2025)
- Cloud data warehouse leader
- VARIANT data type
- Semi-structured data (JSON, AVRO, Parquet)
- Time travel queries
- Snowflake-specific functions
Phase 2: BigQuery (Q3 2025)
- Google Cloud platform
- Standard SQL mode
- Arrays and STRUCT types
- Nested queries
- BigQuery-specific functions
Phase 3: Redshift (Q3 2025)
- AWS data warehouse
- Distribution styles (KEY, EVEN, ALL)
- Sort keys
- Compression encoding
- Redshift-specific syntax
Phase 4: Hive/Spark SQL (Q4 2025)
- Big data ecosystem
- SerDe properties
- Partition specifications
- Hive UDFs
Phase 5: Plugin System
- Community dialect support
- Custom dialect registration
- Dialect detection
Acceptance Criteria (per dialect)
- ≥ 80% dialect coverage
- Comprehensive test suite (100+ queries)
- Documentation with examples
- Keyword recognition
- Syntax parsing
Technical Details
Priority: Medium
Effort: Large (240h / 2-3 months per major dialect)
Phase: Phase 7 - Advanced Features
Dependencies: None
Market Impact
- Snowflake: 10,000+ customers
- BigQuery: Massive GCP user base
- Redshift: Dominant AWS data warehouse
- Combined: 3x market expansion