Skip to content

FEAT-009: Additional SQL Dialects (Snowflake, BigQuery, Redshift) #80

@ajitpratap0

Description

@ajitpratap0

Summary

Add support for major cloud data warehouse dialects to expand market reach.

Current State

5 dialects supported: PostgreSQL, MySQL, SQL Server, Oracle, SQLite

Target Dialects (Phased)

Phase 1: Snowflake (Q2 2025)

  • Cloud data warehouse leader
  • VARIANT data type
  • Semi-structured data (JSON, AVRO, Parquet)
  • Time travel queries
  • Snowflake-specific functions

Phase 2: BigQuery (Q3 2025)

  • Google Cloud platform
  • Standard SQL mode
  • Arrays and STRUCT types
  • Nested queries
  • BigQuery-specific functions

Phase 3: Redshift (Q3 2025)

  • AWS data warehouse
  • Distribution styles (KEY, EVEN, ALL)
  • Sort keys
  • Compression encoding
  • Redshift-specific syntax

Phase 4: Hive/Spark SQL (Q4 2025)

  • Big data ecosystem
  • SerDe properties
  • Partition specifications
  • Hive UDFs

Phase 5: Plugin System

  • Community dialect support
  • Custom dialect registration
  • Dialect detection

Acceptance Criteria (per dialect)

  • ≥ 80% dialect coverage
  • Comprehensive test suite (100+ queries)
  • Documentation with examples
  • Keyword recognition
  • Syntax parsing

Technical Details

Priority: Medium
Effort: Large (240h / 2-3 months per major dialect)
Phase: Phase 7 - Advanced Features
Dependencies: None

Market Impact

  • Snowflake: 10,000+ customers
  • BigQuery: Massive GCP user base
  • Redshift: Dominant AWS data warehouse
  • Combined: 3x market expansion

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions