diff --git a/CLAUDE.md b/CLAUDE.md
new file mode 100644
index 0000000..0360e24
--- /dev/null
+++ b/CLAUDE.md
@@ -0,0 +1,284 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Development Commands
+
+### Essential Commands
+```bash
+# Install with development dependencies
+uv pip install -e .[dev]
+
+# Run all tests
+uv run pytest
+
+# Run specific test file
+uv run pytest tests/unit/core/test_config.py
+
+# Run single test
+uv run pytest tests/unit/core/test_config.py::TestPydanticConfig::test_config_update
+
+# Linting and formatting
+uv run ruff check           # Lint check
+uv run ruff check --fix     # Auto-fix linting issues
+uv run black chuck_data tests  # Format code
+uv run pyright             # Type checking
+
+# Run application locally
+python -m chuck_data        # Or: uv run python -m chuck_data
+chuck-data --no-color      # Disable colors for testing
+```
+
+### Test Categories
+Tests are organized with pytest markers:
+- Default: Unit tests only (fast)
+- `pytest -m integration`: Integration tests (requires Databricks access)
+- `pytest -m data_test`: Tests that create Databricks resources
+- `pytest -m e2e`: End-to-end tests (slow, comprehensive)
+
+### Test Structure (Recently Reorganized)
+```
+tests/
+├── unit/
+│   ├── commands/     # Command handler tests
+│   ├── clients/      # API client tests  
+│   ├── ui/          # TUI/display tests
+│   └── core/        # Core functionality tests
+├── integration/     # Integration tests
+└── fixtures/        # Test stubs and fixtures
+```
+
+## Architecture Overview
+
+### Command Processing Flow
+1. **TUI** (`ui/tui.py`) receives user input
+2. **Command Registry** (`command_registry.py`) maps commands to handlers
+3. **Service Layer** (`service.py`) orchestrates business logic
+4. **Command Handlers** (`commands/`) execute specific operations
+5. **API Clients** (`clients/`) interact with external services
+
+### Key Components
+
+**ChuckService** - Main service facade that:
+- Initializes Databricks API client from config
+- Routes commands through the command registry
+- Handles error reporting and metrics collection
+- Acts as bridge between TUI and business logic
+
+**Command Registry** - Unified registry where each command is defined with:
+- Handler function, parameters, and validation rules
+- Visibility flags (user vs agent accessible)
+- Display preferences (condensed vs full output)
+- Interactive input support flags
+
+**Configuration System** - Pydantic-based config that:
+- Supports both file storage (~/.chuck_config.json) and environment variables
+- Environment variables use CHUCK_ prefix (e.g., CHUCK_WORKSPACE_URL)
+- Handles workspace URLs, tokens, active catalog/schema/model settings
+- Includes usage tracking consent management
+
+**Agent System** - AI-powered assistant that:
+- Uses LLM clients (OpenAI-compatible) with configurable models
+- Has specialized modes: general queries, PII detection, bulk PII scanning, Stitch setup
+- Executes commands through the same registry as TUI
+- Maintains conversation history and context
+
+**Interactive Context** - Session state management for:
+- Multi-step command workflows (like setup wizards)
+- Command-specific context data
+- Cross-command state sharing
+
+### External Integrations
+
+**Databricks Integration** - Primary platform integration:
+- Unity Catalog operations (catalogs, schemas, tables, volumes)
+- SQL Warehouse management and query execution
+- Model serving endpoints for LLM access
+- Job management and cluster operations
+- Authentication via personal access tokens
+
+**Amperity Integration** - Data platform operations:
+- Authentication flow with browser-based OAuth
+- Bug reporting and metrics submission
+- Stitch integration for data pipeline setup
+
+### Test Mocking Guidelines
+Core Principle
+
+Mock external boundaries only. Use real objects for all internal business logic to catch integration bugs.
+
+✅ ALWAYS Mock These (External Boundaries)
+
+HTTP/Network Calls
+
+# Databricks SDK and API calls
+@patch('databricks.sdk.WorkspaceClient')
+@patch('requests.get')
+@patch('requests.post')
+
+# OpenAI/LLM API calls
+@patch('openai.OpenAI')
+# OR use LLMClientStub fixture
+
+File System Operations
+
+# Only when testing file I/O behavior
+@patch('builtins.open')
+@patch('os.path.exists')
+@patch('os.makedirs')
+@patch('tempfile.TemporaryDirectory')
+
+# Log file operations
+@patch('chuck_data.logger.setup_file_logging')
+
+System/Environment
+
+# Environment variables (when testing env behavior)
+@patch.dict('os.environ', {'CHUCK_TOKEN': 'test'})
+
+# System calls
+@patch('subprocess.run')
+@patch('datetime.datetime.now')  # for deterministic timestamps
+
+User Input/Terminal
+
+# Interactive prompts
+@patch('prompt_toolkit.prompt')
+@patch('readchar.readkey')
+@patch('sys.stdout.write')  # when testing specific output
+
+❌ NEVER Mock These (Internal Logic)
+
+Configuration Objects
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.config.ConfigManager')
+
+# ✅ DO THIS:
+config_manager = ConfigManager('/tmp/test_config.json')
+
+Business Logic Classes
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.service.ChuckService')
+
+# ✅ DO THIS:
+service = ChuckService(client=mocked_databricks_client)
+
+Data Objects
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.commands.base.CommandResult')
+
+# ✅ DO THIS:
+result = CommandResult(success=True, data="test")
+
+Utility Functions
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.utils.normalize_workspace_url')
+
+# ✅ DO THIS:
+from chuck_data.utils import normalize_workspace_url
+normalized = normalize_workspace_url("https://test.databricks.com")
+
+Command Registry/Routing
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.command_registry.get_command')
+
+# ✅ DO THIS:
+from chuck_data.command_registry import get_command
+command_def = get_command('/status')  # Test real routing
+
+Amperity Client
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.clients.amperity.AmperityClient')
+
+# ✅ DO THIS:
+Use the fixture `AmperityClientStub` to stub only the external API calls, while using the real command logic.
+
+Databricks Client
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.clients.databricks.DatabricksClient')
+
+# ✅ DO THIS:
+Use the fixture `Da:tabricksClientStub` to stub only the external API calls, while using the real command logic.
+
+LLM Client
+
+# ❌ DON'T DO THIS:
+@patch('chuck_data.clients.llm.LLMClient')
+
+# ✅ DO THIS:
+Use the fixture `LLMClientStub` to stub only the external API calls, while using the real command logic.
+
+
+🎯 Approved Test Patterns
+
+Pattern 1: External Client + Real Internal Logic
+
+def test_list_catalogs_command():
+  # Mock external boundary
+  mock_client = DatabricksClientStub()
+  mock_client.add_catalog("test_catalog")
+
+  # Use real service
+  service = ChuckService(client=mock_client)
+
+  # Test real command execution
+  result = service.execute_command("/list_catalogs")
+
+  assert result.success
+  assert "test_catalog" in result.data
+
+Pattern 2: Real Config with Temporary Files
+
+def test_config_update():
+  with tempfile.NamedTemporaryFile() as tmp:
+      # Use real config manager
+      config_manager = ConfigManager(tmp.name)
+
+      # Test real config logic
+      config_manager.update(workspace_url="https://test.databricks.com")
+
+      # Verify real file operations
+      reloaded = ConfigManager(tmp.name)
+      assert reloaded.get_config().workspace_url == "https://test.databricks.com"
+
+Pattern 3: Stub Only External APIs
+
+def test_auth_flow():
+  # Stub external API
+  amperity_stub = AmperityClientStub()
+  amperity_stub.set_auth_completion_failure(True)
+
+  # Use real command logic
+  result = handle_amperity_login(amperity_stub)
+
+  # Test real error handling
+  assert not result.success
+  assert "Authentication failed" in result.message
+
+🚫 Red Flags (Stop and Reconsider)
+
+- @patch('chuck_data.config.*')
+- @patch('chuck_data.commands.*.handle_*')
+- @patch('chuck_data.service.*')
+- @patch('chuck_data.utils.*')
+- @patch('chuck_data.models.*')
+- Any patch of internal business logic functions
+
+✅ Quick Decision Tree
+
+Before mocking anything, ask:
+
+1. Does this cross a process boundary? (network, file, subprocess) → Mock it
+2. Is this user input or system interaction? → Mock it
+3. Is this internal business logic? → Use real object
+4. Is this a data transformation? → Use real function
+5. When in doubt → Use real object
+
+Exception: Only mock internal logic when testing error conditions that are impossible to trigger naturally.
diff --git a/chuck_data/agent/manager.py b/chuck_data/agent/manager.py
index 6f18399..3650ab7 100644
--- a/chuck_data/agent/manager.py
+++ b/chuck_data/agent/manager.py
@@ -19,9 +19,9 @@
 
 
 class AgentManager:
-    def __init__(self, client, model=None, tool_output_callback=None):
+    def __init__(self, client, model=None, tool_output_callback=None, llm_client=None):
         self.api_client = client
-        self.llm_client = LLMClient()
+        self.llm_client = llm_client or LLMClient()
         self.model = model
         self.tool_output_callback = tool_output_callback
         self.conversation_history = [
diff --git a/chuck_data/commands/agent.py b/chuck_data/commands/agent.py
index 8bb8f38..684853a 100644
--- a/chuck_data/commands/agent.py
+++ b/chuck_data/commands/agent.py
@@ -15,13 +15,14 @@
 
 
 def handle_command(
-    client: Optional[DatabricksAPIClient], **kwargs: Any
+    client: Optional[DatabricksAPIClient], llm_client=None, **kwargs: Any
 ) -> CommandResult:
     """
     Process a natural language query using the LLM agent.
 
     Args:
         client: DatabricksAPIClient instance for API calls (optional)
+        llm_client: LLMClient instance for AI calls (optional, creates default if None)
         **kwargs: Command parameters
             - query: The natural language query from the user
             - mode: Optional agent mode (general, pii, bulk_pii, stitch)
@@ -56,14 +57,17 @@ def handle_command(
     if isinstance(query, str):
         query = query.strip()
 
+    # Get the mode early to check if query is required
+    mode = kwargs.get("mode", "general").lower()
+
     # Now, check if the (potentially stripped) query is truly empty or None.
-    if not query:
+    # Some modes (bulk_pii, stitch) don't require a query
+    if not query and mode not in ["bulk_pii", "stitch"]:
         return CommandResult(
             False, message="Please provide a query. Usage: /ask Your question here"
         )
 
     # Get optional parameters
-    mode = kwargs.get("mode", "general").lower()
     catalog_name = kwargs.get("catalog_name")
     schema_name = kwargs.get("schema_name")
     tool_output_callback = kwargs.get("tool_output_callback")
@@ -75,8 +79,10 @@ def handle_command(
         # Get metrics collector
         metrics_collector = get_metrics_collector()
 
-        # Create agent manager with the API client and tool output callback
-        agent = AgentManager(client, tool_output_callback=tool_output_callback)
+        # Create agent manager with the API client, tool output callback, and optional LLM client
+        agent = AgentManager(
+            client, tool_output_callback=tool_output_callback, llm_client=llm_client
+        )
 
         # Load conversation history
         try:
@@ -90,9 +96,7 @@ def handle_command(
         # Process the query based on the selected mode
         if mode == "pii":
             # PII detection mode for a single table
-            response = agent.process_pii_detection(
-                table_name=query, catalog_name=catalog_name, schema_name=schema_name
-            )
+            response = agent.process_pii_detection(table_name=query)
         elif mode == "bulk_pii":
             # Bulk PII scanning mode for a schema
             response = agent.process_bulk_pii_scan(
diff --git a/chuck_data/commands/wizard/validator.py b/chuck_data/commands/wizard/validator.py
index 8aed8a9..d72ca6b 100644
--- a/chuck_data/commands/wizard/validator.py
+++ b/chuck_data/commands/wizard/validator.py
@@ -27,6 +27,16 @@ class ValidationResult:
 class InputValidator:
     """Handles validation of user inputs for wizard steps."""
 
+    def __init__(self, databricks_client_factory=None):
+        """Initialize validator with optional client factory for dependency injection.
+
+        Args:
+            databricks_client_factory: Optional factory function that takes (workspace_url, token)
+                                     and returns a Databricks client instance. If None, creates
+                                     real DatabricksAPIClient instances.
+        """
+        self.databricks_client_factory = databricks_client_factory
+
     def validate_workspace_url(self, url_input: str) -> ValidationResult:
         """Validate and process workspace URL input."""
         if not url_input or not url_input.strip():
@@ -73,10 +83,15 @@ def validate_token(self, token: str, workspace_url: str) -> ValidationResult:
         token = token.strip()
 
         try:
-            # Validate token with Databricks API using the provided workspace URL
-            from chuck_data.clients.databricks import DatabricksAPIClient
+            # Create client using factory if provided, otherwise use real client
+            if self.databricks_client_factory:
+                client = self.databricks_client_factory(workspace_url, token)
+            else:
+                # Validate token with Databricks API using the provided workspace URL
+                from chuck_data.clients.databricks import DatabricksAPIClient
+
+                client = DatabricksAPIClient(workspace_url, token)
 
-            client = DatabricksAPIClient(workspace_url, token)
             is_valid = client.validate_token()
 
             if not is_valid:
diff --git a/pytest.ini b/pytest.ini
index 27115c9..eea2c18 100644
--- a/pytest.ini
+++ b/pytest.ini
@@ -1,6 +1 @@
 [pytest]
-markers =
-    integration: Integration tests (requires Databricks access)
-    data_test: Data tests that create resources in Databricks
-    e2e: End-to-end tests that will run on Databricks and take a long time
-addopts = -m "not integration and not data_test and not e2e"
diff --git a/tests/commands/test_add_stitch_report.py b/tests/commands/test_add_stitch_report.py
deleted file mode 100644
index 5668a76..0000000
--- a/tests/commands/test_add_stitch_report.py
+++ /dev/null
@@ -1,145 +0,0 @@
-"""
-Tests for add_stitch_report command handler.
-
-This module contains tests for the add_stitch_report command handler.
-"""
-
-import unittest
-from unittest.mock import patch
-
-from chuck_data.commands.add_stitch_report import handle_command
-from tests.fixtures import DatabricksClientStub, MetricsCollectorStub
-
-
-class TestAddStitchReport(unittest.TestCase):
-    """Tests for add_stitch_report command handler."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = DatabricksClientStub()
-        # Client stub has create_stitch_notebook method by default
-
-    def test_missing_client(self):
-        """Test handling when client is not provided."""
-        result = handle_command(None, table_path="catalog.schema.table")
-        self.assertFalse(result.success)
-        self.assertIn("Client is required", result.message)
-
-    def test_missing_table_path(self):
-        """Test handling when table_path is missing."""
-        result = handle_command(self.client)
-        self.assertFalse(result.success)
-        self.assertIn("Table path must be provided", result.message)
-
-    def test_invalid_table_path_format(self):
-        """Test handling when table_path format is invalid."""
-        result = handle_command(self.client, table_path="invalid_format")
-        self.assertFalse(result.success)
-        self.assertIn("must be in the format", result.message)
-
-    @patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
-    def test_successful_report_creation(self, mock_get_metrics_collector):
-        """Test successful stitch report notebook creation."""
-        # Setup mocks
-        metrics_collector_stub = MetricsCollectorStub()
-        mock_get_metrics_collector.return_value = metrics_collector_stub
-
-        self.client.set_create_stitch_notebook_result(
-            {
-                "path": "/Workspace/Users/user@example.com/Stitch Results",
-                "status": "success",
-            }
-        )
-
-        # Call function
-        result = handle_command(self.client, table_path="catalog.schema.table")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Successfully created", result.message)
-        # Verify the call was made with correct arguments
-        self.assertEqual(len(self.client.create_stitch_notebook_calls), 1)
-        args, kwargs = self.client.create_stitch_notebook_calls[0]
-        self.assertEqual(args, ("catalog.schema.table", None))
-
-        # Verify metrics collection
-        self.assertEqual(len(metrics_collector_stub.track_event_calls), 1)
-        call = metrics_collector_stub.track_event_calls[0]
-        self.assertEqual(call["prompt"], "add-stitch-report command")
-        self.assertEqual(call["additional_data"]["status"], "success")
-
-    @patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
-    def test_report_creation_with_custom_name(self, mock_get_metrics_collector):
-        """Test stitch report creation with custom notebook name."""
-        # Setup mocks
-        metrics_collector_stub = MetricsCollectorStub()
-        mock_get_metrics_collector.return_value = metrics_collector_stub
-
-        self.client.set_create_stitch_notebook_result(
-            {
-                "path": "/Workspace/Users/user@example.com/My Custom Report",
-                "status": "success",
-            }
-        )
-
-        # Call function
-        result = handle_command(
-            self.client, table_path="catalog.schema.table", name="My Custom Report"
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Successfully created", result.message)
-        # Verify the call was made with correct arguments
-        self.assertEqual(len(self.client.create_stitch_notebook_calls), 1)
-        args, kwargs = self.client.create_stitch_notebook_calls[0]
-        self.assertEqual(args, ("catalog.schema.table", "My Custom Report"))
-
-    @patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
-    def test_report_creation_with_rest_args(self, mock_get_metrics_collector):
-        """Test stitch report creation with rest arguments as notebook name."""
-        # Setup mocks
-        metrics_collector_stub = MetricsCollectorStub()
-        mock_get_metrics_collector.return_value = metrics_collector_stub
-
-        self.client.set_create_stitch_notebook_result(
-            {
-                "path": "/Workspace/Users/user@example.com/Multi Word Name",
-                "status": "success",
-            }
-        )
-
-        # Call function with rest parameter
-        result = handle_command(
-            self.client, table_path="catalog.schema.table", rest="Multi Word Name"
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Successfully created", result.message)
-        # Verify the call was made with correct arguments
-        self.assertEqual(len(self.client.create_stitch_notebook_calls), 1)
-        args, kwargs = self.client.create_stitch_notebook_calls[0]
-        self.assertEqual(args, ("catalog.schema.table", "Multi Word Name"))
-
-    @patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
-    def test_report_creation_api_error(self, mock_get_metrics_collector):
-        """Test handling when API call to create notebook fails."""
-        # Setup mocks
-        metrics_collector_stub = MetricsCollectorStub()
-        mock_get_metrics_collector.return_value = metrics_collector_stub
-
-        self.client.set_create_stitch_notebook_error(ValueError("API Error"))
-
-        # Call function
-        result = handle_command(self.client, table_path="catalog.schema.table")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Error creating Stitch report", result.message)
-
-        # Verify metrics collection for error
-        self.assertEqual(len(metrics_collector_stub.track_event_calls), 1)
-        call = metrics_collector_stub.track_event_calls[0]
-        self.assertEqual(call["prompt"], "add-stitch-report command")
-        self.assertEqual(call["error"], "API Error")
diff --git a/tests/commands/test_agent.py b/tests/commands/test_agent.py
deleted file mode 100644
index ecddc2a..0000000
--- a/tests/commands/test_agent.py
+++ /dev/null
@@ -1,249 +0,0 @@
-"""
-Tests for agent command handler.
-
-This module contains tests for the agent command handler.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-
-# Create mocks at module level to avoid importing problematic classes
-class MockAgentManagerClass:
-    def __init__(self, *args, **kwargs):
-        self.api_client = args[0] if args else None
-        self.tool_output_callback = kwargs.get("tool_output_callback")
-        self.conversation_history = [
-            {"role": "user", "content": "Test question"},
-            {"role": "assistant", "content": "Test response"},
-        ]
-
-    def process_query(self, query):
-        return f"Processed query: {query}"
-
-    def process_pii_detection(self, table_name, catalog_name=None, schema_name=None):
-        return f"PII detection for {table_name}"
-
-    def process_bulk_pii_scan(self, catalog_name=None, schema_name=None):
-        return f"Bulk PII scan for {catalog_name}.{schema_name}"
-
-    def process_setup_stitch(self, catalog_name=None, schema_name=None):
-        return f"Stitch setup for {catalog_name}.{schema_name}"
-
-
-# Directly apply the mock to avoid importing the actual class
-with patch("chuck_data.agent.manager.AgentManager", MockAgentManagerClass):
-    from chuck_data.commands.agent import handle_command
-
-
-class TestAgentCommand(unittest.TestCase):
-    """Tests for agent command handler."""
-
-    def test_missing_query(self):
-        """Test handling when query parameter is not provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("Please provide a query", result.message)
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    @patch("chuck_data.commands.agent.get_metrics_collector")
-    def test_general_query_mode(
-        self, mock_get_metrics_collector, mock_set_history, mock_get_history
-    ):
-        """Test processing a general query."""
-        mock_client = MagicMock()
-        mock_metrics_collector = MagicMock()
-        mock_get_metrics_collector.return_value = mock_metrics_collector
-
-        # Call function
-        result = handle_command(mock_client, query="What tables are available?")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Processed query: What tables are available?"
-        )
-        mock_set_history.assert_called_once()
-
-        # Verify metrics collection
-        mock_metrics_collector.track_event.assert_called_once()
-        # Check that the right parameters were passed
-        call_args = mock_metrics_collector.track_event.call_args[1]
-        self.assertEqual(call_args["prompt"], "What tables are available?")
-        self.assertEqual(
-            call_args["tools"],
-            [
-                {
-                    "name": "general_query",
-                    "arguments": {"query": "What tables are available?"},
-                }
-            ],
-        )
-        self.assertIn(
-            {"role": "assistant", "content": "Test response"},
-            call_args["conversation_history"],
-        )
-        self.assertEqual(
-            call_args["additional_data"],
-            {"event_context": "agent_interaction", "agent_mode": "general"},
-        )
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    @patch("chuck_data.commands.agent.get_metrics_collector")
-    def test_pii_detection_mode(
-        self, mock_get_metrics_collector, mock_set_history, mock_get_history
-    ):
-        """Test processing a PII detection query."""
-        mock_client = MagicMock()
-        mock_metrics_collector = MagicMock()
-        mock_get_metrics_collector.return_value = mock_metrics_collector
-
-        # Call function
-        result = handle_command(
-            mock_client,
-            query="customers",
-            mode="pii",
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["response"], "PII detection for customers")
-        mock_set_history.assert_called_once()
-
-        # Verify metrics collection
-        mock_metrics_collector.track_event.assert_called_once()
-        # Check that the right parameters were passed
-        call_args = mock_metrics_collector.track_event.call_args[1]
-        self.assertEqual(call_args["prompt"], "customers")
-        self.assertEqual(
-            call_args["tools"],
-            [{"name": "pii_detection", "arguments": {"table": "customers"}}],
-        )
-        self.assertIn(
-            {"role": "assistant", "content": "Test response"},
-            call_args["conversation_history"],
-        )
-        self.assertEqual(
-            call_args["additional_data"],
-            {"event_context": "agent_interaction", "agent_mode": "pii"},
-        )
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    def test_bulk_pii_scan_mode(self, mock_set_history, mock_get_history):
-        """Test processing a bulk PII scan."""
-        mock_client = MagicMock()
-
-        # Call function
-        result = handle_command(
-            mock_client,
-            query="Scan all tables",
-            mode="bulk_pii",
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Bulk PII scan for test_catalog.test_schema"
-        )
-        mock_set_history.assert_called_once()
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    def test_stitch_setup_mode(self, mock_set_history, mock_get_history):
-        """Test processing a stitch setup request."""
-        mock_client = MagicMock()
-
-        # Call function
-        result = handle_command(
-            mock_client,
-            query="Set up stitch",
-            mode="stitch",
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Stitch setup for test_catalog.test_schema"
-        )
-        mock_set_history.assert_called_once()
-
-    @patch("chuck_data.agent.AgentManager", side_effect=Exception("Agent error"))
-    def test_agent_exception(self, mock_agent_manager):
-        """Test agent with unexpected exception."""
-        # Call function
-        result = handle_command(None, query="This will fail")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Failed to process query", result.message)
-        self.assertEqual(str(result.error), "Agent error")
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    def test_query_from_rest_parameter(self, mock_set_history, mock_get_history):
-        """Test processing a query from the rest parameter."""
-        mock_client = MagicMock()
-
-        # Call function with rest parameter instead of query
-        result = handle_command(mock_client, rest="What tables are available?")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Processed query: What tables are available?"
-        )
-        mock_set_history.assert_called_once()
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    def test_query_from_raw_args_parameter(self, mock_set_history, mock_get_history):
-        """Test processing a query from the raw_args parameter."""
-        mock_client = MagicMock()
-
-        # Call function with raw_args parameter
-        raw_args = ["What", "tables", "are", "available?"]
-        result = handle_command(mock_client, raw_args=raw_args)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Processed query: What tables are available?"
-        )
-        mock_set_history.assert_called_once()
-
-    @patch("chuck_data.agent.AgentManager", MockAgentManagerClass)
-    @patch("chuck_data.config.get_agent_history", return_value=[])
-    @patch("chuck_data.config.set_agent_history")
-    def test_callback_parameter_passed(self, mock_set_history, mock_get_history):
-        """Test that tool_output_callback is properly passed to AgentManager."""
-        mock_client = MagicMock()
-        mock_callback = MagicMock()
-
-        # Call function with callback
-        result = handle_command(
-            mock_client,
-            query="What tables are available?",
-            tool_output_callback=mock_callback,
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(
-            result.data["response"], "Processed query: What tables are available?"
-        )
-        mock_set_history.assert_called_once()
diff --git a/tests/commands/test_auth.py b/tests/commands/test_auth.py
deleted file mode 100644
index 3b25b62..0000000
--- a/tests/commands/test_auth.py
+++ /dev/null
@@ -1,146 +0,0 @@
-"""Unit tests for the auth commands module."""
-
-import unittest
-from unittest.mock import patch
-
-from chuck_data.commands.auth import (
-    handle_amperity_login,
-    handle_databricks_login,
-    handle_logout,
-)
-from tests.fixtures import AmperityClientStub
-
-
-class TestAuthCommands(unittest.TestCase):
-    """Test cases for authentication commands."""
-
-    @patch("chuck_data.commands.auth.AmperityAPIClient")
-    def test_amperity_login_success(self, mock_auth_client_class):
-        """Test successful Amperity login flow."""
-        # Use AmperityClientStub instead of MagicMock
-        client_stub = AmperityClientStub()
-        mock_auth_client_class.return_value = client_stub
-
-        # Execute
-        result = handle_amperity_login(None)
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Authentication completed successfully.")
-
-    @patch("chuck_data.commands.auth.AmperityAPIClient")
-    def test_amperity_login_start_failure(self, mock_auth_client_class):
-        """Test failure during start of Amperity login flow."""
-        # Use AmperityClientStub configured to fail at start
-        client_stub = AmperityClientStub()
-        client_stub.set_auth_start_failure(True)
-        mock_auth_client_class.return_value = client_stub
-
-        # Execute
-        result = handle_amperity_login(None)
-
-        # Verify
-        self.assertFalse(result.success)
-        self.assertEqual(
-            result.message, "Login failed: Failed to start auth: 500 - Server Error"
-        )
-
-    @patch("chuck_data.commands.auth.AmperityAPIClient")
-    def test_amperity_login_completion_failure(self, mock_auth_client_class):
-        """Test failure during completion of Amperity login flow."""
-        # Use AmperityClientStub configured to fail at completion
-        client_stub = AmperityClientStub()
-        client_stub.set_auth_completion_failure(True)
-        mock_auth_client_class.return_value = client_stub
-
-        # Execute
-        result = handle_amperity_login(None)
-
-        # Verify
-        self.assertFalse(result.success)
-        self.assertEqual(result.message, "Login failed: Authentication failed: error")
-
-    @patch("chuck_data.commands.auth.set_databricks_token")
-    def test_databricks_login_success(self, mock_set_token):
-        """Test setting the Databricks token."""
-        # Setup
-        mock_set_token.return_value = True
-        test_token = "test-token-123"
-
-        # Execute
-        result = handle_databricks_login(None, token=test_token)
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Databricks token set successfully")
-        mock_set_token.assert_called_with(test_token)
-
-    def test_databricks_login_missing_token(self):
-        """Test error when token is missing."""
-        # Execute
-        result = handle_databricks_login(None)
-
-        # Verify
-        self.assertFalse(result.success)
-        self.assertEqual(result.message, "Token parameter is required")
-
-    @patch("chuck_data.commands.auth.set_databricks_token")
-    def test_logout_databricks(self, mock_set_db_token):
-        """Test logout from Databricks."""
-        # Setup
-        mock_set_db_token.return_value = True
-
-        # Execute
-        result = handle_logout(None, service="databricks")
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Successfully logged out from databricks")
-        mock_set_db_token.assert_called_with("")
-
-    @patch("chuck_data.config.set_amperity_token")
-    def test_logout_amperity(self, mock_set_amp_token):
-        """Test logout from Amperity."""
-        # Setup
-        mock_set_amp_token.return_value = True
-
-        # Execute
-        result = handle_logout(None, service="amperity")
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Successfully logged out from amperity")
-        mock_set_amp_token.assert_called_with("")
-
-    @patch("chuck_data.config.set_amperity_token")
-    @patch("chuck_data.commands.auth.set_databricks_token")
-    def test_logout_default(self, mock_set_db_token, mock_set_amp_token):
-        """Test default logout behavior (only Amperity)."""
-        # Setup
-        mock_set_amp_token.return_value = True
-
-        # Execute
-        result = handle_logout(None)  # No service specified
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Successfully logged out from amperity")
-        mock_set_amp_token.assert_called_with("")
-        mock_set_db_token.assert_not_called()
-
-    @patch("chuck_data.commands.auth.set_databricks_token")
-    @patch("chuck_data.config.set_amperity_token")
-    def test_logout_all(self, mock_set_amp_token, mock_set_db_token):
-        """Test logout from all services."""
-        # Setup
-        mock_set_db_token.return_value = True
-        mock_set_amp_token.return_value = True
-
-        # Execute
-        result = handle_logout(None, service="all")
-
-        # Verify
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Successfully logged out from all")
-        mock_set_db_token.assert_called_with("")
-        mock_set_amp_token.assert_called_with("")
diff --git a/tests/commands/test_base.py b/tests/commands/test_base.py
deleted file mode 100644
index 426f870..0000000
--- a/tests/commands/test_base.py
+++ /dev/null
@@ -1,35 +0,0 @@
-"""
-Tests for the base module in the commands package.
-"""
-
-import unittest
-from chuck_data.commands.base import CommandResult
-
-
-class TestCommandResult(unittest.TestCase):
-    """Test cases for the CommandResult class."""
-
-    def test_command_result_success(self):
-        """Test creating a successful CommandResult."""
-        result = CommandResult(True, data="test data", message="test message")
-        self.assertTrue(result.success)
-        self.assertEqual(result.data, "test data")
-        self.assertEqual(result.message, "test message")
-        self.assertIsNone(result.error)
-
-    def test_command_result_failure(self):
-        """Test creating a failure CommandResult."""
-        error = ValueError("test error")
-        result = CommandResult(False, error=error, message="test error message")
-        self.assertFalse(result.success)
-        self.assertIsNone(result.data)
-        self.assertEqual(result.message, "test error message")
-        self.assertEqual(result.error, error)
-
-    def test_command_result_defaults(self):
-        """Test CommandResult with default values."""
-        result = CommandResult(True)
-        self.assertTrue(result.success)
-        self.assertIsNone(result.data)
-        self.assertIsNone(result.message)
-        self.assertIsNone(result.error)
diff --git a/tests/commands/test_catalog_selection.py b/tests/commands/test_catalog_selection.py
deleted file mode 100644
index f996e09..0000000
--- a/tests/commands/test_catalog_selection.py
+++ /dev/null
@@ -1,124 +0,0 @@
-"""
-Tests for catalog_selection command handler.
-
-This module contains tests for the catalog selection command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.catalog_selection import handle_command
-from chuck_data.config import ConfigManager, get_active_catalog
-from tests.fixtures import DatabricksClientStub
-
-
-class TestCatalogSelection(unittest.TestCase):
-    """Tests for catalog selection command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_missing_catalog_name(self):
-        """Test handling when catalog parameter is not provided."""
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn("catalog parameter is required", result.message)
-
-    def test_successful_catalog_selection(self):
-        """Test successful catalog selection."""
-        # Set up catalog in stub
-        self.client_stub.add_catalog("test_catalog", catalog_type="MANAGED")
-
-        # Call function
-        result = handle_command(self.client_stub, catalog="test_catalog")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Active catalog is now set to 'test_catalog'", result.message)
-        self.assertIn("Type: MANAGED", result.message)
-        self.assertEqual(result.data["catalog_name"], "test_catalog")
-        self.assertEqual(result.data["catalog_type"], "MANAGED")
-
-        # Verify config was updated
-        self.assertEqual(get_active_catalog(), "test_catalog")
-
-    def test_catalog_selection_with_verification_failure(self):
-        """Test catalog selection when verification fails."""
-        # Add some catalogs but not the one we're looking for (make sure names are very different)
-        self.client_stub.add_catalog("xyz", catalog_type="MANAGED")
-
-        # Call function with nonexistent catalog that won't fuzzy match
-        result = handle_command(self.client_stub, catalog="completely_different_name")
-
-        # Verify results - should fail since catalog doesn't exist and no fuzzy match
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No catalog found matching 'completely_different_name'", result.message
-        )
-        self.assertIn("Available catalogs: xyz", result.message)
-
-    def test_catalog_selection_exception(self):
-        """Test catalog selection with unexpected exception."""
-        # Create a stub that raises an exception during config setting
-        # We'll simulate this by using an invalid config path
-        self.patcher.stop()  # Stop the existing patcher
-        self.temp_dir.cleanup()  # Clean up temp directory
-
-        # Try to use an invalid config path that will cause an exception
-        invalid_config_manager = ConfigManager("/invalid/path/config.json")
-        with patch("chuck_data.config._config_manager", invalid_config_manager):
-            result = handle_command(self.client_stub, catalog_name="test_catalog")
-
-        # This might succeed despite the invalid path, so let's test a different exception scenario
-        # Instead, let's create a custom stub that fails on get_catalog
-        class FailingStub(DatabricksClientStub):
-            def get_catalog(self, catalog_name):
-                raise Exception("Failed to set catalog")
-
-        failing_stub = FailingStub()
-        # Set up a new temp directory and config for this test
-        temp_dir = tempfile.TemporaryDirectory()
-        config_path = os.path.join(temp_dir.name, "test_config.json")
-        config_manager = ConfigManager(config_path)
-
-        with patch("chuck_data.config._config_manager", config_manager):
-            # This should trigger the exception in the catalog verification
-            result = handle_command(failing_stub, catalog="test_catalog")
-
-            # Should fail since get_catalog fails and no catalogs in list
-            self.assertFalse(result.success)
-            self.assertIn("No catalogs found in workspace", result.message)
-
-        temp_dir.cleanup()
-
-    def test_select_catalog_by_name(self):
-        """Test catalog selection by name."""
-        self.client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
-
-        result = handle_command(self.client_stub, catalog="Test Catalog")
-
-        self.assertTrue(result.success)
-        self.assertIn("Active catalog is now set to 'Test Catalog'", result.message)
-
-    def test_select_catalog_fuzzy_matching(self):
-        """Test catalog selection with fuzzy matching."""
-        self.client_stub.add_catalog("Test Catalog Long Name", catalog_type="MANAGED")
-
-        result = handle_command(self.client_stub, catalog="Test")
-
-        self.assertTrue(result.success)
-        self.assertIn("Test Catalog Long Name", result.message)
diff --git a/tests/commands/test_help.py b/tests/commands/test_help.py
deleted file mode 100644
index e46cf18..0000000
--- a/tests/commands/test_help.py
+++ /dev/null
@@ -1,46 +0,0 @@
-"""
-Tests for help command handler.
-
-This module contains tests for the help command handler.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.help import handle_command
-
-
-class TestHelp(unittest.TestCase):
-    """Tests for help command handler."""
-
-    @patch("chuck_data.commands.help.get_user_commands")
-    @patch("chuck_data.ui.help_formatter.format_help_text")
-    def test_help_command_success(self, mock_format_help_text, mock_get_user_commands):
-        """Test successful help command execution."""
-        # Setup mocks
-        mock_user_commands = {"command1": MagicMock(), "command2": MagicMock()}
-        mock_get_user_commands.return_value = mock_user_commands
-        mock_format_help_text.return_value = "Formatted help text"
-
-        # Call function
-        result = handle_command(None)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["help_text"], "Formatted help text")
-        mock_get_user_commands.assert_called_once()
-        mock_format_help_text.assert_called_once()
-
-    @patch("chuck_data.commands.help.get_user_commands")
-    def test_help_command_exception(self, mock_get_user_commands):
-        """Test help command with exception."""
-        # Setup mock
-        mock_get_user_commands.side_effect = Exception("Test error")
-
-        # Call function
-        result = handle_command(None)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Error generating help text", result.message)
-        self.assertEqual(str(result.error), "Test error")
diff --git a/tests/commands/test_jobs.py b/tests/commands/test_jobs.py
deleted file mode 100644
index 5e145ef..0000000
--- a/tests/commands/test_jobs.py
+++ /dev/null
@@ -1,140 +0,0 @@
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.jobs import handle_launch_job, handle_job_status
-from chuck_data.commands.base import CommandResult
-from chuck_data.config import ConfigManager
-from tests.fixtures import DatabricksClientStub
-
-
-class TestJobs(unittest.TestCase):
-    """Tests for job handling commands."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_handle_launch_job_success(self):
-        """Test launching a job with all required parameters."""
-        # Use kwargs format instead of positional arguments
-        result: CommandResult = handle_launch_job(
-            self.client_stub,
-            config_path="/Volumes/test/config.json",
-            init_script_path="/init/script.sh",
-            run_name="MyTestJob",
-        )
-        assert result.success is True
-        assert "123456" in result.message
-        assert result.data["run_id"] == "123456"
-
-    def test_handle_launch_job_no_run_id(self):
-        """Test launching a job where response doesn't include run_id."""
-
-        # Create a stub that returns response without run_id
-        class NoRunIdStub(DatabricksClientStub):
-            def submit_job_run(self, config_path, init_script_path, run_name=None):
-                return {}  # No run_id in response
-
-        no_run_id_client = NoRunIdStub()
-
-        # Use kwargs format
-        result = handle_launch_job(
-            no_run_id_client,
-            config_path="/Volumes/test/config.json",
-            init_script_path="/init/script.sh",
-            run_name="NoRunId",
-        )
-        self.assertFalse(result.success)
-        # Now we're looking for more generic failed/failure message
-        self.assertTrue("Failed" in result.message or "No run_id" in result.message)
-
-    def test_handle_launch_job_http_error(self):
-        """Test launching a job with HTTP error response."""
-
-        # Create a stub that raises an HTTP error
-        class FailingJobStub(DatabricksClientStub):
-            def submit_job_run(self, config_path, init_script_path, run_name=None):
-                raise Exception("Bad Request")
-
-        failing_client = FailingJobStub()
-
-        # Use kwargs format
-        result = handle_launch_job(
-            failing_client,
-            config_path="/Volumes/test/config.json",
-            init_script_path="/init/script.sh",
-        )
-        self.assertFalse(result.success)
-        self.assertIn("Bad Request", result.message)
-
-    def test_handle_launch_job_missing_token(self):
-        """Test launching a job with missing API token."""
-        # Use kwargs format
-        result = handle_launch_job(
-            None,
-            config_path="/Volumes/test/config.json",
-            init_script_path="/init/script.sh",
-        )
-        self.assertFalse(result.success)
-        self.assertIn("Client required", result.message)
-
-    def test_handle_launch_job_missing_url(self):
-        """Test launching a job with missing workspace URL."""
-        # Use kwargs format
-        result = handle_launch_job(
-            None,
-            config_path="/Volumes/test/config.json",
-            init_script_path="/init/script.sh",
-        )
-        self.assertFalse(result.success)
-        self.assertIn("Client required", result.message)
-
-    def test_handle_job_status_basic_success(self):
-        """Test getting job status with successful response."""
-        # Use kwargs format
-        result = handle_job_status(self.client_stub, run_id="123456")
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["state"]["life_cycle_state"], "RUNNING")
-        self.assertEqual(result.data["run_id"], 123456)
-
-    def test_handle_job_status_http_error(self):
-        """Test getting job status with HTTP error response."""
-
-        # Create a stub that raises an HTTP error
-        class FailingStatusStub(DatabricksClientStub):
-            def get_job_run_status(self, run_id):
-                raise Exception("Not Found")
-
-        failing_client = FailingStatusStub()
-
-        # Use kwargs format
-        result = handle_job_status(failing_client, run_id="999999")
-        self.assertFalse(result.success)
-        self.assertIn("Not Found", result.message)
-
-    def test_handle_job_status_missing_token(self):
-        """Test getting job status with missing API token."""
-        # Use kwargs format
-        result = handle_job_status(None, run_id="123456")
-        self.assertFalse(result.success)
-        self.assertIn("Client required", result.message)
-
-    def test_handle_job_status_missing_url(self):
-        """Test getting job status with missing workspace URL."""
-        # Use kwargs format
-        result = handle_job_status(None, run_id="123456")
-        self.assertFalse(result.success)
-        self.assertIn("Client required", result.message)
diff --git a/tests/commands/test_list_catalogs.py b/tests/commands/test_list_catalogs.py
deleted file mode 100644
index 02aa18c..0000000
--- a/tests/commands/test_list_catalogs.py
+++ /dev/null
@@ -1,158 +0,0 @@
-"""
-Tests for list_catalogs command handler.
-
-This module contains tests for the list_catalogs command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.list_catalogs import handle_command
-from chuck_data.config import ConfigManager
-from tests.fixtures import DatabricksClientStub
-
-
-class TestListCatalogs(unittest.TestCase):
-    """Tests for list_catalogs command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_no_client(self):
-        """Test handling when no client is provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("No Databricks client available", result.message)
-
-    def test_successful_list_catalogs(self):
-        """Test successful list catalogs."""
-        # Set up test data using stub
-        self.client_stub.add_catalog(
-            "catalog1",
-            catalog_type="MANAGED",
-            comment="Test catalog 1",
-            provider={"name": "provider1"},
-            created_at="2023-01-01",
-        )
-        self.client_stub.add_catalog(
-            "catalog2",
-            catalog_type="EXTERNAL",
-            comment="Test catalog 2",
-            provider={"name": "provider2"},
-            created_at="2023-01-02",
-        )
-
-        # Call function with parameters
-        result = handle_command(self.client_stub, include_browse=True, max_results=50)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["catalogs"]), 2)
-        self.assertEqual(result.data["total_count"], 2)
-        self.assertIn("Found 2 catalog(s).", result.message)
-        self.assertFalse(result.data.get("display", True))  # Should default to False
-        self.assertIn("current_catalog", result.data)
-
-        # Verify catalog data
-        catalog_names = [c["name"] for c in result.data["catalogs"]]
-        self.assertIn("catalog1", catalog_names)
-        self.assertIn("catalog2", catalog_names)
-
-    def test_successful_list_catalogs_with_pagination(self):
-        """Test successful list catalogs with pagination."""
-        # Set up test data
-        self.client_stub.add_catalog("catalog1", catalog_type="MANAGED")
-        self.client_stub.add_catalog("catalog2", catalog_type="EXTERNAL")
-
-        # For pagination testing, we need to modify the stub to return pagination token
-        class PaginatingClientStub(DatabricksClientStub):
-            def list_catalogs(
-                self, include_browse=False, max_results=None, page_token=None
-            ):
-                result = super().list_catalogs(include_browse, max_results, page_token)
-                # Add pagination token if page_token was provided
-                if page_token:
-                    result["next_page_token"] = "abc123"
-                return result
-
-        paginating_stub = PaginatingClientStub()
-        paginating_stub.add_catalog("catalog1", catalog_type="MANAGED")
-        paginating_stub.add_catalog("catalog2", catalog_type="EXTERNAL")
-
-        # Call function with page token
-        result = handle_command(paginating_stub, page_token="xyz789")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["next_page_token"], "abc123")
-        self.assertIn("More catalogs available with page token: abc123", result.message)
-
-    def test_empty_catalog_list(self):
-        """Test handling when no catalogs are found."""
-        # Don't add any catalogs to stub
-
-        # Call function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("No catalogs found in this workspace.", result.message)
-        self.assertEqual(result.data["total_count"], 0)
-        self.assertFalse(result.data.get("display", True))
-        self.assertIn("current_catalog", result.data)
-
-    def test_list_catalogs_exception(self):
-        """Test list_catalogs with unexpected exception."""
-
-        # Create a stub that raises an exception for list_catalogs
-        class FailingClientStub(DatabricksClientStub):
-            def list_catalogs(
-                self, include_browse=False, max_results=None, page_token=None
-            ):
-                raise Exception("API error")
-
-        failing_client = FailingClientStub()
-
-        # Call function
-        result = handle_command(failing_client)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Failed to list catalogs", result.message)
-        self.assertEqual(str(result.error), "API error")
-
-    def test_list_catalogs_with_display_true(self):
-        """Test list catalogs with display=true shows table."""
-        # Set up test data
-        self.client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
-
-        result = handle_command(self.client_stub, display=True)
-
-        self.assertTrue(result.success)
-        self.assertTrue(result.data.get("display"))
-        self.assertEqual(len(result.data.get("catalogs", [])), 1)
-
-    def test_list_catalogs_with_display_false(self):
-        """Test list catalogs with display=false returns data without display."""
-        # Set up test data
-        self.client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
-
-        result = handle_command(self.client_stub, display=False)
-
-        self.assertTrue(result.success)
-        self.assertFalse(result.data.get("display"))
-        self.assertEqual(len(result.data.get("catalogs", [])), 1)
diff --git a/tests/commands/test_list_models.py b/tests/commands/test_list_models.py
deleted file mode 100644
index ee0fee2..0000000
--- a/tests/commands/test_list_models.py
+++ /dev/null
@@ -1,121 +0,0 @@
-"""
-Tests for list_models command handler.
-
-This module contains tests for the list_models command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.list_models import handle_command
-from chuck_data.config import ConfigManager, set_active_model
-from tests.fixtures import DatabricksClientStub
-
-
-class TestListModels(unittest.TestCase):
-    """Tests for list_models command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_basic_list_models(self):
-        """Test listing models without detailed information."""
-        # Set up test data using stub
-        self.client_stub.add_model("model1", created_timestamp=123456789)
-        self.client_stub.add_model("model2", created_timestamp=987654321)
-        set_active_model("model1")
-
-        # Call function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["models"]), 2)
-        self.assertEqual(result.data["active_model"], "model1")
-        self.assertFalse(result.data["detailed"])
-        self.assertIsNone(result.data["filter"])
-        self.assertIsNone(result.message)
-
-    def test_detailed_list_models(self):
-        """Test listing models with detailed information."""
-        # Set up test data using stub
-        self.client_stub.add_model(
-            "model1", created_timestamp=123456789, details="model1 details"
-        )
-        self.client_stub.add_model(
-            "model2", created_timestamp=987654321, details="model2 details"
-        )
-        set_active_model("model1")
-
-        # Call function
-        result = handle_command(self.client_stub, detailed=True)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["models"]), 2)
-        self.assertTrue(result.data["detailed"])
-        self.assertEqual(result.data["models"][0]["details"]["name"], "model1")
-        self.assertEqual(result.data["models"][1]["details"]["name"], "model2")
-
-    def test_filtered_list_models(self):
-        """Test listing models with filtering."""
-        # Set up test data using stub
-        self.client_stub.add_model("claude-v1", created_timestamp=123456789)
-        self.client_stub.add_model("gpt-4", created_timestamp=987654321)
-        self.client_stub.add_model("claude-instant", created_timestamp=456789123)
-        set_active_model("claude-v1")
-
-        # Call function
-        result = handle_command(self.client_stub, filter="claude")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["models"]), 2)
-        self.assertEqual(result.data["models"][0]["name"], "claude-v1")
-        self.assertEqual(result.data["models"][1]["name"], "claude-instant")
-        self.assertEqual(result.data["filter"], "claude")
-
-    def test_empty_list_models(self):
-        """Test listing models when no models are found."""
-        # Don't add any models to stub
-        # Don't set active model
-
-        # Call function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["models"]), 0)
-        self.assertIsNotNone(result.message)
-        self.assertIn("No models found", result.message)
-
-    def test_list_models_exception(self):
-        """Test listing models with exception."""
-
-        # Create a stub that raises an exception for list_models
-        class FailingClientStub(DatabricksClientStub):
-            def list_models(self, **kwargs):
-                raise Exception("API error")
-
-        failing_client = FailingClientStub()
-
-        # Call function
-        result = handle_command(failing_client)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertEqual(str(result.error), "API error")
diff --git a/tests/commands/test_list_schemas.py b/tests/commands/test_list_schemas.py
deleted file mode 100644
index c8d6aef..0000000
--- a/tests/commands/test_list_schemas.py
+++ /dev/null
@@ -1,152 +0,0 @@
-"""
-Tests for schema commands including list-schemas and select-schema.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.list_schemas import handle_command as list_schemas_handler
-from chuck_data.commands.schema_selection import handle_command as select_schema_handler
-from chuck_data.config import ConfigManager, get_active_schema, set_active_catalog
-from tests.fixtures import DatabricksClientStub
-
-
-class TestSchemaCommands(unittest.TestCase):
-    """Tests for schema-related commands."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    # Tests for list-schemas command
-    def test_list_schemas_with_display_true(self):
-        """Test list schemas with display=true shows table."""
-        # Set up test data
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-
-        result = list_schemas_handler(self.client_stub, display=True)
-
-        self.assertTrue(result.success)
-        self.assertTrue(result.data.get("display"))
-        self.assertEqual(len(result.data.get("schemas", [])), 1)
-        self.assertEqual(result.data["schemas"][0]["name"], "test_schema")
-
-    def test_list_schemas_with_display_false(self):
-        """Test list schemas with display=false returns data without display."""
-        # Set up test data
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-
-        result = list_schemas_handler(self.client_stub, display=False)
-
-        self.assertTrue(result.success)
-        self.assertFalse(result.data.get("display"))
-        self.assertEqual(len(result.data.get("schemas", [])), 1)
-
-    def test_list_schemas_no_active_catalog(self):
-        """Test list schemas when no active catalog is set."""
-        result = list_schemas_handler(self.client_stub)
-
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No catalog specified and no active catalog selected", result.message
-        )
-
-    def test_list_schemas_empty_catalog(self):
-        """Test list schemas with empty catalog."""
-        set_active_catalog("empty_catalog")
-        self.client_stub.add_catalog("empty_catalog")
-
-        result = list_schemas_handler(self.client_stub, display=True)
-
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data.get("schemas", [])), 0)
-        self.assertTrue(result.data.get("display"))
-
-    # Tests for select-schema command
-    def test_select_schema_by_name(self):
-        """Test schema selection by name."""
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-
-        result = select_schema_handler(self.client_stub, schema="test_schema")
-
-        self.assertTrue(result.success)
-        self.assertIn("Active schema is now set to 'test_schema'", result.message)
-        self.assertEqual(get_active_schema(), "test_schema")
-
-    def test_select_schema_fuzzy_matching(self):
-        """Test schema selection with fuzzy matching."""
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema_long_name")
-
-        result = select_schema_handler(self.client_stub, schema="test")
-
-        self.assertTrue(result.success)
-        self.assertIn("test_schema_long_name", result.message)
-        self.assertEqual(get_active_schema(), "test_schema_long_name")
-
-    def test_select_schema_no_match(self):
-        """Test schema selection with no matching schema."""
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "different_schema")
-
-        result = select_schema_handler(self.client_stub, schema="nonexistent")
-
-        self.assertFalse(result.success)
-        self.assertIn("No schema found matching 'nonexistent'", result.message)
-        self.assertIn("Available schemas:", result.message)
-
-    def test_select_schema_missing_parameter(self):
-        """Test schema selection with missing schema parameter."""
-        result = select_schema_handler(self.client_stub)
-
-        self.assertFalse(result.success)
-        self.assertIn("schema parameter is required", result.message)
-
-    def test_select_schema_no_active_catalog(self):
-        """Test schema selection with no active catalog."""
-        result = select_schema_handler(self.client_stub, schema="test_schema")
-
-        self.assertFalse(result.success)
-        self.assertIn("No active catalog selected", result.message)
-
-    def test_select_schema_tool_output_callback(self):
-        """Test schema selection with tool output callback."""
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema_with_callback")
-
-        # Mock callback to capture output
-        callback_calls = []
-
-        def mock_callback(tool_name, data):
-            callback_calls.append((tool_name, data))
-
-        result = select_schema_handler(
-            self.client_stub, schema="callback", tool_output_callback=mock_callback
-        )
-
-        self.assertTrue(result.success)
-        # Should have called the callback with step information
-        self.assertTrue(len(callback_calls) > 0)
-        self.assertEqual(callback_calls[0][0], "select-schema")
diff --git a/tests/commands/test_list_tables.py b/tests/commands/test_list_tables.py
deleted file mode 100644
index 98b431d..0000000
--- a/tests/commands/test_list_tables.py
+++ /dev/null
@@ -1,203 +0,0 @@
-"""
-Tests for list_tables command handler.
-
-This module contains tests for the list_tables command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.list_tables import handle_command
-from chuck_data.config import ConfigManager
-from tests.fixtures import DatabricksClientStub
-
-
-class TestListTables(unittest.TestCase):
-    """Tests for list_tables command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_no_client(self):
-        """Test handling when no client is provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("No Databricks client available", result.message)
-
-    def test_no_active_catalog(self):
-        """Test handling when no catalog is provided and no active catalog is set."""
-        # Don't set any active catalog in config
-
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No catalog specified and no active catalog selected", result.message
-        )
-
-    def test_no_active_schema(self):
-        """Test handling when no schema is provided and no active schema is set."""
-        # Set active catalog but not schema
-        from chuck_data.config import set_active_catalog
-
-        set_active_catalog("test_catalog")
-
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No schema specified and no active schema selected", result.message
-        )
-
-    def test_successful_list_tables_with_parameters(self):
-        """Test successful list tables with all parameters specified."""
-        # Set up test data using stub
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-        self.client_stub.add_table(
-            "test_catalog",
-            "test_schema",
-            "table1",
-            table_type="MANAGED",
-            comment="Test table 1",
-            created_at="2023-01-01",
-        )
-        self.client_stub.add_table(
-            "test_catalog",
-            "test_schema",
-            "table2",
-            table_type="VIEW",
-            comment="Test table 2",
-            created_at="2023-01-02",
-        )
-
-        # Call function
-        result = handle_command(
-            self.client_stub,
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-            include_delta_metadata=True,
-            omit_columns=False,
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["tables"]), 2)
-        self.assertEqual(result.data["total_count"], 2)
-        self.assertEqual(result.data["catalog_name"], "test_catalog")
-        self.assertEqual(result.data["schema_name"], "test_schema")
-        self.assertIn("Found 2 table(s) in 'test_catalog.test_schema'", result.message)
-
-        # Verify table data
-        table_names = [t["name"] for t in result.data["tables"]]
-        self.assertIn("table1", table_names)
-        self.assertIn("table2", table_names)
-
-    def test_successful_list_tables_with_defaults(self):
-        """Test successful list tables using default active catalog and schema."""
-        # Set up active catalog and schema
-        from chuck_data.config import set_active_catalog, set_active_schema
-
-        set_active_catalog("active_catalog")
-        set_active_schema("active_schema")
-
-        # Set up test data
-        self.client_stub.add_catalog("active_catalog")
-        self.client_stub.add_schema("active_catalog", "active_schema")
-        self.client_stub.add_table("active_catalog", "active_schema", "table1")
-
-        # Call function with no catalog or schema parameters
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["tables"]), 1)
-        self.assertEqual(result.data["catalog_name"], "active_catalog")
-        self.assertEqual(result.data["schema_name"], "active_schema")
-        self.assertEqual(result.data["tables"][0]["name"], "table1")
-
-    def test_empty_table_list(self):
-        """Test handling when no tables are found."""
-        # Set up catalog and schema but no tables
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-        # Don't add any tables
-
-        # Call function
-        result = handle_command(
-            self.client_stub, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn(
-            "No tables found in schema 'test_catalog.test_schema'", result.message
-        )
-
-    def test_list_tables_exception(self):
-        """Test list_tables with unexpected exception."""
-
-        # Create a stub that raises an exception for list_tables
-        class FailingClientStub(DatabricksClientStub):
-            def list_tables(self, *args, **kwargs):
-                raise Exception("API error")
-
-        failing_client = FailingClientStub()
-
-        # Call function
-        result = handle_command(
-            failing_client, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Failed to list tables", result.message)
-        self.assertEqual(str(result.error), "API error")
-
-    def test_list_tables_with_display_true(self):
-        """Test list tables with display=true shows table."""
-        # Set up test data
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-        self.client_stub.add_table("test_catalog", "test_schema", "test_table")
-
-        result = handle_command(
-            self.client_stub,
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-            display=True,
-        )
-
-        self.assertTrue(result.success)
-        self.assertTrue(result.data.get("display"))
-        self.assertEqual(len(result.data.get("tables", [])), 1)
-
-    def test_list_tables_with_display_false(self):
-        """Test list tables with display=false returns data without display."""
-        # Set up test data
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-        self.client_stub.add_table("test_catalog", "test_schema", "test_table")
-
-        result = handle_command(
-            self.client_stub,
-            catalog_name="test_catalog",
-            schema_name="test_schema",
-            display=False,
-        )
-
-        self.assertTrue(result.success)
-        self.assertFalse(result.data.get("display"))
-        self.assertEqual(len(result.data.get("tables", [])), 1)
diff --git a/tests/commands/test_list_warehouses.py b/tests/commands/test_list_warehouses.py
deleted file mode 100644
index d5ef36f..0000000
--- a/tests/commands/test_list_warehouses.py
+++ /dev/null
@@ -1,331 +0,0 @@
-"""
-Tests for list_warehouses command handler.
-
-This module contains tests for the list_warehouses command handler.
-"""
-
-import unittest
-
-from chuck_data.commands.list_warehouses import handle_command
-from tests.fixtures import DatabricksClientStub
-
-
-class TestListWarehouses(unittest.TestCase):
-    """Tests for list_warehouses command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-    def test_no_client(self):
-        """Test handling when no client is provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("No Databricks client available", result.message)
-
-    def test_successful_list_warehouses(self):
-        """Test successful warehouse listing with various warehouse types."""
-        # Add test warehouses with different configurations
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-123",
-            name="Test Serverless Warehouse",
-            size="XLARGE",
-            state="STOPPED",
-            enable_serverless_compute=True,
-            warehouse_type="PRO",
-            creator_name="test.user@example.com",
-            auto_stop_mins=10,
-        )
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-456",
-            name="Test Regular Warehouse",
-            size="SMALL",
-            state="RUNNING",
-            enable_serverless_compute=False,
-            warehouse_type="CLASSIC",
-            creator_name="another.user@example.com",
-            auto_stop_mins=60,
-        )
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-789",
-            name="Test XXSMALL Warehouse",
-            size="XXSMALL",
-            state="STARTING",
-            enable_serverless_compute=True,
-            warehouse_type="PRO",
-            creator_name="third.user@example.com",
-            auto_stop_mins=15,
-        )
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["warehouses"]), 3)
-        self.assertEqual(result.data["total_count"], 3)
-        self.assertIn("Found 3 SQL warehouse(s)", result.message)
-
-        # Verify warehouse data structure and content
-        warehouses = result.data["warehouses"]
-        warehouse_names = [w["name"] for w in warehouses]
-        self.assertIn("Test Serverless Warehouse", warehouse_names)
-        self.assertIn("Test Regular Warehouse", warehouse_names)
-        self.assertIn("Test XXSMALL Warehouse", warehouse_names)
-
-        # Verify specific warehouse details
-        serverless_warehouse = next(
-            w for w in warehouses if w["name"] == "Test Serverless Warehouse"
-        )
-        self.assertEqual(serverless_warehouse["id"], "warehouse-123")
-        self.assertEqual(serverless_warehouse["size"], "XLARGE")
-        self.assertEqual(serverless_warehouse["state"], "STOPPED")
-        self.assertEqual(serverless_warehouse["enable_serverless_compute"], True)
-        self.assertEqual(serverless_warehouse["warehouse_type"], "PRO")
-        self.assertEqual(serverless_warehouse["creator_name"], "test.user@example.com")
-        self.assertEqual(serverless_warehouse["auto_stop_mins"], 10)
-
-        regular_warehouse = next(
-            w for w in warehouses if w["name"] == "Test Regular Warehouse"
-        )
-        self.assertEqual(regular_warehouse["id"], "warehouse-456")
-        self.assertEqual(regular_warehouse["size"], "SMALL")
-        self.assertEqual(regular_warehouse["state"], "RUNNING")
-        self.assertEqual(regular_warehouse["enable_serverless_compute"], False)
-        self.assertEqual(regular_warehouse["warehouse_type"], "CLASSIC")
-        self.assertEqual(regular_warehouse["creator_name"], "another.user@example.com")
-        self.assertEqual(regular_warehouse["auto_stop_mins"], 60)
-
-    def test_empty_warehouse_list(self):
-        """Test handling when no warehouses are found."""
-        # Don't add any warehouses to the stub
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("No SQL warehouses found", result.message)
-
-    def test_list_warehouses_exception(self):
-        """Test list_warehouses with unexpected exception."""
-
-        # Create a stub that raises an exception for list_warehouses
-        class FailingClientStub(DatabricksClientStub):
-            def list_warehouses(self, **kwargs):
-                raise Exception("API connection error")
-
-        failing_client = FailingClientStub()
-
-        # Call the function
-        result = handle_command(failing_client)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Failed to fetch warehouses", result.message)
-        self.assertEqual(str(result.error), "API connection error")
-
-    def test_warehouse_data_integrity(self):
-        """Test that all required warehouse fields are preserved."""
-        # Add a warehouse with all possible fields
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-complete",
-            name="Complete Test Warehouse",
-            size="MEDIUM",
-            state="STOPPED",
-            enable_serverless_compute=True,
-            creator_name="complete.user@example.com",
-            auto_stop_mins=30,
-            # Additional fields that might be present
-            cluster_size="Medium",
-            min_num_clusters=1,
-            max_num_clusters=5,
-            warehouse_type="PRO",
-            enable_photon=True,
-        )
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        warehouses = result.data["warehouses"]
-        self.assertEqual(len(warehouses), 1)
-
-        warehouse = warehouses[0]
-        # Verify all required fields are present
-        required_fields = [
-            "id",
-            "name",
-            "size",
-            "state",
-            "creator_name",
-            "auto_stop_mins",
-            "enable_serverless_compute",
-        ]
-        for field in required_fields:
-            self.assertIn(
-                field,
-                warehouse,
-                f"Required field '{field}' missing from warehouse data",
-            )
-
-        # Verify field values
-        self.assertEqual(warehouse["id"], "warehouse-complete")
-        self.assertEqual(warehouse["name"], "Complete Test Warehouse")
-        self.assertEqual(warehouse["size"], "MEDIUM")
-        self.assertEqual(warehouse["state"], "STOPPED")
-        self.assertEqual(warehouse["enable_serverless_compute"], True)
-        self.assertEqual(warehouse["creator_name"], "complete.user@example.com")
-        self.assertEqual(warehouse["auto_stop_mins"], 30)
-
-    def test_various_warehouse_sizes(self):
-        """Test that different warehouse sizes are handled correctly."""
-        sizes = [
-            "XXSMALL",
-            "XSMALL",
-            "SMALL",
-            "MEDIUM",
-            "LARGE",
-            "XLARGE",
-            "2XLARGE",
-            "3XLARGE",
-            "4XLARGE",
-        ]
-
-        # Add warehouses with different sizes
-        for i, size in enumerate(sizes):
-            self.client_stub.add_warehouse(
-                warehouse_id=f"warehouse-{i}",
-                name=f"Test {size} Warehouse",
-                size=size,
-                state="STOPPED",
-                enable_serverless_compute=True,
-            )
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["warehouses"]), len(sizes))
-
-        # Verify all sizes are preserved correctly
-        warehouses = result.data["warehouses"]
-        returned_sizes = [w["size"] for w in warehouses]
-        for size in sizes:
-            self.assertIn(
-                size, returned_sizes, f"Size {size} not found in returned warehouses"
-            )
-
-    def test_various_warehouse_states(self):
-        """Test that different warehouse states are handled correctly."""
-        states = ["RUNNING", "STOPPED", "STARTING", "STOPPING", "DELETING", "DELETED"]
-
-        # Add warehouses with different states
-        for i, state in enumerate(states):
-            self.client_stub.add_warehouse(
-                warehouse_id=f"warehouse-{i}",
-                name=f"Test {state} Warehouse",
-                size="SMALL",
-                state=state,
-                enable_serverless_compute=False,
-            )
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["warehouses"]), len(states))
-
-        # Verify all states are preserved correctly
-        warehouses = result.data["warehouses"]
-        returned_states = [w["state"] for w in warehouses]
-        for state in states:
-            self.assertIn(
-                state,
-                returned_states,
-                f"State {state} not found in returned warehouses",
-            )
-
-    def test_serverless_compute_boolean_handling(self):
-        """Test that serverless compute boolean values are handled correctly."""
-        # Add warehouses with different serverless settings
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-serverless-true",
-            name="Serverless True Warehouse",
-            size="SMALL",
-            state="STOPPED",
-            enable_serverless_compute=True,
-        )
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-serverless-false",
-            name="Serverless False Warehouse",
-            size="SMALL",
-            state="STOPPED",
-            enable_serverless_compute=False,
-        )
-
-        # Call the function
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        warehouses = result.data["warehouses"]
-        self.assertEqual(len(warehouses), 2)
-
-        # Find warehouses by name and verify serverless settings
-        serverless_true = next(
-            w for w in warehouses if w["name"] == "Serverless True Warehouse"
-        )
-        serverless_false = next(
-            w for w in warehouses if w["name"] == "Serverless False Warehouse"
-        )
-
-        self.assertTrue(serverless_true["enable_serverless_compute"])
-        self.assertFalse(serverless_false["enable_serverless_compute"])
-
-        # Ensure they're proper boolean values, not strings
-        self.assertIsInstance(serverless_true["enable_serverless_compute"], bool)
-        self.assertIsInstance(serverless_false["enable_serverless_compute"], bool)
-
-    def test_display_parameter_false(self):
-        """Test that display=False parameter works correctly."""
-        # Add test warehouse
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-test",
-            name="Test Warehouse",
-            size="SMALL",
-            state="RUNNING",
-        )
-
-        # Call function with display=False
-        result = handle_command(self.client_stub, display=False)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["warehouses"]), 1)
-        # Should still include current_warehouse_id for highlighting
-        self.assertIn("current_warehouse_id", result.data)
-
-    def test_display_parameter_false_default(self):
-        """Test that display parameter defaults to False."""
-        # Add test warehouse
-        self.client_stub.add_warehouse(
-            warehouse_id="warehouse-test",
-            name="Test Warehouse",
-            size="SMALL",
-            state="RUNNING",
-        )
-
-        # Call function without display parameter
-        result = handle_command(self.client_stub)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["warehouses"]), 1)
-        # Should include current_warehouse_id for highlighting
-        self.assertIn("current_warehouse_id", result.data)
-        # Should default to display=False
-        self.assertEqual(result.data["display"], False)
diff --git a/tests/commands/test_model_selection.py b/tests/commands/test_model_selection.py
deleted file mode 100644
index 4d138c4..0000000
--- a/tests/commands/test_model_selection.py
+++ /dev/null
@@ -1,88 +0,0 @@
-"""
-Tests for model_selection command handler.
-
-This module contains tests for the model_selection command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.model_selection import handle_command
-from chuck_data.config import ConfigManager, get_active_model
-from tests.fixtures import DatabricksClientStub
-
-
-class TestModelSelection(unittest.TestCase):
-    """Tests for model selection command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_missing_model_name(self):
-        """Test handling when model_name is not provided."""
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn("model_name parameter is required", result.message)
-
-    def test_successful_model_selection(self):
-        """Test successful model selection."""
-        # Set up test data using stub
-        self.client_stub.add_model("claude-v1", created_timestamp=123456789)
-        self.client_stub.add_model("gpt-4", created_timestamp=987654321)
-
-        # Call function
-        result = handle_command(self.client_stub, model_name="claude-v1")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Active model is now set to 'claude-v1'", result.message)
-
-        # Verify config was updated
-        self.assertEqual(get_active_model(), "claude-v1")
-
-    def test_model_not_found(self):
-        """Test model selection when model is not found."""
-        # Set up test data using stub - but don't include the requested model
-        self.client_stub.add_model("claude-v1", created_timestamp=123456789)
-        self.client_stub.add_model("gpt-4", created_timestamp=987654321)
-
-        # Call function with nonexistent model
-        result = handle_command(self.client_stub, model_name="nonexistent-model")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Model 'nonexistent-model' not found", result.message)
-
-        # Verify config was not updated
-        self.assertIsNone(get_active_model())
-
-    def test_model_selection_api_exception(self):
-        """Test model selection when API call throws an exception."""
-
-        # Create a stub that raises an exception for list_models
-        class FailingClientStub(DatabricksClientStub):
-            def list_models(self, **kwargs):
-                raise Exception("API error")
-
-        failing_client = FailingClientStub()
-
-        # Call function
-        result = handle_command(failing_client, model_name="claude-v1")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertEqual(str(result.error), "API error")
diff --git a/tests/commands/test_models.py b/tests/commands/test_models.py
deleted file mode 100644
index f41853f..0000000
--- a/tests/commands/test_models.py
+++ /dev/null
@@ -1,134 +0,0 @@
-"""
-Tests for the model-related command modules.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.config import ConfigManager, set_active_model, get_active_model
-from chuck_data.commands.models import handle_command as handle_models
-from chuck_data.commands.list_models import handle_command as handle_list_models
-from chuck_data.commands.model_selection import handle_command as handle_model_selection
-
-
-class StubClient:
-    """Simple client stub for model commands."""
-
-    def __init__(self, models=None, active_model=None):
-        self.models = models or []
-        self.active_model = active_model
-
-    def list_models(self):
-        return self.models
-
-    def get_active_model(self):
-        return self.active_model
-
-
-class TestModelsCommands(unittest.TestCase):
-    """Test cases for the model-related command handlers."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-        self.client = None
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_handle_models_with_models(self):
-        """Test handling models command with available models."""
-        self.client = StubClient(
-            models=[
-                {"name": "model1", "status": "READY"},
-                {"name": "model2", "status": "READY"},
-            ]
-        )
-
-        result = handle_models(self.client)
-
-        self.assertTrue(result.success)
-        self.assertEqual(result.data, self.client.list_models())
-
-    def test_handle_models_empty(self):
-        """Test handling models command with no available models."""
-        self.client = StubClient(models=[])
-
-        result = handle_models(self.client)
-
-        self.assertTrue(result.success)
-        self.assertEqual(result.data, [])
-        self.assertIn("No models found", result.message)
-
-    def test_handle_list_models_basic(self):
-        """Test list models command (basic)."""
-        self.client = StubClient(
-            models=[
-                {"name": "model1", "status": "READY"},
-                {"name": "model2", "status": "READY"},
-            ],
-            active_model="model1",
-        )
-        set_active_model(self.client.active_model)
-
-        result = handle_list_models(self.client)
-
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["models"], self.client.list_models())
-        self.assertEqual(result.data["active_model"], self.client.active_model)
-        self.assertFalse(result.data["detailed"])
-        self.assertIsNone(result.data["filter"])
-
-    def test_handle_list_models_filter(self):
-        """Test list models command with filter."""
-        self.client = StubClient(
-            models=[
-                {"name": "model1", "status": "READY"},
-                {"name": "model2", "status": "READY"},
-            ],
-            active_model="model1",
-        )
-        set_active_model(self.client.active_model)
-
-        result = handle_list_models(self.client, filter="model1")
-
-        self.assertTrue(result.success)
-        self.assertEqual(len(result.data["models"]), 1)
-        self.assertEqual(result.data["models"][0]["name"], "model1")
-        self.assertEqual(result.data["filter"], "model1")
-
-    def test_handle_model_selection_success(self):
-        """Test successful model selection."""
-        self.client = StubClient(models=[{"name": "model1"}, {"name": "valid-model"}])
-
-        result = handle_model_selection(self.client, model_name="valid-model")
-
-        self.assertTrue(result.success)
-        self.assertEqual(get_active_model(), "valid-model")
-        self.assertIn("Active model is now set to 'valid-model'", result.message)
-
-    def test_handle_model_selection_invalid(self):
-        """Test selecting an invalid model."""
-        self.client = StubClient(models=[{"name": "model1"}, {"name": "model2"}])
-
-        result = handle_model_selection(self.client, model_name="nonexistent-model")
-
-        self.assertFalse(result.success)
-        self.assertIn("not found", result.message)
-
-    def test_handle_model_selection_no_name(self):
-        """Test model selection with no model name provided."""
-        self.client = StubClient(models=[])  # models unused
-
-        result = handle_model_selection(self.client)
-
-        # Verify the result
-        self.assertFalse(result.success)
-        self.assertIn("model_name parameter is required", result.message)
diff --git a/tests/commands/test_pii_tools.py b/tests/commands/test_pii_tools.py
deleted file mode 100644
index 420ebc9..0000000
--- a/tests/commands/test_pii_tools.py
+++ /dev/null
@@ -1,125 +0,0 @@
-"""
-Tests for the PII tools helper module.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.pii_tools import (
-    _helper_tag_pii_columns_logic,
-    _helper_scan_schema_for_pii_logic,
-)
-from chuck_data.config import ConfigManager
-from tests.fixtures import DatabricksClientStub, LLMClientStub
-
-
-class TestPIITools(unittest.TestCase):
-    """Test cases for the PII tools helper functions."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client_stub = DatabricksClientStub()
-        self.llm_client = LLMClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-        # Mock columns from database
-        self.mock_columns = [
-            {"name": "first_name", "type_name": "string"},
-            {"name": "email", "type_name": "string"},
-            {"name": "signup_date", "type_name": "date"},
-        ]
-
-        # Configure LLM client stub for PII detection response
-        pii_response_content = '[{"name":"first_name","semantic":"given-name"},{"name":"email","semantic":"email"},{"name":"signup_date","semantic":null}]'
-        self.llm_client.set_response_content(pii_response_content)
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    @patch("chuck_data.commands.pii_tools.json.loads")
-    def test_tag_pii_columns_logic_success(self, mock_json_loads):
-        """Test successful tagging of PII columns."""
-        # Set up test data using stub
-        self.client_stub.add_catalog("mycat")
-        self.client_stub.add_schema("mycat", "myschema")
-        self.client_stub.add_table(
-            "mycat", "myschema", "users", columns=self.mock_columns
-        )
-
-        # Mock the JSON parsing instead of relying on actual JSON parsing
-        mock_json_loads.return_value = [
-            {"name": "first_name", "semantic": "given-name"},
-            {"name": "email", "semantic": "email"},
-            {"name": "signup_date", "semantic": None},
-        ]
-
-        # Call the function
-        result = _helper_tag_pii_columns_logic(
-            self.client_stub,
-            self.llm_client,
-            "users",
-            catalog_name_context="mycat",
-            schema_name_context="myschema",
-        )
-
-        # Verify the result
-        self.assertEqual(result["full_name"], "mycat.myschema.users")
-        self.assertEqual(result["table_name"], "users")
-        self.assertEqual(result["column_count"], 3)
-        self.assertEqual(result["pii_column_count"], 2)
-        self.assertTrue(result["has_pii"])
-        self.assertFalse(result["skipped"])
-        self.assertEqual(result["columns"][0]["semantic"], "given-name")
-        self.assertEqual(result["columns"][1]["semantic"], "email")
-        self.assertIsNone(result["columns"][2]["semantic"])
-
-    @patch("concurrent.futures.ThreadPoolExecutor")
-    def test_scan_schema_for_pii_logic(self, mock_executor):
-        """Test scanning a schema for PII."""
-        # Set up test data using stub
-        self.client_stub.add_catalog("test_cat")
-        self.client_stub.add_schema("test_cat", "test_schema")
-        self.client_stub.add_table("test_cat", "test_schema", "users")
-        self.client_stub.add_table("test_cat", "test_schema", "orders")
-        self.client_stub.add_table("test_cat", "test_schema", "_stitch_temp")
-
-        # Mock the ThreadPoolExecutor
-        mock_future = MagicMock()
-        mock_future.result.return_value = {
-            "table_name": "users",
-            "full_name": "test_cat.test_schema.users",
-            "pii_column_count": 2,
-            "has_pii": True,
-            "skipped": False,
-        }
-
-        mock_context = MagicMock()
-        mock_context.__enter__.return_value = mock_context
-        mock_context.submit.return_value = mock_future
-        mock_executor.return_value = mock_context
-
-        # Mock concurrent.futures.as_completed to return mock_future
-        with patch("concurrent.futures.as_completed", return_value=[mock_future]):
-            # Call the function
-            result = _helper_scan_schema_for_pii_logic(
-                self.client_stub, self.llm_client, "test_cat", "test_schema"
-            )
-
-            # Verify the result
-            self.assertEqual(result["catalog"], "test_cat")
-            self.assertEqual(result["schema"], "test_schema")
-            self.assertEqual(
-                result["tables_scanned_attempted"], 2
-            )  # Excluding _stitch_temp
-            self.assertEqual(result["tables_successfully_processed"], 1)
-            self.assertEqual(result["tables_with_pii"], 1)
-            self.assertEqual(result["total_pii_columns"], 2)
diff --git a/tests/commands/test_scan_pii.py b/tests/commands/test_scan_pii.py
deleted file mode 100644
index 8eca72f..0000000
--- a/tests/commands/test_scan_pii.py
+++ /dev/null
@@ -1,150 +0,0 @@
-"""
-Tests for scan_pii command handler.
-
-This module contains tests for the scan_pii command handler.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.scan_pii import handle_command
-from tests.fixtures import LLMClientStub
-
-
-class TestScanPII(unittest.TestCase):
-    """Tests for scan_pii command handler."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-
-    def test_missing_client(self):
-        """Test handling when client is not provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("Client is required", result.message)
-
-    @patch("chuck_data.commands.scan_pii.get_active_catalog")
-    @patch("chuck_data.commands.scan_pii.get_active_schema")
-    def test_missing_context(self, mock_get_active_schema, mock_get_active_catalog):
-        """Test handling when catalog or schema is missing."""
-        # Setup mocks
-        mock_get_active_catalog.return_value = None
-        mock_get_active_schema.return_value = None
-
-        # Call function
-        result = handle_command(self.client)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Catalog and schema must be specified", result.message)
-
-    @patch("chuck_data.commands.scan_pii.LLMClient")
-    @patch("chuck_data.commands.scan_pii._helper_scan_schema_for_pii_logic")
-    def test_successful_scan(self, mock_helper_scan, mock_llm_client):
-        """Test successful schema scan for PII."""
-        # Setup mocks
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-
-        mock_helper_scan.return_value = {
-            "tables_successfully_processed": 5,
-            "tables_scanned_attempted": 6,
-            "tables_with_pii": 3,
-            "total_pii_columns": 8,
-            "catalog": "test_catalog",
-            "schema": "test_schema",
-            "results_detail": [
-                {"full_name": "test_catalog.test_schema.table1", "has_pii": True},
-                {"full_name": "test_catalog.test_schema.table2", "has_pii": True},
-                {"full_name": "test_catalog.test_schema.table3", "has_pii": True},
-                {"full_name": "test_catalog.test_schema.table4", "has_pii": False},
-                {"full_name": "test_catalog.test_schema.table5", "has_pii": False},
-            ],
-        }
-
-        # Call function
-        result = handle_command(
-            self.client, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["tables_successfully_processed"], 5)
-        self.assertEqual(result.data["tables_with_pii"], 3)
-        self.assertEqual(result.data["total_pii_columns"], 8)
-        self.assertIn("Scanned 5/6 tables", result.message)
-        self.assertIn("Found 3 tables with 8 PII columns", result.message)
-        mock_helper_scan.assert_called_once_with(
-            self.client, llm_client_stub, "test_catalog", "test_schema"
-        )
-
-    @patch("chuck_data.commands.scan_pii.get_active_catalog")
-    @patch("chuck_data.commands.scan_pii.get_active_schema")
-    @patch("chuck_data.commands.scan_pii.LLMClient")
-    @patch("chuck_data.commands.scan_pii._helper_scan_schema_for_pii_logic")
-    def test_scan_with_active_context(
-        self,
-        mock_helper_scan,
-        mock_llm_client,
-        mock_get_active_schema,
-        mock_get_active_catalog,
-    ):
-        """Test schema scan using active catalog and schema."""
-        # Setup mocks
-        mock_get_active_catalog.return_value = "active_catalog"
-        mock_get_active_schema.return_value = "active_schema"
-
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-
-        mock_helper_scan.return_value = {
-            "tables_successfully_processed": 3,
-            "tables_scanned_attempted": 3,
-            "tables_with_pii": 1,
-            "total_pii_columns": 2,
-        }
-
-        # Call function without catalog/schema args
-        result = handle_command(self.client)
-
-        # Verify results
-        self.assertTrue(result.success)
-        mock_helper_scan.assert_called_once_with(
-            self.client, llm_client_stub, "active_catalog", "active_schema"
-        )
-
-    @patch("chuck_data.commands.scan_pii.LLMClient")
-    @patch("chuck_data.commands.scan_pii._helper_scan_schema_for_pii_logic")
-    def test_scan_with_helper_error(self, mock_helper_scan, mock_llm_client):
-        """Test handling when helper returns an error."""
-        # Setup mocks
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-
-        mock_helper_scan.return_value = {"error": "Failed to list tables"}
-
-        # Call function
-        result = handle_command(
-            self.client, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertEqual(result.message, "Failed to list tables")
-
-    @patch("chuck_data.commands.scan_pii.LLMClient")
-    def test_scan_with_exception(self, mock_llm_client):
-        """Test handling when an exception occurs."""
-        # Setup mocks
-        mock_llm_client.side_effect = Exception("LLM client error")
-
-        # Call function
-        result = handle_command(
-            self.client, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Error during bulk PII scan", result.message)
-        self.assertEqual(str(result.error), "LLM client error")
diff --git a/tests/commands/test_schema_selection.py b/tests/commands/test_schema_selection.py
deleted file mode 100644
index 3c52b52..0000000
--- a/tests/commands/test_schema_selection.py
+++ /dev/null
@@ -1,112 +0,0 @@
-"""
-Tests for schema_selection command handler.
-
-This module contains tests for the schema selection command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.schema_selection import handle_command
-from chuck_data.config import ConfigManager, get_active_schema, set_active_catalog
-from tests.fixtures import DatabricksClientStub
-
-
-class TestSchemaSelection(unittest.TestCase):
-    """Tests for schema selection command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_missing_schema_name(self):
-        """Test handling when schema parameter is not provided."""
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn("schema parameter is required", result.message)
-
-    def test_no_active_catalog(self):
-        """Test handling when no active catalog is selected."""
-        # Don't set any active catalog in config
-
-        # Call function
-        result = handle_command(self.client_stub, schema="test_schema")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("No active catalog selected", result.message)
-
-    def test_successful_schema_selection(self):
-        """Test successful schema selection."""
-        # Set up active catalog and test data
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "test_schema")
-
-        # Call function
-        result = handle_command(self.client_stub, schema="test_schema")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn("Active schema is now set to 'test_schema'", result.message)
-        self.assertIn("in catalog 'test_catalog'", result.message)
-        self.assertEqual(result.data["schema_name"], "test_schema")
-        self.assertEqual(result.data["catalog_name"], "test_catalog")
-
-        # Verify config was updated
-        self.assertEqual(get_active_schema(), "test_schema")
-
-    def test_schema_selection_with_verification_failure(self):
-        """Test schema selection when no matching schema exists."""
-        # Set up active catalog but don't add the schema to stub
-        set_active_catalog("test_catalog")
-        self.client_stub.add_catalog("test_catalog")
-        self.client_stub.add_schema("test_catalog", "completely_different_schema_name")
-
-        # Call function with non-existent schema that won't match via fuzzy matching
-        result = handle_command(self.client_stub, schema="xyz_nonexistent_abc")
-
-        # Verify results - should fail cleanly
-        self.assertFalse(result.success)
-        self.assertIn("No schema found matching 'xyz_nonexistent_abc'", result.message)
-        self.assertIn("Available schemas:", result.message)
-
-    def test_schema_selection_exception(self):
-        """Test schema selection with list_schemas exception."""
-        # Set up active catalog
-        set_active_catalog("test_catalog")
-
-        # Create a stub that raises an exception during list_schemas
-        class FailingStub(DatabricksClientStub):
-            def list_schemas(
-                self,
-                catalog_name,
-                include_browse=False,
-                max_results=None,
-                page_token=None,
-                **kwargs,
-            ):
-                raise Exception("Failed to list schemas")
-
-        failing_stub = FailingStub()
-        failing_stub.add_catalog("test_catalog")
-
-        # Call function
-        result = handle_command(failing_stub, schema="test_schema")
-
-        # Should fail due to the exception
-        self.assertFalse(result.success)
-        self.assertIn("Failed to list schemas", result.message)
diff --git a/tests/commands/test_setup_stitch.py b/tests/commands/test_setup_stitch.py
deleted file mode 100644
index e9df860..0000000
--- a/tests/commands/test_setup_stitch.py
+++ /dev/null
@@ -1,226 +0,0 @@
-"""
-Tests for setup_stitch command handler.
-
-This module contains tests for the setup_stitch command handler.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.setup_stitch import handle_command
-from tests.fixtures import LLMClientStub
-
-
-class TestSetupStitch(unittest.TestCase):
-    """Tests for setup_stitch command handler."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-
-    def test_missing_client(self):
-        """Test handling when client is not provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("Client is required", result.message)
-
-    @patch("chuck_data.commands.setup_stitch.get_active_catalog")
-    @patch("chuck_data.commands.setup_stitch.get_active_schema")
-    def test_missing_context(self, mock_get_active_schema, mock_get_active_catalog):
-        """Test handling when catalog or schema is missing."""
-        # Setup mocks
-        mock_get_active_catalog.return_value = None
-        mock_get_active_schema.return_value = None
-
-        # Call function
-        result = handle_command(self.client)
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Target catalog and schema must be specified", result.message)
-
-    @patch("chuck_data.commands.setup_stitch._helper_launch_stitch_job")
-    @patch("chuck_data.commands.setup_stitch.LLMClient")
-    @patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
-    @patch("chuck_data.commands.setup_stitch.get_metrics_collector")
-    def test_successful_setup(
-        self,
-        mock_get_metrics_collector,
-        mock_helper_setup,
-        mock_llm_client,
-        mock_launch_job,
-    ):
-        """Test successful Stitch setup."""
-        # Setup mocks
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-        mock_metrics_collector = MagicMock()
-        mock_get_metrics_collector.return_value = mock_metrics_collector
-
-        mock_helper_setup.return_value = {
-            "stitch_config": {},
-            "metadata": {
-                "target_catalog": "test_catalog",
-                "target_schema": "test_schema",
-            },
-        }
-        mock_launch_job.return_value = {
-            "message": "Stitch setup completed successfully.",
-            "tables_processed": 5,
-            "pii_columns_tagged": 8,
-            "config_created": True,
-            "config_path": "/Volumes/test_catalog/test_schema/_stitch/config.json",
-        }
-
-        # Call function with auto_confirm to use legacy behavior
-        result = handle_command(
-            self.client,
-            **{
-                "catalog_name": "test_catalog",
-                "schema_name": "test_schema",
-                "auto_confirm": True,
-            },
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Stitch setup completed successfully.")
-        self.assertEqual(result.data["tables_processed"], 5)
-        self.assertEqual(result.data["pii_columns_tagged"], 8)
-        self.assertTrue(result.data["config_created"])
-        mock_helper_setup.assert_called_once_with(
-            self.client, llm_client_stub, "test_catalog", "test_schema"
-        )
-        mock_launch_job.assert_called_once_with(
-            self.client,
-            {},
-            {"target_catalog": "test_catalog", "target_schema": "test_schema"},
-        )
-
-        # Verify metrics collection
-        mock_metrics_collector.track_event.assert_called_once_with(
-            prompt="setup-stitch command",
-            tools=[
-                {
-                    "name": "setup_stitch",
-                    "arguments": {"catalog": "test_catalog", "schema": "test_schema"},
-                }
-            ],
-            additional_data={
-                "event_context": "direct_stitch_command",
-                "status": "success",
-                "tables_processed": 5,
-                "pii_columns_tagged": 8,
-                "config_created": True,
-                "config_path": "/Volumes/test_catalog/test_schema/_stitch/config.json",
-            },
-        )
-
-    @patch("chuck_data.commands.setup_stitch._helper_launch_stitch_job")
-    @patch("chuck_data.commands.setup_stitch.get_active_catalog")
-    @patch("chuck_data.commands.setup_stitch.get_active_schema")
-    @patch("chuck_data.commands.setup_stitch.LLMClient")
-    @patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
-    def test_setup_with_active_context(
-        self,
-        mock_helper_setup,
-        mock_llm_client,
-        mock_get_active_schema,
-        mock_get_active_catalog,
-        mock_launch_job,
-    ):
-        """Test Stitch setup using active catalog and schema."""
-        # Setup mocks
-        mock_get_active_catalog.return_value = "active_catalog"
-        mock_get_active_schema.return_value = "active_schema"
-
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-
-        mock_helper_setup.return_value = {
-            "stitch_config": {},
-            "metadata": {
-                "target_catalog": "active_catalog",
-                "target_schema": "active_schema",
-            },
-        }
-        mock_launch_job.return_value = {
-            "message": "Stitch setup completed.",
-            "tables_processed": 3,
-            "config_created": True,
-        }
-
-        # Call function without catalog/schema args, with auto_confirm
-        result = handle_command(self.client, **{"auto_confirm": True})
-
-        # Verify results
-        self.assertTrue(result.success)
-        mock_helper_setup.assert_called_once_with(
-            self.client, llm_client_stub, "active_catalog", "active_schema"
-        )
-        mock_launch_job.assert_called_once_with(
-            self.client,
-            {},
-            {"target_catalog": "active_catalog", "target_schema": "active_schema"},
-        )
-
-    @patch("chuck_data.commands.setup_stitch.LLMClient")
-    @patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
-    @patch("chuck_data.commands.setup_stitch.get_metrics_collector")
-    def test_setup_with_helper_error(
-        self, mock_get_metrics_collector, mock_helper_setup, mock_llm_client
-    ):
-        """Test handling when helper returns an error."""
-        # Setup mocks
-        llm_client_stub = LLMClientStub()
-        mock_llm_client.return_value = llm_client_stub
-        mock_metrics_collector = MagicMock()
-        mock_get_metrics_collector.return_value = mock_metrics_collector
-
-        mock_helper_setup.return_value = {"error": "Failed to scan tables for PII"}
-
-        # Call function with auto_confirm
-        result = handle_command(
-            self.client,
-            **{
-                "catalog_name": "test_catalog",
-                "schema_name": "test_schema",
-                "auto_confirm": True,
-            },
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertEqual(result.message, "Failed to scan tables for PII")
-
-        # Verify metrics collection for error
-        mock_metrics_collector.track_event.assert_called_once_with(
-            prompt="setup-stitch command",
-            tools=[
-                {
-                    "name": "setup_stitch",
-                    "arguments": {"catalog": "test_catalog", "schema": "test_schema"},
-                }
-            ],
-            error="Failed to scan tables for PII",
-            additional_data={
-                "event_context": "direct_stitch_command",
-                "status": "error",
-            },
-        )
-
-    @patch("chuck_data.commands.setup_stitch.LLMClient")
-    def test_setup_with_exception(self, mock_llm_client):
-        """Test handling when an exception occurs."""
-        # Setup mocks
-        mock_llm_client.side_effect = Exception("LLM client error")
-
-        # Call function
-        result = handle_command(
-            self.client, catalog_name="test_catalog", schema_name="test_schema"
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Error setting up Stitch", result.message)
-        self.assertEqual(str(result.error), "LLM client error")
diff --git a/tests/commands/test_status.py b/tests/commands/test_status.py
deleted file mode 100644
index 23e5b79..0000000
--- a/tests/commands/test_status.py
+++ /dev/null
@@ -1,106 +0,0 @@
-"""
-Tests for the status command module.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.status import handle_command
-
-
-class TestStatusCommand(unittest.TestCase):
-    """Test cases for the status command handler."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-
-    @patch("chuck_data.commands.status.get_workspace_url")
-    @patch("chuck_data.commands.status.get_active_catalog")
-    @patch("chuck_data.commands.status.get_active_schema")
-    @patch("chuck_data.commands.status.get_active_model")
-    @patch("chuck_data.commands.status.validate_all_permissions")
-    def test_handle_status_with_valid_connection(
-        self,
-        mock_permissions,
-        mock_get_model,
-        mock_get_schema,
-        mock_get_catalog,
-        mock_get_url,
-    ):
-        """Test status command with valid connection."""
-        # Setup mocks
-        mock_get_url.return_value = "test-workspace"
-        mock_get_catalog.return_value = "test-catalog"
-        mock_get_schema.return_value = "test-schema"
-        mock_get_model.return_value = "test-model"
-        mock_permissions.return_value = {"test_resource": {"authorized": True}}
-
-        # Call function
-        result = handle_command(self.client)
-
-        # Verify result
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["workspace_url"], "test-workspace")
-        self.assertEqual(result.data["active_catalog"], "test-catalog")
-        self.assertEqual(result.data["active_schema"], "test-schema")
-        self.assertEqual(result.data["active_model"], "test-model")
-        self.assertEqual(
-            result.data["connection_status"], "Connected (client present)."
-        )
-        self.assertEqual(result.data["permissions"], mock_permissions.return_value)
-
-    @patch("chuck_data.commands.status.get_workspace_url")
-    @patch("chuck_data.commands.status.get_active_catalog")
-    @patch("chuck_data.commands.status.get_active_schema")
-    @patch("chuck_data.commands.status.get_active_model")
-    def test_handle_status_with_no_client(
-        self, mock_get_model, mock_get_schema, mock_get_catalog, mock_get_url
-    ):
-        """Test status command with no client provided."""
-        # Setup mocks
-        mock_get_url.return_value = "test-workspace"
-        mock_get_catalog.return_value = "test-catalog"
-        mock_get_schema.return_value = "test-schema"
-        mock_get_model.return_value = "test-model"
-
-        # Call function with no client
-        result = handle_command(None)
-
-        # Verify result
-        self.assertTrue(result.success)
-        self.assertEqual(result.data["workspace_url"], "test-workspace")
-        self.assertEqual(result.data["active_catalog"], "test-catalog")
-        self.assertEqual(result.data["active_schema"], "test-schema")
-        self.assertEqual(result.data["active_model"], "test-model")
-        self.assertEqual(
-            result.data["connection_status"],
-            "Client not available or not initialized.",
-        )
-
-    @patch("chuck_data.commands.status.get_workspace_url")
-    @patch("chuck_data.commands.status.get_active_catalog")
-    @patch("chuck_data.commands.status.get_active_schema")
-    @patch("chuck_data.commands.status.get_active_model")
-    @patch("chuck_data.commands.status.validate_all_permissions")
-    @patch("logging.error")
-    def test_handle_status_with_exception(
-        self,
-        mock_log,
-        mock_permissions,
-        mock_get_model,
-        mock_get_schema,
-        mock_get_catalog,
-        mock_get_url,
-    ):
-        """Test status command when an exception occurs."""
-        # Setup mock to raise exception
-        mock_get_url.side_effect = ValueError("Config error")
-
-        # Call function
-        result = handle_command(self.client)
-
-        # Verify result
-        self.assertFalse(result.success)
-        self.assertIsNotNone(result.error)
-        mock_log.assert_called_once()
diff --git a/tests/commands/test_stitch_tools.py b/tests/commands/test_stitch_tools.py
deleted file mode 100644
index 6b47e1f..0000000
--- a/tests/commands/test_stitch_tools.py
+++ /dev/null
@@ -1,458 +0,0 @@
-"""
-Tests for stitch_tools command handler utilities.
-
-This module contains tests for the Stitch integration utilities.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.commands.stitch_tools import _helper_setup_stitch_logic
-from tests.fixtures import LLMClientStub
-
-
-class TestStitchTools(unittest.TestCase):
-    """Tests for Stitch tool utility functions."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-        self.llm_client = LLMClientStub()
-
-        # Mock a successful PII scan result
-        self.mock_pii_scan_results = {
-            "tables_successfully_processed": 5,
-            "tables_with_pii": 3,
-            "total_pii_columns": 8,
-            "results_detail": [
-                {
-                    "full_name": "test_catalog.test_schema.customers",
-                    "has_pii": True,
-                    "skipped": False,
-                    "columns": [
-                        {"name": "id", "type": "int", "semantic": None},
-                        {"name": "name", "type": "string", "semantic": "full-name"},
-                        {"name": "email", "type": "string", "semantic": "email"},
-                    ],
-                },
-                {
-                    "full_name": "test_catalog.test_schema.orders",
-                    "has_pii": True,
-                    "skipped": False,
-                    "columns": [
-                        {"name": "id", "type": "int", "semantic": None},
-                        {"name": "customer_id", "type": "int", "semantic": None},
-                        {
-                            "name": "shipping_address",
-                            "type": "string",
-                            "semantic": "address",
-                        },
-                    ],
-                },
-                {
-                    "full_name": "test_catalog.test_schema.metrics",
-                    "has_pii": False,
-                    "skipped": False,
-                    "columns": [
-                        {"name": "id", "type": "int", "semantic": None},
-                        {"name": "date", "type": "date", "semantic": None},
-                    ],
-                },
-            ],
-        }
-
-        # Mock PII scan results with unsupported types
-        self.mock_pii_scan_results_with_unsupported = {
-            "tables_successfully_processed": 2,
-            "tables_with_pii": 2,
-            "total_pii_columns": 4,
-            "results_detail": [
-                {
-                    "full_name": "test_catalog.test_schema.customers",
-                    "has_pii": True,
-                    "skipped": False,
-                    "columns": [
-                        {"name": "id", "type": "int", "semantic": None},
-                        {"name": "name", "type": "string", "semantic": "full-name"},
-                        {
-                            "name": "metadata",
-                            "type": "STRUCT",
-                            "semantic": None,
-                        },  # Unsupported
-                        {
-                            "name": "tags",
-                            "type": "ARRAY",
-                            "semantic": None,
-                        },  # Unsupported
-                    ],
-                },
-                {
-                    "full_name": "test_catalog.test_schema.geo_data",
-                    "has_pii": True,
-                    "skipped": False,
-                    "columns": [
-                        {
-                            "name": "location",
-                            "type": "GEOGRAPHY",
-                            "semantic": "address",
-                        },  # Unsupported
-                        {
-                            "name": "geometry",
-                            "type": "GEOMETRY",
-                            "semantic": None,
-                        },  # Unsupported
-                        {
-                            "name": "properties",
-                            "type": "MAP",
-                            "semantic": None,
-                        },  # Unsupported
-                        {
-                            "name": "description",
-                            "type": "string",
-                            "semantic": "full-name",
-                        },
-                    ],
-                },
-            ],
-        }
-
-    def test_missing_params(self):
-        """Test handling when parameters are missing."""
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "", "test_schema"
-        )
-        self.assertIn("error", result)
-        self.assertIn("Target catalog and schema are required", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    def test_pii_scan_error(self, mock_scan_pii):
-        """Test handling when PII scan returns an error."""
-        # Setup mock
-        mock_scan_pii.return_value = {"error": "Failed to access tables"}
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("PII Scan failed during Stitch setup", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    def test_volume_list_error(self, mock_scan_pii):
-        """Test handling when listing volumes fails."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.side_effect = Exception("API Error")
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("Failed to list volumes", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    def test_volume_create_error(self, mock_scan_pii):
-        """Test handling when creating volume fails."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.return_value = {
-            "volumes": []
-        }  # Empty list, volume doesn't exist
-        self.client.create_volume.return_value = None  # Creation failed
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("Failed to create volume 'chuck'", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    def test_no_tables_with_pii(self, mock_scan_pii):
-        """Test handling when no tables with PII are found."""
-        # Setup mocks
-        no_pii_results = self.mock_pii_scan_results.copy()
-        # Override results_detail with no tables that have PII
-        no_pii_results["results_detail"] = [
-            {
-                "full_name": "test_catalog.test_schema.metrics",
-                "has_pii": False,
-                "skipped": False,
-                "columns": [{"name": "id", "type": "int", "semantic": None}],
-            }
-        ]
-        mock_scan_pii.return_value = no_pii_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("No tables with PII found", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    def test_missing_amperity_token(self, mock_get_amperity_token, mock_scan_pii):
-        """Test handling when Amperity token is missing."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        self.client.upload_file.return_value = True  # Config file upload successful
-        mock_get_amperity_token.return_value = None  # No token
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("Amperity token not found", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    def test_amperity_init_script_error(self, mock_get_amperity_token, mock_scan_pii):
-        """Test handling when fetching Amperity init script fails."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        self.client.upload_file.return_value = True  # Config file upload successful
-        mock_get_amperity_token.return_value = "fake_token"
-        self.client.fetch_amperity_job_init.side_effect = Exception("API Error")
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertIn("Error fetching Amperity init script", result["error"])
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    @patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
-    def test_versioned_init_script_upload_error(
-        self, mock_upload_init, mock_get_amperity_token, mock_scan_pii
-    ):
-        """Test handling when versioned init script upload fails."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        mock_get_amperity_token.return_value = "fake_token"
-        self.client.fetch_amperity_job_init.return_value = {
-            "cluster-init": "echo 'init script'"
-        }
-        # Mock versioned init script upload failure
-        mock_upload_init.return_value = {
-            "error": "Failed to upload versioned init script"
-        }
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertIn("error", result)
-        self.assertEqual(result["error"], "Failed to upload versioned init script")
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    @patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
-    def test_successful_setup(
-        self, mock_upload_init, mock_get_amperity_token, mock_scan_pii
-    ):
-        """Test successful Stitch integration setup with versioned init script."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        self.client.upload_file.return_value = True  # File uploads successful
-        mock_get_amperity_token.return_value = "fake_token"
-        self.client.fetch_amperity_job_init.return_value = {
-            "cluster-init": "echo 'init script'"
-        }
-        # Mock versioned init script upload
-        mock_upload_init.return_value = {
-            "success": True,
-            "volume_path": "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh",
-            "filename": "cluster_init-2025-06-02_14-30.sh",
-            "timestamp": "2025-06-02_14-30",
-        }
-        self.client.submit_job_run.return_value = {"run_id": "12345"}
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertTrue(result.get("success"))
-        self.assertIn("stitch_config", result)
-        self.assertIn("metadata", result)
-        metadata = result["metadata"]
-        self.assertIn("config_file_path", metadata)
-        self.assertIn("init_script_path", metadata)
-        self.assertEqual(
-            metadata["init_script_path"],
-            "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh",
-        )
-
-        # Verify versioned init script upload was called
-        mock_upload_init.assert_called_once_with(
-            client=self.client,
-            target_catalog="test_catalog",
-            target_schema="test_schema",
-            init_script_content="echo 'init script'",
-        )
-
-        # Verify no unsupported columns warning when all columns are supported
-        self.assertIn("unsupported_columns", metadata)
-        self.assertEqual(len(metadata["unsupported_columns"]), 0)
-        self.assertNotIn("Note: Some columns were excluded", result.get("message", ""))
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    @patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
-    def test_unsupported_types_filtered(
-        self, mock_upload_init, mock_get_amperity_token, mock_scan_pii
-    ):
-        """Test that unsupported column types are filtered out from Stitch config."""
-        # Setup mocks
-        mock_scan_pii.return_value = self.mock_pii_scan_results_with_unsupported
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        self.client.upload_file.return_value = True  # File uploads successful
-        mock_get_amperity_token.return_value = "fake_token"
-        self.client.fetch_amperity_job_init.return_value = {
-            "cluster-init": "echo 'init script'"
-        }
-        # Mock versioned init script upload
-        mock_upload_init.return_value = {
-            "success": True,
-            "volume_path": "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh",
-            "filename": "cluster_init-2025-06-02_14-30.sh",
-            "timestamp": "2025-06-02_14-30",
-        }
-        self.client.submit_job_run.return_value = {"run_id": "12345"}
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results
-        self.assertTrue(result.get("success"))
-
-        # Get the generated config content
-        import json
-
-        config_content = json.dumps(result["stitch_config"])
-
-        # Verify unsupported types are not in the config
-        unsupported_types = ["STRUCT", "ARRAY", "GEOGRAPHY", "GEOMETRY", "MAP"]
-        for unsupported_type in unsupported_types:
-            self.assertNotIn(
-                unsupported_type,
-                config_content,
-                f"Config should not contain unsupported type: {unsupported_type}",
-            )
-
-        # Verify supported types are still included
-        self.assertIn(
-            "int", config_content, "Config should contain supported type: int"
-        )
-        self.assertIn(
-            "string", config_content, "Config should contain supported type: string"
-        )
-
-        # Verify unsupported columns are reported to user
-        self.assertIn("metadata", result)
-        metadata = result["metadata"]
-        self.assertIn("unsupported_columns", metadata)
-        unsupported_info = metadata["unsupported_columns"]
-        self.assertEqual(
-            len(unsupported_info), 2
-        )  # Two tables have unsupported columns
-
-        # Check first table (customers)
-        customers_unsupported = next(
-            t for t in unsupported_info if "customers" in t["table"]
-        )
-        self.assertEqual(len(customers_unsupported["columns"]), 2)  # metadata and tags
-        column_types = [col["type"] for col in customers_unsupported["columns"]]
-        self.assertIn("STRUCT", column_types)
-        self.assertIn("ARRAY", column_types)
-
-        # Check second table (geo_data)
-        geo_unsupported = next(t for t in unsupported_info if "geo_data" in t["table"])
-        self.assertEqual(
-            len(geo_unsupported["columns"]), 3
-        )  # location, geometry, properties
-        geo_column_types = [col["type"] for col in geo_unsupported["columns"]]
-        self.assertIn("GEOGRAPHY", geo_column_types)
-        self.assertIn("GEOMETRY", geo_column_types)
-        self.assertIn("MAP", geo_column_types)
-
-        # Verify warning message includes unsupported columns info in metadata
-        self.assertIn("unsupported_columns", metadata)
-
-    @patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
-    @patch("chuck_data.commands.stitch_tools.get_amperity_token")
-    def test_all_columns_unsupported_types(
-        self, mock_get_amperity_token, mock_scan_pii
-    ):
-        """Test handling when all columns have unsupported types."""
-        # Setup mocks with all unsupported types
-        all_unsupported_results = {
-            "tables_successfully_processed": 1,
-            "tables_with_pii": 1,
-            "total_pii_columns": 2,
-            "results_detail": [
-                {
-                    "full_name": "test_catalog.test_schema.complex_data",
-                    "has_pii": True,
-                    "skipped": False,
-                    "columns": [
-                        {"name": "metadata", "type": "STRUCT", "semantic": "full-name"},
-                        {"name": "tags", "type": "ARRAY", "semantic": "address"},
-                        {"name": "location", "type": "GEOGRAPHY", "semantic": None},
-                    ],
-                },
-            ],
-        }
-        mock_scan_pii.return_value = all_unsupported_results
-        self.client.list_volumes.return_value = {
-            "volumes": [{"name": "chuck"}]
-        }  # Volume exists
-        mock_get_amperity_token.return_value = "fake_token"  # Add token mock
-
-        # Call function
-        result = _helper_setup_stitch_logic(
-            self.client, self.llm_client, "test_catalog", "test_schema"
-        )
-
-        # Verify results - should fail because no supported columns remain
-        self.assertIn("error", result)
-        self.assertIn("No tables with PII found", result["error"])
diff --git a/tests/commands/test_tag_pii.py b/tests/commands/test_tag_pii.py
deleted file mode 100644
index 2ffc8f3..0000000
--- a/tests/commands/test_tag_pii.py
+++ /dev/null
@@ -1,199 +0,0 @@
-"""Unit tests for tag_pii command."""
-
-import os
-import tempfile
-from unittest.mock import MagicMock, patch
-
-from chuck_data.commands.tag_pii import handle_command, apply_semantic_tags
-from chuck_data.commands.base import CommandResult
-from chuck_data.config import (
-    ConfigManager,
-    set_warehouse_id,
-    set_active_catalog,
-    set_active_schema,
-)
-from tests.fixtures import DatabricksClientStub
-
-
-class TestTagPiiCommand:
-    """Test cases for the tag_pii command handler."""
-
-    def setup_method(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def teardown_method(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_missing_table_name(self):
-        """Test that missing table_name parameter is handled correctly."""
-        result = handle_command(
-            None, pii_columns=[{"name": "test", "semantic": "email"}]
-        )
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "table_name parameter is required" in result.message
-
-    def test_missing_pii_columns(self):
-        """Test that missing pii_columns parameter is handled correctly."""
-        result = handle_command(None, table_name="test_table")
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "pii_columns parameter is required" in result.message
-
-    def test_empty_pii_columns(self):
-        """Test that empty pii_columns list is handled correctly."""
-        result = handle_command(None, table_name="test_table", pii_columns=[])
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "pii_columns parameter is required" in result.message
-
-    def test_missing_client(self):
-        """Test that missing client is handled correctly."""
-        result = handle_command(
-            None,
-            table_name="test_table",
-            pii_columns=[{"name": "test", "semantic": "email"}],
-        )
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "Client is required for PII tagging" in result.message
-
-    def test_missing_warehouse_id(self):
-        """Test that missing warehouse ID is handled correctly."""
-        # Don't set warehouse ID in config
-
-        result = handle_command(
-            self.client_stub,
-            table_name="test_table",
-            pii_columns=[{"name": "test", "semantic": "email"}],
-        )
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "No warehouse ID configured" in result.message
-
-    def test_missing_catalog_schema_for_simple_table_name(self):
-        """Test that missing catalog/schema for simple table name is handled."""
-        set_warehouse_id("warehouse123")
-        # Don't set active catalog/schema
-
-        result = handle_command(
-            self.client_stub,
-            table_name="simple_table",  # No dots, so needs catalog/schema
-            pii_columns=[{"name": "test", "semantic": "email"}],
-        )
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert "No active catalog and schema selected" in result.message
-
-    def test_table_not_found(self):
-        """Test that table not found is handled correctly."""
-        set_warehouse_id("warehouse123")
-        set_active_catalog("test_catalog")
-        set_active_schema("test_schema")
-
-        # Don't add the table to stub - will cause table not found
-
-        result = handle_command(
-            self.client_stub,
-            table_name="nonexistent_table",
-            pii_columns=[{"name": "test", "semantic": "email"}],
-        )
-
-        assert isinstance(result, CommandResult)
-        assert not result.success
-        assert (
-            "Table test_catalog.test_schema.nonexistent_table not found"
-            in result.message
-        )
-
-    def test_apply_semantic_tags_success(self):
-        """Test successful application of semantic tags."""
-        pii_columns = [
-            {"name": "email_col", "semantic": "email"},
-            {"name": "name_col", "semantic": "given-name"},
-        ]
-
-        results = apply_semantic_tags(
-            self.client_stub, "catalog.schema.table", pii_columns, "warehouse123"
-        )
-
-        assert len(results) == 2
-        assert all(r["success"] for r in results)
-        assert results[0]["column"] == "email_col"
-        assert results[0]["semantic_type"] == "email"
-        assert results[1]["column"] == "name_col"
-        assert results[1]["semantic_type"] == "given-name"
-
-    def test_apply_semantic_tags_missing_data(self):
-        """Test handling of missing column data in apply_semantic_tags."""
-        pii_columns = [
-            {"name": "email_col"},  # Missing semantic type
-            {"semantic": "email"},  # Missing column name
-            {"name": "good_col", "semantic": "phone"},  # Good data
-        ]
-
-        results = apply_semantic_tags(
-            self.client_stub, "catalog.schema.table", pii_columns, "warehouse123"
-        )
-
-        assert len(results) == 3
-        assert not results[0]["success"]  # Missing semantic type
-        assert not results[1]["success"]  # Missing column name
-        assert results[2]["success"]  # Good data
-
-        assert "Missing column name or semantic type" in results[0]["error"]
-        assert "Missing column name or semantic type" in results[1]["error"]
-
-    def test_apply_semantic_tags_sql_failure(self):
-        """Test handling of SQL execution failures."""
-
-        # Create a stub that returns SQL failure
-        class FailingSQLStub(DatabricksClientStub):
-            def submit_sql_statement(self, sql_text=None, sql=None, **kwargs):
-                return {
-                    "status": {
-                        "state": "FAILED",
-                        "error": {"message": "SQL execution failed"},
-                    }
-                }
-
-        failing_client = FailingSQLStub()
-        pii_columns = [{"name": "email_col", "semantic": "email"}]
-
-        results = apply_semantic_tags(
-            failing_client, "catalog.schema.table", pii_columns, "warehouse123"
-        )
-
-        assert len(results) == 1
-        assert not results[0]["success"]
-        assert "SQL execution failed" in results[0]["error"]
-
-    def test_apply_semantic_tags_exception(self):
-        """Test handling of exceptions during SQL execution."""
-        mock_client = MagicMock()
-        mock_client.submit_sql_statement.side_effect = Exception("Connection error")
-
-        pii_columns = [{"name": "email_col", "semantic": "email"}]
-
-        results = apply_semantic_tags(
-            mock_client, "catalog.schema.table", pii_columns, "warehouse123"
-        )
-
-        assert len(results) == 1
-        assert not results[0]["success"]
-        assert "Connection error" in results[0]["error"]
diff --git a/tests/commands/test_warehouse_selection.py b/tests/commands/test_warehouse_selection.py
deleted file mode 100644
index 9c8fa96..0000000
--- a/tests/commands/test_warehouse_selection.py
+++ /dev/null
@@ -1,153 +0,0 @@
-"""
-Tests for warehouse_selection command handler.
-
-This module contains tests for the warehouse selection command handler.
-"""
-
-import unittest
-import os
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.commands.warehouse_selection import handle_command
-from chuck_data.config import ConfigManager, get_warehouse_id
-from tests.fixtures import DatabricksClientStub
-
-
-class TestWarehouseSelection(unittest.TestCase):
-    """Tests for warehouse selection command handler."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.client_stub = DatabricksClientStub()
-
-        # Set up config management
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-        self.config_manager = ConfigManager(self.config_path)
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.patcher.start()
-
-    def tearDown(self):
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_missing_warehouse_parameter(self):
-        """Test handling when warehouse parameter is not provided."""
-        result = handle_command(self.client_stub)
-        self.assertFalse(result.success)
-        self.assertIn(
-            "warehouse parameter is required",
-            result.message,
-        )
-
-    def test_successful_warehouse_selection_by_id(self):
-        """Test successful warehouse selection by ID."""
-        # Set up warehouse in stub
-        self.client_stub.add_warehouse(
-            name="Test Warehouse", state="RUNNING", size="2X-Small"
-        )
-        # The warehouse_id should be "warehouse_0" based on the stub implementation
-        warehouse_id = "warehouse_0"
-
-        # Call function with warehouse ID
-        result = handle_command(self.client_stub, warehouse=warehouse_id)
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn(
-            "Active SQL warehouse is now set to 'Test Warehouse'", result.message
-        )
-        self.assertIn(f"(ID: {warehouse_id}", result.message)
-        self.assertIn("State: RUNNING", result.message)
-        self.assertEqual(result.data["warehouse_id"], warehouse_id)
-        self.assertEqual(result.data["warehouse_name"], "Test Warehouse")
-        self.assertEqual(result.data["state"], "RUNNING")
-
-        # Verify config was updated
-        self.assertEqual(get_warehouse_id(), warehouse_id)
-
-    def test_warehouse_selection_with_verification_failure(self):
-        """Test warehouse selection when verification fails."""
-        # Add a warehouse to stub but call with different ID - will cause verification failure
-        self.client_stub.add_warehouse(
-            name="Production Warehouse", state="RUNNING", size="2X-Small"
-        )
-
-        # Call function with non-existent warehouse ID that won't match by name
-        result = handle_command(
-            self.client_stub, warehouse="xyz-completely-different-name"
-        )
-
-        # Verify results - should now fail when warehouse is not found
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No warehouse found matching 'xyz-completely-different-name'",
-            result.message,
-        )
-
-    def test_warehouse_selection_no_client(self):
-        """Test warehouse selection with no client available."""
-        # Call function with no client
-        result = handle_command(None, warehouse="abc123")
-
-        # Verify results - should now fail when no client is available
-        self.assertFalse(result.success)
-        self.assertIn(
-            "No API client available to verify warehouse",
-            result.message,
-        )
-
-    def test_warehouse_selection_exception(self):
-        """Test warehouse selection with unexpected exception."""
-
-        # Create a stub that raises an exception during warehouse verification
-        class FailingStub(DatabricksClientStub):
-            def get_warehouse(self, warehouse_id):
-                raise Exception("Failed to set warehouse")
-
-            def list_warehouses(self, **kwargs):
-                raise Exception("Failed to list warehouses")
-
-        failing_stub = FailingStub()
-
-        # Call function
-        result = handle_command(failing_stub, warehouse="abc123")
-
-        # Should fail when both get_warehouse and list_warehouses fail
-        self.assertFalse(result.success)
-        self.assertIn("Failed to list warehouses", result.message)
-
-    def test_warehouse_selection_by_name(self):
-        """Test warehouse selection by name parameter."""
-        # Set up warehouse in stub
-        self.client_stub.add_warehouse(
-            name="Test Warehouse", state="RUNNING", size="2X-Small"
-        )
-
-        # Call function with warehouse name
-        result = handle_command(self.client_stub, warehouse="Test Warehouse")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn(
-            "Active SQL warehouse is now set to 'Test Warehouse'", result.message
-        )
-        self.assertEqual(result.data["warehouse_name"], "Test Warehouse")
-
-    def test_warehouse_selection_fuzzy_matching(self):
-        """Test warehouse selection with fuzzy name matching."""
-        # Set up warehouse in stub
-        self.client_stub.add_warehouse(
-            name="Starter Warehouse", state="RUNNING", size="2X-Small"
-        )
-
-        # Call function with partial name match
-        result = handle_command(self.client_stub, warehouse="Starter")
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn(
-            "Active SQL warehouse is now set to 'Starter Warehouse'", result.message
-        )
-        self.assertEqual(result.data["warehouse_name"], "Starter Warehouse")
diff --git a/tests/commands/test_workspace_selection.py b/tests/commands/test_workspace_selection.py
deleted file mode 100644
index ed015e3..0000000
--- a/tests/commands/test_workspace_selection.py
+++ /dev/null
@@ -1,90 +0,0 @@
-"""
-Tests for workspace_selection command handler.
-
-This module contains tests for the workspace selection command handler.
-"""
-
-import unittest
-from unittest.mock import patch
-
-from chuck_data.commands.workspace_selection import handle_command
-
-
-class TestWorkspaceSelection(unittest.TestCase):
-    """Tests for workspace selection command handler."""
-
-    def test_missing_workspace_url(self):
-        """Test handling when workspace_url is not provided."""
-        result = handle_command(None)
-        self.assertFalse(result.success)
-        self.assertIn("workspace_url parameter is required", result.message)
-
-    @patch("chuck_data.databricks.url_utils.validate_workspace_url")
-    def test_invalid_workspace_url(self, mock_validate_workspace_url):
-        """Test handling when workspace_url is invalid."""
-        # Setup mocks
-        mock_validate_workspace_url.return_value = (False, "Invalid URL format")
-
-        # Call function
-        result = handle_command(None, workspace_url="invalid-url")
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertIn("Error: Invalid URL format", result.message)
-        mock_validate_workspace_url.assert_called_once_with("invalid-url")
-
-    @patch("chuck_data.databricks.url_utils.validate_workspace_url")
-    @patch("chuck_data.databricks.url_utils.normalize_workspace_url")
-    @patch("chuck_data.databricks.url_utils.detect_cloud_provider")
-    @patch("chuck_data.databricks.url_utils.format_workspace_url_for_display")
-    @patch("chuck_data.commands.workspace_selection.set_workspace_url")
-    def test_successful_workspace_selection(
-        self,
-        mock_set_workspace_url,
-        mock_format_url,
-        mock_detect_cloud,
-        mock_normalize_url,
-        mock_validate_url,
-    ):
-        """Test successful workspace selection."""
-        # Setup mocks
-        mock_validate_url.return_value = (True, "")
-        mock_normalize_url.return_value = "dbc-example.cloud.databricks.com"
-        mock_detect_cloud.return_value = "Azure"
-        mock_format_url.return_value = "dbc-example (Azure)"
-
-        # Call function
-        result = handle_command(
-            None, workspace_url="https://dbc-example.cloud.databricks.com"
-        )
-
-        # Verify results
-        self.assertTrue(result.success)
-        self.assertIn(
-            "Workspace URL is now set to 'dbc-example (Azure)'", result.message
-        )
-        self.assertIn("Restart may be needed", result.message)
-        self.assertEqual(
-            result.data["workspace_url"], "https://dbc-example.cloud.databricks.com"
-        )
-        self.assertEqual(result.data["display_url"], "dbc-example (Azure)")
-        self.assertEqual(result.data["cloud_provider"], "Azure")
-        self.assertTrue(result.data["requires_restart"])
-        mock_set_workspace_url.assert_called_once_with(
-            "https://dbc-example.cloud.databricks.com"
-        )
-
-    @patch("chuck_data.databricks.url_utils.validate_workspace_url")
-    def test_workspace_url_exception(self, mock_validate_workspace_url):
-        """Test handling when an exception occurs."""
-        # Setup mocks
-        mock_validate_workspace_url.side_effect = Exception("Validation error")
-
-        # Call function
-        result = handle_command(
-            None, workspace_url="https://dbc-example.databricks.com"
-        )
-
-        # Verify results
-        self.assertFalse(result.success)
-        self.assertEqual(str(result.error), "Validation error")
diff --git a/tests/conftest.py b/tests/conftest.py
new file mode 100644
index 0000000..de6abab
--- /dev/null
+++ b/tests/conftest.py
@@ -0,0 +1,66 @@
+"""Pytest fixtures for Chuck tests."""
+
+import pytest
+import tempfile
+import os
+from unittest.mock import MagicMock
+
+from tests.fixtures.databricks.client import DatabricksClientStub
+from tests.fixtures.amperity import AmperityClientStub
+from tests.fixtures.llm import LLMClientStub
+from tests.fixtures.collectors import MetricsCollectorStub
+from chuck_data.config import ConfigManager
+
+# Import environment fixtures to make them available globally
+
+
+@pytest.fixture
+def databricks_client_stub():
+    """Create a fresh DatabricksClientStub for each test."""
+    return DatabricksClientStub()
+
+
+@pytest.fixture
+def databricks_client_stub_with_data():
+    """Create a DatabricksClientStub with default test data."""
+    stub = DatabricksClientStub()
+    # Add some default test data
+    stub.add_catalog("test_catalog", catalog_type="MANAGED")
+    stub.add_schema("test_catalog", "test_schema")
+    stub.add_table("test_catalog", "test_schema", "test_table")
+    stub.add_warehouse(warehouse_id="test-warehouse", name="Test Warehouse")
+    return stub
+
+
+@pytest.fixture
+def amperity_client_stub():
+    """Create a fresh AmperityClientStub for each test."""
+    return AmperityClientStub()
+
+
+@pytest.fixture
+def llm_client_stub():
+    """Create a fresh LLMClientStub for each test."""
+    return LLMClientStub()
+
+
+@pytest.fixture
+def metrics_collector_stub():
+    """Create a fresh MetricsCollectorStub for each test."""
+    return MetricsCollectorStub()
+
+
+@pytest.fixture
+def temp_config():
+    """Create a temporary config file for testing."""
+    temp_dir = tempfile.TemporaryDirectory()
+    config_path = os.path.join(temp_dir.name, "test_config.json")
+    config_manager = ConfigManager(config_path)
+    yield config_manager
+    temp_dir.cleanup()
+
+
+@pytest.fixture
+def mock_console():
+    """Create a mock console for TUI testing."""
+    return MagicMock()
diff --git a/tests/fixtures.py b/tests/fixtures.py
deleted file mode 100644
index 101854d..0000000
--- a/tests/fixtures.py
+++ /dev/null
@@ -1,807 +0,0 @@
-"""Test fixtures for Chuck tests."""
-
-
-class AmperityClientStub:
-    """Comprehensive stub for AmperityAPIClient with predictable responses."""
-
-    def __init__(self):
-        self.base_url = "chuck.amperity.com"
-        self.nonce = None
-        self.token = None
-        self.state = "pending"
-        self.auth_thread = None
-
-        # Test configuration
-        self.should_fail_auth_start = False
-        self.should_fail_auth_completion = False
-        self.should_fail_metrics = False
-        self.should_fail_bug_report = False
-        self.should_raise_exception = False
-        self.auth_completion_delay = 0
-
-        # Track method calls for testing
-        self.metrics_calls = []
-
-    def start_auth(self) -> tuple[bool, str]:
-        """Start the authentication process."""
-        if self.should_fail_auth_start:
-            return False, "Failed to start auth: 500 - Server Error"
-
-        self.nonce = "test-nonce-123"
-        self.state = "started"
-        return True, "Authentication started. Please log in via the browser."
-
-    def get_auth_status(self) -> dict:
-        """Return the current authentication status."""
-        return {"state": self.state, "nonce": self.nonce, "has_token": bool(self.token)}
-
-    def wait_for_auth_completion(
-        self, poll_interval: int = 1, timeout: int = None
-    ) -> tuple[bool, str]:
-        """Wait for authentication to complete in a blocking manner."""
-        if not self.nonce:
-            return False, "Authentication not started"
-
-        if self.should_fail_auth_completion:
-            self.state = "error"
-            return False, "Authentication failed: error"
-
-        # Simulate successful authentication
-        self.state = "success"
-        self.token = "test-auth-token-456"
-        return True, "Authentication completed successfully."
-
-    def submit_metrics(self, payload: dict, token: str) -> bool:
-        """Send usage metrics to the Amperity API."""
-        # Track the call
-        self.metrics_calls.append((payload, token))
-
-        if self.should_raise_exception:
-            raise Exception("Test exception")
-
-        if self.should_fail_metrics:
-            return False
-
-        # Validate basic payload structure
-        if not isinstance(payload, dict):
-            return False
-
-        if not token:
-            return False
-
-        return True
-
-    def submit_bug_report(self, payload: dict, token: str) -> tuple[bool, str]:
-        """Send a bug report to the Amperity API."""
-        if self.should_fail_bug_report:
-            return False, "Failed to submit bug report: 500"
-
-        # Validate basic payload structure
-        if not isinstance(payload, dict):
-            return False, "Invalid payload format"
-
-        if not token:
-            return False, "Authentication token required"
-
-        return True, "Bug report submitted successfully"
-
-    def _poll_auth_state(self) -> None:
-        """Poll the auth state endpoint until authentication is complete."""
-        # In stub, this is a no-op since we control state directly
-        pass
-
-    # Helper methods for test configuration
-    def set_auth_start_failure(self, should_fail: bool = True):
-        """Configure whether start_auth should fail."""
-        self.should_fail_auth_start = should_fail
-
-    def set_auth_completion_failure(self, should_fail: bool = True):
-        """Configure whether wait_for_auth_completion should fail."""
-        self.should_fail_auth_completion = should_fail
-
-    def set_metrics_failure(self, should_fail: bool = True):
-        """Configure whether submit_metrics should fail."""
-        self.should_fail_metrics = should_fail
-
-    def set_bug_report_failure(self, should_fail: bool = True):
-        """Configure whether submit_bug_report should fail."""
-        self.should_fail_bug_report = should_fail
-
-    def reset(self):
-        """Reset all state to initial values."""
-        self.nonce = None
-        self.token = None
-        self.state = "pending"
-        self.auth_thread = None
-        self.should_fail_auth_start = False
-        self.should_fail_auth_completion = False
-        self.should_fail_metrics = False
-        self.should_fail_bug_report = False
-        self.auth_completion_delay = 0
-
-
-class DatabricksClientStub:
-    """Comprehensive stub for DatabricksAPIClient with predictable responses."""
-
-    def __init__(self):
-        # Initialize with default data
-        self.catalogs = []
-        self.schemas = {}  # catalog_name -> [schemas]
-        self.tables = {}  # (catalog, schema) -> [tables]
-        self.models = []
-        self.warehouses = []
-        self.volumes = {}  # catalog_name -> [volumes]
-        self.connection_status = "connected"
-        self.permissions = {}
-        self.sql_results = {}  # sql -> results mapping
-        self.pii_scan_results = {}  # table_name -> pii results
-
-        # Call tracking
-        self.create_stitch_notebook_calls = []
-        self.list_catalogs_calls = []
-        self.get_catalog_calls = []
-        self.list_schemas_calls = []
-        self.get_schema_calls = []
-        self.list_tables_calls = []
-        self.get_table_calls = []
-
-    # Catalog operations
-    def list_catalogs(self, include_browse=False, max_results=None, page_token=None):
-        # Track the call
-        self.list_catalogs_calls.append((include_browse, max_results, page_token))
-        return {"catalogs": self.catalogs}
-
-    def get_catalog(self, catalog_name):
-        # Track the call
-        self.get_catalog_calls.append((catalog_name,))
-        catalog = next((c for c in self.catalogs if c["name"] == catalog_name), None)
-        if not catalog:
-            raise Exception(f"Catalog {catalog_name} not found")
-        return catalog
-
-    # Schema operations
-    def list_schemas(
-        self,
-        catalog_name,
-        include_browse=False,
-        max_results=None,
-        page_token=None,
-        **kwargs,
-    ):
-        # Track the call
-        self.list_schemas_calls.append(
-            (catalog_name, include_browse, max_results, page_token)
-        )
-        return {"schemas": self.schemas.get(catalog_name, [])}
-
-    def get_schema(self, full_name):
-        # Track the call
-        self.get_schema_calls.append((full_name,))
-        # Parse full_name in format "catalog_name.schema_name"
-        parts = full_name.split(".")
-        if len(parts) != 2:
-            raise Exception("Invalid schema name format")
-
-        catalog_name, schema_name = parts
-        schemas = self.schemas.get(catalog_name, [])
-        schema = next((s for s in schemas if s["name"] == schema_name), None)
-        if not schema:
-            raise Exception(f"Schema {full_name} not found")
-        return schema
-
-    # Table operations
-    def list_tables(
-        self,
-        catalog_name,
-        schema_name,
-        max_results=None,
-        page_token=None,
-        include_delta_metadata=False,
-        omit_columns=False,
-        omit_properties=False,
-        omit_username=False,
-        include_browse=False,
-        include_manifest_capabilities=False,
-        **kwargs,
-    ):
-        # Track the call
-        self.list_tables_calls.append(
-            (
-                catalog_name,
-                schema_name,
-                max_results,
-                page_token,
-                include_delta_metadata,
-                omit_columns,
-                omit_properties,
-                omit_username,
-                include_browse,
-                include_manifest_capabilities,
-            )
-        )
-        key = (catalog_name, schema_name)
-        tables = self.tables.get(key, [])
-        return {"tables": tables, "next_page_token": None}
-
-    def get_table(
-        self,
-        full_name,
-        include_delta_metadata=False,
-        include_browse=False,
-        include_manifest_capabilities=False,
-        full_table_name=None,
-        **kwargs,
-    ):
-        # Track the call
-        self.get_table_calls.append(
-            (
-                full_name or full_table_name,
-                include_delta_metadata,
-                include_browse,
-                include_manifest_capabilities,
-            )
-        )
-        # Support both parameter names for compatibility
-        table_name = full_name or full_table_name
-        if not table_name:
-            raise Exception("Table name is required")
-
-        # Parse full_table_name and return table details
-        parts = table_name.split(".")
-        if len(parts) != 3:
-            raise Exception("Invalid table name format")
-
-        catalog, schema, table = parts
-        key = (catalog, schema)
-        tables = self.tables.get(key, [])
-        table_info = next((t for t in tables if t["name"] == table), None)
-        if not table_info:
-            raise Exception(f"Table {table_name} not found")
-        return table_info
-
-    # Model operations
-    def list_models(self, **kwargs):
-        if hasattr(self, "_list_models_error"):
-            raise self._list_models_error
-        return self.models
-
-    def get_model(self, model_name):
-        if hasattr(self, "_get_model_error"):
-            raise self._get_model_error
-        model = next((m for m in self.models if m["name"] == model_name), None)
-        return model
-
-    # Warehouse operations
-    def list_warehouses(self, **kwargs):
-        return self.warehouses
-
-    def get_warehouse(self, warehouse_id):
-        warehouse = next((w for w in self.warehouses if w["id"] == warehouse_id), None)
-        if not warehouse:
-            raise Exception(f"Warehouse {warehouse_id} not found")
-        return warehouse
-
-    def start_warehouse(self, warehouse_id):
-        warehouse = self.get_warehouse(warehouse_id)
-        warehouse["state"] = "STARTING"
-        return warehouse
-
-    def stop_warehouse(self, warehouse_id):
-        warehouse = self.get_warehouse(warehouse_id)
-        warehouse["state"] = "STOPPING"
-        return warehouse
-
-    # Volume operations
-    def list_volumes(self, catalog_name, **kwargs):
-        return {"volumes": self.volumes.get(catalog_name, [])}
-
-    def create_volume(
-        self, catalog_name, schema_name, volume_name, volume_type="MANAGED", **kwargs
-    ):
-        key = catalog_name
-        if key not in self.volumes:
-            self.volumes[key] = []
-
-        volume = {
-            "name": volume_name,
-            "full_name": f"{catalog_name}.{schema_name}.{volume_name}",
-            "volume_type": volume_type,
-            "catalog_name": catalog_name,
-            "schema_name": schema_name,
-            **kwargs,
-        }
-        self.volumes[key].append(volume)
-        return volume
-
-    # SQL operations
-    def execute_sql(self, sql, **kwargs):
-        # Return pre-configured results or default
-        if sql in self.sql_results:
-            return self.sql_results[sql]
-
-        # Default response
-        return {
-            "result": {
-                "data_array": [["row1_col1", "row1_col2"], ["row2_col1", "row2_col2"]],
-                "column_names": ["col1", "col2"],
-            },
-            "next_page_token": kwargs.get("return_next_page") and "next_token" or None,
-        }
-
-    def submit_sql_statement(self, sql_text=None, sql=None, **kwargs):
-        # Support both parameter names for compatibility
-        # Return successful SQL submission by default
-        return {"status": {"state": "SUCCEEDED"}}
-
-    # PII scanning
-    def scan_table_pii(self, table_name):
-        if table_name in self.pii_scan_results:
-            return self.pii_scan_results[table_name]
-
-        return {
-            "table_name": table_name,
-            "pii_columns": ["email", "phone"],
-            "scan_timestamp": "2023-01-01T00:00:00Z",
-        }
-
-    def tag_columns_pii(self, table_name, columns, pii_type):
-        return {
-            "table_name": table_name,
-            "tagged_columns": columns,
-            "pii_type": pii_type,
-            "status": "success",
-        }
-
-    # Connection/status
-    def test_connection(self):
-        if self.connection_status == "connected":
-            return {"status": "success", "workspace": "test-workspace"}
-        else:
-            raise Exception("Connection failed")
-
-    def get_current_user(self):
-        return {"userName": "test.user@example.com", "displayName": "Test User"}
-
-    # File upload operations
-    def upload_file(self, file_path, destination_path):
-        return {
-            "source_path": file_path,
-            "destination_path": destination_path,
-            "status": "uploaded",
-            "size_bytes": 1024,
-        }
-
-    # Job operations
-    def list_jobs(self, **kwargs):
-        return {"jobs": []}
-
-    def get_job(self, job_id):
-        return {
-            "job_id": job_id,
-            "settings": {"name": f"test_job_{job_id}"},
-            "state": "TERMINATED",
-        }
-
-    def run_job(self, job_id):
-        return {"run_id": f"run_{job_id}_001", "job_id": job_id, "state": "RUNNING"}
-
-    def submit_job_run(self, config_path, init_script_path, run_name=None):
-        """Submit a job run and return run_id."""
-        from datetime import datetime
-
-        if not run_name:
-            run_name = (
-                f"Chuck AI One-Time Run {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}"
-            )
-
-        # Return a successful job submission
-        return {"run_id": 123456}
-
-    def get_job_run_status(self, run_id):
-        """Get job run status."""
-        return {
-            "state": {"life_cycle_state": "RUNNING"},
-            "run_id": int(run_id),
-            "run_name": "Test Run",
-            "creator_user_name": "test@example.com",
-        }
-
-    # Helper methods to set up test data
-    def add_catalog(self, name, catalog_type="MANAGED", **kwargs):
-        catalog = {"name": name, "type": catalog_type, **kwargs}
-        self.catalogs.append(catalog)
-        return catalog
-
-    def add_schema(self, catalog_name, schema_name, **kwargs):
-        if catalog_name not in self.schemas:
-            self.schemas[catalog_name] = []
-        schema = {"name": schema_name, "catalog_name": catalog_name, **kwargs}
-        self.schemas[catalog_name].append(schema)
-        return schema
-
-    def add_table(
-        self, catalog_name, schema_name, table_name, table_type="MANAGED", **kwargs
-    ):
-        key = (catalog_name, schema_name)
-        if key not in self.tables:
-            self.tables[key] = []
-
-        table = {
-            "name": table_name,
-            "full_name": f"{catalog_name}.{schema_name}.{table_name}",
-            "table_type": table_type,
-            "catalog_name": catalog_name,
-            "schema_name": schema_name,
-            "comment": kwargs.get("comment", ""),
-            "created_at": kwargs.get("created_at", "2023-01-01T00:00:00Z"),
-            "created_by": kwargs.get("created_by", "test.user@example.com"),
-            "owner": kwargs.get("owner", "test.user@example.com"),
-            "columns": kwargs.get("columns", []),
-            "properties": kwargs.get("properties", {}),
-            **kwargs,
-        }
-        self.tables[key].append(table)
-        return table
-
-    def add_model(self, name, status="READY", **kwargs):
-        model = {"name": name, "status": status, **kwargs}
-        self.models.append(model)
-        return model
-
-    def add_warehouse(
-        self,
-        warehouse_id=None,
-        name="Test Warehouse",
-        state="RUNNING",
-        size="SMALL",
-        enable_serverless_compute=False,
-        warehouse_type="PRO",
-        creator_name="test.user@example.com",
-        auto_stop_mins=60,
-        **kwargs,
-    ):
-        if warehouse_id is None:
-            warehouse_id = f"warehouse_{len(self.warehouses)}"
-
-        warehouse = {
-            "id": warehouse_id,
-            "name": name,
-            "state": state,
-            "size": size,  # Use size instead of cluster_size for the main field
-            "cluster_size": size,  # Keep cluster_size for backward compatibility
-            "enable_serverless_compute": enable_serverless_compute,
-            "warehouse_type": warehouse_type,
-            "creator_name": creator_name,
-            "auto_stop_mins": auto_stop_mins,
-            "jdbc_url": f"jdbc:databricks://test.cloud.databricks.com:443/default;transportMode=http;ssl=1;httpPath=/sql/1.0/warehouses/{warehouse_id}",
-            **kwargs,
-        }
-        self.warehouses.append(warehouse)
-        return warehouse
-
-    def add_volume(
-        self, catalog_name, schema_name, volume_name, volume_type="MANAGED", **kwargs
-    ):
-        key = catalog_name
-        if key not in self.volumes:
-            self.volumes[key] = []
-
-        volume = {
-            "name": volume_name,
-            "full_name": f"{catalog_name}.{schema_name}.{volume_name}",
-            "volume_type": volume_type,
-            "catalog_name": catalog_name,
-            "schema_name": schema_name,
-            **kwargs,
-        }
-        self.volumes[key].append(volume)
-        return volume
-
-    def set_sql_result(self, sql, result):
-        """Set a specific result for a SQL query."""
-        self.sql_results[sql] = result
-
-    def set_pii_scan_result(self, table_name, result):
-        """Set a specific PII scan result for a table."""
-        self.pii_scan_results[table_name] = result
-
-    def set_connection_status(self, status):
-        """Set the connection status for testing."""
-        self.connection_status = status
-
-    def set_list_models_error(self, error):
-        """Configure list_models to raise an error."""
-        self._list_models_error = error
-
-    def set_get_model_error(self, error):
-        """Configure get_model to raise an error."""
-        self._get_model_error = error
-
-    def create_stitch_notebook(self, *args, **kwargs):
-        """Create a stitch notebook (simulate successful creation)."""
-        # Track the call
-        self.create_stitch_notebook_calls.append((args, kwargs))
-
-        if hasattr(self, "_create_stitch_notebook_result"):
-            return self._create_stitch_notebook_result
-        if hasattr(self, "_create_stitch_notebook_error"):
-            raise self._create_stitch_notebook_error
-        return {
-            "notebook_id": "test-notebook-123",
-            "path": "/Workspace/Stitch/test_notebook.py",
-        }
-
-    def set_create_stitch_notebook_result(self, result):
-        """Configure create_stitch_notebook return value."""
-        self._create_stitch_notebook_result = result
-
-    def set_create_stitch_notebook_error(self, error):
-        """Configure create_stitch_notebook to raise error."""
-        self._create_stitch_notebook_error = error
-
-    def reset(self):
-        """Reset all data to initial state."""
-        self.catalogs = []
-        self.schemas = {}
-        self.tables = {}
-        self.models = []
-        self.warehouses = []
-        self.volumes = {}
-        self.connection_status = "connected"
-        self.permissions = {}
-        self.sql_results = {}
-        self.pii_scan_results = {}
-
-
-# Model response fixtures
-MODEL_FIXTURES = {
-    "endpoints": [
-        {
-            "name": "databricks-llama-4-maverick",
-            "config": {
-                "served_entities": [
-                    {
-                        "name": "databricks-llama-4-maverick",
-                        "foundation_model": {"name": "Llama 4 Maverick"},
-                    }
-                ],
-            },
-        },
-        {
-            "name": "databricks-claude-3-7-sonnet",
-            "config": {
-                "served_entities": [
-                    {
-                        "name": "databricks-claude-3-7-sonnet",
-                        "foundation_model": {"name": "Claude 3.7 Sonnet"},
-                    }
-                ],
-            },
-        },
-    ]
-}
-
-# Expected model list after parsing
-EXPECTED_MODEL_LIST = [
-    {
-        "name": "databricks-llama-4-maverick",
-        "config": {
-            "served_entities": [
-                {
-                    "name": "databricks-llama-4-maverick",
-                    "foundation_model": {"name": "Llama 4 Maverick"},
-                }
-            ],
-        },
-    },
-    {
-        "name": "databricks-claude-3-7-sonnet",
-        "config": {
-            "served_entities": [
-                {
-                    "name": "databricks-claude-3-7-sonnet",
-                    "foundation_model": {"name": "Claude 3.7 Sonnet"},
-                }
-            ],
-        },
-    },
-]
-
-# Empty model response
-EMPTY_MODEL_RESPONSE = {"endpoints": []}
-
-# For TUI tests
-SIMPLE_MODEL_LIST = [
-    {"name": "databricks-llama-4-maverick"},
-    {"name": "databricks-claude-3-7-sonnet"},
-]
-
-
-class LLMClientStub:
-    """Comprehensive stub for LLMClient with predictable responses."""
-
-    def __init__(self):
-        self.databricks_token = "test-token"
-        self.base_url = "https://test.databricks.com"
-
-        # Test configuration
-        self.should_fail_chat = False
-        self.should_raise_exception = False
-        self.response_content = "Test LLM response"
-        self.tool_calls = []
-        self.streaming_responses = []
-
-        # Track method calls for testing
-        self.chat_calls = []
-
-        # Pre-configured responses for specific scenarios
-        self.configured_responses = {}
-
-    def chat(self, messages, model=None, tools=None, stream=False, tool_choice="auto"):
-        """Simulate LLM chat completion."""
-        # Track the call
-        call_info = {
-            "messages": messages,
-            "model": model,
-            "tools": tools,
-            "stream": stream,
-            "tool_choice": tool_choice,
-        }
-        self.chat_calls.append(call_info)
-
-        if self.should_raise_exception:
-            raise Exception("Test LLM exception")
-
-        if self.should_fail_chat:
-            raise Exception("LLM API error")
-
-        # Check for configured response based on messages
-        messages_key = str(messages)
-        if messages_key in self.configured_responses:
-            return self.configured_responses[messages_key]
-
-        # Create mock response structure
-        mock_choice = MockChoice()
-        mock_choice.message = MockMessage()
-
-        if self.tool_calls:
-            # Return tool calls if configured
-            mock_choice.message.tool_calls = self.tool_calls
-            mock_choice.message.content = None
-        else:
-            # Return content response
-            mock_choice.message.content = self.response_content
-            mock_choice.message.tool_calls = None
-
-        mock_response = MockChatResponse()
-        mock_response.choices = [mock_choice]
-
-        return mock_response
-
-    def set_response_content(self, content):
-        """Set the content for the next chat response."""
-        self.response_content = content
-
-    def set_tool_calls(self, tool_calls):
-        """Set tool calls for the next chat response."""
-        self.tool_calls = tool_calls
-
-    def configure_response_for_messages(self, messages, response):
-        """Configure a specific response for specific messages."""
-        self.configured_responses[str(messages)] = response
-
-    def set_chat_failure(self, should_fail=True):
-        """Configure chat to fail."""
-        self.should_fail_chat = should_fail
-
-    def set_exception(self, should_raise=True):
-        """Configure chat to raise exception."""
-        self.should_raise_exception = should_raise
-
-
-class MockMessage:
-    """Mock LLM message object."""
-
-    def __init__(self):
-        self.content = None
-        self.tool_calls = None
-
-
-class MockChoice:
-    """Mock LLM choice object."""
-
-    def __init__(self):
-        self.message = None
-
-
-class MockChatResponse:
-    """Mock LLM chat response object."""
-
-    def __init__(self):
-        self.choices = []
-
-
-class MockToolCall:
-    """Mock LLM tool call object."""
-
-    def __init__(self, id="test-id", name="test-function", arguments="{}"):
-        self.id = id
-        self.function = MockFunction(name, arguments)
-
-
-class MockFunction:
-    """Mock LLM function object."""
-
-    def __init__(self, name, arguments):
-        self.name = name
-        self.arguments = arguments
-
-
-class MetricsCollectorStub:
-    """Comprehensive stub for MetricsCollector with predictable responses."""
-
-    def __init__(self):
-        # Track method calls for testing
-        self.track_event_calls = []
-
-        # Test configuration
-        self.should_fail_track_event = False
-        self.should_return_false = False
-
-    def track_event(
-        self,
-        prompt=None,
-        tools=None,
-        conversation_history=None,
-        error=None,
-        additional_data=None,
-    ):
-        """Track an event (simulate metrics collection)."""
-        call_info = {
-            "prompt": prompt,
-            "tools": tools,
-            "conversation_history": conversation_history,
-            "error": error,
-            "additional_data": additional_data,
-        }
-        self.track_event_calls.append(call_info)
-
-        if self.should_fail_track_event:
-            raise Exception("Metrics collection failed")
-
-        return not self.should_return_false
-
-    def set_track_event_failure(self, should_fail=True):
-        """Configure track_event to fail."""
-        self.should_fail_track_event = should_fail
-
-    def set_return_false(self, should_return_false=True):
-        """Configure track_event to return False."""
-        self.should_return_false = should_return_false
-
-
-class ConfigManagerStub:
-    """Comprehensive stub for ConfigManager with predictable responses."""
-
-    def __init__(self):
-        self.config = ConfigStub()
-
-    def get_config(self):
-        """Return the config stub."""
-        return self.config
-
-
-class ConfigStub:
-    """Comprehensive stub for Config objects with predictable responses."""
-
-    def __init__(self):
-        # Default config values
-        self.workspace_url = "https://test.databricks.com"
-        self.active_catalog = "test_catalog"
-        self.active_schema = "test_schema"
-        self.active_model = "test_model"
-        self.usage_tracking_consent = True
-
-        # Additional config properties as needed
-        self.databricks_token = "test-token"
-        self.host = "test.databricks.com"
diff --git a/tests/fixtures/__init__.py b/tests/fixtures/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/fixtures/amperity.py b/tests/fixtures/amperity.py
new file mode 100644
index 0000000..94069b9
--- /dev/null
+++ b/tests/fixtures/amperity.py
@@ -0,0 +1,120 @@
+"""Amperity client fixtures."""
+
+
+class AmperityClientStub:
+    """Comprehensive stub for AmperityAPIClient with predictable responses."""
+
+    def __init__(self):
+        self.base_url = "chuck.amperity.com"
+        self.nonce = None
+        self.token = None
+        self.state = "pending"
+        self.auth_thread = None
+
+        # Test configuration
+        self.should_fail_auth_start = False
+        self.should_fail_auth_completion = False
+        self.should_fail_metrics = False
+        self.should_fail_bug_report = False
+        self.should_raise_exception = False
+        self.auth_completion_delay = 0
+
+        # Track method calls for testing
+        self.metrics_calls = []
+
+    def start_auth(self) -> tuple[bool, str]:
+        """Start the authentication process."""
+        if self.should_fail_auth_start:
+            return False, "Failed to start auth: 500 - Server Error"
+
+        self.nonce = "test-nonce-123"
+        self.state = "started"
+        return True, "Authentication started. Please log in via the browser."
+
+    def get_auth_status(self) -> dict:
+        """Return the current authentication status."""
+        return {"state": self.state, "nonce": self.nonce, "has_token": bool(self.token)}
+
+    def wait_for_auth_completion(
+        self, poll_interval: int = 1, timeout: int = None
+    ) -> tuple[bool, str]:
+        """Wait for authentication to complete in a blocking manner."""
+        if not self.nonce:
+            return False, "Authentication not started"
+
+        if self.should_fail_auth_completion:
+            self.state = "error"
+            return False, "Authentication failed: error"
+
+        # Simulate successful authentication
+        self.state = "success"
+        self.token = "test-auth-token-456"
+        return True, "Authentication completed successfully."
+
+    def submit_metrics(self, payload: dict, token: str) -> bool:
+        """Send usage metrics to the Amperity API."""
+        # Track the call
+        self.metrics_calls.append((payload, token))
+
+        if self.should_raise_exception:
+            raise Exception("Test exception")
+
+        if self.should_fail_metrics:
+            return False
+
+        # Validate basic payload structure
+        if not isinstance(payload, dict):
+            return False
+
+        if not token:
+            return False
+
+        return True
+
+    def submit_bug_report(self, payload: dict, token: str) -> tuple[bool, str]:
+        """Send a bug report to the Amperity API."""
+        if self.should_fail_bug_report:
+            return False, "Failed to submit bug report: 500"
+
+        # Validate basic payload structure
+        if not isinstance(payload, dict):
+            return False, "Invalid payload format"
+
+        if not token:
+            return False, "Authentication token required"
+
+        return True, "Bug report submitted successfully"
+
+    def _poll_auth_state(self) -> None:
+        """Poll the auth state endpoint until authentication is complete."""
+        # In stub, this is a no-op since we control state directly
+        pass
+
+    # Helper methods for test configuration
+    def set_auth_start_failure(self, should_fail: bool = True):
+        """Configure whether start_auth should fail."""
+        self.should_fail_auth_start = should_fail
+
+    def set_auth_completion_failure(self, should_fail: bool = True):
+        """Configure whether wait_for_auth_completion should fail."""
+        self.should_fail_auth_completion = should_fail
+
+    def set_metrics_failure(self, should_fail: bool = True):
+        """Configure whether submit_metrics should fail."""
+        self.should_fail_metrics = should_fail
+
+    def set_bug_report_failure(self, should_fail: bool = True):
+        """Configure whether submit_bug_report should fail."""
+        self.should_fail_bug_report = should_fail
+
+    def reset(self):
+        """Reset all state to initial values."""
+        self.nonce = None
+        self.token = None
+        self.state = "pending"
+        self.auth_thread = None
+        self.should_fail_auth_start = False
+        self.should_fail_auth_completion = False
+        self.should_fail_metrics = False
+        self.should_fail_bug_report = False
+        self.auth_completion_delay = 0
diff --git a/tests/fixtures/collectors.py b/tests/fixtures/collectors.py
new file mode 100644
index 0000000..2de7b8d
--- /dev/null
+++ b/tests/fixtures/collectors.py
@@ -0,0 +1,71 @@
+"""Metrics collector and related fixtures."""
+
+
+class MetricsCollectorStub:
+    """Comprehensive stub for MetricsCollector with predictable responses."""
+
+    def __init__(self):
+        # Track method calls for testing
+        self.track_event_calls = []
+
+        # Test configuration
+        self.should_fail_track_event = False
+        self.should_return_false = False
+
+    def track_event(
+        self,
+        prompt=None,
+        tools=None,
+        conversation_history=None,
+        error=None,
+        additional_data=None,
+    ):
+        """Track an event (simulate metrics collection)."""
+        call_info = {
+            "prompt": prompt,
+            "tools": tools,
+            "conversation_history": conversation_history,
+            "error": error,
+            "additional_data": additional_data,
+        }
+        self.track_event_calls.append(call_info)
+
+        if self.should_fail_track_event:
+            raise Exception("Metrics collection failed")
+
+        return not self.should_return_false
+
+    def set_track_event_failure(self, should_fail=True):
+        """Configure track_event to fail."""
+        self.should_fail_track_event = should_fail
+
+    def set_return_false(self, should_return_false=True):
+        """Configure track_event to return False."""
+        self.should_return_false = should_return_false
+
+
+class ConfigManagerStub:
+    """Comprehensive stub for ConfigManager with predictable responses."""
+
+    def __init__(self):
+        self.config = ConfigStub()
+
+    def get_config(self):
+        """Return the config stub."""
+        return self.config
+
+
+class ConfigStub:
+    """Comprehensive stub for Config objects with predictable responses."""
+
+    def __init__(self):
+        # Default config values
+        self.workspace_url = "https://test.databricks.com"
+        self.active_catalog = "test_catalog"
+        self.active_schema = "test_schema"
+        self.active_model = "test_model"
+        self.usage_tracking_consent = True
+
+        # Additional config properties as needed
+        self.databricks_token = "test-token"
+        self.host = "test.databricks.com"
diff --git a/tests/fixtures/databricks/__init__.py b/tests/fixtures/databricks/__init__.py
new file mode 100644
index 0000000..d7538e4
--- /dev/null
+++ b/tests/fixtures/databricks/__init__.py
@@ -0,0 +1,29 @@
+"""Databricks client fixtures organized by functionality."""
+
+from .catalog_stub import CatalogStubMixin
+from .schema_stub import SchemaStubMixin
+from .table_stub import TableStubMixin
+from .model_stub import ModelStubMixin
+from .warehouse_stub import WarehouseStubMixin
+from .volume_stub import VolumeStubMixin
+from .sql_stub import SQLStubMixin
+from .job_stub import JobStubMixin
+from .pii_stub import PIIStubMixin
+from .connection_stub import ConnectionStubMixin
+from .file_stub import FileStubMixin
+from .client import DatabricksClientStub
+
+__all__ = [
+    "CatalogStubMixin",
+    "SchemaStubMixin",
+    "TableStubMixin",
+    "ModelStubMixin",
+    "WarehouseStubMixin",
+    "VolumeStubMixin",
+    "SQLStubMixin",
+    "JobStubMixin",
+    "PIIStubMixin",
+    "ConnectionStubMixin",
+    "FileStubMixin",
+    "DatabricksClientStub",
+]
diff --git a/tests/fixtures/databricks/catalog_stub.py b/tests/fixtures/databricks/catalog_stub.py
new file mode 100644
index 0000000..af7ff0c
--- /dev/null
+++ b/tests/fixtures/databricks/catalog_stub.py
@@ -0,0 +1,29 @@
+"""Catalog operations mixin for DatabricksClientStub."""
+
+
+class CatalogStubMixin:
+    """Mixin providing catalog operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.catalogs = []
+        self.get_catalog_calls = []
+        self.list_catalogs_calls = []
+
+    def list_catalogs(self, include_browse=False, max_results=None, page_token=None):
+        """List catalogs with optional parameters."""
+        self.list_catalogs_calls.append((include_browse, max_results, page_token))
+        return {"catalogs": self.catalogs}
+
+    def get_catalog(self, catalog_name):
+        """Get a specific catalog by name."""
+        self.get_catalog_calls.append((catalog_name,))
+        catalog = next((c for c in self.catalogs if c["name"] == catalog_name), None)
+        if not catalog:
+            raise Exception(f"Catalog {catalog_name} not found")
+        return catalog
+
+    def add_catalog(self, name, catalog_type="MANAGED", **kwargs):
+        """Add a catalog to the test data."""
+        catalog = {"name": name, "type": catalog_type, **kwargs}
+        self.catalogs.append(catalog)
+        return catalog
diff --git a/tests/fixtures/databricks/client.py b/tests/fixtures/databricks/client.py
new file mode 100644
index 0000000..78cb8cc
--- /dev/null
+++ b/tests/fixtures/databricks/client.py
@@ -0,0 +1,70 @@
+"""Main DatabricksClientStub that combines all functionality mixins."""
+
+from .catalog_stub import CatalogStubMixin
+from .schema_stub import SchemaStubMixin
+from .table_stub import TableStubMixin
+from .model_stub import ModelStubMixin
+from .warehouse_stub import WarehouseStubMixin
+from .volume_stub import VolumeStubMixin
+from .sql_stub import SQLStubMixin
+from .job_stub import JobStubMixin
+from .pii_stub import PIIStubMixin
+from .connection_stub import ConnectionStubMixin
+from .file_stub import FileStubMixin
+
+
+class DatabricksClientStub(
+    CatalogStubMixin,
+    SchemaStubMixin,
+    TableStubMixin,
+    ModelStubMixin,
+    WarehouseStubMixin,
+    VolumeStubMixin,
+    SQLStubMixin,
+    JobStubMixin,
+    PIIStubMixin,
+    ConnectionStubMixin,
+    FileStubMixin,
+):
+    """Comprehensive stub for DatabricksAPIClient with predictable responses.
+
+    This stub combines all functionality mixins to provide a complete test double
+    for the Databricks API client.
+    """
+
+    def __init__(self):
+        # Initialize all mixins
+        CatalogStubMixin.__init__(self)
+        SchemaStubMixin.__init__(self)
+        TableStubMixin.__init__(self)
+        ModelStubMixin.__init__(self)
+        WarehouseStubMixin.__init__(self)
+        VolumeStubMixin.__init__(self)
+        SQLStubMixin.__init__(self)
+        JobStubMixin.__init__(self)
+        PIIStubMixin.__init__(self)
+        ConnectionStubMixin.__init__(self)
+        FileStubMixin.__init__(self)
+
+    def reset(self):
+        """Reset all data to initial state."""
+        self.catalogs = []
+        self.schemas = {}
+        self.tables = {}
+        self.models = []
+        self.warehouses = []
+        self.volumes = {}
+        self.connection_status = "connected"
+        self.permissions = {}
+        self.token_validation_result = True
+        self.sql_results = {}
+        self.pii_scan_results = {}
+
+        # Reset call tracking
+        self.create_stitch_notebook_calls = []
+        self.list_catalogs_calls = []
+        self.get_catalog_calls = []
+        self.list_schemas_calls = []
+        self.get_schema_calls = []
+        self.list_tables_calls = []
+        self.get_table_calls = []
diff --git a/tests/fixtures/databricks/connection_stub.py b/tests/fixtures/databricks/connection_stub.py
new file mode 100644
index 0000000..2c04b1f
--- /dev/null
+++ b/tests/fixtures/databricks/connection_stub.py
@@ -0,0 +1,44 @@
+"""Connection operations mixin for DatabricksClientStub."""
+
+
+class ConnectionStubMixin:
+    """Mixin providing connection operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.connection_status = "connected"
+        self.permissions = {}
+        self.token_validation_result = True
+
+    def test_connection(self):
+        """Test the connection."""
+        if self.connection_status == "connected":
+            return {"status": "success", "workspace": "test-workspace"}
+        else:
+            raise Exception("Connection failed")
+
+    def get_current_user(self):
+        """Get current user information."""
+        return {"userName": "test.user@example.com", "displayName": "Test User"}
+
+    def set_connection_status(self, status):
+        """Set the connection status for testing."""
+        self.connection_status = status
+
+    def validate_token(self):
+        """Validate the token."""
+        if self.token_validation_result is True:
+            return True
+        elif self.token_validation_result is False:
+            return False
+        else:
+            # If it's an exception, raise it
+            raise self.token_validation_result
+
+    def set_token_validation_result(self, result):
+        """Set the token validation result for testing.
+
+        Args:
+            result: True for valid token, False for invalid token,
+                   or Exception instance to raise an exception
+        """
+        self.token_validation_result = result
diff --git a/tests/fixtures/databricks/file_stub.py b/tests/fixtures/databricks/file_stub.py
new file mode 100644
index 0000000..0488de6
--- /dev/null
+++ b/tests/fixtures/databricks/file_stub.py
@@ -0,0 +1,14 @@
+"""File operations mixin for DatabricksClientStub."""
+
+
+class FileStubMixin:
+    """Mixin providing file operations for DatabricksClientStub."""
+
+    def upload_file(self, file_path, destination_path):
+        """Upload a file."""
+        return {
+            "source_path": file_path,
+            "destination_path": destination_path,
+            "status": "uploaded",
+            "size_bytes": 1024,
+        }
diff --git a/tests/fixtures/databricks/job_stub.py b/tests/fixtures/databricks/job_stub.py
new file mode 100644
index 0000000..073c5e4
--- /dev/null
+++ b/tests/fixtures/databricks/job_stub.py
@@ -0,0 +1,67 @@
+"""Job operations mixin for DatabricksClientStub."""
+
+
+class JobStubMixin:
+    """Mixin providing job operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.create_stitch_notebook_calls = []
+
+    def list_jobs(self, **kwargs):
+        """List jobs."""
+        return {"jobs": []}
+
+    def get_job(self, job_id):
+        """Get job by ID."""
+        return {
+            "job_id": job_id,
+            "settings": {"name": f"test_job_{job_id}"},
+            "state": "TERMINATED",
+        }
+
+    def run_job(self, job_id):
+        """Run a job."""
+        return {"run_id": f"run_{job_id}_001", "job_id": job_id, "state": "RUNNING"}
+
+    def submit_job_run(self, config_path, init_script_path, run_name=None):
+        """Submit a job run and return run_id."""
+        from datetime import datetime
+
+        if not run_name:
+            run_name = (
+                f"Chuck AI One-Time Run {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}"
+            )
+
+        # Return a successful job submission
+        return {"run_id": 123456}
+
+    def get_job_run_status(self, run_id):
+        """Get job run status."""
+        return {
+            "state": {"life_cycle_state": "RUNNING"},
+            "run_id": int(run_id),
+            "run_name": "Test Run",
+            "creator_user_name": "test@example.com",
+        }
+
+    def create_stitch_notebook(self, *args, **kwargs):
+        """Create a stitch notebook (simulate successful creation)."""
+        # Track the call
+        self.create_stitch_notebook_calls.append((args, kwargs))
+
+        if hasattr(self, "_create_stitch_notebook_result"):
+            return self._create_stitch_notebook_result
+        if hasattr(self, "_create_stitch_notebook_error"):
+            raise self._create_stitch_notebook_error
+        return {
+            "notebook_id": "test-notebook-123",
+            "path": "/Workspace/Stitch/test_notebook.py",
+        }
+
+    def set_create_stitch_notebook_result(self, result):
+        """Configure create_stitch_notebook return value."""
+        self._create_stitch_notebook_result = result
+
+    def set_create_stitch_notebook_error(self, error):
+        """Configure create_stitch_notebook to raise error."""
+        self._create_stitch_notebook_error = error
diff --git a/tests/fixtures/databricks/model_stub.py b/tests/fixtures/databricks/model_stub.py
new file mode 100644
index 0000000..f24ca47
--- /dev/null
+++ b/tests/fixtures/databricks/model_stub.py
@@ -0,0 +1,35 @@
+"""Model operations mixin for DatabricksClientStub."""
+
+
+class ModelStubMixin:
+    """Mixin providing model operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.models = []
+
+    def list_models(self, **kwargs):
+        """List available models."""
+        if hasattr(self, "_list_models_error"):
+            raise self._list_models_error
+        return self.models
+
+    def get_model(self, model_name):
+        """Get a specific model by name."""
+        if hasattr(self, "_get_model_error"):
+            raise self._get_model_error
+        model = next((m for m in self.models if m["name"] == model_name), None)
+        return model
+
+    def add_model(self, name, status="READY", **kwargs):
+        """Add a model to the test data."""
+        model = {"name": name, "status": status, **kwargs}
+        self.models.append(model)
+        return model
+
+    def set_list_models_error(self, error):
+        """Configure list_models to raise an error."""
+        self._list_models_error = error
+
+    def set_get_model_error(self, error):
+        """Configure get_model to raise an error."""
+        self._get_model_error = error
diff --git a/tests/fixtures/databricks/pii_stub.py b/tests/fixtures/databricks/pii_stub.py
new file mode 100644
index 0000000..e6029e0
--- /dev/null
+++ b/tests/fixtures/databricks/pii_stub.py
@@ -0,0 +1,32 @@
+"""PII operations mixin for DatabricksClientStub."""
+
+
+class PIIStubMixin:
+    """Mixin providing PII operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.pii_scan_results = {}  # table_name -> pii results
+
+    def scan_table_pii(self, table_name):
+        """Scan table for PII data."""
+        if table_name in self.pii_scan_results:
+            return self.pii_scan_results[table_name]
+
+        return {
+            "table_name": table_name,
+            "pii_columns": ["email", "phone"],
+            "scan_timestamp": "2023-01-01T00:00:00Z",
+        }
+
+    def tag_columns_pii(self, table_name, columns, pii_type):
+        """Tag columns as PII."""
+        return {
+            "table_name": table_name,
+            "tagged_columns": columns,
+            "pii_type": pii_type,
+            "status": "success",
+        }
+
+    def set_pii_scan_result(self, table_name, result):
+        """Set a specific PII scan result for a table."""
+        self.pii_scan_results[table_name] = result
diff --git a/tests/fixtures/databricks/schema_stub.py b/tests/fixtures/databricks/schema_stub.py
new file mode 100644
index 0000000..f3cfb29
--- /dev/null
+++ b/tests/fixtures/databricks/schema_stub.py
@@ -0,0 +1,47 @@
+"""Schema operations mixin for DatabricksClientStub."""
+
+
+class SchemaStubMixin:
+    """Mixin providing schema operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.schemas = {}  # catalog_name -> [schemas]
+        self.list_schemas_calls = []
+        self.get_schema_calls = []
+
+    def list_schemas(
+        self,
+        catalog_name,
+        include_browse=False,
+        max_results=None,
+        page_token=None,
+        **kwargs,
+    ):
+        """List schemas in a catalog."""
+        self.list_schemas_calls.append(
+            (catalog_name, include_browse, max_results, page_token)
+        )
+        return {"schemas": self.schemas.get(catalog_name, [])}
+
+    def get_schema(self, full_name):
+        """Get a specific schema by full name."""
+        self.get_schema_calls.append((full_name,))
+        # Parse full_name in format "catalog_name.schema_name"
+        parts = full_name.split(".")
+        if len(parts) != 2:
+            raise Exception("Invalid schema name format")
+
+        catalog_name, schema_name = parts
+        schemas = self.schemas.get(catalog_name, [])
+        schema = next((s for s in schemas if s["name"] == schema_name), None)
+        if not schema:
+            raise Exception(f"Schema {full_name} not found")
+        return schema
+
+    def add_schema(self, catalog_name, schema_name, **kwargs):
+        """Add a schema to the test data."""
+        if catalog_name not in self.schemas:
+            self.schemas[catalog_name] = []
+        schema = {"name": schema_name, "catalog_name": catalog_name, **kwargs}
+        self.schemas[catalog_name].append(schema)
+        return schema
diff --git a/tests/fixtures/databricks/sql_stub.py b/tests/fixtures/databricks/sql_stub.py
new file mode 100644
index 0000000..0496793
--- /dev/null
+++ b/tests/fixtures/databricks/sql_stub.py
@@ -0,0 +1,33 @@
+"""SQL operations mixin for DatabricksClientStub."""
+
+
+class SQLStubMixin:
+    """Mixin providing SQL operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.sql_results = {}  # sql -> results mapping
+
+    def execute_sql(self, sql, **kwargs):
+        """Execute SQL and return results."""
+        # Return pre-configured results or default
+        if sql in self.sql_results:
+            return self.sql_results[sql]
+
+        # Default response
+        return {
+            "result": {
+                "data_array": [["row1_col1", "row1_col2"], ["row2_col1", "row2_col2"]],
+                "column_names": ["col1", "col2"],
+            },
+            "next_page_token": kwargs.get("return_next_page") and "next_token" or None,
+        }
+
+    def submit_sql_statement(self, sql_text=None, sql=None, **kwargs):
+        """Submit SQL statement for execution."""
+        # Support both parameter names for compatibility
+        # Return successful SQL submission by default
+        return {"status": {"state": "SUCCEEDED"}}
+
+    def set_sql_result(self, sql, result):
+        """Set a specific result for a SQL query."""
+        self.sql_results[sql] = result
diff --git a/tests/fixtures/databricks/table_stub.py b/tests/fixtures/databricks/table_stub.py
new file mode 100644
index 0000000..29b811e
--- /dev/null
+++ b/tests/fixtures/databricks/table_stub.py
@@ -0,0 +1,104 @@
+"""Table operations mixin for DatabricksClientStub."""
+
+
+class TableStubMixin:
+    """Mixin providing table operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.tables = {}  # (catalog, schema) -> [tables]
+        self.list_tables_calls = []
+        self.get_table_calls = []
+
+    def list_tables(
+        self,
+        catalog_name,
+        schema_name,
+        max_results=None,
+        page_token=None,
+        include_delta_metadata=False,
+        omit_columns=False,
+        omit_properties=False,
+        omit_username=False,
+        include_browse=False,
+        include_manifest_capabilities=False,
+        **kwargs,
+    ):
+        """List tables in a schema."""
+        self.list_tables_calls.append(
+            (
+                catalog_name,
+                schema_name,
+                max_results,
+                page_token,
+                include_delta_metadata,
+                omit_columns,
+                omit_properties,
+                omit_username,
+                include_browse,
+                include_manifest_capabilities,
+            )
+        )
+        key = (catalog_name, schema_name)
+        tables = self.tables.get(key, [])
+        return {"tables": tables, "next_page_token": None}
+
+    def get_table(
+        self,
+        full_name,
+        include_delta_metadata=False,
+        include_browse=False,
+        include_manifest_capabilities=False,
+        full_table_name=None,
+        **kwargs,
+    ):
+        """Get a specific table by full name."""
+        self.get_table_calls.append(
+            (
+                full_name or full_table_name,
+                include_delta_metadata,
+                include_browse,
+                include_manifest_capabilities,
+            )
+        )
+        # Support both parameter names for compatibility
+        table_name = full_name or full_table_name
+        if not table_name:
+            raise Exception("Table name is required")
+
+        # Parse full_table_name and return table details
+        parts = table_name.split(".")
+        if len(parts) != 3:
+            raise Exception("Invalid table name format")
+
+        catalog, schema, table = parts
+        key = (catalog, schema)
+        tables = self.tables.get(key, [])
+        table_info = next((t for t in tables if t["name"] == table), None)
+        if not table_info:
+            raise Exception(f"Table {table_name} not found")
+        return table_info
+
+    def add_table(
+        self, catalog_name, schema_name, table_name, table_type="MANAGED", **kwargs
+    ):
+        """Add a table to the test data."""
+        key = (catalog_name, schema_name)
+        if key not in self.tables:
+            self.tables[key] = []
+
+        table = {
+            "name": table_name,
+            "full_name": f"{catalog_name}.{schema_name}.{table_name}",
+            "table_type": table_type,
+            "catalog_name": catalog_name,
+            "schema_name": schema_name,
+            "comment": kwargs.get("comment", ""),
+            "created_at": kwargs.get("created_at", "2023-01-01T00:00:00Z"),
+            "created_by": kwargs.get("created_by", "test.user@example.com"),
+            "owner": kwargs.get("owner", "test.user@example.com"),
+            "columns": kwargs.get("columns", []),
+            "properties": kwargs.get("properties", {}),
+            **kwargs,
+        }
+        self.tables[key].append(table)
+        return table
diff --git a/tests/fixtures/databricks/volume_stub.py b/tests/fixtures/databricks/volume_stub.py
new file mode 100644
index 0000000..f0aff41
--- /dev/null
+++ b/tests/fixtures/databricks/volume_stub.py
@@ -0,0 +1,50 @@
+"""Volume operations mixin for DatabricksClientStub."""
+
+
+class VolumeStubMixin:
+    """Mixin providing volume operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.volumes = {}  # catalog_name -> [volumes]
+
+    def list_volumes(self, catalog_name, **kwargs):
+        """List volumes in a catalog."""
+        return {"volumes": self.volumes.get(catalog_name, [])}
+
+    def create_volume(
+        self, catalog_name, schema_name, volume_name, volume_type="MANAGED", **kwargs
+    ):
+        """Create a volume."""
+        key = catalog_name
+        if key not in self.volumes:
+            self.volumes[key] = []
+
+        volume = {
+            "name": volume_name,
+            "full_name": f"{catalog_name}.{schema_name}.{volume_name}",
+            "volume_type": volume_type,
+            "catalog_name": catalog_name,
+            "schema_name": schema_name,
+            **kwargs,
+        }
+        self.volumes[key].append(volume)
+        return volume
+
+    def add_volume(
+        self, catalog_name, schema_name, volume_name, volume_type="MANAGED", **kwargs
+    ):
+        """Add a volume to the test data."""
+        key = catalog_name
+        if key not in self.volumes:
+            self.volumes[key] = []
+
+        volume = {
+            "name": volume_name,
+            "full_name": f"{catalog_name}.{schema_name}.{volume_name}",
+            "volume_type": volume_type,
+            "catalog_name": catalog_name,
+            "schema_name": schema_name,
+            **kwargs,
+        }
+        self.volumes[key].append(volume)
+        return volume
diff --git a/tests/fixtures/databricks/warehouse_stub.py b/tests/fixtures/databricks/warehouse_stub.py
new file mode 100644
index 0000000..3efba06
--- /dev/null
+++ b/tests/fixtures/databricks/warehouse_stub.py
@@ -0,0 +1,63 @@
+"""Warehouse operations mixin for DatabricksClientStub."""
+
+
+class WarehouseStubMixin:
+    """Mixin providing warehouse operations for DatabricksClientStub."""
+
+    def __init__(self):
+        self.warehouses = []
+
+    def list_warehouses(self, **kwargs):
+        """List available warehouses."""
+        return self.warehouses
+
+    def get_warehouse(self, warehouse_id):
+        """Get a specific warehouse by ID."""
+        warehouse = next((w for w in self.warehouses if w["id"] == warehouse_id), None)
+        if not warehouse:
+            raise Exception(f"Warehouse {warehouse_id} not found")
+        return warehouse
+
+    def start_warehouse(self, warehouse_id):
+        """Start a warehouse."""
+        warehouse = self.get_warehouse(warehouse_id)
+        warehouse["state"] = "STARTING"
+        return warehouse
+
+    def stop_warehouse(self, warehouse_id):
+        """Stop a warehouse."""
+        warehouse = self.get_warehouse(warehouse_id)
+        warehouse["state"] = "STOPPING"
+        return warehouse
+
+    def add_warehouse(
+        self,
+        warehouse_id=None,
+        name="Test Warehouse",
+        state="RUNNING",
+        size="SMALL",
+        enable_serverless_compute=False,
+        warehouse_type="PRO",
+        creator_name="test.user@example.com",
+        auto_stop_mins=60,
+        **kwargs,
+    ):
+        """Add a warehouse to the test data."""
+        if warehouse_id is None:
+            warehouse_id = f"warehouse_{len(self.warehouses)}"
+
+        warehouse = {
+            "id": warehouse_id,
+            "name": name,
+            "state": state,
+            "size": size,  # Use size instead of cluster_size for the main field
+            "cluster_size": size,  # Keep cluster_size for backward compatibility
+            "enable_serverless_compute": enable_serverless_compute,
+            "warehouse_type": warehouse_type,
+            "creator_name": creator_name,
+            "auto_stop_mins": auto_stop_mins,
+            "jdbc_url": f"jdbc:databricks://test.cloud.databricks.com:443/default;transportMode=http;ssl=1;httpPath=/sql/1.0/warehouses/{warehouse_id}",
+            **kwargs,
+        }
+        self.warehouses.append(warehouse)
+        return warehouse
diff --git a/tests/fixtures/environment.py b/tests/fixtures/environment.py
new file mode 100644
index 0000000..74d6fe4
--- /dev/null
+++ b/tests/fixtures/environment.py
@@ -0,0 +1,102 @@
+"""Environment fixtures for Chuck tests.
+
+These fixtures provide clean, isolated environment setups for different test scenarios,
+replacing scattered @patch.dict calls throughout the test suite.
+"""
+
+import pytest
+import os
+from unittest.mock import patch
+
+
+@pytest.fixture
+def clean_env():
+    """
+    Provide completely clean environment for config tests.
+
+    This fixture clears all environment variables to ensure config tests
+    get predictable behavior without interference from host environment
+    CHUCK_* variables or other system settings.
+
+    Usage:
+        def test_config_behavior(clean_env):
+            # Test runs with empty environment
+            # Config values come only from test setup, not env vars
+    """
+    with patch.dict(os.environ, {}, clear=True):
+        yield
+
+
+@pytest.fixture
+def mock_databricks_env():
+    """
+    Provide standard Databricks test environment variables.
+
+    Sets up common Databricks environment variables needed for
+    authentication and workspace tests.
+
+    Usage:
+        def test_databricks_auth(mock_databricks_env):
+            # DATABRICKS_TOKEN and DATABRICKS_WORKSPACE_URL are set
+    """
+    test_env = {
+        "DATABRICKS_TOKEN": "test_token",
+        "DATABRICKS_WORKSPACE_URL": "test-workspace",
+    }
+    with patch.dict(os.environ, test_env, clear=True):
+        yield
+
+
+@pytest.fixture
+def no_color_env():
+    """
+    Provide NO_COLOR environment for display tests.
+
+    Sets NO_COLOR environment variable to test color output behavior.
+
+    Usage:
+        def test_no_color_output(no_color_env):
+            # NO_COLOR is set, color output should be disabled
+    """
+    with patch.dict(os.environ, {"NO_COLOR": "1"}, clear=True):
+        yield
+
+
+@pytest.fixture
+def no_color_true_env():
+    """
+    Provide NO_COLOR=true environment for display tests.
+
+    Sets NO_COLOR=true to test alternative true value handling.
+
+    Usage:
+        def test_no_color_true_output(no_color_true_env):
+            # NO_COLOR=true, color output should be disabled
+    """
+    with patch.dict(os.environ, {"NO_COLOR": "true"}, clear=True):
+        yield
+
+
+@pytest.fixture
+def chuck_env_vars():
+    """
+    Provide specific CHUCK_* environment variables for config override tests.
+
+    Sets up CHUCK_* prefixed environment variables to test the config system's
+    environment variable override behavior.
+
+    Usage:
+        def test_config_env_override(chuck_env_vars):
+            # CHUCK_WORKSPACE_URL and other vars are set
+            # Config system should read from these env vars
+    """
+    test_env = {
+        "CHUCK_WORKSPACE_URL": "env-workspace",
+        "CHUCK_ACTIVE_MODEL": "env-model",
+        "CHUCK_WAREHOUSE_ID": "env-warehouse",
+        "CHUCK_ACTIVE_CATALOG": "env-catalog",
+        "CHUCK_ACTIVE_SCHEMA": "env-schema",
+        "CHUCK_DATABRICKS_TOKEN": "env-token",
+    }
+    with patch.dict(os.environ, test_env, clear=True):
+        yield
diff --git a/tests/fixtures/llm.py b/tests/fixtures/llm.py
new file mode 100644
index 0000000..a447380
--- /dev/null
+++ b/tests/fixtures/llm.py
@@ -0,0 +1,121 @@
+"""LLM client fixtures."""
+
+
+class LLMClientStub:
+    """Comprehensive stub for LLMClient with predictable responses."""
+
+    def __init__(self):
+        self.databricks_token = "test-token"
+        self.base_url = "https://test.databricks.com"
+
+        # Test configuration
+        self.should_fail_chat = False
+        self.should_raise_exception = False
+        self.response_content = "Test LLM response"
+        self.tool_calls = []
+        self.streaming_responses = []
+
+        # Track method calls for testing
+        self.chat_calls = []
+
+        # Pre-configured responses for specific scenarios
+        self.configured_responses = {}
+
+    def chat(self, messages, model=None, tools=None, stream=False, tool_choice="auto"):
+        """Simulate LLM chat completion."""
+        # Track the call
+        call_info = {
+            "messages": messages,
+            "model": model,
+            "tools": tools,
+            "stream": stream,
+            "tool_choice": tool_choice,
+        }
+        self.chat_calls.append(call_info)
+
+        if self.should_raise_exception:
+            raise Exception("Test LLM exception")
+
+        if self.should_fail_chat:
+            raise Exception("LLM API error")
+
+        # Check for configured response based on messages
+        messages_key = str(messages)
+        if messages_key in self.configured_responses:
+            return self.configured_responses[messages_key]
+
+        # Create mock response structure
+        mock_choice = MockChoice()
+        mock_choice.message = MockMessage()
+
+        if self.tool_calls:
+            # Return tool calls if configured
+            mock_choice.message.tool_calls = self.tool_calls
+            mock_choice.message.content = None
+        else:
+            # Return content response
+            mock_choice.message.content = self.response_content
+            mock_choice.message.tool_calls = None
+
+        mock_response = MockChatResponse()
+        mock_response.choices = [mock_choice]
+
+        return mock_response
+
+    def set_response_content(self, content):
+        """Set the content for the next chat response."""
+        self.response_content = content
+
+    def set_tool_calls(self, tool_calls):
+        """Set tool calls for the next chat response."""
+        self.tool_calls = tool_calls
+
+    def configure_response_for_messages(self, messages, response):
+        """Configure a specific response for specific messages."""
+        self.configured_responses[str(messages)] = response
+
+    def set_chat_failure(self, should_fail=True):
+        """Configure chat to fail."""
+        self.should_fail_chat = should_fail
+
+    def set_exception(self, should_raise=True):
+        """Configure chat to raise exception."""
+        self.should_raise_exception = should_raise
+
+
+class MockMessage:
+    """Mock LLM message object."""
+
+    def __init__(self):
+        self.content = None
+        self.tool_calls = None
+
+
+class MockChoice:
+    """Mock LLM choice object."""
+
+    def __init__(self):
+        self.message = None
+
+
+class MockChatResponse:
+    """Mock LLM chat response object."""
+
+    def __init__(self):
+        self.choices = []
+
+
+class MockToolCall:
+    """Mock LLM tool call object."""
+
+    def __init__(self, id="test-id", name="test-function", arguments="{}"):
+        self.id = id
+        self.function = MockFunction(name, arguments)
+
+
+class MockFunction:
+    """Mock LLM function object."""
+
+    def __init__(self, name, arguments):
+        self.name = name
+        self.arguments = arguments
diff --git a/tests/integration/test_integration.py b/tests/integration/test_integration.py
new file mode 100644
index 0000000..b466567
--- /dev/null
+++ b/tests/integration/test_integration.py
@@ -0,0 +1,109 @@
+"""Integration tests for the Chuck application."""
+
+import pytest
+from unittest.mock import patch
+from chuck_data.config import (
+    set_active_model,
+    get_active_model,
+    ConfigManager,
+)
+import os
+import json
+
+
+@pytest.fixture
+def integration_setup():
+    """Set up the test environment with controlled configuration."""
+    # Set up test environment
+    test_config_path = "/tmp/.test_chuck_integration_config.json"
+
+    # Create a test config manager instance
+    config_manager = ConfigManager(config_path=test_config_path)
+
+    # Replace the global config manager with our test instance
+    config_manager_patcher = patch("chuck_data.config._config_manager", config_manager)
+    config_manager_patcher.start()
+
+    # Mock environment for authentication
+    env_patcher = patch.dict(
+        "os.environ",
+        {
+            "DATABRICKS_TOKEN": "test_token",
+            "DATABRICKS_WORKSPACE_URL": "test-workspace",
+        },
+    )
+    env_patcher.start()
+
+    # Initialize the config with workspace_url
+    config_manager.update(workspace_url="test-workspace")
+
+    yield {
+        "test_config_path": test_config_path,
+        "config_manager": config_manager,
+        "config_manager_patcher": config_manager_patcher,
+        "env_patcher": env_patcher,
+    }
+
+    # Cleanup
+    if os.path.exists(test_config_path):
+        os.remove(test_config_path)
+    config_manager_patcher.stop()
+    env_patcher.stop()
+
+
+def test_config_operations(integration_setup):
+    """Test that config operations work properly."""
+    test_config_path = integration_setup["test_config_path"]
+
+    # Test writing and reading config
+    set_active_model("test-model")
+
+    # Verify the config file was actually created with correct content
+    assert os.path.exists(test_config_path)
+    with open(test_config_path, "r") as f:
+        saved_config = json.load(f)
+    assert saved_config["active_model"] == "test-model"
+
+    # Test reading the config
+    active_model = get_active_model()
+    assert active_model == "test-model"
+
+
+def test_catalog_config_operations(integration_setup):
+    """Test catalog config operations."""
+    test_config_path = integration_setup["test_config_path"]
+
+    # Test writing and reading catalog config
+    from chuck_data.config import set_active_catalog, get_active_catalog
+
+    test_catalog = "test-catalog"
+    set_active_catalog(test_catalog)
+
+    # Verify the config file was updated with catalog
+    with open(test_config_path, "r") as f:
+        saved_config = json.load(f)
+    assert saved_config["active_catalog"] == test_catalog
+
+    # Test reading the catalog config
+    active_catalog = get_active_catalog()
+    assert active_catalog == test_catalog
+
+
+def test_schema_config_operations(integration_setup):
+    """Test schema config operations."""
+    test_config_path = integration_setup["test_config_path"]
+
+    # Test writing and reading schema config
+    from chuck_data.config import set_active_schema, get_active_schema
+
+    test_schema = "test-schema"
+    set_active_schema(test_schema)
+
+    # Verify the config file was updated with schema
+    with open(test_config_path, "r") as f:
+        saved_config = json.load(f)
+    assert saved_config["active_schema"] == test_schema
+
+    # Test reading the schema config
+    active_schema = get_active_schema()
+    assert active_schema == test_schema
diff --git a/tests/test_agent_manager.py b/tests/test_agent_manager.py
deleted file mode 100644
index 82db870..0000000
--- a/tests/test_agent_manager.py
+++ /dev/null
@@ -1,283 +0,0 @@
-"""
-Tests for the AgentManager class.
-"""
-
-import unittest
-import sys
-from unittest.mock import patch, MagicMock
-
-# Mock the optional openai dependency used by LLMClient if it is not
-# installed. This prevents import errors during test collection.
-sys.modules.setdefault("openai", MagicMock())
-
-from chuck_data.agent import AgentManager  # noqa: E402
-from tests.fixtures import LLMClientStub, MockToolCall  # noqa: E402
-from chuck_data.agent.prompts import (  # noqa: E402
-    PII_AGENT_SYSTEM_MESSAGE,
-    BULK_PII_AGENT_SYSTEM_MESSAGE,
-    STITCH_AGENT_SYSTEM_MESSAGE,
-)
-
-
-class TestAgentManager(unittest.TestCase):
-    """Test cases for the AgentManager."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        # Mock the API client that might be passed to AgentManager
-        self.mock_api_client = MagicMock()
-
-        # Use LLMClientStub instead of MagicMock
-        self.llm_client_stub = LLMClientStub()
-        self.patcher = patch(
-            "chuck_data.agent.manager.LLMClient", return_value=self.llm_client_stub
-        )
-        self.MockLLMClient = self.patcher.start()
-
-        # Mock tool functions used within AgentManager
-        self.patcher_get_schemas = patch("chuck_data.agent.manager.get_tool_schemas")
-        self.MockGetToolSchemas = self.patcher_get_schemas.start()
-        self.patcher_execute_tool = patch("chuck_data.agent.manager.execute_tool")
-        self.MockExecuteTool = self.patcher_execute_tool.start()
-
-        # Create a mock callback for testing
-        self.mock_callback = MagicMock()
-
-        # Instantiate AgentManager
-        self.agent_manager = AgentManager(self.mock_api_client, model="test-model")
-
-    def tearDown(self):
-        """Clean up after tests."""
-        self.patcher.stop()
-        self.patcher_get_schemas.stop()
-        self.patcher_execute_tool.stop()
-
-    def test_agent_manager_initialization(self):
-        """Test that AgentManager initializes correctly."""
-        self.MockLLMClient.assert_called_once()  # Check LLMClient was instantiated
-        self.assertEqual(self.agent_manager.api_client, self.mock_api_client)
-        self.assertEqual(self.agent_manager.model, "test-model")
-        self.assertIsNone(self.agent_manager.tool_output_callback)  # Default to None
-        expected_history = [
-            {
-                "role": "system",
-                "content": self.agent_manager.conversation_history[0]["content"],
-            }
-        ]
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-        self.assertIs(self.agent_manager.llm_client, self.llm_client_stub)
-
-    def test_agent_manager_initialization_with_callback(self):
-        """Test that AgentManager initializes correctly with a callback."""
-        agent_with_callback = AgentManager(
-            self.mock_api_client,
-            model="test-model",
-            tool_output_callback=self.mock_callback,
-        )
-        self.assertEqual(agent_with_callback.api_client, self.mock_api_client)
-        self.assertEqual(agent_with_callback.model, "test-model")
-        self.assertEqual(agent_with_callback.tool_output_callback, self.mock_callback)
-
-    def test_add_user_message(self):
-        """Test adding a user message."""
-        # Reset conversation history for this test
-        self.agent_manager.conversation_history = []
-
-        self.agent_manager.add_user_message("Hello agent!")
-        expected_history = [
-            {"role": "user", "content": "Hello agent!"},
-        ]
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-        self.agent_manager.add_user_message("Another message.")
-        expected_history.append({"role": "user", "content": "Another message."})
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-    def test_add_assistant_message(self):
-        """Test adding an assistant message."""
-        # Reset conversation history for this test
-        self.agent_manager.conversation_history = []
-
-        self.agent_manager.add_assistant_message("Hello user!")
-        expected_history = [
-            {"role": "assistant", "content": "Hello user!"},
-        ]
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-        self.agent_manager.add_assistant_message("How can I help?")
-        expected_history.append({"role": "assistant", "content": "How can I help?"})
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-    def test_add_system_message_new(self):
-        """Test adding a system message when none exists."""
-        self.agent_manager.add_system_message("You are a helpful assistant.")
-        expected_history = [
-            {"role": "system", "content": "You are a helpful assistant."}
-        ]
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-        # Add another message to ensure system message stays at the start
-        self.agent_manager.add_user_message("User query")
-        expected_history.append({"role": "user", "content": "User query"})
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-    def test_add_system_message_replace(self):
-        """Test adding a system message replaces an existing one."""
-        self.agent_manager.add_system_message("Initial system message.")
-        self.agent_manager.add_user_message("User query")
-        self.agent_manager.add_system_message("Updated system message.")
-
-        expected_history = [
-            {"role": "system", "content": "Updated system message."},
-            {"role": "user", "content": "User query"},
-        ]
-        self.assertEqual(self.agent_manager.conversation_history, expected_history)
-
-    # --- Tests for process_with_tools ---
-
-    def test_process_with_tools_no_tool_calls(self):
-        """Test processing when the LLM responds with content only."""
-        # Setup
-        mock_tools = [{"type": "function", "function": {"name": "dummy_tool"}}]
-
-        # Mock the LLM client response - content only, no tool calls
-        mock_resp = MagicMock()
-        mock_resp.choices = [MagicMock()]
-        mock_resp.choices[0].delta = MagicMock(content="Final answer.", tool_calls=None)
-        # Configure stub to return the mock response directly
-        self.llm_client_stub.set_response_content("Final answer.")
-
-        # Run the method
-        self.agent_manager.process_with_tools = MagicMock(return_value="Final answer.")
-
-        # Call the method
-        result = self.agent_manager.process_with_tools(mock_tools)
-
-        # Assertions
-        self.assertEqual(result, "Final answer.")
-
-    def test_process_with_tools_iteration_limit(self):
-        """Ensure process_with_tools stops after the max iteration limit."""
-        mock_tools = [{"type": "function", "function": {"name": "dummy_tool"}}]
-
-        tool_call = MagicMock()
-        tool_call.function.name = "dummy_tool"
-        tool_call.id = "1"
-        tool_call.function.arguments = "{}"
-
-        mock_resp = MagicMock()
-        mock_resp.choices = [MagicMock()]
-        mock_resp.choices[0].message = MagicMock(tool_calls=[tool_call])
-
-        # Configure stub to return tool calls
-        mock_tool_call = MockToolCall(id="1", name="dummy_tool", arguments="{}")
-        self.llm_client_stub.set_tool_calls([mock_tool_call])
-        self.MockExecuteTool.return_value = {"result": "ok"}
-
-        result = self.agent_manager.process_with_tools(mock_tools, max_iterations=2)
-
-        self.assertEqual(result, "Error: maximum iterations reached.")
-
-    @patch("chuck_data.agent.manager.AgentManager.process_with_tools")
-    def test_process_pii_detection(self, mock_process):
-        """Test process_pii_detection sets up context and calls process_with_tools."""
-        mock_tools = [{"schema": "tool1"}]
-        self.MockGetToolSchemas.return_value = mock_tools
-        mock_process.return_value = "PII analysis complete."
-
-        result = self.agent_manager.process_pii_detection("my_table")
-
-        self.assertEqual(result, "PII analysis complete.")
-        # Check system message
-        self.assertEqual(self.agent_manager.conversation_history[0]["role"], "system")
-        self.assertEqual(
-            self.agent_manager.conversation_history[0]["content"],
-            PII_AGENT_SYSTEM_MESSAGE,
-        )
-        # Check user message
-        self.assertEqual(self.agent_manager.conversation_history[1]["role"], "user")
-        self.assertEqual(
-            self.agent_manager.conversation_history[1]["content"],
-            "Analyze the table 'my_table' for PII data.",
-        )
-        # Check call to process_with_tools
-        mock_process.assert_called_once_with(mock_tools)
-
-    @patch("chuck_data.agent.manager.AgentManager.process_with_tools")
-    def test_process_bulk_pii_scan(self, mock_process):
-        """Test process_bulk_pii_scan sets up context and calls process_with_tools."""
-        mock_tools = [{"schema": "tool2"}]
-        self.MockGetToolSchemas.return_value = mock_tools
-        mock_process.return_value = "Bulk PII scan complete."
-
-        result = self.agent_manager.process_bulk_pii_scan(
-            catalog_name="cat", schema_name="sch"
-        )
-
-        self.assertEqual(result, "Bulk PII scan complete.")
-        # Check system message
-        self.assertEqual(self.agent_manager.conversation_history[0]["role"], "system")
-        self.assertEqual(
-            self.agent_manager.conversation_history[0]["content"],
-            BULK_PII_AGENT_SYSTEM_MESSAGE,
-        )
-        # Check user message
-        self.assertEqual(self.agent_manager.conversation_history[1]["role"], "user")
-        self.assertEqual(
-            self.agent_manager.conversation_history[1]["content"],
-            "Scan all tables in catalog 'cat' and schema 'sch' for PII data.",
-        )
-        # Check call to process_with_tools
-        mock_process.assert_called_once_with(mock_tools)
-
-    @patch("chuck_data.agent.manager.AgentManager.process_with_tools")
-    def test_process_setup_stitch(self, mock_process):
-        """Test process_setup_stitch sets up context and calls process_with_tools."""
-        mock_tools = [{"schema": "tool3"}]
-        self.MockGetToolSchemas.return_value = mock_tools
-        mock_process.return_value = "Stitch setup complete."
-
-        result = self.agent_manager.process_setup_stitch(
-            catalog_name="cat", schema_name="sch"
-        )
-
-        self.assertEqual(result, "Stitch setup complete.")
-        # Check system message
-        self.assertEqual(self.agent_manager.conversation_history[0]["role"], "system")
-        self.assertEqual(
-            self.agent_manager.conversation_history[0]["content"],
-            STITCH_AGENT_SYSTEM_MESSAGE,
-        )
-        # Check user message
-        self.assertEqual(self.agent_manager.conversation_history[1]["role"], "user")
-        self.assertEqual(
-            self.agent_manager.conversation_history[1]["content"],
-            "Set up a Stitch integration for catalog 'cat' and schema 'sch'.",
-        )
-        # Check call to process_with_tools
-        mock_process.assert_called_once_with(mock_tools)
-
-    @patch("chuck_data.agent.manager.AgentManager.process_with_tools")
-    def test_process_query(self, mock_process):
-        """Test process_query adds user message and calls process_with_tools."""
-        mock_tools = [{"schema": "tool4"}]
-        self.MockGetToolSchemas.return_value = mock_tools
-        mock_process.return_value = "Query processed."
-
-        # Reset the conversation history to a clean state for this test
-        self.agent_manager.conversation_history = []
-        self.agent_manager.add_system_message("General assistant.")
-        self.agent_manager.add_user_message("Previous question.")
-        self.agent_manager.add_assistant_message("Previous answer.")
-
-        result = self.agent_manager.process_query("What is the weather?")
-
-        self.assertEqual(result, "Query processed.")
-        # Check latest user message
-        self.assertEqual(self.agent_manager.conversation_history[-1]["role"], "user")
-        self.assertEqual(
-            self.agent_manager.conversation_history[-1]["content"],
-            "What is the weather?",
-        )
-        # Check call to process_with_tools
-        mock_process.assert_called_once_with(mock_tools)
diff --git a/tests/test_agent_tool_display_routing.py b/tests/test_agent_tool_display_routing.py
deleted file mode 100644
index 902c343..0000000
--- a/tests/test_agent_tool_display_routing.py
+++ /dev/null
@@ -1,452 +0,0 @@
-"""
-Tests for agent tool display routing in the TUI.
-
-These tests ensure that when agents use list-* commands, they display
-the same formatted tables as when users use equivalent slash commands.
-"""
-
-import unittest
-from unittest.mock import patch
-from chuck_data.ui.tui import ChuckTUI
-from chuck_data.commands.base import CommandResult
-from chuck_data.agent.tool_executor import execute_tool
-
-
-class TestAgentToolDisplayRouting(unittest.TestCase):
-    """Test cases for agent tool display routing."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        # Use a real TUI instance but capture console output
-        self.tui = ChuckTUI()
-        # We'll capture calls to console.print to verify table display
-
-    def test_agent_list_commands_display_tables_not_raw_json(self):
-        """
-        End-to-end test: Agent tool calls should display formatted tables, not raw JSON.
-
-        This is the critical test that prevents the regression where agents
-        would see raw JSON instead of formatted tables.
-        """
-        from chuck_data.commands import register_all_commands
-        from chuck_data.command_registry import get_command
-        from unittest.mock import MagicMock
-
-        # Register all commands
-        register_all_commands()
-
-        # Test data that would normally be returned by list commands
-        test_cases = [
-            {
-                "tool_name": "list-schemas",
-                "test_data": {
-                    "schemas": [
-                        {"name": "bronze", "comment": "Bronze layer"},
-                        {"name": "silver", "comment": "Silver layer"},
-                    ],
-                    "catalog_name": "test_catalog",
-                    "total_count": 2,
-                },
-                "expected_table_indicators": ["Schemas in catalog", "bronze", "silver"],
-            },
-            {
-                "tool_name": "list-catalogs",
-                "test_data": {
-                    "catalogs": [
-                        {
-                            "name": "catalog1",
-                            "type": "MANAGED",
-                            "comment": "First catalog",
-                        },
-                        {
-                            "name": "catalog2",
-                            "type": "EXTERNAL",
-                            "comment": "Second catalog",
-                        },
-                    ],
-                    "total_count": 2,
-                },
-                "expected_table_indicators": [
-                    "Available Catalogs",
-                    "catalog1",
-                    "catalog2",
-                ],
-            },
-            {
-                "tool_name": "list-tables",
-                "test_data": {
-                    "tables": [
-                        {"name": "table1", "table_type": "MANAGED"},
-                        {"name": "table2", "table_type": "EXTERNAL"},
-                    ],
-                    "catalog_name": "test_catalog",
-                    "schema_name": "test_schema",
-                    "total_count": 2,
-                },
-                "expected_table_indicators": [
-                    "Tables in test_catalog.test_schema",
-                    "table1",
-                    "table2",
-                ],
-            },
-        ]
-
-        for case in test_cases:
-            with self.subTest(tool=case["tool_name"]):
-                # Mock console to capture output
-                mock_console = MagicMock()
-                self.tui.console = mock_console
-
-                # Get the command definition
-                cmd_def = get_command(case["tool_name"])
-                self.assertIsNotNone(cmd_def, f"Command {case['tool_name']} not found")
-
-                # Verify agent_display setting based on command type
-                if case["tool_name"] in [
-                    "list-catalogs",
-                    "list-schemas",
-                    "list-tables",
-                ]:
-                    # list-catalogs, list-schemas, and list-tables use conditional display
-                    self.assertEqual(
-                        cmd_def.agent_display,
-                        "conditional",
-                        f"Command {case['tool_name']} must have agent_display='conditional'",
-                    )
-                    # For conditional display, we need to test with display=true to see the table
-                    test_data_with_display = case["test_data"].copy()
-                    test_data_with_display["display"] = True
-                    from chuck_data.exceptions import PaginationCancelled
-
-                    with self.assertRaises(PaginationCancelled):
-                        self.tui.display_tool_output(
-                            case["tool_name"], test_data_with_display
-                        )
-                else:
-                    # Other commands use full display
-                    self.assertEqual(
-                        cmd_def.agent_display,
-                        "full",
-                        f"Command {case['tool_name']} must have agent_display='full'",
-                    )
-                    # Call the display method with test data - should raise PaginationCancelled
-                    from chuck_data.exceptions import PaginationCancelled
-
-                    with self.assertRaises(PaginationCancelled):
-                        self.tui.display_tool_output(
-                            case["tool_name"], case["test_data"]
-                        )
-
-                # Verify console.print was called (indicates table display, not raw JSON)
-                mock_console.print.assert_called()
-
-                # Verify the output was processed by checking the call arguments
-                print_calls = mock_console.print.call_args_list
-
-                # Verify that Rich Table objects were printed (not raw JSON strings)
-                table_objects_found = False
-                raw_json_found = False
-
-                for call in print_calls:
-                    args, kwargs = call
-                    for arg in args:
-                        # Check if we're printing Rich Table objects (good)
-                        if hasattr(arg, "__class__") and "Table" in str(type(arg)):
-                            table_objects_found = True
-                        # Check if we're printing raw JSON strings (bad)
-                        elif isinstance(arg, str) and (
-                            '"schemas":' in arg
-                            or '"catalogs":' in arg
-                            or '"tables":' in arg
-                        ):
-                            raw_json_found = True
-
-                # Verify we're displaying tables, not raw JSON
-                self.assertTrue(
-                    table_objects_found,
-                    f"No Rich Table objects found in {case['tool_name']} output - this indicates the regression",
-                )
-                self.assertFalse(
-                    raw_json_found,
-                    f"Raw JSON strings found in {case['tool_name']} output - this indicates the regression",
-                )
-
-    def test_unknown_tool_falls_back_to_generic_display(self):
-        """Test that unknown tools fall back to generic display."""
-        from unittest.mock import MagicMock
-
-        test_data = {"some": "data"}
-
-        mock_console = MagicMock()
-        self.tui.console = mock_console
-
-        self.tui._display_full_tool_output("unknown-tool", test_data)
-        # Should create a generic panel
-        mock_console.print.assert_called()
-
-    def test_command_name_mapping_prevents_regression(self):
-        """
-        Test that ensures command name mapping in TUI covers both hyphenated and underscore versions.
-
-        This test specifically prevents the regression where agent tool names with hyphens
-        (like 'list-schemas') weren't being mapped to the correct display methods.
-        """
-
-        # Test cases: agent tool name -> expected display method call
-        command_mappings = [
-            ("list-schemas", "_display_schemas"),
-            ("list-catalogs", "_display_catalogs"),
-            ("list-tables", "_display_tables"),
-            ("list-warehouses", "_display_warehouses"),
-            ("list-volumes", "_display_volumes"),
-            ("detailed-models", "_display_detailed_models"),
-            ("list-models", "_display_models"),
-        ]
-
-        for tool_name, expected_method in command_mappings:
-            with self.subTest(tool_name=tool_name):
-                # Mock the expected display method
-                with patch.object(self.tui, expected_method) as mock_method:
-                    # Call with appropriate test data structure based on what the TUI routing expects
-                    if tool_name == "list-models":
-                        # For list-models, the TUI checks if "models" key exists in the dict
-                        # If not, it calls _display_models with the dict itself
-                        # (which seems like a bug, but we're testing the current behavior)
-                        test_data = [
-                            {"name": "test_model", "creator": "test"}
-                        ]  # This will be passed to _display_models
-                    elif tool_name == "detailed-models":
-                        # For detailed-models, it expects "models" key in the dict
-                        test_data = {
-                            "models": [{"name": "test_model", "creator": "test"}]
-                        }
-                    else:
-                        test_data = {"test": "data"}
-                    self.tui._display_full_tool_output(tool_name, test_data)
-
-                    # Verify the correct method was called
-                    mock_method.assert_called_once_with(test_data)
-
-    def test_agent_display_setting_validation(self):
-        """
-        Test that validates ALL list commands have agent_display='full'.
-
-        This prevents regressions where commands might be added without proper display settings.
-        """
-        from chuck_data.commands import register_all_commands
-        from chuck_data.command_registry import get_command, get_agent_commands
-
-        register_all_commands()
-
-        # Get all agent-visible commands
-        agent_commands = get_agent_commands()
-
-        # Find all list-* commands
-        list_commands = [
-            name
-            for name in agent_commands.keys()
-            if name.startswith("list-") or name == "detailed-models"
-        ]
-
-        # Ensure we have the expected list commands
-        expected_list_commands = {
-            "list-schemas",
-            "list-catalogs",
-            "list-tables",
-            "list-warehouses",
-            "list-volumes",
-            "detailed-models",
-            "list-models",
-        }
-
-        found_commands = set(list_commands)
-        self.assertEqual(
-            found_commands,
-            expected_list_commands,
-            f"Expected list commands changed. Found: {found_commands}, Expected: {expected_list_commands}",
-        )
-
-        # Verify each has agent_display="full" (except list-warehouses, list-catalogs, list-schemas, and list-tables which use conditional display)
-        for cmd_name in list_commands:
-            with self.subTest(command=cmd_name):
-                cmd_def = get_command(cmd_name)
-                if cmd_name in [
-                    "list-warehouses",
-                    "list-catalogs",
-                    "list-schemas",
-                    "list-tables",
-                ]:
-                    # list-warehouses, list-catalogs, list-schemas, and list-tables use conditional display with display parameter
-                    self.assertEqual(
-                        cmd_def.agent_display,
-                        "conditional",
-                        f"Command {cmd_name} should use conditional display with display parameter control",
-                    )
-                    # Verify it has a display_condition function
-                    self.assertIsNotNone(
-                        cmd_def.display_condition,
-                        f"Command {cmd_name} with conditional display must have display_condition function",
-                    )
-                else:
-                    self.assertEqual(
-                        cmd_def.agent_display,
-                        "full",
-                        f"Command {cmd_name} must have agent_display='full' for table display",
-                    )
-
-    def test_end_to_end_agent_tool_execution_with_table_display(self):
-        """
-        Full end-to-end test: Execute an agent tool and verify it displays tables.
-
-        This test goes through the complete flow: agent calls tool -> tool executes ->
-        output callback triggers -> TUI displays formatted table.
-        """
-        from unittest.mock import MagicMock
-
-        # Mock an API client
-        mock_client = MagicMock()
-
-        # Mock console to capture display output
-        mock_console = MagicMock()
-        self.tui.console = mock_console
-
-        # Create a simple output callback that mimics agent behavior
-        def output_callback(tool_name, tool_data):
-            """This mimics how agents call display_tool_output"""
-            self.tui.display_tool_output(tool_name, tool_data)
-
-        # Test with list-schemas command
-        with patch("chuck_data.agent.tool_executor.get_command") as mock_get_command:
-            # Get the real command definition
-            from chuck_data.commands.list_schemas import DEFINITION as schemas_def
-            from chuck_data.commands import register_all_commands
-
-            register_all_commands()
-
-            mock_get_command.return_value = schemas_def
-
-            # Mock the handler to return test data
-            with patch.object(schemas_def, "handler") as mock_handler:
-                mock_handler.__name__ = "mock_handler"
-                mock_handler.return_value = CommandResult(
-                    True,
-                    data={
-                        "schemas": [
-                            {"name": "bronze", "comment": "Bronze layer"},
-                            {"name": "silver", "comment": "Silver layer"},
-                        ],
-                        "catalog_name": "test_catalog",
-                        "total_count": 2,
-                        "display": True,  # This triggers the display
-                    },
-                    message="Found 2 schemas",
-                )
-
-                # Execute the tool with output callback (mimics agent behavior)
-                # The output callback should raise PaginationCancelled which bubbles up
-                from chuck_data.exceptions import PaginationCancelled
-
-                with patch("chuck_data.agent.tool_executor.jsonschema.validate"):
-                    with self.assertRaises(PaginationCancelled):
-                        execute_tool(
-                            mock_client,
-                            "list-schemas",
-                            {"catalog_name": "test_catalog", "display": True},
-                            output_callback=output_callback,
-                        )
-
-                # Verify the callback triggered table display (not raw JSON)
-                mock_console.print.assert_called()
-
-                # Verify table-formatted output was displayed (use same approach as main test)
-                print_calls = mock_console.print.call_args_list
-
-                # Verify that Rich Table objects were printed (not raw JSON strings)
-                table_objects_found = False
-                raw_json_found = False
-
-                for call in print_calls:
-                    args, kwargs = call
-                    for arg in args:
-                        # Check if we're printing Rich Table objects (good)
-                        if hasattr(arg, "__class__") and "Table" in str(type(arg)):
-                            table_objects_found = True
-                        # Check if we're printing raw JSON strings (bad)
-                        elif isinstance(arg, str) and (
-                            '"schemas":' in arg or '"total_count":' in arg
-                        ):
-                            raw_json_found = True
-
-                # Verify we're displaying tables, not raw JSON
-                self.assertTrue(
-                    table_objects_found,
-                    "No Rich Table objects found - this indicates the regression",
-                )
-                self.assertFalse(
-                    raw_json_found,
-                    "Raw JSON strings found - this indicates the regression",
-                )
-
-    def test_list_commands_raise_pagination_cancelled_like_run_sql(self):
-        """
-        Test that list-* commands raise PaginationCancelled to return to chuck > prompt,
-        just like run-sql does.
-
-        This is the key behavior the user requested - list commands should show tables
-        and immediately return to chuck > prompt, not continue with agent processing.
-        """
-        from chuck_data.exceptions import PaginationCancelled
-        from unittest.mock import MagicMock
-
-        list_display_methods = [
-            (
-                "_display_schemas",
-                {"schemas": [{"name": "test"}], "catalog_name": "test"},
-            ),
-            ("_display_catalogs", {"catalogs": [{"name": "test"}]}),
-            (
-                "_display_tables",
-                {
-                    "tables": [{"name": "test"}],
-                    "catalog_name": "test",
-                    "schema_name": "test",
-                },
-            ),
-            ("_display_warehouses", {"warehouses": [{"name": "test", "id": "test"}]}),
-            (
-                "_display_volumes",
-                {
-                    "volumes": [{"name": "test"}],
-                    "catalog_name": "test",
-                    "schema_name": "test",
-                },
-            ),
-            (
-                "_display_models",
-                [{"name": "test", "creator": "test"}],
-            ),  # models expects a list directly
-            ("_display_detailed_models", {"models": [{"name": "test"}]}),
-        ]
-
-        for method_name, test_data in list_display_methods:
-            with self.subTest(method=method_name):
-                # Mock console to prevent actual output
-                mock_console = MagicMock()
-                self.tui.console = mock_console
-
-                # Get the display method
-                display_method = getattr(self.tui, method_name)
-
-                # Call the method and verify it raises PaginationCancelled
-                with self.assertRaises(
-                    PaginationCancelled,
-                    msg=f"{method_name} should raise PaginationCancelled to return to chuck > prompt",
-                ):
-                    display_method(test_data)
-
-                # Verify console output was called (table was displayed)
-                mock_console.print.assert_called()
-
-
-if __name__ == "__main__":
-    unittest.main()
diff --git a/tests/test_agent_tools.py b/tests/test_agent_tools.py
deleted file mode 100644
index 589940f..0000000
--- a/tests/test_agent_tools.py
+++ /dev/null
@@ -1,296 +0,0 @@
-"""
-Tests for the agent tool implementations.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock, Mock
-from jsonschema.exceptions import ValidationError
-from chuck_data.agent import (
-    execute_tool,
-    get_tool_schemas,
-)
-from chuck_data.commands.base import CommandResult
-
-
-class TestAgentTools(unittest.TestCase):
-    """Test cases for agent tool implementations."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.mock_client = MagicMock()
-        self.mock_callback = MagicMock()
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    def test_execute_tool_unknown(self, mock_get_command):
-        """Test execute_tool with unknown tool name."""
-        # Configure the mock to return None for the unknown tool
-        mock_get_command.return_value = None
-
-        result = execute_tool(self.mock_client, "unknown_tool", {})
-
-        # Verify the command was looked up
-        mock_get_command.assert_called_once_with("unknown_tool")
-        # Verify the expected error response
-        self.assertEqual(result, {"error": "Tool 'unknown_tool' not found."})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    def test_execute_tool_not_visible_to_agent(self, mock_get_command):
-        """Test execute_tool with a tool that's not visible to the agent."""
-        # Create a mock command definition that's not visible to agents
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = False
-        mock_get_command.return_value = mock_command_def
-
-        result = execute_tool(self.mock_client, "hidden_tool", {})
-
-        # Verify proper error is returned
-        self.assertEqual(
-            result, {"error": "Tool 'hidden_tool' is not available to the agent."}
-        )
-        mock_get_command.assert_called_once_with("hidden_tool")
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_validation_error(self, mock_validate, mock_get_command):
-        """Test execute_tool with validation error."""
-        # Setup mock command definition
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_get_command.return_value = mock_command_def
-
-        # Setup validation error
-        mock_validate.side_effect = ValidationError(
-            "Invalid arguments", schema={"type": "object"}
-        )
-
-        result = execute_tool(self.mock_client, "test_tool", {})
-
-        # Verify an error response is returned containing the validation message
-        self.assertIn("error", result)
-        self.assertIn("Invalid arguments", result["error"])
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_success(self, mock_validate, mock_get_command):
-        """Test execute_tool with successful execution."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-        mock_command_def.output_formatter = None  # No output formatter
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_success_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to return success
-        mock_handler.return_value = CommandResult(
-            True, data={"result": "success"}, message="Success"
-        )
-
-        result = execute_tool(self.mock_client, "test_tool", {"param1": "test"})
-
-        # Verify the handler was called with correct arguments
-        mock_handler.assert_called_once_with(self.mock_client, param1="test")
-        # Verify the successful result is returned
-        self.assertEqual(result, {"result": "success"})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_success_with_callback(self, mock_validate, mock_get_command):
-        """Test execute_tool with successful execution and callback."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-        mock_command_def.output_formatter = None  # No output formatter
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_callback_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to return success with data
-        mock_handler.return_value = CommandResult(
-            True, data={"result": "callback_test"}, message="Success"
-        )
-
-        result = execute_tool(
-            self.mock_client,
-            "test_tool",
-            {"param1": "test"},
-            output_callback=self.mock_callback,
-        )
-
-        # Verify the handler was called with correct arguments (including tool_output_callback)
-        mock_handler.assert_called_once_with(
-            self.mock_client, param1="test", tool_output_callback=self.mock_callback
-        )
-        # Verify the callback was called with tool name and data
-        self.mock_callback.assert_called_once_with(
-            "test_tool", {"result": "callback_test"}
-        )
-        # Verify the successful result is returned
-        self.assertEqual(result, {"result": "callback_test"})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_success_callback_exception(
-        self, mock_validate, mock_get_command
-    ):
-        """Test execute_tool with callback that throws exception."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-        mock_command_def.output_formatter = None  # No output formatter
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_callback_exception_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to return success with data
-        mock_handler.return_value = CommandResult(
-            True, data={"result": "callback_exception_test"}, message="Success"
-        )
-
-        # Setup callback to throw exception
-        self.mock_callback.side_effect = Exception("Callback failed")
-
-        result = execute_tool(
-            self.mock_client,
-            "test_tool",
-            {"param1": "test"},
-            output_callback=self.mock_callback,
-        )
-
-        # Verify the handler was called with correct arguments (including tool_output_callback)
-        mock_handler.assert_called_once_with(
-            self.mock_client, param1="test", tool_output_callback=self.mock_callback
-        )
-        # Verify the callback was called (and failed)
-        self.mock_callback.assert_called_once_with(
-            "test_tool", {"result": "callback_exception_test"}
-        )
-        # Verify the successful result is still returned despite callback failure
-        self.assertEqual(result, {"result": "callback_exception_test"})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_success_no_data(self, mock_validate, mock_get_command):
-        """Test execute_tool with successful execution but no data."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-        mock_command_def.output_formatter = None  # No output formatter
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_no_data_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to return success but no data
-        mock_handler.return_value = CommandResult(True, data=None, message="Success")
-
-        result = execute_tool(self.mock_client, "test_tool", {"param1": "test"})
-
-        # Verify the default success response is returned when no data
-        self.assertEqual(result, {"success": True, "message": "Success"})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_failure(self, mock_validate, mock_get_command):
-        """Test execute_tool with handler failure."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_failure_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to return failure
-        error = ValueError("Test error")
-        mock_handler.return_value = CommandResult(False, error=error, message="Failed")
-
-        result = execute_tool(self.mock_client, "test_tool", {"param1": "test"})
-
-        # Verify error details are included in response
-        self.assertEqual(result, {"error": "Failed", "details": "Test error"})
-
-    @patch("chuck_data.agent.tool_executor.get_command")
-    @patch("chuck_data.agent.tool_executor.jsonschema.validate")
-    def test_execute_tool_handler_exception(self, mock_validate, mock_get_command):
-        """Test execute_tool with handler throwing exception."""
-        # Setup mock command definition with handler name
-        mock_command_def = Mock()
-        mock_command_def.visible_to_agent = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = ["param1"]
-        mock_command_def.needs_api_client = True
-        mock_command_def.output_formatter = None  # No output formatter
-
-        # Create a handler with a __name__ attribute
-        mock_handler = Mock()
-        mock_handler.__name__ = "mock_exception_handler"
-        mock_command_def.handler = mock_handler
-
-        mock_get_command.return_value = mock_command_def
-
-        # Setup handler to throw exception
-        mock_handler.side_effect = Exception("Unexpected error")
-
-        result = execute_tool(self.mock_client, "test_tool", {"param1": "test"})
-
-        # Verify exception is caught and returned as error
-        self.assertIn("error", result)
-        self.assertIn("Unexpected error", result["error"])
-
-    @patch("chuck_data.agent.tool_executor.get_command_registry_tool_schemas")
-    def test_get_tool_schemas(self, mock_get_schemas):
-        """Test get_tool_schemas returns schemas from command registry."""
-        # Setup mock schemas
-        mock_schemas = [
-            {
-                "type": "function",
-                "function": {
-                    "name": "test_tool",
-                    "description": "Test tool",
-                    "parameters": {"type": "object", "properties": {}},
-                },
-            }
-        ]
-        mock_get_schemas.return_value = mock_schemas
-
-        schemas = get_tool_schemas()
-
-        # Verify schemas are returned correctly
-        self.assertEqual(schemas, mock_schemas)
-        mock_get_schemas.assert_called_once()
diff --git a/tests/test_catalogs.py b/tests/test_catalogs.py
deleted file mode 100644
index dab53d0..0000000
--- a/tests/test_catalogs.py
+++ /dev/null
@@ -1,317 +0,0 @@
-"""
-Tests for the catalogs module.
-"""
-
-import unittest
-from chuck_data.catalogs import (
-    list_catalogs,
-    get_catalog,
-    list_schemas,
-    get_schema,
-    list_tables,
-    get_table,
-)
-from tests.fixtures import DatabricksClientStub
-
-
-class TestCatalogs(unittest.TestCase):
-    """Test cases for the catalog-related functions."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = DatabricksClientStub()
-
-    def test_list_catalogs_no_params(self):
-        """Test listing catalogs with no parameters."""
-        # Set up stub data
-        self.client.add_catalog("catalog1", type="MANAGED")
-        self.client.add_catalog("catalog2", type="EXTERNAL")
-        expected_response = {
-            "catalogs": [
-                {"name": "catalog1", "type": "MANAGED"},
-                {"name": "catalog2", "type": "EXTERNAL"},
-            ]
-        }
-
-        # Call the function
-        result = list_catalogs(self.client)
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_catalogs_calls), 1)
-        self.assertEqual(self.client.list_catalogs_calls[0], (False, None, None))
-
-    def test_list_catalogs_with_params(self):
-        """Test listing catalogs with all parameters."""
-        # Set up stub data (empty list)
-        expected_response = {"catalogs": []}
-
-        # Call the function with all parameters
-        result = list_catalogs(
-            self.client, include_browse=True, max_results=10, page_token="abc123"
-        )
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_catalogs_calls), 1)
-        self.assertEqual(self.client.list_catalogs_calls[0], (True, 10, "abc123"))
-
-    def test_get_catalog(self):
-        """Test getting a specific catalog."""
-        # Set up stub data
-        self.client.add_catalog("test-catalog", type="MANAGED")
-        catalog_detail = {"name": "test-catalog", "type": "MANAGED"}
-
-        # Call the function
-        result = get_catalog(self.client, "test-catalog")
-
-        # Verify the result
-        self.assertEqual(result, catalog_detail)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.get_catalog_calls), 1)
-        self.assertEqual(self.client.get_catalog_calls[0], ("test-catalog",))
-
-    def test_list_schemas_basic(self):
-        """Test listing schemas with only required parameters."""
-        # Set up stub data
-        self.client.add_catalog("catalog1")
-        self.client.add_schema("catalog1", "schema1", full_name="catalog1.schema1")
-        self.client.add_schema("catalog1", "schema2", full_name="catalog1.schema2")
-        expected_response = {
-            "schemas": [
-                {
-                    "name": "schema1",
-                    "catalog_name": "catalog1",
-                    "full_name": "catalog1.schema1",
-                },
-                {
-                    "name": "schema2",
-                    "catalog_name": "catalog1",
-                    "full_name": "catalog1.schema2",
-                },
-            ]
-        }
-
-        # Call the function with just the catalog name
-        result = list_schemas(self.client, "catalog1")
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_schemas_calls), 1)
-        self.assertEqual(
-            self.client.list_schemas_calls[0], ("catalog1", False, None, None)
-        )
-
-    def test_list_schemas_all_params(self):
-        """Test listing schemas with all parameters."""
-        # Set up stub data (empty catalog)
-        self.client.add_catalog("catalog1")
-        expected_response = {"schemas": []}
-
-        # Call the function with all parameters
-        result = list_schemas(
-            self.client,
-            catalog_name="catalog1",
-            include_browse=True,
-            max_results=20,
-            page_token="xyz789",
-        )
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_schemas_calls), 1)
-        self.assertEqual(
-            self.client.list_schemas_calls[0], ("catalog1", True, 20, "xyz789")
-        )
-
-    def test_get_schema(self):
-        """Test getting a specific schema."""
-        # Set up stub data
-        self.client.add_catalog("test-catalog")
-        self.client.add_schema(
-            "test-catalog", "test-schema", full_name="test-catalog.test-schema"
-        )
-        schema_detail = {
-            "name": "test-schema",
-            "catalog_name": "test-catalog",
-            "full_name": "test-catalog.test-schema",
-        }
-
-        # Call the function
-        result = get_schema(self.client, "test-catalog.test-schema")
-
-        # Verify the result
-        self.assertEqual(result, schema_detail)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.get_schema_calls), 1)
-        self.assertEqual(self.client.get_schema_calls[0], ("test-catalog.test-schema",))
-
-    def test_list_tables_basic(self):
-        """Test listing tables with only required parameters."""
-        # Set up stub data
-        self.client.add_catalog("test-catalog")
-        self.client.add_schema("test-catalog", "test-schema")
-        self.client.add_table(
-            "test-catalog", "test-schema", "table1", table_type="MANAGED"
-        )
-        expected_response = {
-            "tables": [
-                {
-                    "name": "table1",
-                    "table_type": "MANAGED",
-                    "full_name": "test-catalog.test-schema.table1",
-                    "catalog_name": "test-catalog",
-                    "schema_name": "test-schema",
-                    "comment": "",
-                    "created_at": "2023-01-01T00:00:00Z",
-                    "created_by": "test.user@example.com",
-                    "owner": "test.user@example.com",
-                    "columns": [],
-                    "properties": {},
-                }
-            ],
-            "next_page_token": None,
-        }
-
-        # Call the function with just the required parameters
-        result = list_tables(self.client, "test-catalog", "test-schema")
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_tables_calls), 1)
-        self.assertEqual(
-            self.client.list_tables_calls[0],
-            (
-                "test-catalog",
-                "test-schema",
-                None,
-                None,
-                False,
-                False,
-                False,
-                False,
-                False,
-                False,
-            ),
-        )
-
-    def test_list_tables_all_params(self):
-        """Test listing tables with all parameters."""
-        # Set up stub data (empty schema)
-        self.client.add_catalog("test-catalog")
-        self.client.add_schema("test-catalog", "test-schema")
-        expected_response = {"tables": [], "next_page_token": None}
-
-        # Call the function with all parameters
-        result = list_tables(
-            self.client,
-            catalog_name="test-catalog",
-            schema_name="test-schema",
-            max_results=30,
-            page_token="page123",
-            include_delta_metadata=True,
-            omit_columns=True,
-            omit_properties=True,
-            omit_username=True,
-            include_browse=True,
-            include_manifest_capabilities=True,
-        )
-
-        # Verify the result
-        self.assertEqual(result, expected_response)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.list_tables_calls), 1)
-        self.assertEqual(
-            self.client.list_tables_calls[0],
-            (
-                "test-catalog",
-                "test-schema",
-                30,
-                "page123",
-                True,
-                True,
-                True,
-                True,
-                True,
-                True,
-            ),
-        )
-
-    def test_get_table_basic(self):
-        """Test getting a specific table with no parameters."""
-        # Set up stub data
-        self.client.add_catalog("test-catalog")
-        self.client.add_schema("test-catalog", "test-schema")
-        self.client.add_table(
-            "test-catalog", "test-schema", "test-table", table_type="MANAGED"
-        )
-        table_detail = {
-            "name": "test-table",
-            "full_name": "test-catalog.test-schema.test-table",
-            "table_type": "MANAGED",
-            "catalog_name": "test-catalog",
-            "schema_name": "test-schema",
-            "comment": "",
-            "created_at": "2023-01-01T00:00:00Z",
-            "created_by": "test.user@example.com",
-            "owner": "test.user@example.com",
-            "columns": [],
-            "properties": {},
-        }
-
-        # Call the function with just the table name
-        result = get_table(self.client, "test-catalog.test-schema.test-table")
-
-        # Verify the result
-        self.assertEqual(result, table_detail)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.get_table_calls), 1)
-        self.assertEqual(
-            self.client.get_table_calls[0],
-            ("test-catalog.test-schema.test-table", False, False, False),
-        )
-
-    def test_get_table_all_params(self):
-        """Test getting a specific table with all parameters."""
-        # Set up stub data
-        self.client.add_catalog("test-catalog")
-        self.client.add_schema("test-catalog", "test-schema")
-        self.client.add_table(
-            "test-catalog", "test-schema", "test-table", table_type="MANAGED"
-        )
-        table_detail = {
-            "name": "test-table",
-            "table_type": "MANAGED",
-            "full_name": "test-catalog.test-schema.test-table",
-            "catalog_name": "test-catalog",
-            "schema_name": "test-schema",
-            "comment": "",
-            "created_at": "2023-01-01T00:00:00Z",
-            "created_by": "test.user@example.com",
-            "owner": "test.user@example.com",
-            "columns": [],
-            "properties": {},
-        }
-
-        # Call the function with all parameters
-        result = get_table(
-            self.client,
-            "test-catalog.test-schema.test-table",
-            include_delta_metadata=True,
-            include_browse=True,
-            include_manifest_capabilities=True,
-        )
-
-        # Verify the result
-        self.assertEqual(result, table_detail)
-        # Verify the call was made with correct parameters
-        self.assertEqual(len(self.client.get_table_calls), 1)
-        self.assertEqual(
-            self.client.get_table_calls[0],
-            ("test-catalog.test-schema.test-table", True, True, True),
-        )
diff --git a/tests/test_chuck.py b/tests/test_chuck.py
deleted file mode 100644
index 8f35653..0000000
--- a/tests/test_chuck.py
+++ /dev/null
@@ -1,38 +0,0 @@
-"""Unit tests for the Chuck TUI."""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-
-class TestChuckTUI(unittest.TestCase):
-    """Test cases for the Chuck TUI."""
-
-    @patch("chuck_data.__main__.ChuckTUI")
-    @patch("chuck_data.__main__.setup_logging")
-    def test_main_runs_tui(self, mock_setup_logging, mock_chuck_tui):
-        """Test that the main function calls ChuckTUI.run()."""
-        mock_instance = MagicMock()
-        mock_chuck_tui.return_value = mock_instance
-
-        from chuck_data.__main__ import main
-
-        main([])
-
-        mock_chuck_tui.assert_called_once_with(no_color=False)
-        mock_instance.run.assert_called_once()
-
-    def test_version_flag(self):
-        """Running with --version should exit after printing version."""
-        import io
-        from chuck_data.__main__ import main
-        from chuck_data.version import __version__
-
-        with patch("sys.stdout", new_callable=io.StringIO) as mock_stdout:
-            with self.assertRaises(SystemExit) as cm:
-                main(["--version"])
-            self.assertEqual(cm.exception.code, 0)
-        self.assertIn(f"chuck-data {__version__}", mock_stdout.getvalue())
-
-
-if __name__ == "__main__":
-    unittest.main()
diff --git a/tests/test_clients_databricks.py b/tests/test_clients_databricks.py
deleted file mode 100644
index 7c94811..0000000
--- a/tests/test_clients_databricks.py
+++ /dev/null
@@ -1,174 +0,0 @@
-"""Tests for the DatabricksAPIClient class."""
-
-import unittest
-from unittest.mock import patch, MagicMock
-import requests
-from chuck_data.clients.databricks import DatabricksAPIClient
-
-
-class TestDatabricksClient(unittest.TestCase):
-    """Unit tests for the DatabricksAPIClient class."""
-
-    def setUp(self):
-        """Set up the test environment."""
-        self.workspace_url = "test-workspace"
-        self.token = "fake-token"
-        self.client = DatabricksAPIClient(self.workspace_url, self.token)
-
-    def test_workspace_url_normalization(self):
-        """Test that workspace URLs are normalized correctly."""
-        test_cases = [
-            ("workspace", "workspace"),
-            ("https://workspace", "workspace"),
-            ("http://workspace", "workspace"),
-            ("workspace.cloud.databricks.com", "workspace"),
-            ("https://workspace.cloud.databricks.com", "workspace"),
-            ("https://workspace.cloud.databricks.com/", "workspace"),
-            ("dbc-12345-ab", "dbc-12345-ab"),
-            # Azure test cases
-            ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
-            (
-                "https://adb-3856707039489412.12.azuredatabricks.net",
-                "adb-3856707039489412.12",
-            ),
-            ("workspace.azuredatabricks.net", "workspace"),
-            # GCP test cases
-            ("workspace.gcp.databricks.com", "workspace"),
-            ("https://workspace.gcp.databricks.com", "workspace"),
-        ]
-
-        for input_url, expected_url in test_cases:
-            client = DatabricksAPIClient(input_url, "token")
-            self.assertEqual(
-                client.workspace_url,
-                expected_url,
-                f"URL should be normalized: {input_url} -> {expected_url}",
-            )
-
-    def test_azure_domain_detection_and_url_construction(self):
-        """Test that Azure domains are detected correctly and URLs are constructed properly."""
-        azure_client = DatabricksAPIClient(
-            "adb-3856707039489412.12.azuredatabricks.net", "token"
-        )
-
-        # Check that cloud provider is detected correctly
-        self.assertEqual(azure_client.cloud_provider, "Azure")
-        self.assertEqual(azure_client.base_domain, "azuredatabricks.net")
-        self.assertEqual(azure_client.workspace_url, "adb-3856707039489412.12")
-
-    def test_gcp_domain_detection_and_url_construction(self):
-        """Test that GCP domains are detected correctly and URLs are constructed properly."""
-        gcp_client = DatabricksAPIClient("workspace.gcp.databricks.com", "token")
-
-        # Check that cloud provider is detected correctly
-        self.assertEqual(gcp_client.cloud_provider, "GCP")
-        self.assertEqual(gcp_client.base_domain, "gcp.databricks.com")
-        self.assertEqual(gcp_client.workspace_url, "workspace")
-
-    @patch("chuck_data.clients.databricks.requests.get")
-    def test_get_success(self, mock_get):
-        """Test successful GET request."""
-        mock_response = MagicMock()
-        mock_response.json.return_value = {"key": "value"}
-        mock_get.return_value = mock_response
-
-        response = self.client.get("/test-endpoint")
-        self.assertEqual(response, {"key": "value"})
-        mock_get.assert_called_once_with(
-            "https://test-workspace.cloud.databricks.com/test-endpoint",
-            headers={
-                "Authorization": "Bearer fake-token",
-                "User-Agent": "amperity",
-            },
-        )
-
-    @patch("chuck_data.clients.databricks.requests.get")
-    def test_get_http_error(self, mock_get):
-        """Test GET request with HTTP error."""
-        mock_response = MagicMock()
-        mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
-            "HTTP 404"
-        )
-        mock_response.text = "Not Found"
-        mock_get.return_value = mock_response
-
-        with self.assertRaises(ValueError) as context:
-            self.client.get("/test-endpoint")
-
-        self.assertIn("HTTP error occurred", str(context.exception))
-        self.assertIn("Not Found", str(context.exception))
-
-    @patch("chuck_data.clients.databricks.requests.get")
-    def test_get_connection_error(self, mock_get):
-        """Test GET request with connection error."""
-        mock_get.side_effect = requests.exceptions.ConnectionError("Connection failed")
-
-        with self.assertRaises(ConnectionError) as context:
-            self.client.get("/test-endpoint")
-
-        self.assertIn("Connection error occurred", str(context.exception))
-
-    @patch("chuck_data.clients.databricks.requests.post")
-    def test_post_success(self, mock_post):
-        """Test successful POST request."""
-        mock_response = MagicMock()
-        mock_response.json.return_value = {"key": "value"}
-        mock_post.return_value = mock_response
-
-        response = self.client.post("/test-endpoint", {"data": "test"})
-        self.assertEqual(response, {"key": "value"})
-        mock_post.assert_called_once_with(
-            "https://test-workspace.cloud.databricks.com/test-endpoint",
-            headers={
-                "Authorization": "Bearer fake-token",
-                "User-Agent": "amperity",
-            },
-            json={"data": "test"},
-        )
-
-    @patch("chuck_data.clients.databricks.requests.post")
-    def test_post_http_error(self, mock_post):
-        """Test POST request with HTTP error."""
-        mock_response = MagicMock()
-        mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
-            "HTTP 400"
-        )
-        mock_response.text = "Bad Request"
-        mock_post.return_value = mock_response
-
-        with self.assertRaises(ValueError) as context:
-            self.client.post("/test-endpoint", {"data": "test"})
-
-        self.assertIn("HTTP error occurred", str(context.exception))
-        self.assertIn("Bad Request", str(context.exception))
-
-    @patch("chuck_data.clients.databricks.requests.post")
-    def test_post_connection_error(self, mock_post):
-        """Test POST request with connection error."""
-        mock_post.side_effect = requests.exceptions.ConnectionError("Connection failed")
-
-        with self.assertRaises(ConnectionError) as context:
-            self.client.post("/test-endpoint", {"data": "test"})
-
-        self.assertIn("Connection error occurred", str(context.exception))
-
-    @patch("chuck_data.clients.databricks.requests.post")
-    def test_fetch_amperity_job_init_http_error(self, mock_post):
-        """fetch_amperity_job_init should show helpful message on HTTP errors."""
-        mock_response = MagicMock()
-        mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
-            "HTTP 401", response=mock_response
-        )
-        mock_response.status_code = 401
-        mock_response.text = '{"status":401,"message":"Unauthorized"}'
-        mock_response.json.return_value = {
-            "status": 401,
-            "message": "Unauthorized",
-        }
-        mock_post.return_value = mock_response
-
-        with self.assertRaises(ValueError) as context:
-            self.client.fetch_amperity_job_init("fake-token")
-
-        self.assertIn("401 Error", str(context.exception))
-        self.assertIn("Please /logout and /login again", str(context.exception))
diff --git a/tests/test_config.py b/tests/test_config.py
deleted file mode 100644
index b1ffa8f..0000000
--- a/tests/test_config.py
+++ /dev/null
@@ -1,289 +0,0 @@
-"""Tests for the configuration functionality in Chuck."""
-
-import unittest
-import os
-import json
-import tempfile
-from unittest.mock import patch
-
-from chuck_data.config import (
-    ConfigManager,
-    get_workspace_url,
-    set_workspace_url,
-    get_active_model,
-    set_active_model,
-    get_warehouse_id,
-    set_warehouse_id,
-    get_active_catalog,
-    set_active_catalog,
-    get_active_schema,
-    set_active_schema,
-    get_databricks_token,
-    set_databricks_token,
-)
-
-
-class TestPydanticConfig(unittest.TestCase):
-    """Test cases for Pydantic-based configuration."""
-
-    def setUp(self):
-        """Set up the test environment."""
-        # Create a temporary file for testing
-        self.temp_dir = tempfile.TemporaryDirectory()
-        self.config_path = os.path.join(self.temp_dir.name, "test_config.json")
-
-        # Create a test-specific config manager
-        self.config_manager = ConfigManager(self.config_path)
-
-        # Mock the global config manager
-        self.patcher = patch("chuck_data.config._config_manager", self.config_manager)
-        self.mock_manager = self.patcher.start()
-
-    def tearDown(self):
-        """Clean up after tests."""
-        self.patcher.stop()
-        self.temp_dir.cleanup()
-
-    def test_default_config(self):
-        """Test default configuration values."""
-        config = self.config_manager.get_config()
-        # No longer expecting a specific default workspace URL since we now preserve full URLs
-        # and the default might be None until explicitly set
-        self.assertIsNone(config.active_model)
-        self.assertIsNone(config.warehouse_id)
-        self.assertIsNone(config.active_catalog)
-        self.assertIsNone(config.active_schema)
-
-    def test_config_update(self):
-        """Test updating configuration values."""
-        # Mock out environment variables that could interfere
-        with patch.dict(os.environ, {}, clear=True):
-            # Update values
-            self.config_manager.update(
-                workspace_url="test-workspace",
-                active_model="test-model",
-                warehouse_id="test-warehouse",
-                active_catalog="test-catalog",
-                active_schema="test-schema",
-            )
-
-            # Check values were updated in memory
-            config = self.config_manager.get_config()
-            self.assertEqual(config.workspace_url, "test-workspace")
-            self.assertEqual(config.active_model, "test-model")
-            self.assertEqual(config.warehouse_id, "test-warehouse")
-            self.assertEqual(config.active_catalog, "test-catalog")
-            self.assertEqual(config.active_schema, "test-schema")
-
-            # Check file was created
-            self.assertTrue(os.path.exists(self.config_path))
-
-            # Check file contents
-            with open(self.config_path, "r") as f:
-                saved_config = json.load(f)
-
-            self.assertEqual(saved_config["workspace_url"], "test-workspace")
-            self.assertEqual(saved_config["active_model"], "test-model")
-            self.assertEqual(saved_config["warehouse_id"], "test-warehouse")
-            self.assertEqual(saved_config["active_catalog"], "test-catalog")
-            self.assertEqual(saved_config["active_schema"], "test-schema")
-
-    def test_config_load_save_cycle(self):
-        """Test loading and saving configuration."""
-        # Mock out environment variables that could interfere
-        with patch.dict(os.environ, {}, clear=True):
-            # Set test values
-            test_url = (
-                "https://test-workspace.cloud.databricks.com"  # Need valid URL string
-            )
-            test_model = "test-model"
-            test_warehouse = "warehouse-id-123"
-
-            # Update config values using the update method
-            self.config_manager.update(
-                workspace_url=test_url,
-                active_model=test_model,
-                warehouse_id=test_warehouse,
-            )
-
-            # Create a new manager to load from disk
-            another_manager = ConfigManager(self.config_path)
-            config = another_manager.get_config()
-
-            # Verify saved values were loaded
-            self.assertEqual(config.workspace_url, test_url)
-            self.assertEqual(config.active_model, test_model)
-            self.assertEqual(config.warehouse_id, test_warehouse)
-
-    def test_api_functions(self):
-        """Test compatibility API functions."""
-        # Mock out environment variable that could interfere
-        with patch.dict(os.environ, {}, clear=True):
-            # Set values using API functions
-            set_workspace_url("api-workspace")
-            set_active_model("api-model")
-            set_warehouse_id("api-warehouse")
-            set_active_catalog("api-catalog")
-            set_active_schema("api-schema")
-
-            # Check values using API functions
-            self.assertEqual(get_workspace_url(), "api-workspace")
-            self.assertEqual(get_active_model(), "api-model")
-            self.assertEqual(get_warehouse_id(), "api-warehouse")
-            self.assertEqual(get_active_catalog(), "api-catalog")
-            self.assertEqual(get_active_schema(), "api-schema")
-
-    def test_environment_override(self):
-        """Test environment variable override for all config values."""
-        # Start with clean environment, set config values
-        with patch.dict(os.environ, {}, clear=True):
-            set_workspace_url("config-workspace")
-            set_active_model("config-model")
-            set_warehouse_id("config-warehouse")
-            set_active_catalog("config-catalog")
-            set_active_schema("config-schema")
-
-            # Test CHUCK_ prefix environment variables take precedence
-            with patch.dict(
-                os.environ,
-                {
-                    "CHUCK_WORKSPACE_URL": "chuck-workspace",
-                    "CHUCK_ACTIVE_MODEL": "chuck-model",
-                    "CHUCK_WAREHOUSE_ID": "chuck-warehouse",
-                    "CHUCK_ACTIVE_CATALOG": "chuck-catalog",
-                    "CHUCK_ACTIVE_SCHEMA": "chuck-schema",
-                    "CHUCK_USAGE_TRACKING_CONSENT": "true",
-                },
-            ):
-                config = self.config_manager.get_config()
-                self.assertEqual(config.workspace_url, "chuck-workspace")
-                self.assertEqual(config.active_model, "chuck-model")
-                self.assertEqual(config.warehouse_id, "chuck-warehouse")
-                self.assertEqual(config.active_catalog, "chuck-catalog")
-                self.assertEqual(config.active_schema, "chuck-schema")
-                self.assertTrue(config.usage_tracking_consent)
-
-            # Test without environment variables fall back to config
-            config = self.config_manager.get_config()
-            self.assertEqual(config.workspace_url, "config-workspace")
-
-    def test_graceful_validation(self):
-        """Test configuration validation is graceful."""
-        # Mock out environment variables that could interfere
-        with patch.dict(os.environ, {}, clear=True):
-            # Set a valid URL that we'll use for testing
-            test_url = "https://valid-workspace.cloud.databricks.com"
-
-            # First test with a valid configuration
-            self.config_manager.update(workspace_url=test_url)
-
-            # Verify the URL was saved correctly
-            reloaded_config = self.config_manager.get_config()
-            self.assertEqual(reloaded_config.workspace_url, test_url)
-
-            # Now test with an empty URL string
-            self.config_manager.update(workspace_url="")
-
-            # With empty string, config validation should handle it - either use default or keep empty
-            reloaded_config = self.config_manager.get_config()
-            # We don't assert exact value because validation might reject empty strings
-            self.assertTrue(
-                isinstance(reloaded_config.workspace_url, str),
-                "Workspace URL should be a string type",
-            )
-
-            # Test other fields
-            self.config_manager.update(
-                workspace_url=test_url,  # Reset to valid URL
-                active_model="",
-                warehouse_id=None,
-            )
-
-            # Verify the values were saved correctly
-            reloaded_config = self.config_manager.get_config()
-            self.assertEqual(reloaded_config.active_model, "")
-            self.assertIsNone(reloaded_config.warehouse_id)
-
-    def test_singleton_pattern(self):
-        """Test that ConfigManager follows singleton pattern."""
-        # Using same path should return same instance
-        test_path = os.path.join(self.temp_dir.name, "singleton_test.json")
-        manager1 = ConfigManager(test_path)
-        manager2 = ConfigManager(test_path)
-
-        # Same instance when using same path
-        self.assertIs(manager1, manager2)
-
-        # Different paths should be different instances in tests
-        other_path = os.path.join(self.temp_dir.name, "other_test.json")
-        manager3 = ConfigManager(other_path)
-        self.assertIsNot(manager1, manager3)
-
-    def test_databricks_token(self):
-        """Test Databricks token getter and setter functions."""
-        # Initialize config with a valid workspace URL to avoid validation errors
-        test_url = "test-workspace"
-        set_workspace_url(test_url)
-
-        # Test with no token set initially (should be None by default)
-        initial_token = get_databricks_token()
-        self.assertIsNone(initial_token)
-
-        # Set token and verify it's stored correctly
-        test_token = "dapi1234567890abcdef"
-        set_databricks_token(test_token)
-
-        # Check value was set in memory
-        self.assertEqual(get_databricks_token(), test_token)
-
-        # Check file was updated
-        with open(self.config_path, "r") as f:
-            saved_config = json.load(f)
-        self.assertEqual(saved_config["databricks_token"], test_token)
-
-        # Create a new manager to verify it loads from disk
-        another_manager = ConfigManager(self.config_path)
-        config = another_manager.get_config()
-        self.assertEqual(config.databricks_token, test_token)
-
-    def test_needs_setup_method(self):
-        """Test the needs_setup method for determining first-time setup requirement."""
-        # Test with no config - should need setup
-        with patch.dict(os.environ, {}, clear=True):
-            self.assertTrue(self.config_manager.needs_setup())
-
-        # Test with partial config - should still need setup
-        with patch.dict(
-            os.environ, {"CHUCK_WORKSPACE_URL": "test-workspace"}, clear=True
-        ):
-            self.assertTrue(self.config_manager.needs_setup())
-
-        # Test with complete config via environment variables - should not need setup
-        with patch.dict(
-            os.environ,
-            {
-                "CHUCK_WORKSPACE_URL": "test-workspace",
-                "CHUCK_AMPERITY_TOKEN": "test-amperity-token",
-                "CHUCK_DATABRICKS_TOKEN": "test-databricks-token",
-                "CHUCK_ACTIVE_MODEL": "test-model",
-            },
-            clear=True,
-        ):
-            self.assertFalse(self.config_manager.needs_setup())
-
-        # Test with complete config in file - should not need setup
-        with patch.dict(os.environ, {}, clear=True):
-            self.config_manager.update(
-                workspace_url="file-workspace",
-                amperity_token="file-amperity-token",
-                databricks_token="file-databricks-token",
-                active_model="file-model",
-            )
-            self.assertFalse(self.config_manager.needs_setup())
-
-    @patch("chuck_data.config.clear_agent_history")
-    def test_set_active_model_clears_history(self, mock_clear_history):
-        """Ensure agent history is cleared when switching models."""
-        with patch.dict(os.environ, {}, clear=True):
-            set_active_model("new-model")
-            mock_clear_history.assert_called_once()
diff --git a/tests/test_databricks_auth.py b/tests/test_databricks_auth.py
deleted file mode 100644
index 7fec5d3..0000000
--- a/tests/test_databricks_auth.py
+++ /dev/null
@@ -1,146 +0,0 @@
-"""Unit tests for the Databricks auth utilities."""
-
-import unittest
-import os
-from unittest.mock import patch, MagicMock
-from chuck_data.databricks_auth import get_databricks_token, validate_databricks_token
-
-
-class TestDatabricksAuth(unittest.TestCase):
-    """Test cases for authentication functionality."""
-
-    @patch("os.getenv", return_value="mock_env_token")
-    @patch("chuck_data.databricks_auth.get_token_from_config", return_value=None)
-    @patch("logging.info")
-    def test_get_databricks_token_from_env(
-        self, mock_log, mock_config_token, mock_getenv
-    ):
-        """
-        Test that the token is retrieved from environment when not in config.
-
-        This validates the fallback to environment variable when config doesn't have a token.
-        """
-        token = get_databricks_token()
-        self.assertEqual(token, "mock_env_token")
-        mock_config_token.assert_called_once()
-        mock_getenv.assert_called_once_with("DATABRICKS_TOKEN")
-        mock_log.assert_called_once()
-
-    @patch("os.getenv", return_value="mock_env_token")
-    @patch(
-        "chuck_data.databricks_auth.get_token_from_config",
-        return_value="mock_config_token",
-    )
-    def test_get_databricks_token_from_config(self, mock_config_token, mock_getenv):
-        """
-        Test that the token is retrieved from config first when available.
-
-        This validates that config is prioritized over environment variable.
-        """
-        token = get_databricks_token()
-        self.assertEqual(token, "mock_config_token")
-        mock_config_token.assert_called_once()
-        # Environment variable should not be checked when config has token
-        mock_getenv.assert_not_called()
-
-    @patch("os.getenv", return_value=None)
-    @patch("chuck_data.databricks_auth.get_token_from_config", return_value=None)
-    def test_get_databricks_token_missing(self, mock_config_token, mock_getenv):
-        """
-        Test behavior when token is not available in config or environment.
-
-        This validates error handling when the required token is missing from both sources.
-        """
-        with self.assertRaises(EnvironmentError) as context:
-            get_databricks_token()
-        self.assertIn("Databricks token not found", str(context.exception))
-        mock_config_token.assert_called_once()
-        mock_getenv.assert_called_once_with("DATABRICKS_TOKEN")
-
-    @patch("chuck_data.clients.databricks.DatabricksAPIClient.validate_token")
-    @patch(
-        "chuck_data.databricks_auth.get_workspace_url", return_value="test-workspace"
-    )
-    def test_validate_databricks_token_success(self, mock_workspace_url, mock_validate):
-        """
-        Test successful validation of a Databricks token.
-
-        This validates the API call structure and successful response handling.
-        """
-        mock_validate.return_value = True
-
-        result = validate_databricks_token("mock_token")
-
-        self.assertTrue(result)
-        mock_validate.assert_called_once()
-
-    def test_workspace_url_defined(self):
-        """
-        Test that the workspace URL can be retrieved from the configuration.
-
-        This is more of a smoke test to ensure the function exists and returns a value.
-        """
-        from chuck_data.config import get_workspace_url, _config_manager
-
-        # Patch the config manager to provide a workspace URL
-        mock_config = MagicMock()
-        mock_config.workspace_url = "test-workspace"
-        with patch.object(_config_manager, "get_config", return_value=mock_config):
-            workspace_url = get_workspace_url()
-            self.assertEqual(workspace_url, "test-workspace")
-
-    @patch("chuck_data.clients.databricks.DatabricksAPIClient.validate_token")
-    @patch(
-        "chuck_data.databricks_auth.get_workspace_url", return_value="test-workspace"
-    )
-    @patch("logging.error")
-    def test_validate_databricks_token_failure(
-        self, mock_log, mock_workspace_url, mock_validate
-    ):
-        """
-        Test failed validation of a Databricks token.
-
-        This validates error handling for invalid or expired tokens.
-        """
-        mock_validate.return_value = False
-
-        result = validate_databricks_token("mock_token")
-
-        self.assertFalse(result)
-        mock_validate.assert_called_once()
-
-    @patch("chuck_data.clients.databricks.DatabricksAPIClient.validate_token")
-    @patch(
-        "chuck_data.databricks_auth.get_workspace_url", return_value="test-workspace"
-    )
-    @patch("logging.error")
-    def test_validate_databricks_token_connection_error(
-        self, mock_log, mock_workspace_url, mock_validate
-    ):
-        """
-        Test failed validation due to connection error.
-
-        This validates network error handling during token validation.
-        """
-        mock_validate.side_effect = ConnectionError("Connection Error")
-
-        # The function should still raise ConnectionError for connection errors
-        with self.assertRaises(ConnectionError) as context:
-            validate_databricks_token("mock_token")
-
-        self.assertIn("Connection Error", str(context.exception))
-        # Verify errors were logged - may be multiple logs for connection errors
-        self.assertTrue(mock_log.call_count >= 1, "Error logging was expected")
-
-    @patch.dict(os.environ, {"DATABRICKS_TOKEN": "test_env_token"})
-    @patch("chuck_data.databricks_auth.get_token_from_config", return_value=None)
-    @patch("logging.info")
-    def test_get_databricks_token_from_real_env(self, mock_log, mock_config_token):
-        """
-        Test retrieving token from actual environment variable when not in config.
-
-        This test checks actual environment integration rather than mocked calls.
-        """
-        token = get_databricks_token()
-        self.assertEqual(token, "test_env_token")
-        mock_config_token.assert_called_once()
diff --git a/tests/test_databricks_client.py b/tests/test_databricks_client.py
deleted file mode 100644
index 03bc8d6..0000000
--- a/tests/test_databricks_client.py
+++ /dev/null
@@ -1,408 +0,0 @@
-"""Tests for the DatabricksAPIClient class."""
-
-import unittest
-from unittest.mock import patch, MagicMock, mock_open
-import requests
-from chuck_data.clients.databricks import DatabricksAPIClient
-
-
-class TestDatabricksAPIClient(unittest.TestCase):
-    """Unit tests for the DatabricksAPIClient class."""
-
-    def setUp(self):
-        """Set up the test environment."""
-        self.workspace_url = "test-workspace"
-        self.token = "fake-token"
-        self.client = DatabricksAPIClient(self.workspace_url, self.token)
-
-    def test_normalize_workspace_url(self):
-        """Test URL normalization."""
-        test_cases = [
-            ("workspace", "workspace"),
-            ("https://workspace", "workspace"),
-            ("http://workspace", "workspace"),
-            ("workspace.cloud.databricks.com", "workspace"),
-            ("https://workspace.cloud.databricks.com", "workspace"),
-            ("https://workspace.cloud.databricks.com/", "workspace"),
-            ("dbc-12345-ab", "dbc-12345-ab"),
-            # Azure test cases
-            ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
-            (
-                "https://adb-3856707039489412.12.azuredatabricks.net",
-                "adb-3856707039489412.12",
-            ),
-            ("workspace.azuredatabricks.net", "workspace"),
-            # GCP test cases
-            ("workspace.gcp.databricks.com", "workspace"),
-            ("https://workspace.gcp.databricks.com", "workspace"),
-        ]
-
-        for input_url, expected_url in test_cases:
-            result = self.client._normalize_workspace_url(input_url)
-            self.assertEqual(result, expected_url)
-
-    def test_azure_client_url_construction(self):
-        """Test that Azure client constructs URLs with correct domain."""
-        azure_client = DatabricksAPIClient(
-            "adb-3856707039489412.12.azuredatabricks.net", "token"
-        )
-
-        # Check that cloud provider is detected correctly
-        self.assertEqual(azure_client.cloud_provider, "Azure")
-        self.assertEqual(azure_client.base_domain, "azuredatabricks.net")
-        self.assertEqual(azure_client.workspace_url, "adb-3856707039489412.12")
-
-    def test_base_domain_map(self):
-        """Ensure _get_base_domain uses the shared domain map."""
-        from chuck_data.databricks.url_utils import DATABRICKS_DOMAIN_MAP
-
-        for provider, domain in DATABRICKS_DOMAIN_MAP.items():
-            with self.subTest(provider=provider):
-                client = DatabricksAPIClient("workspace", "token")
-                client.cloud_provider = provider
-                self.assertEqual(client._get_base_domain(), domain)
-
-    @patch("requests.get")
-    def test_azure_get_request_url(self, mock_get):
-        """Test that Azure client constructs correct URLs for GET requests."""
-        azure_client = DatabricksAPIClient(
-            "adb-3856707039489412.12.azuredatabricks.net", "token"
-        )
-        mock_response = MagicMock()
-        mock_response.json.return_value = {"key": "value"}
-        mock_get.return_value = mock_response
-
-        azure_client.get("/test-endpoint")
-
-        mock_get.assert_called_once_with(
-            "https://adb-3856707039489412.12.azuredatabricks.net/test-endpoint",
-            headers={
-                "Authorization": "Bearer token",
-                "User-Agent": "amperity",
-            },
-        )
-
-    def test_compute_node_types(self):
-        """Test that appropriate compute node types are returned for each cloud provider."""
-        test_cases = [
-            ("workspace.cloud.databricks.com", "AWS", "r5d.4xlarge"),
-            ("workspace.azuredatabricks.net", "Azure", "Standard_E16ds_v4"),
-            ("workspace.gcp.databricks.com", "GCP", "n2-standard-16"),
-            ("workspace.databricks.com", "Generic", "r5d.4xlarge"),
-        ]
-
-        for url, expected_provider, expected_node_type in test_cases:
-            with self.subTest(url=url):
-                client = DatabricksAPIClient(url, "token")
-                self.assertEqual(client.cloud_provider, expected_provider)
-                self.assertEqual(client.get_compute_node_type(), expected_node_type)
-
-    def test_cloud_attributes(self):
-        """Test that appropriate cloud attributes are returned for each provider."""
-        # Test AWS attributes
-        aws_client = DatabricksAPIClient("workspace.cloud.databricks.com", "token")
-        aws_attrs = aws_client.get_cloud_attributes()
-        self.assertIn("aws_attributes", aws_attrs)
-        self.assertEqual(
-            aws_attrs["aws_attributes"]["availability"], "SPOT_WITH_FALLBACK"
-        )
-
-        # Test Azure attributes
-        azure_client = DatabricksAPIClient("workspace.azuredatabricks.net", "token")
-        azure_attrs = azure_client.get_cloud_attributes()
-        self.assertIn("azure_attributes", azure_attrs)
-        self.assertEqual(
-            azure_attrs["azure_attributes"]["availability"], "SPOT_WITH_FALLBACK_AZURE"
-        )
-
-        # Test GCP attributes
-        gcp_client = DatabricksAPIClient("workspace.gcp.databricks.com", "token")
-        gcp_attrs = gcp_client.get_cloud_attributes()
-        self.assertIn("gcp_attributes", gcp_attrs)
-        self.assertEqual(gcp_attrs["gcp_attributes"]["use_preemptible_executors"], True)
-
-    @patch.object(DatabricksAPIClient, "post")
-    def test_job_submission_uses_correct_node_type(self, mock_post):
-        """Test that job submission uses the correct node type for Azure."""
-        mock_post.return_value = {"run_id": "12345"}
-
-        azure_client = DatabricksAPIClient("workspace.azuredatabricks.net", "token")
-        azure_client.submit_job_run("/config/path", "/init/script/path")
-
-        # Verify that post was called and get the payload
-        mock_post.assert_called_once()
-        call_args = mock_post.call_args
-        payload = call_args[0][1]  # Second argument is the data payload
-
-        # Check that the cluster config uses Azure node type
-        cluster_config = payload["tasks"][0]["new_cluster"]
-        self.assertEqual(cluster_config["node_type_id"], "Standard_E16ds_v4")
-
-        # Check that Azure attributes are present
-        self.assertIn("azure_attributes", cluster_config)
-        self.assertEqual(
-            cluster_config["azure_attributes"]["availability"],
-            "SPOT_WITH_FALLBACK_AZURE",
-        )
-
-    # Base API request tests
-
-    @patch("requests.get")
-    def test_get_success(self, mock_get):
-        """Test successful GET request."""
-        mock_response = MagicMock()
-        mock_response.json.return_value = {"key": "value"}
-        mock_get.return_value = mock_response
-
-        response = self.client.get("/test-endpoint")
-        self.assertEqual(response, {"key": "value"})
-        mock_get.assert_called_once_with(
-            "https://test-workspace.cloud.databricks.com/test-endpoint",
-            headers={
-                "Authorization": "Bearer fake-token",
-                "User-Agent": "amperity",
-            },
-        )
-
-    @patch("requests.get")
-    def test_get_http_error(self, mock_get):
-        """Test GET request with HTTP error."""
-        mock_response = MagicMock()
-        mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
-            "HTTP 404"
-        )
-        mock_response.text = "Not Found"
-        mock_get.return_value = mock_response
-
-        with self.assertRaises(ValueError) as context:
-            self.client.get("/test-endpoint")
-
-        self.assertIn("HTTP error occurred", str(context.exception))
-        self.assertIn("Not Found", str(context.exception))
-
-    @patch("requests.get")
-    def test_get_connection_error(self, mock_get):
-        """Test GET request with connection error."""
-        mock_get.side_effect = requests.exceptions.ConnectionError("Connection failed")
-
-        with self.assertRaises(ConnectionError) as context:
-            self.client.get("/test-endpoint")
-
-        self.assertIn("Connection error occurred", str(context.exception))
-
-    @patch("requests.post")
-    def test_post_success(self, mock_post):
-        """Test successful POST request."""
-        mock_response = MagicMock()
-        mock_response.json.return_value = {"key": "value"}
-        mock_post.return_value = mock_response
-
-        response = self.client.post("/test-endpoint", {"data": "test"})
-        self.assertEqual(response, {"key": "value"})
-        mock_post.assert_called_once_with(
-            "https://test-workspace.cloud.databricks.com/test-endpoint",
-            headers={
-                "Authorization": "Bearer fake-token",
-                "User-Agent": "amperity",
-            },
-            json={"data": "test"},
-        )
-
-    @patch("requests.post")
-    def test_post_http_error(self, mock_post):
-        """Test POST request with HTTP error."""
-        mock_response = MagicMock()
-        mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
-            "HTTP 400"
-        )
-        mock_response.text = "Bad Request"
-        mock_post.return_value = mock_response
-
-        with self.assertRaises(ValueError) as context:
-            self.client.post("/test-endpoint", {"data": "test"})
-
-        self.assertIn("HTTP error occurred", str(context.exception))
-        self.assertIn("Bad Request", str(context.exception))
-
-    @patch("requests.post")
-    def test_post_connection_error(self, mock_post):
-        """Test POST request with connection error."""
-        mock_post.side_effect = requests.exceptions.ConnectionError("Connection failed")
-
-        with self.assertRaises(ConnectionError) as context:
-            self.client.post("/test-endpoint", {"data": "test"})
-
-        self.assertIn("Connection error occurred", str(context.exception))
-
-    # Authentication method tests
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_validate_token_success(self, mock_get):
-        """Test successful token validation."""
-        mock_get.return_value = {"user_name": "test-user"}
-
-        result = self.client.validate_token()
-
-        self.assertTrue(result)
-        mock_get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_validate_token_failure(self, mock_get):
-        """Test failed token validation."""
-        mock_get.side_effect = Exception("Token validation failed")
-
-        result = self.client.validate_token()
-
-        self.assertFalse(result)
-        mock_get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
-
-    # Unity Catalog method tests
-
-    @patch.object(DatabricksAPIClient, "get")
-    @patch.object(DatabricksAPIClient, "get_with_params")
-    def test_list_catalogs(self, mock_get_with_params, mock_get):
-        """Test list_catalogs with and without parameters."""
-        # Without parameters
-        mock_get.return_value = {"catalogs": [{"name": "test_catalog"}]}
-        result = self.client.list_catalogs()
-        self.assertEqual(result, {"catalogs": [{"name": "test_catalog"}]})
-        mock_get.assert_called_once_with("/api/2.1/unity-catalog/catalogs")
-
-        # With parameters
-        mock_get_with_params.return_value = {"catalogs": [{"name": "test_catalog"}]}
-        result = self.client.list_catalogs(include_browse=True, max_results=10)
-        self.assertEqual(result, {"catalogs": [{"name": "test_catalog"}]})
-        mock_get_with_params.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs",
-            {"include_browse": "true", "max_results": "10"},
-        )
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_get_catalog(self, mock_get):
-        """Test get_catalog method."""
-        mock_get.return_value = {"name": "test_catalog", "comment": "Test catalog"}
-
-        result = self.client.get_catalog("test_catalog")
-
-        self.assertEqual(result, {"name": "test_catalog", "comment": "Test catalog"})
-        mock_get.assert_called_once_with("/api/2.1/unity-catalog/catalogs/test_catalog")
-
-    # File system method tests
-
-    @patch("requests.put")
-    def test_upload_file_with_content(self, mock_put):
-        """Test successful file upload with content."""
-        mock_response = MagicMock()
-        mock_response.status_code = 204
-        mock_put.return_value = mock_response
-
-        result = self.client.upload_file("/test/path.txt", content="Test content")
-
-        self.assertTrue(result)
-        mock_put.assert_called_once()
-        # Check URL and headers
-        call_args = mock_put.call_args
-        self.assertIn(
-            "https://test-workspace.cloud.databricks.com/api/2.0/fs/files/test/path.txt",
-            call_args[0][0],
-        )
-        self.assertEqual(
-            call_args[1]["headers"]["Content-Type"], "application/octet-stream"
-        )
-        # Check that content was encoded to bytes
-        self.assertEqual(call_args[1]["data"], b"Test content")
-
-    @patch("builtins.open", new_callable=mock_open, read_data=b"file content")
-    @patch("requests.put")
-    def test_upload_file_with_file_path(self, mock_put, mock_file):
-        """Test successful file upload with file path."""
-        mock_response = MagicMock()
-        mock_response.status_code = 204
-        mock_put.return_value = mock_response
-
-        result = self.client.upload_file("/test/path.txt", file_path="/local/file.txt")
-
-        self.assertTrue(result)
-        mock_file.assert_called_once_with("/local/file.txt", "rb")
-        mock_put.assert_called_once()
-        # Check that file content was read
-        call_args = mock_put.call_args
-        self.assertEqual(call_args[1]["data"], b"file content")
-
-    def test_upload_file_invalid_args(self):
-        """Test upload_file with invalid arguments."""
-        # Test when both file_path and content are provided
-        with self.assertRaises(ValueError) as context:
-            self.client.upload_file(
-                "/test/path.txt", file_path="/local.txt", content="content"
-            )
-        self.assertIn(
-            "Exactly one of file_path or content must be provided",
-            str(context.exception),
-        )
-
-        # Test when neither file_path nor content is provided
-        with self.assertRaises(ValueError) as context:
-            self.client.upload_file("/test/path.txt")
-        self.assertIn(
-            "Exactly one of file_path or content must be provided",
-            str(context.exception),
-        )
-
-    # Model serving tests
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_list_models(self, mock_get):
-        """Test list_models method."""
-        mock_response = {"endpoints": [{"name": "model1"}, {"name": "model2"}]}
-        mock_get.return_value = mock_response
-
-        result = self.client.list_models()
-
-        self.assertEqual(result, [{"name": "model1"}, {"name": "model2"}])
-        mock_get.assert_called_once_with("/api/2.0/serving-endpoints")
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_get_model(self, mock_get):
-        """Test get_model method."""
-        mock_response = {"name": "model1", "status": "ready"}
-        mock_get.return_value = mock_response
-
-        result = self.client.get_model("model1")
-
-        self.assertEqual(result, {"name": "model1", "status": "ready"})
-        mock_get.assert_called_once_with("/api/2.0/serving-endpoints/model1")
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_get_model_not_found(self, mock_get):
-        """Test get_model with 404 error."""
-        mock_get.side_effect = ValueError("HTTP error occurred: 404 Not Found")
-
-        result = self.client.get_model("nonexistent-model")
-
-        self.assertIsNone(result)
-        mock_get.assert_called_once_with("/api/2.0/serving-endpoints/nonexistent-model")
-
-    # SQL warehouse tests
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_list_warehouses(self, mock_get):
-        """Test list_warehouses method."""
-        mock_response = {"warehouses": [{"id": "123"}, {"id": "456"}]}
-        mock_get.return_value = mock_response
-
-        result = self.client.list_warehouses()
-
-        self.assertEqual(result, [{"id": "123"}, {"id": "456"}])
-        mock_get.assert_called_once_with("/api/2.0/sql/warehouses")
-
-    @patch.object(DatabricksAPIClient, "get")
-    def test_get_warehouse(self, mock_get):
-        """Test get_warehouse method."""
-        mock_response = {"id": "123", "name": "Test Warehouse"}
-        mock_get.return_value = mock_response
-
-        result = self.client.get_warehouse("123")
-
-        self.assertEqual(result, {"id": "123", "name": "Test Warehouse"})
-        mock_get.assert_called_once_with("/api/2.0/sql/warehouses/123")
diff --git a/tests/test_integration.py b/tests/test_integration.py
deleted file mode 100644
index e350214..0000000
--- a/tests/test_integration.py
+++ /dev/null
@@ -1,98 +0,0 @@
-"""Integration tests for the Chuck application."""
-
-import unittest
-from unittest.mock import patch
-from chuck_data.config import (
-    set_active_model,
-    get_active_model,
-    ConfigManager,
-)
-import os
-import json
-
-
-class TestChuckIntegration(unittest.TestCase):
-    """Integration test cases for Chuck application."""
-
-    def setUp(self):
-        """Set up the test environment with controlled configuration."""
-        # Set up test environment
-        self.test_config_path = "/tmp/.test_chuck_integration_config.json"
-
-        # Create a test config manager instance
-        self.config_manager = ConfigManager(config_path=self.test_config_path)
-
-        # Replace the global config manager with our test instance
-        self.config_manager_patcher = patch(
-            "chuck_data.config._config_manager", self.config_manager
-        )
-        self.mock_config_manager = self.config_manager_patcher.start()
-
-        # Mock environment for authentication
-        self.env_patcher = patch.dict(
-            "os.environ",
-            {
-                "DATABRICKS_TOKEN": "test_token",
-                "DATABRICKS_WORKSPACE_URL": "test-workspace",
-            },
-        )
-        self.env_patcher.start()
-
-        # Initialize the config with workspace_url
-        self.config_manager.update(workspace_url="test-workspace")
-
-    def tearDown(self):
-        """Clean up the test environment after tests."""
-        if os.path.exists(self.test_config_path):
-            os.remove(self.test_config_path)
-        self.config_manager_patcher.stop()
-        self.env_patcher.stop()
-
-    def test_config_operations(self):
-        """Test that config operations work properly."""
-        # Test writing and reading config
-        set_active_model("test-model")
-
-        # Verify the config file was actually created with correct content
-        self.assertTrue(os.path.exists(self.test_config_path))
-        with open(self.test_config_path, "r") as f:
-            saved_config = json.load(f)
-        self.assertEqual(saved_config["active_model"], "test-model")
-
-        # Test reading the config
-        active_model = get_active_model()
-        self.assertEqual(active_model, "test-model")
-
-    def test_catalog_config_operations(self):
-        """Test catalog config operations."""
-        # Test writing and reading catalog config
-        from chuck_data.config import set_active_catalog, get_active_catalog
-
-        test_catalog = "test-catalog"
-        set_active_catalog(test_catalog)
-
-        # Verify the config file was updated with catalog
-        with open(self.test_config_path, "r") as f:
-            saved_config = json.load(f)
-        self.assertEqual(saved_config["active_catalog"], test_catalog)
-
-        # Test reading the catalog config
-        active_catalog = get_active_catalog()
-        self.assertEqual(active_catalog, test_catalog)
-
-    def test_schema_config_operations(self):
-        """Test schema config operations."""
-        # Test writing and reading schema config
-        from chuck_data.config import set_active_schema, get_active_schema
-
-        test_schema = "test-schema"
-        set_active_schema(test_schema)
-
-        # Verify the config file was updated with schema
-        with open(self.test_config_path, "r") as f:
-            saved_config = json.load(f)
-        self.assertEqual(saved_config["active_schema"], test_schema)
-
-        # Test reading the schema config
-        active_schema = get_active_schema()
-        self.assertEqual(active_schema, test_schema)
diff --git a/tests/test_metrics_collector.py b/tests/test_metrics_collector.py
deleted file mode 100644
index 31157fe..0000000
--- a/tests/test_metrics_collector.py
+++ /dev/null
@@ -1,180 +0,0 @@
-"""
-Tests for the metrics collector.
-"""
-
-import unittest
-from unittest.mock import patch
-
-from chuck_data.metrics_collector import MetricsCollector, get_metrics_collector
-from tests.fixtures import AmperityClientStub, ConfigManagerStub
-
-
-class TestMetricsCollector(unittest.TestCase):
-    """Test cases for MetricsCollector."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.config_manager_stub = ConfigManagerStub()
-        self.config_stub = self.config_manager_stub.config
-
-        # Create the metrics collector with mocked config and AmperityClientStub
-        self.amperity_client_stub = AmperityClientStub()
-        with patch(
-            "chuck_data.metrics_collector.get_config_manager",
-            return_value=self.config_manager_stub,
-        ):
-            with patch(
-                "chuck_data.metrics_collector.AmperityAPIClient",
-                return_value=self.amperity_client_stub,
-            ):
-                self.metrics_collector = MetricsCollector()
-
-    def test_should_track_with_consent(self):
-        """Test that metrics are tracked when consent is given."""
-        self.config_stub.usage_tracking_consent = True
-        result = self.metrics_collector._should_track()
-        self.assertTrue(result)
-
-    def test_should_track_without_consent(self):
-        """Test that metrics are not tracked when consent is not given."""
-        self.config_stub.usage_tracking_consent = False
-        result = self.metrics_collector._should_track()
-        self.assertFalse(result)
-
-    def test_get_chuck_configuration(self):
-        """Test that configuration is retrieved correctly."""
-        self.config_stub.workspace_url = "test-workspace"
-        self.config_stub.active_catalog = "test-catalog"
-        self.config_stub.active_schema = "test-schema"
-        self.config_stub.active_model = "test-model"
-
-        result = self.metrics_collector._get_chuck_configuration_for_metric()
-
-        self.assertEqual(
-            result,
-            {
-                "workspace_url": "test-workspace",
-                "active_catalog": "test-catalog",
-                "active_schema": "test-schema",
-                "active_model": "test-model",
-            },
-        )
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
-    def test_track_event_no_consent(self, mock_get_token):
-        """Test that tracking is skipped when consent is not given."""
-        self.config_stub.usage_tracking_consent = False
-
-        # Reset stub metrics call count
-        self.amperity_client_stub.metrics_calls = []
-
-        result = self.metrics_collector.track_event(prompt="test prompt")
-
-        self.assertFalse(result)
-        # Ensure submit_metrics is not called
-        self.assertEqual(len(self.amperity_client_stub.metrics_calls), 0)
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
-    @patch("chuck_data.metrics_collector.MetricsCollector.send_metric")
-    def test_track_event_with_all_fields(self, mock_send_metric, mock_get_token):
-        """Test tracking with all fields provided."""
-        self.config_stub.usage_tracking_consent = True
-        mock_send_metric.return_value = True
-
-        # Prepare test data
-        prompt = "test prompt"
-        tools = [{"name": "test_tool", "arguments": {"arg1": "value1"}}]
-        conversation_history = [{"role": "assistant", "content": "test response"}]
-        error = "test error"
-        additional_data = {"event_context": "test_context"}
-
-        # Call track_event
-        result = self.metrics_collector.track_event(
-            prompt=prompt,
-            tools=tools,
-            conversation_history=conversation_history,
-            error=error,
-            additional_data=additional_data,
-        )
-
-        # Assert results
-        self.assertTrue(result)
-        mock_send_metric.assert_called_once()
-
-        # Check payload content
-        payload = mock_send_metric.call_args[0][0]
-        self.assertEqual(payload["event"], "USAGE")
-        self.assertEqual(payload["prompt"], prompt)
-        self.assertEqual(payload["tools"], tools)
-        self.assertEqual(payload["conversation_history"], conversation_history)
-        self.assertEqual(payload["error"], error)
-        self.assertEqual(payload["additional_data"], additional_data)
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
-    def test_send_metric_successful(self, mock_get_token):
-        """Test successful metrics sending."""
-        payload = {"event": "USAGE", "prompt": "test prompt"}
-
-        # Reset stub metrics call count
-        self.amperity_client_stub.metrics_calls = []
-
-        result = self.metrics_collector.send_metric(payload)
-
-        self.assertTrue(result)
-        self.assertEqual(len(self.amperity_client_stub.metrics_calls), 1)
-        self.assertEqual(
-            self.amperity_client_stub.metrics_calls[0], (payload, "test-token")
-        )
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
-    def test_send_metric_failure(self, mock_get_token):
-        """Test handling of metrics sending failure."""
-        # Configure stub to simulate failure
-        self.amperity_client_stub.should_fail_metrics = True
-        self.amperity_client_stub.metrics_calls = []
-
-        payload = {"event": "USAGE", "prompt": "test prompt"}
-
-        result = self.metrics_collector.send_metric(payload)
-
-        self.assertFalse(result)
-        self.assertEqual(len(self.amperity_client_stub.metrics_calls), 1)
-        self.assertEqual(
-            self.amperity_client_stub.metrics_calls[0], (payload, "test-token")
-        )
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
-    def test_send_metric_exception(self, mock_get_token):
-        """Test handling of exceptions during metrics sending."""
-        # Configure stub to raise exception
-        self.amperity_client_stub.should_raise_exception = True
-        self.amperity_client_stub.metrics_calls = []
-
-        payload = {"event": "USAGE", "prompt": "test prompt"}
-
-        result = self.metrics_collector.send_metric(payload)
-
-        self.assertFalse(result)
-        self.assertEqual(len(self.amperity_client_stub.metrics_calls), 1)
-        self.assertEqual(
-            self.amperity_client_stub.metrics_calls[0], (payload, "test-token")
-        )
-
-    @patch("chuck_data.metrics_collector.get_amperity_token", return_value=None)
-    def test_send_metric_no_token(self, mock_get_token):
-        """Test that metrics are not sent when no token is available."""
-        # Reset stub metrics call count
-        self.amperity_client_stub.metrics_calls = []
-
-        payload = {"event": "USAGE", "prompt": "test prompt"}
-
-        result = self.metrics_collector.send_metric(payload)
-
-        self.assertFalse(result)
-        self.assertEqual(len(self.amperity_client_stub.metrics_calls), 0)
-
-    def test_get_metrics_collector(self):
-        """Test that get_metrics_collector returns the singleton instance."""
-        with patch("chuck_data.metrics_collector._metrics_collector") as mock_collector:
-            collector = get_metrics_collector()
-            self.assertEqual(collector, mock_collector)
diff --git a/tests/test_models.py b/tests/test_models.py
deleted file mode 100644
index 7739e19..0000000
--- a/tests/test_models.py
+++ /dev/null
@@ -1,121 +0,0 @@
-"""Unit tests for the models module."""
-
-import unittest
-from chuck_data.models import list_models, get_model
-from tests.fixtures import (
-    EXPECTED_MODEL_LIST,
-    DatabricksClientStub,
-)
-
-
-class TestModels(unittest.TestCase):
-    """Test cases for the models module."""
-
-    def test_list_models_success(self):
-        """Test successful retrieval of model list."""
-        # Create a client stub
-        client_stub = DatabricksClientStub()
-        # Configure stub to return expected model list
-        client_stub.models = EXPECTED_MODEL_LIST
-
-        models = list_models(client_stub)
-
-        self.assertEqual(models, EXPECTED_MODEL_LIST)
-
-    def test_list_models_empty(self):
-        """Test retrieval with empty model list."""
-        # Create a client stub
-        client_stub = DatabricksClientStub()
-        # Configure stub to return empty list
-        client_stub.models = []
-
-        models = list_models(client_stub)
-        self.assertEqual(models, [])
-
-    def test_list_models_http_error(self):
-        """Test failure with HTTP error."""
-        # Create a client stub
-        client_stub = DatabricksClientStub()
-        # Configure stub to raise ValueError
-        client_stub.set_list_models_error(
-            ValueError("HTTP error occurred: 404 Not Found")
-        )
-
-        with self.assertRaises(ValueError) as context:
-            list_models(client_stub)
-        self.assertIn("Model serving API error", str(context.exception))
-
-    def test_list_models_connection_error(self):
-        """Test failure due to connection error."""
-        # Create a client stub
-        client_stub = DatabricksClientStub()
-        # Configure stub to raise ConnectionError
-        client_stub.set_list_models_error(ConnectionError("Connection failed"))
-
-        with self.assertRaises(ConnectionError) as context:
-            list_models(client_stub)
-        self.assertIn("Failed to connect to serving endpoint", str(context.exception))
-
-    def test_get_model_success(self):
-        """Test successful retrieval of a specific model."""
-        # Create client stub and configure model detail
-        client_stub = DatabricksClientStub()
-        model_detail = {
-            "name": "databricks-llama-4-maverick",
-            "creator": "user@example.com",
-            "creation_timestamp": 1645123456789,
-            "state": "READY",
-        }
-        client_stub.add_model(
-            "databricks-llama-4-maverick",
-            status="READY",
-            creator="user@example.com",
-            creation_timestamp=1645123456789,
-        )
-
-        # Call the function
-        result = get_model(client_stub, "databricks-llama-4-maverick")
-
-        # Verify results
-        self.assertEqual(result["name"], model_detail["name"])
-        self.assertEqual(result["creator"], model_detail["creator"])
-
-    def test_get_model_not_found(self):
-        """Test retrieval of a non-existent model."""
-        # Create client stub that returns None for not found models
-        client_stub = DatabricksClientStub()
-        # No model added, so get_model will return None
-
-        # Call the function
-        result = get_model(client_stub, "nonexistent-model")
-
-        # Verify result is None
-        self.assertIsNone(result)
-
-    def test_get_model_error(self):
-        """Test retrieval with a non-404 error."""
-        # Create client stub that raises a 500 error
-        client_stub = DatabricksClientStub()
-        client_stub.set_get_model_error(
-            ValueError("HTTP error occurred: 500 Internal Server Error")
-        )
-
-        # Call the function and expect an exception
-        with self.assertRaises(ValueError) as context:
-            get_model(client_stub, "error-model")
-
-        # Verify error handling
-        self.assertIn("Model serving API error", str(context.exception))
-
-    def test_get_model_connection_error(self):
-        """Test retrieval with connection error."""
-        # Create client stub that raises a connection error
-        client_stub = DatabricksClientStub()
-        client_stub.set_get_model_error(ConnectionError("Connection failed"))
-
-        # Call the function and expect an exception
-        with self.assertRaises(ConnectionError) as context:
-            get_model(client_stub, "network-error-model")
-
-        # Verify error handling
-        self.assertIn("Failed to connect to serving endpoint", str(context.exception))
diff --git a/tests/test_no_color_env.py b/tests/test_no_color_env.py
deleted file mode 100644
index 5d9c420..0000000
--- a/tests/test_no_color_env.py
+++ /dev/null
@@ -1,73 +0,0 @@
-"""Tests for the NO_COLOR environment variable."""
-
-import unittest
-from unittest.mock import patch, MagicMock
-import sys
-import os
-
-# Add the project root to sys.path so we can import chuck_data.__main__ as chuck
-sys.path.insert(0, os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
-
-import chuck_data.__main__ as chuck
-
-
-class TestNoColorEnvVar(unittest.TestCase):
-    """Test cases for NO_COLOR environment variable functionality."""
-
-    @patch("chuck_data.__main__.ChuckTUI")
-    @patch("chuck_data.__main__.setup_logging")
-    def test_default_color_mode(self, mock_setup_logging, mock_chuck_tui):
-        """Test that default mode passes no_color=False to ChuckTUI constructor."""
-        mock_tui_instance = MagicMock()
-        mock_chuck_tui.return_value = mock_tui_instance
-
-        # Call main function (without NO_COLOR env var)
-        chuck.main([])
-
-        # Verify ChuckTUI was called with no_color=False
-        mock_chuck_tui.assert_called_once_with(no_color=False)
-        # Verify run was called
-        mock_tui_instance.run.assert_called_once()
-
-    @patch("chuck_data.__main__.ChuckTUI")
-    @patch("chuck_data.__main__.setup_logging")
-    @patch.dict(os.environ, {"NO_COLOR": "1"})
-    def test_no_color_env_var_1(self, mock_setup_logging, mock_chuck_tui):
-        """Test that NO_COLOR=1 enables no-color mode."""
-        mock_tui_instance = MagicMock()
-        mock_chuck_tui.return_value = mock_tui_instance
-
-        # Call main function
-        chuck.main([])
-
-        # Verify ChuckTUI was called with no_color=True due to env var
-        mock_chuck_tui.assert_called_once_with(no_color=True)
-
-    @patch("chuck_data.__main__.ChuckTUI")
-    @patch("chuck_data.__main__.setup_logging")
-    @patch.dict(os.environ, {"NO_COLOR": "true"})
-    def test_no_color_env_var_true(self, mock_setup_logging, mock_chuck_tui):
-        """Test that NO_COLOR=true enables no-color mode."""
-        mock_tui_instance = MagicMock()
-        mock_chuck_tui.return_value = mock_tui_instance
-
-        # Call main function
-        chuck.main([])
-
-        # Verify ChuckTUI was called with no_color=True due to env var
-        mock_chuck_tui.assert_called_once_with(no_color=True)
-
-    @patch("chuck_data.__main__.ChuckTUI")
-    @patch("chuck_data.__main__.setup_logging")
-    def test_no_color_flag(self, mock_setup_logging, mock_chuck_tui):
-        """The --no-color flag forces no_color=True."""
-        mock_tui_instance = MagicMock()
-        mock_chuck_tui.return_value = mock_tui_instance
-
-        chuck.main(["--no-color"])
-
-        mock_chuck_tui.assert_called_once_with(no_color=True)
-
-
-if __name__ == "__main__":
-    unittest.main()
diff --git a/tests/test_permission_validator.py b/tests/test_permission_validator.py
deleted file mode 100644
index 51e1dc7..0000000
--- a/tests/test_permission_validator.py
+++ /dev/null
@@ -1,428 +0,0 @@
-"""Tests for the permission validator module."""
-
-import unittest
-from unittest.mock import patch, MagicMock, call
-
-from chuck_data.databricks.permission_validator import (
-    validate_all_permissions,
-    check_basic_connectivity,
-    check_unity_catalog,
-    check_sql_warehouse,
-    check_jobs,
-    check_models,
-    check_volumes,
-)
-
-
-class TestPermissionValidator(unittest.TestCase):
-    """Test cases for permission validator module."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-
-    def test_validate_all_permissions(self):
-        """Test that validate_all_permissions calls all check functions."""
-        with (
-            patch(
-                "chuck_data.databricks.permission_validator.check_basic_connectivity"
-            ) as mock_basic,
-            patch(
-                "chuck_data.databricks.permission_validator.check_unity_catalog"
-            ) as mock_catalog,
-            patch(
-                "chuck_data.databricks.permission_validator.check_sql_warehouse"
-            ) as mock_warehouse,
-            patch("chuck_data.databricks.permission_validator.check_jobs") as mock_jobs,
-            patch(
-                "chuck_data.databricks.permission_validator.check_models"
-            ) as mock_models,
-            patch(
-                "chuck_data.databricks.permission_validator.check_volumes"
-            ) as mock_volumes,
-        ):
-
-            # Set return values for mock functions
-            mock_basic.return_value = {"authorized": True}
-            mock_catalog.return_value = {"authorized": True}
-            mock_warehouse.return_value = {"authorized": True}
-            mock_jobs.return_value = {"authorized": True}
-            mock_models.return_value = {"authorized": True}
-            mock_volumes.return_value = {"authorized": True}
-
-            # Call the function
-            result = validate_all_permissions(self.client)
-
-            # Verify all check functions were called
-            mock_basic.assert_called_once_with(self.client)
-            mock_catalog.assert_called_once_with(self.client)
-            mock_warehouse.assert_called_once_with(self.client)
-            mock_jobs.assert_called_once_with(self.client)
-            mock_models.assert_called_once_with(self.client)
-            mock_volumes.assert_called_once_with(self.client)
-
-            # Verify result contains all categories
-            self.assertIn("basic_connectivity", result)
-            self.assertIn("unity_catalog", result)
-            self.assertIn("sql_warehouse", result)
-            self.assertIn("jobs", result)
-            self.assertIn("models", result)
-            self.assertIn("volumes", result)
-
-    @patch("logging.debug")
-    def test_check_basic_connectivity_success(self, mock_debug):
-        """Test basic connectivity check with successful response."""
-        # Set up mock response
-        self.client.get.return_value = {"userName": "test_user"}
-
-        # Call the function
-        result = check_basic_connectivity(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(result["details"], "Connected as test_user")
-        self.assertEqual(result["api_path"], "/api/2.0/preview/scim/v2/Me")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()  # No errors, so no debug logging
-
-    @patch("logging.debug")
-    def test_check_basic_connectivity_error(self, mock_debug):
-        """Test basic connectivity check with error."""
-        # Set up mock response
-        self.client.get.side_effect = Exception("Connection failed")
-
-        # Call the function
-        result = check_basic_connectivity(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Connection failed")
-        self.assertEqual(result["api_path"], "/api/2.0/preview/scim/v2/Me")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
-
-    @patch("logging.debug")
-    def test_check_unity_catalog_success(self, mock_debug):
-        """Test Unity Catalog check with successful response."""
-        # Set up mock response
-        self.client.get.return_value = {"catalogs": [{"name": "test_catalog"}]}
-
-        # Call the function
-        result = check_unity_catalog(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs?max_results=1"
-        )
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(
-            result["details"], "Unity Catalog access granted (1 catalogs visible)"
-        )
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/catalogs")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_unity_catalog_empty(self, mock_debug):
-        """Test Unity Catalog check with empty response."""
-        # Set up mock response
-        self.client.get.return_value = {"catalogs": []}
-
-        # Call the function
-        result = check_unity_catalog(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs?max_results=1"
-        )
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(
-            result["details"], "Unity Catalog access granted (0 catalogs visible)"
-        )
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/catalogs")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_unity_catalog_error(self, mock_debug):
-        """Test Unity Catalog check with error."""
-        # Set up mock response
-        self.client.get.side_effect = Exception("Access denied")
-
-        # Call the function
-        result = check_unity_catalog(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs?max_results=1"
-        )
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Access denied")
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/catalogs")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
-
-    @patch("logging.debug")
-    def test_check_sql_warehouse_success(self, mock_debug):
-        """Test SQL warehouse check with successful response."""
-        # Set up mock response
-        self.client.get.return_value = {"warehouses": [{"id": "warehouse1"}]}
-
-        # Call the function
-        result = check_sql_warehouse(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.0/sql/warehouses?page_size=1")
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(
-            result["details"], "SQL Warehouse access granted (1 warehouses visible)"
-        )
-        self.assertEqual(result["api_path"], "/api/2.0/sql/warehouses")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_sql_warehouse_error(self, mock_debug):
-        """Test SQL warehouse check with error."""
-        # Set up mock response
-        self.client.get.side_effect = Exception("Access denied")
-
-        # Call the function
-        result = check_sql_warehouse(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.0/sql/warehouses?page_size=1")
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Access denied")
-        self.assertEqual(result["api_path"], "/api/2.0/sql/warehouses")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
-
-    @patch("logging.debug")
-    def test_check_jobs_success(self, mock_debug):
-        """Test jobs check with successful response."""
-        # Set up mock response
-        self.client.get.return_value = {"jobs": [{"job_id": "job1"}]}
-
-        # Call the function
-        result = check_jobs(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.1/jobs/list?limit=1")
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(result["details"], "Jobs access granted (1 jobs visible)")
-        self.assertEqual(result["api_path"], "/api/2.1/jobs/list")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_jobs_error(self, mock_debug):
-        """Test jobs check with error."""
-        # Set up mock response
-        self.client.get.side_effect = Exception("Access denied")
-
-        # Call the function
-        result = check_jobs(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with("/api/2.1/jobs/list?limit=1")
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Access denied")
-        self.assertEqual(result["api_path"], "/api/2.1/jobs/list")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
-
-    @patch("logging.debug")
-    def test_check_models_success(self, mock_debug):
-        """Test models check with successful response."""
-        # Set up mock response
-        self.client.get.return_value = {"registered_models": [{"name": "model1"}]}
-
-        # Call the function
-        result = check_models(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with(
-            "/api/2.0/mlflow/registered-models/list?max_results=1"
-        )
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(
-            result["details"], "ML Models access granted (1 models visible)"
-        )
-        self.assertEqual(result["api_path"], "/api/2.0/mlflow/registered-models/list")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_models_error(self, mock_debug):
-        """Test models check with error."""
-        # Set up mock response
-        self.client.get.side_effect = Exception("Access denied")
-
-        # Call the function
-        result = check_models(self.client)
-
-        # Verify the API was called correctly
-        self.client.get.assert_called_once_with(
-            "/api/2.0/mlflow/registered-models/list?max_results=1"
-        )
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Access denied")
-        self.assertEqual(result["api_path"], "/api/2.0/mlflow/registered-models/list")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
-
-    @patch("logging.debug")
-    def test_check_volumes_success_full_path(self, mock_debug):
-        """Test volumes check with successful response through the full path."""
-        # Set up mock responses for the multi-step process
-        catalog_response = {"catalogs": [{"name": "test_catalog"}]}
-        schema_response = {"schemas": [{"name": "test_schema"}]}
-        volume_response = {"volumes": [{"name": "test_volume"}]}
-
-        # Configure the client mock to return different responses for different calls
-        self.client.get.side_effect = [
-            catalog_response,
-            schema_response,
-            volume_response,
-        ]
-
-        # Call the function
-        result = check_volumes(self.client)
-
-        # Verify the API calls were made correctly
-        expected_calls = [
-            call("/api/2.1/unity-catalog/catalogs?max_results=1"),
-            call(
-                "/api/2.1/unity-catalog/schemas?catalog_name=test_catalog&max_results=1"
-            ),
-            call(
-                "/api/2.1/unity-catalog/volumes?catalog_name=test_catalog&schema_name=test_schema"
-            ),
-        ]
-        self.assertEqual(self.client.get.call_args_list, expected_calls)
-
-        # Verify the result
-        self.assertTrue(result["authorized"])
-        self.assertEqual(
-            result["details"],
-            "Volumes access granted in test_catalog.test_schema (1 volumes visible)",
-        )
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/volumes")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_volumes_no_catalogs(self, mock_debug):
-        """Test volumes check when no catalogs are available."""
-        # Set up empty catalog response
-        self.client.get.return_value = {"catalogs": []}
-
-        # Call the function
-        result = check_volumes(self.client)
-
-        # Verify only the catalogs API was called
-        self.client.get.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs?max_results=1"
-        )
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(
-            result["error"], "No catalogs available to check volumes access"
-        )
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/volumes")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_volumes_no_schemas(self, mock_debug):
-        """Test volumes check when no schemas are available."""
-        # Set up mock responses
-        catalog_response = {"catalogs": [{"name": "test_catalog"}]}
-        schema_response = {"schemas": []}
-
-        # Configure the client mock
-        self.client.get.side_effect = [catalog_response, schema_response]
-
-        # Call the function
-        result = check_volumes(self.client)
-
-        # Verify the APIs were called
-        expected_calls = [
-            call("/api/2.1/unity-catalog/catalogs?max_results=1"),
-            call(
-                "/api/2.1/unity-catalog/schemas?catalog_name=test_catalog&max_results=1"
-            ),
-        ]
-        self.assertEqual(self.client.get.call_args_list, expected_calls)
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(
-            result["error"],
-            "No schemas available in catalog 'test_catalog' to check volumes access",
-        )
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/volumes")
-
-        # Verify logging occurred
-        mock_debug.assert_not_called()
-
-    @patch("logging.debug")
-    def test_check_volumes_error(self, mock_debug):
-        """Test volumes check with an API error."""
-        # Set up mock response to raise exception
-        self.client.get.side_effect = Exception("Access denied")
-
-        # Call the function
-        result = check_volumes(self.client)
-
-        # Verify the API was called
-        self.client.get.assert_called_once_with(
-            "/api/2.1/unity-catalog/catalogs?max_results=1"
-        )
-
-        # Verify the result
-        self.assertFalse(result["authorized"])
-        self.assertEqual(result["error"], "Access denied")
-        self.assertEqual(result["api_path"], "/api/2.1/unity-catalog/volumes")
-
-        # Verify logging occurred
-        mock_debug.assert_called_once()
diff --git a/tests/test_profiler.py b/tests/test_profiler.py
deleted file mode 100644
index 51b3f0a..0000000
--- a/tests/test_profiler.py
+++ /dev/null
@@ -1,220 +0,0 @@
-"""
-Tests for the profiler module.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-from chuck_data.profiler import (
-    list_tables,
-    query_llm,
-    generate_manifest,
-    store_manifest,
-    profile_table,
-)
-
-
-class TestProfiler(unittest.TestCase):
-    """Test cases for the profiler module."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-        self.warehouse_id = "warehouse-123"
-
-    @patch("chuck_data.profiler.time.sleep")
-    def test_list_tables(self, mock_sleep):
-        """Test listing tables."""
-        # Set up mock responses
-        self.client.post.return_value = {"statement_id": "stmt-123"}
-
-        # Mock the get call to return a completed query status
-        self.client.get.return_value = {
-            "status": {"state": "SUCCEEDED"},
-            "result": {
-                "data": [
-                    ["table1", "catalog1", "schema1"],
-                    ["table2", "catalog1", "schema2"],
-                ]
-            },
-        }
-
-        # Call the function
-        result = list_tables(self.client, self.warehouse_id)
-
-        # Check the result
-        expected_tables = [
-            {
-                "table_name": "table1",
-                "catalog_name": "catalog1",
-                "schema_name": "schema1",
-            },
-            {
-                "table_name": "table2",
-                "catalog_name": "catalog1",
-                "schema_name": "schema2",
-            },
-        ]
-        self.assertEqual(result, expected_tables)
-
-        # Verify API calls
-        self.client.post.assert_called_once()
-        self.client.get.assert_called_once()
-
-    @patch("chuck_data.profiler.time.sleep")
-    def test_list_tables_polling(self, mock_sleep):
-        """Test polling behavior when listing tables."""
-        # Set up mock responses
-        self.client.post.return_value = {"statement_id": "stmt-123"}
-
-        # Set up get to return PENDING then RUNNING then SUCCEEDED
-        self.client.get.side_effect = [
-            {"status": {"state": "PENDING"}},
-            {"status": {"state": "RUNNING"}},
-            {
-                "status": {"state": "SUCCEEDED"},
-                "result": {"data": [["table1", "catalog1", "schema1"]]},
-            },
-        ]
-
-        # Call the function
-        result = list_tables(self.client, self.warehouse_id)
-
-        # Verify polling behavior
-        self.assertEqual(len(self.client.get.call_args_list), 3)
-        self.assertEqual(mock_sleep.call_count, 2)
-
-        # Check result
-        self.assertEqual(len(result), 1)
-        self.assertEqual(result[0]["table_name"], "table1")
-
-    @patch("chuck_data.profiler.time.sleep")
-    def test_list_tables_failed_query(self, mock_sleep):
-        """Test list tables with failed SQL query."""
-        # Set up mock responses
-        self.client.post.return_value = {"statement_id": "stmt-123"}
-        self.client.get.return_value = {"status": {"state": "FAILED"}}
-
-        # Call the function
-        result = list_tables(self.client, self.warehouse_id)
-
-        # Verify it returns empty list on failure
-        self.assertEqual(result, [])
-
-    def test_generate_manifest(self):
-        """Test generating a manifest."""
-        # Test data
-        table_info = {
-            "catalog_name": "catalog1",
-            "schema_name": "schema1",
-            "table_name": "table1",
-        }
-        schema = [{"col_name": "id", "data_type": "integer"}]
-        sample_data = {"columns": ["id"], "rows": [{"id": 1}, {"id": 2}]}
-        pii_tags = ["id"]
-
-        # Call the function
-        result = generate_manifest(table_info, schema, sample_data, pii_tags)
-
-        # Check the result
-        self.assertEqual(result["table"], table_info)
-        self.assertEqual(result["schema"], schema)
-        self.assertEqual(result["pii_tags"], pii_tags)
-        self.assertTrue("profiling_timestamp" in result)
-
-    @patch("chuck_data.profiler.time.sleep")
-    @patch("chuck_data.profiler.base64.b64encode")
-    def test_store_manifest(self, mock_b64encode, mock_sleep):
-        """Test storing a manifest."""
-        # Set up mock responses
-        mock_b64encode.return_value = b"base64_encoded_data"
-        self.client.post.return_value = {"success": True}
-
-        # Test data
-        manifest = {"table": {"name": "table1"}, "pii_tags": ["id"]}
-        manifest_path = "/chuck/manifests/table1_manifest.json"
-
-        # Call the function
-        result = store_manifest(self.client, manifest_path, manifest)
-
-        # Check the result
-        self.assertTrue(result)
-
-        # Verify API call
-        self.client.post.assert_called_once()
-        self.assertEqual(self.client.post.call_args[0][0], "/api/2.0/dbfs/put")
-        # Verify the manifest path was passed correctly
-        self.assertEqual(self.client.post.call_args[0][1]["path"], manifest_path)
-
-    @patch("chuck_data.profiler.store_manifest")
-    @patch("chuck_data.profiler.generate_manifest")
-    @patch("chuck_data.profiler.query_llm")
-    @patch("chuck_data.profiler.get_sample_data")
-    @patch("chuck_data.profiler.get_table_schema")
-    @patch("chuck_data.profiler.list_tables")
-    def test_profile_table_success(
-        self,
-        mock_list_tables,
-        mock_get_schema,
-        mock_get_sample,
-        mock_query_llm,
-        mock_generate_manifest,
-        mock_store_manifest,
-    ):
-        """Test successfully profiling a table."""
-        # Set up mock responses
-        table_info = {
-            "catalog_name": "catalog1",
-            "schema_name": "schema1",
-            "table_name": "table1",
-        }
-        schema = [{"col_name": "id", "data_type": "integer"}]
-        sample_data = {"column_names": ["id"], "rows": [{"id": 1}]}
-        pii_tags = ["id"]
-        manifest = {"table": table_info, "pii_tags": pii_tags}
-        manifest_path = "/chuck/manifests/table1_manifest.json"
-
-        mock_list_tables.return_value = [table_info]
-        mock_get_schema.return_value = schema
-        mock_get_sample.return_value = sample_data
-        mock_query_llm.return_value = {"predictions": [{"pii_tags": pii_tags}]}
-        mock_generate_manifest.return_value = manifest
-        mock_store_manifest.return_value = True
-
-        # Call the function without specific table (should use first table found)
-        result = profile_table(self.client, self.warehouse_id, "test-model")
-
-        # Check the result
-        self.assertEqual(result, manifest_path)
-
-        # Verify the correct functions were called
-        mock_list_tables.assert_called_once_with(self.client, self.warehouse_id)
-        mock_get_schema.assert_called_once()
-        mock_get_sample.assert_called_once()
-        mock_query_llm.assert_called_once()
-        mock_generate_manifest.assert_called_once()
-        mock_store_manifest.assert_called_once()
-
-    def test_query_llm(self):
-        """Test querying the LLM."""
-        # Set up mock response
-        self.client.post.return_value = {"predictions": [{"pii_tags": ["id"]}]}
-
-        # Test data
-        endpoint_name = "test-model"
-        input_data = {
-            "schema": [{"col_name": "id", "data_type": "integer"}],
-            "sample_data": {"column_names": ["id"], "rows": [{"id": 1}]},
-        }
-
-        # Call the function
-        result = query_llm(self.client, endpoint_name, input_data)
-
-        # Check the result
-        self.assertEqual(result, {"predictions": [{"pii_tags": ["id"]}]})
-
-        # Verify API call
-        self.client.post.assert_called_once()
-        self.assertEqual(
-            self.client.post.call_args[0][0],
-            "/api/2.0/serving-endpoints/test-model/invocations",
-        )
diff --git a/tests/test_service.py b/tests/test_service.py
deleted file mode 100644
index 945cc0e..0000000
--- a/tests/test_service.py
+++ /dev/null
@@ -1,179 +0,0 @@
-"""
-Tests for the service layer.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-
-from chuck_data.service import ChuckService
-from chuck_data.command_registry import CommandDefinition
-from chuck_data.commands.base import CommandResult
-
-
-class TestChuckService(unittest.TestCase):
-    """Test cases for ChuckService."""
-
-    def setUp(self):
-        """Set up test fixtures."""
-        self.mock_client = MagicMock()
-        self.service = ChuckService(client=self.mock_client)
-
-    def test_service_initialization(self):
-        """Test service initialization with client."""
-        self.assertEqual(self.service.client, self.mock_client)
-
-    @patch("chuck_data.service.get_command")
-    def test_execute_command_status(self, mock_get_command):
-        """Test execute_command with status command (which now includes auth functionality)."""
-        # Setup mock handler and command definition
-        mock_handle_status = MagicMock()
-        mock_handle_status.return_value = CommandResult(
-            success=True,
-            message="Status checked",
-            data={
-                "connection": {"status": "valid", "message": "Connected"},
-                "permissions": {"unity_catalog": True, "models": True},
-            },
-        )
-
-        # Create mock command definition
-        mock_command_def = MagicMock(spec=CommandDefinition)
-        mock_command_def.handler = mock_handle_status
-        mock_command_def.name = "status"
-        mock_command_def.visible_to_user = True
-        mock_command_def.needs_api_client = True
-        mock_command_def.parameters = {}
-        mock_command_def.required_params = []
-        mock_command_def.supports_interactive_input = False
-
-        # Setup mock to return our command definition
-        mock_get_command.return_value = mock_command_def
-
-        # Execute command
-        result = self.service.execute_command("/status")
-
-        # Verify
-        mock_get_command.assert_called_once_with("/status")
-        mock_handle_status.assert_called_once_with(self.mock_client)
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Status checked")
-        self.assertIn("connection", result.data)
-        self.assertIn("permissions", result.data)
-
-    @patch("chuck_data.service.get_command")
-    def test_execute_command_models(self, mock_get_command):
-        """Test execute_command with models command."""
-        # Setup mock handler
-        mock_data = [{"name": "model1"}, {"name": "model2"}]
-        mock_handle_models = MagicMock()
-        mock_handle_models.return_value = CommandResult(success=True, data=mock_data)
-
-        # Create mock command definition
-        mock_command_def = MagicMock(spec=CommandDefinition)
-        mock_command_def.handler = mock_handle_models
-        mock_command_def.name = "models"
-        mock_command_def.visible_to_user = True
-        mock_command_def.needs_api_client = True
-        mock_command_def.parameters = {}
-        mock_command_def.required_params = []
-        mock_command_def.supports_interactive_input = False
-
-        # Setup mock to return our command definition
-        mock_get_command.return_value = mock_command_def
-
-        # Execute command
-        result = self.service.execute_command("/models")
-
-        # Verify
-        mock_get_command.assert_called_once_with("/models")
-        mock_handle_models.assert_called_once_with(self.mock_client)
-        self.assertTrue(result.success)
-        self.assertEqual(result.data, mock_data)
-
-    def test_execute_unknown_command(self):
-        """Test execute_command with unknown command."""
-        result = self.service.execute_command("unknown_command")
-        self.assertFalse(result.success)
-        self.assertIn("Unknown command", result.message)
-
-    @patch("chuck_data.service.get_command")
-    def test_execute_command_with_params(self, mock_get_command):
-        """Test execute_command with parameters."""
-        # Setup mock handler
-        mock_handle_model_selection = MagicMock()
-        mock_handle_model_selection.return_value = CommandResult(
-            success=True, message="Model selected"
-        )
-
-        # Create mock command definition
-        mock_command_def = MagicMock(spec=CommandDefinition)
-        mock_command_def.handler = mock_handle_model_selection
-        mock_command_def.name = "select_model"
-        mock_command_def.visible_to_user = True
-        mock_command_def.needs_api_client = True
-        mock_command_def.parameters = {
-            "model_name": {
-                "type": "string",
-                "description": "The name of the model to make active.",
-            }
-        }
-        mock_command_def.required_params = ["model_name"]
-        mock_command_def.supports_interactive_input = False
-
-        # Setup mock to return our command definition
-        mock_get_command.return_value = mock_command_def
-
-        # Execute command
-        result = self.service.execute_command("/select_model", "test-model")
-
-        # Verify - use keyword arguments instead of positional
-        mock_get_command.assert_called_once_with("/select_model")
-        mock_handle_model_selection.assert_called_once_with(
-            self.mock_client, model_name="test-model"
-        )
-        self.assertTrue(result.success)
-        self.assertEqual(result.message, "Model selected")
-
-    @patch("chuck_data.service.get_command")
-    @patch("chuck_data.service.get_metrics_collector")
-    def test_execute_command_error_handling(
-        self, mock_get_metrics_collector, mock_get_command
-    ):
-        """Test error handling with metrics collection in execute_command."""
-        # Setup mock handler that raises exception
-        mock_handler = MagicMock()
-        mock_handler.side_effect = Exception("Command failed")
-
-        # Setup metrics collector mock
-        mock_metrics_collector = MagicMock()
-        mock_get_metrics_collector.return_value = mock_metrics_collector
-
-        # Create mock command definition
-        mock_command_def = MagicMock(spec=CommandDefinition)
-        mock_command_def.handler = mock_handler
-        mock_command_def.name = "test_command"
-        mock_command_def.visible_to_user = True
-        mock_command_def.needs_api_client = True
-        mock_command_def.parameters = {"param1": {"type": "string"}}
-        mock_command_def.required_params = []
-        mock_command_def.supports_interactive_input = False
-
-        # Setup mock to return our command definition
-        mock_get_command.return_value = mock_command_def
-
-        # Execute command that will raise an exception
-        result = self.service.execute_command("/test_command", "param_value")
-
-        # Verify error handling
-        self.assertFalse(result.success)
-        self.assertIn("Error during command execution", result.message)
-
-        # Verify metrics collection for error reporting
-        mock_metrics_collector.track_event.assert_called_once()
-
-        # Check parameters in the metrics call
-        call_args = mock_metrics_collector.track_event.call_args[1]
-        self.assertIn("prompt", call_args)  # Should have command context as prompt
-        self.assertIn("error", call_args)  # Should have error traceback
-        self.assertEqual(call_args["tools"][0]["name"], "test_command")
-        self.assertEqual(call_args["additional_data"]["event_context"], "error_report")
diff --git a/tests/test_url_utils.py b/tests/test_url_utils.py
deleted file mode 100644
index 1118f0f..0000000
--- a/tests/test_url_utils.py
+++ /dev/null
@@ -1,139 +0,0 @@
-"""Tests for the url_utils module."""
-
-import unittest
-from chuck_data.databricks.url_utils import (
-    normalize_workspace_url,
-    detect_cloud_provider,
-    get_full_workspace_url,
-    validate_workspace_url,
-    DATABRICKS_DOMAIN_MAP,
-)
-
-
-class TestUrlUtils(unittest.TestCase):
-    """Unit tests for the url_utils module."""
-
-    def test_normalize_workspace_url(self):
-        """Test URL normalization function."""
-        test_cases = [
-            # Basic cases
-            ("workspace", "workspace"),
-            ("https://workspace", "workspace"),
-            ("http://workspace", "workspace"),
-            # AWS cases
-            ("workspace.cloud.databricks.com", "workspace"),
-            ("https://workspace.cloud.databricks.com", "workspace"),
-            ("dbc-12345-ab.cloud.databricks.com", "dbc-12345-ab"),
-            # Azure cases - the problematic one from the issue
-            ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
-            (
-                "https://adb-3856707039489412.12.azuredatabricks.net",
-                "adb-3856707039489412.12",
-            ),
-            # Another Azure case from user error
-            (
-                "https://adb-8924977320831502.2.azuredatabricks.net",
-                "adb-8924977320831502.2",
-            ),
-            ("workspace.azuredatabricks.net", "workspace"),
-            ("https://workspace.azuredatabricks.net", "workspace"),
-            # GCP cases
-            ("workspace.gcp.databricks.com", "workspace"),
-            ("https://workspace.gcp.databricks.com", "workspace"),
-            # Generic cases
-            ("workspace.databricks.com", "workspace"),
-            ("https://workspace.databricks.com", "workspace"),
-        ]
-
-        for input_url, expected_url in test_cases:
-            with self.subTest(input_url=input_url):
-                result = normalize_workspace_url(input_url)
-                self.assertEqual(result, expected_url)
-
-    def test_detect_cloud_provider(self):
-        """Test cloud provider detection."""
-        test_cases = [
-            # AWS cases
-            ("workspace.cloud.databricks.com", "AWS"),
-            ("https://workspace.cloud.databricks.com", "AWS"),
-            ("dbc-12345-ab.cloud.databricks.com", "AWS"),
-            # Azure cases
-            ("adb-3856707039489412.12.azuredatabricks.net", "Azure"),
-            ("https://adb-3856707039489412.12.azuredatabricks.net", "Azure"),
-            ("workspace.azuredatabricks.net", "Azure"),
-            # GCP cases
-            ("workspace.gcp.databricks.com", "GCP"),
-            ("https://workspace.gcp.databricks.com", "GCP"),
-            # Generic cases
-            ("workspace.databricks.com", "Generic"),
-            ("https://workspace.databricks.com", "Generic"),
-            # Default to AWS for unknown
-            ("some-workspace", "AWS"),
-            ("unknown.domain.com", "AWS"),
-        ]
-
-        for input_url, expected_provider in test_cases:
-            with self.subTest(input_url=input_url):
-                result = detect_cloud_provider(input_url)
-                self.assertEqual(result, expected_provider)
-
-    def test_get_full_workspace_url(self):
-        """Test full workspace URL generation."""
-        test_cases = [
-            ("workspace", "AWS", "https://workspace.cloud.databricks.com"),
-            ("workspace", "Azure", "https://workspace.azuredatabricks.net"),
-            ("workspace", "GCP", "https://workspace.gcp.databricks.com"),
-            ("workspace", "Generic", "https://workspace.databricks.com"),
-            ("adb-123456789", "Azure", "https://adb-123456789.azuredatabricks.net"),
-            # Default to AWS for unknown provider
-            ("workspace", "Unknown", "https://workspace.cloud.databricks.com"),
-        ]
-
-        for workspace_id, cloud_provider, expected_url in test_cases:
-            with self.subTest(workspace_id=workspace_id, cloud_provider=cloud_provider):
-                result = get_full_workspace_url(workspace_id, cloud_provider)
-                self.assertEqual(result, expected_url)
-
-    def test_validate_workspace_url(self):
-        """Test workspace URL validation."""
-        # Valid cases
-        valid_cases = [
-            "workspace",
-            "dbc-12345-ab",
-            "adb-123456789",
-            "workspace.cloud.databricks.com",
-            "workspace.azuredatabricks.net",
-            "workspace.gcp.databricks.com",
-            "https://workspace.cloud.databricks.com",
-            "https://workspace.azuredatabricks.net",
-        ]
-
-        for url in valid_cases:
-            with self.subTest(url=url):
-                is_valid, error_msg = validate_workspace_url(url)
-                self.assertTrue(
-                    is_valid, f"URL should be valid: {url}, error: {error_msg}"
-                )
-                self.assertIsNone(error_msg)
-
-        # Invalid cases
-        invalid_cases = [
-            ("", "Workspace URL cannot be empty"),
-            (None, "Workspace URL cannot be empty"),
-            (123, "Workspace URL must be a string"),
-        ]
-
-        for url, expected_error_fragment in invalid_cases:
-            with self.subTest(url=url):
-                is_valid, error_msg = validate_workspace_url(url)
-                self.assertFalse(is_valid, f"URL should be invalid: {url}")
-                self.assertIsNotNone(error_msg)
-                if expected_error_fragment:
-                    self.assertIn(expected_error_fragment, error_msg)
-
-    def test_domain_map_consistency(self):
-        """Ensure the shared domain map is used for URL generation."""
-        for provider, domain in DATABRICKS_DOMAIN_MAP.items():
-            with self.subTest(provider=provider):
-                full_url = get_full_workspace_url("myws", provider)
-                self.assertEqual(full_url, f"https://myws.{domain}")
diff --git a/tests/test_utils.py b/tests/test_utils.py
deleted file mode 100644
index f4e9756..0000000
--- a/tests/test_utils.py
+++ /dev/null
@@ -1,182 +0,0 @@
-"""
-Tests for the utils module.
-"""
-
-import unittest
-from unittest.mock import patch, MagicMock
-from chuck_data.utils import build_query_params, execute_sql_statement
-
-
-class TestUtils(unittest.TestCase):
-    """Test cases for utility functions."""
-
-    def test_build_query_params_empty(self):
-        """Test building query params with empty input."""
-        result = build_query_params({})
-        self.assertEqual(result, "")
-
-    def test_build_query_params_none_values(self):
-        """Test building query params with None values."""
-        params = {"key1": "value1", "key2": None, "key3": "value3"}
-        result = build_query_params(params)
-        self.assertEqual(result, "?key1=value1&key3=value3")
-
-    def test_build_query_params_bool_values(self):
-        """Test building query params with boolean values."""
-        params = {"key1": True, "key2": False, "key3": "value3"}
-        result = build_query_params(params)
-        self.assertEqual(result, "?key1=true&key2=false&key3=value3")
-
-    def test_build_query_params_int_values(self):
-        """Test building query params with integer values."""
-        params = {"key1": 123, "key2": "value2"}
-        result = build_query_params(params)
-        self.assertEqual(result, "?key1=123&key2=value2")
-
-    def test_build_query_params_multiple_params(self):
-        """Test building query params with multiple parameters."""
-        params = {"param1": "value1", "param2": "value2", "param3": "value3"}
-        result = build_query_params(params)
-        # Check that all params are included and properly formatted
-        self.assertTrue(result.startswith("?"))
-        self.assertIn("param1=value1", result)
-        self.assertIn("param2=value2", result)
-        self.assertIn("param3=value3", result)
-        self.assertEqual(len(result.split("&")), 3)
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_success(self, mock_sleep):
-        """Test successful SQL statement execution."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses
-        mock_client.post.return_value = {"statement_id": "123"}
-        mock_client.get.return_value = {
-            "status": {"state": "SUCCEEDED"},
-            "result": {"data": [["row1"], ["row2"]]},
-        }
-
-        # Execute the function
-        result = execute_sql_statement(
-            mock_client, "warehouse-123", "SELECT * FROM table"
-        )
-
-        # Verify interactions
-        mock_client.post.assert_called_once()
-        mock_client.get.assert_called_once_with("/api/2.0/sql/statements/123")
-
-        # Verify result
-        self.assertEqual(result, {"data": [["row1"], ["row2"]]})
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_with_catalog(self, mock_sleep):
-        """Test SQL statement execution with catalog parameter."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses
-        mock_client.post.return_value = {"statement_id": "123"}
-        mock_client.get.return_value = {
-            "status": {"state": "SUCCEEDED"},
-            "result": {"data": []},
-        }
-
-        # Execute with catalog parameter
-        execute_sql_statement(
-            mock_client, "warehouse-123", "SELECT * FROM table", catalog="test-catalog"
-        )
-
-        # Verify the catalog was included in the request
-        post_args = mock_client.post.call_args[0][1]
-        self.assertEqual(post_args.get("catalog"), "test-catalog")
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_with_custom_timeout(self, mock_sleep):
-        """Test SQL statement execution with custom timeout."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses
-        mock_client.post.return_value = {"statement_id": "123"}
-        mock_client.get.return_value = {
-            "status": {"state": "SUCCEEDED"},
-            "result": {},
-        }
-
-        # Execute with custom timeout
-        custom_timeout = "60s"
-        execute_sql_statement(
-            mock_client,
-            "warehouse-123",
-            "SELECT * FROM table",
-            wait_timeout=custom_timeout,
-        )
-
-        # Verify the timeout was included in the request
-        post_args = mock_client.post.call_args[0][1]
-        self.assertEqual(post_args.get("wait_timeout"), custom_timeout)
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_polling(self, mock_sleep):
-        """Test SQL statement execution with polling."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses for polling
-        mock_client.post.return_value = {"statement_id": "123"}
-
-        # Configure get to return "RUNNING" twice then "SUCCEEDED"
-        mock_client.get.side_effect = [
-            {"status": {"state": "PENDING"}},
-            {"status": {"state": "RUNNING"}},
-            {"status": {"state": "SUCCEEDED"}, "result": {"data": []}},
-        ]
-
-        # Execute the function
-        execute_sql_statement(mock_client, "warehouse-123", "SELECT * FROM table")
-
-        # Verify that get was called 3 times (polling behavior)
-        self.assertEqual(mock_client.get.call_count, 3)
-
-        # Verify sleep was called twice (once for each non-complete state)
-        mock_sleep.assert_called_with(1)
-        self.assertEqual(mock_sleep.call_count, 2)
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_failed(self, mock_sleep):
-        """Test SQL statement execution that fails."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses
-        mock_client.post.return_value = {"statement_id": "123"}
-        mock_client.get.return_value = {
-            "status": {"state": "FAILED", "error": {"message": "SQL syntax error"}},
-        }
-
-        # Execute the function and check for exception
-        with self.assertRaises(ValueError) as context:
-            execute_sql_statement(mock_client, "warehouse-123", "SELECT * INVALID SQL")
-
-        # Verify error message
-        self.assertIn("SQL statement failed: SQL syntax error", str(context.exception))
-
-    @patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
-    def test_execute_sql_statement_error_without_message(self, mock_sleep):
-        """Test SQL statement execution that fails without specific message."""
-        # Create mock client
-        mock_client = MagicMock()
-
-        # Set up mock responses
-        mock_client.post.return_value = {"statement_id": "123"}
-        mock_client.get.return_value = {
-            "status": {"state": "FAILED", "error": {}},
-        }
-
-        # Execute the function and check for exception
-        with self.assertRaises(ValueError) as context:
-            execute_sql_statement(mock_client, "warehouse-123", "SELECT * INVALID SQL")
-
-        # Verify default error message
-        self.assertIn("SQL statement failed: Unknown error", str(context.exception))
diff --git a/tests/test_warehouses.py b/tests/test_warehouses.py
deleted file mode 100644
index e19a15e..0000000
--- a/tests/test_warehouses.py
+++ /dev/null
@@ -1,84 +0,0 @@
-"""
-Tests for the warehouses module.
-"""
-
-import unittest
-from unittest.mock import MagicMock
-from chuck_data.warehouses import list_warehouses, get_warehouse, create_warehouse
-
-
-class TestWarehouses(unittest.TestCase):
-    """Test cases for the warehouse-related functions."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.client = MagicMock()
-        self.sample_warehouses = [
-            {"id": "warehouse-123", "name": "Test Warehouse 1", "state": "RUNNING"},
-            {"id": "warehouse-456", "name": "Test Warehouse 2", "state": "STOPPED"},
-        ]
-
-    def test_list_warehouses(self):
-        """Test listing warehouses."""
-        # Set up mock response
-        self.client.list_warehouses.return_value = self.sample_warehouses
-
-        # Call the function
-        result = list_warehouses(self.client)
-
-        # Verify the result
-        self.assertEqual(result, self.sample_warehouses)
-        self.client.list_warehouses.assert_called_once()
-
-    def test_list_warehouses_empty_response(self):
-        """Test listing warehouses with empty response."""
-        # Set up mock response
-        self.client.list_warehouses.return_value = []
-
-        # Call the function
-        result = list_warehouses(self.client)
-
-        # Verify the result is an empty list
-        self.assertEqual(result, [])
-        self.client.list_warehouses.assert_called_once()
-
-    def test_get_warehouse(self):
-        """Test getting a specific warehouse."""
-        # Set up mock response
-        warehouse_detail = {
-            "id": "warehouse-123",
-            "name": "Test Warehouse",
-            "state": "RUNNING",
-        }
-        self.client.get_warehouse.return_value = warehouse_detail
-
-        # Call the function
-        result = get_warehouse(self.client, "warehouse-123")
-
-        # Verify the result
-        self.assertEqual(result, warehouse_detail)
-        self.client.get_warehouse.assert_called_once_with("warehouse-123")
-
-    def test_create_warehouse(self):
-        """Test creating a warehouse."""
-        # Set up mock response
-        new_warehouse = {
-            "id": "warehouse-789",
-            "name": "New Warehouse",
-            "state": "CREATING",
-        }
-        self.client.create_warehouse.return_value = new_warehouse
-
-        # Create options for new warehouse
-        warehouse_options = {
-            "name": "New Warehouse",
-            "cluster_size": "Small",
-            "auto_stop_mins": 120,
-        }
-
-        # Call the function
-        result = create_warehouse(self.client, warehouse_options)
-
-        # Verify the result
-        self.assertEqual(result, new_warehouse)
-        self.client.create_warehouse.assert_called_once_with(warehouse_options)
diff --git a/tests/unit/__init__.py b/tests/unit/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/clients/__init__.py b/tests/unit/clients/__init__.py
similarity index 100%
rename from tests/clients/__init__.py
rename to tests/unit/clients/__init__.py
diff --git a/tests/clients/test_amperity.py b/tests/unit/clients/test_amperity.py
similarity index 100%
rename from tests/clients/test_amperity.py
rename to tests/unit/clients/test_amperity.py
diff --git a/tests/commands/__init__.py b/tests/unit/commands/__init__.py
similarity index 100%
rename from tests/commands/__init__.py
rename to tests/unit/commands/__init__.py
diff --git a/tests/unit/commands/test_add_stitch_report.py b/tests/unit/commands/test_add_stitch_report.py
new file mode 100644
index 0000000..5080f39
--- /dev/null
+++ b/tests/unit/commands/test_add_stitch_report.py
@@ -0,0 +1,149 @@
+"""
+Tests for add_stitch_report command handler.
+
+This module contains tests for the add_stitch_report command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.add_stitch_report import handle_command
+
+
+def test_missing_client():
+    """Test handling when client is not provided."""
+    result = handle_command(None, table_path="catalog.schema.table")
+    assert not result.success
+    assert "Client is required" in result.message
+
+
+def test_missing_table_path(databricks_client_stub):
+    """Test handling when table_path is missing."""
+    result = handle_command(databricks_client_stub)
+    assert not result.success
+    assert "Table path must be provided" in result.message
+
+
+def test_invalid_table_path_format(databricks_client_stub):
+    """Test handling when table_path format is invalid."""
+    result = handle_command(databricks_client_stub, table_path="invalid_format")
+    assert not result.success
+    assert "must be in the format" in result.message
+
+
+@patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
+def test_successful_report_creation(
+    mock_get_metrics_collector, databricks_client_stub, metrics_collector_stub
+):
+    """Test successful stitch report notebook creation."""
+    # Setup mocks
+    mock_get_metrics_collector.return_value = metrics_collector_stub
+
+    databricks_client_stub.set_create_stitch_notebook_result(
+        {
+            "path": "/Workspace/Users/user@example.com/Stitch Results",
+            "status": "success",
+        }
+    )
+
+    # Call function
+    result = handle_command(databricks_client_stub, table_path="catalog.schema.table")
+
+    # Verify results
+    assert result.success
+    assert "Successfully created" in result.message
+    # Verify the call was made with correct arguments
+    assert len(databricks_client_stub.create_stitch_notebook_calls) == 1
+    args, kwargs = databricks_client_stub.create_stitch_notebook_calls[0]
+    assert args == ("catalog.schema.table", None)
+
+    # Verify metrics collection
+    assert len(metrics_collector_stub.track_event_calls) == 1
+    call = metrics_collector_stub.track_event_calls[0]
+    assert call["prompt"] == "add-stitch-report command"
+    assert call["additional_data"]["status"] == "success"
+
+
+@patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
+def test_report_creation_with_custom_name(
+    mock_get_metrics_collector, databricks_client_stub, metrics_collector_stub
+):
+    """Test stitch report creation with custom notebook name."""
+    # Setup mocks
+    mock_get_metrics_collector.return_value = metrics_collector_stub
+
+    databricks_client_stub.set_create_stitch_notebook_result(
+        {
+            "path": "/Workspace/Users/user@example.com/My Custom Report",
+            "status": "success",
+        }
+    )
+
+    # Call function
+    result = handle_command(
+        databricks_client_stub,
+        table_path="catalog.schema.table",
+        name="My Custom Report",
+    )
+
+    # Verify results
+    assert result.success
+    assert "Successfully created" in result.message
+    # Verify the call was made with correct arguments
+    assert len(databricks_client_stub.create_stitch_notebook_calls) == 1
+    args, kwargs = databricks_client_stub.create_stitch_notebook_calls[0]
+    assert args == ("catalog.schema.table", "My Custom Report")
+
+
+@patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
+def test_report_creation_with_rest_args(
+    mock_get_metrics_collector, databricks_client_stub, metrics_collector_stub
+):
+    """Test stitch report creation with rest arguments as notebook name."""
+    # Setup mocks
+    mock_get_metrics_collector.return_value = metrics_collector_stub
+
+    databricks_client_stub.set_create_stitch_notebook_result(
+        {
+            "path": "/Workspace/Users/user@example.com/Multi Word Name",
+            "status": "success",
+        }
+    )
+
+    # Call function with rest parameter
+    result = handle_command(
+        databricks_client_stub,
+        table_path="catalog.schema.table",
+        rest="Multi Word Name",
+    )
+
+    # Verify results
+    assert result.success
+    assert "Successfully created" in result.message
+    # Verify the call was made with correct arguments
+    assert len(databricks_client_stub.create_stitch_notebook_calls) == 1
+    args, kwargs = databricks_client_stub.create_stitch_notebook_calls[0]
+    assert args == ("catalog.schema.table", "Multi Word Name")
+
+
+@patch("chuck_data.commands.add_stitch_report.get_metrics_collector")
+def test_report_creation_api_error(
+    mock_get_metrics_collector, databricks_client_stub, metrics_collector_stub
+):
+    """Test handling when API call to create notebook fails."""
+    # Setup mocks
+    mock_get_metrics_collector.return_value = metrics_collector_stub
+
+    databricks_client_stub.set_create_stitch_notebook_error(ValueError("API Error"))
+
+    # Call function
+    result = handle_command(databricks_client_stub, table_path="catalog.schema.table")
+
+    # Verify results
+    assert not result.success
+    assert "Error creating Stitch report" in result.message
+
+    # Verify metrics collection for error
+    assert len(metrics_collector_stub.track_event_calls) == 1
+    call = metrics_collector_stub.track_event_calls[0]
+    assert call["prompt"] == "add-stitch-report command"
+    assert call["error"] == "API Error"
diff --git a/tests/unit/commands/test_agent.py b/tests/unit/commands/test_agent.py
new file mode 100644
index 0000000..de4724f
--- /dev/null
+++ b/tests/unit/commands/test_agent.py
@@ -0,0 +1,295 @@
+"""
+Tests for agent command handler.
+
+Following improved testing patterns:
+- Direct dependency injection of stubs (no mocking needed!)
+- Use real agent manager logic and real config system
+- Test end-to-end agent command behavior with injected external dependencies
+"""
+
+import tempfile
+from unittest.mock import patch
+
+from chuck_data.commands.agent import handle_command
+from chuck_data.config import ConfigManager
+
+
+def test_missing_query_real_logic():
+    """Test handling when query parameter is not provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "Please provide a query" in result.message
+
+
+def test_general_query_mode_real_logic(databricks_client_stub, llm_client_stub):
+    """Test general query mode with real agent logic and direct dependency injection."""
+    # Configure LLM stub for expected behavior
+    llm_client_stub.set_response_content("This is a test response from the agent.")
+
+    # Use real config with temp file
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection - no mocking needed!
+            result = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,  # Inject LLM stub directly
+                mode="general",
+                query="What is the status of my workspace?",
+            )
+
+    # Verify real command execution with injected dependencies
+    assert result.success
+    assert result.data is not None
+    assert "response" in result.data
+
+
+def test_pii_mode_real_logic(databricks_client_stub_with_data, llm_client_stub):
+    """Test PII detection mode with real agent logic."""
+    # Configure LLM stub for PII analysis
+    llm_client_stub.set_response_content(
+        "This table contains potential PII in the email column."
+    )
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection - query becomes table_name for PII mode
+            result = handle_command(
+                databricks_client_stub_with_data,
+                llm_client=llm_client_stub,
+                mode="pii",
+                query="test_table",  # This is passed as table_name to process_pii_detection
+            )
+
+    # Verify real PII detection execution
+    assert result.success
+    assert result.data is not None
+    assert "response" in result.data
+
+
+def test_bulk_pii_mode_real_logic(databricks_client_stub_with_data, llm_client_stub):
+    """Test bulk PII scanning mode with real agent logic."""
+    # Configure LLM stub for bulk analysis
+    llm_client_stub.set_response_content(
+        "Completed bulk PII scan. Found 3 tables with potential PII."
+    )
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection
+            result = handle_command(
+                databricks_client_stub_with_data,
+                llm_client=llm_client_stub,
+                mode="bulk_pii",
+                catalog_name="test_catalog",
+                schema_name="test_schema",
+            )
+
+    # Verify real bulk scanning execution
+    assert result.success
+    assert result.data is not None
+    assert "response" in result.data
+
+
+def test_stitch_mode_real_logic(databricks_client_stub_with_data, llm_client_stub):
+    """Test Stitch setup mode with real agent logic."""
+    # Configure LLM stub for Stitch setup
+    llm_client_stub.set_response_content(
+        "Stitch integration setup completed successfully."
+    )
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection
+            result = handle_command(
+                databricks_client_stub_with_data,
+                llm_client=llm_client_stub,
+                mode="stitch",
+                catalog_name="test_catalog",
+                schema_name="test_schema",
+            )
+
+    # Verify real Stitch setup execution
+    assert result.success
+    assert result.data is not None
+    assert "response" in result.data
+
+
+def test_agent_error_handling_real_logic(databricks_client_stub, llm_client_stub):
+    """Test agent error handling with real business logic."""
+    # Configure LLM stub to simulate error
+    llm_client_stub.set_exception(True)
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection
+            result = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                mode="general",
+                query="Test query",
+            )
+
+    # Should handle LLM errors gracefully with real error handling logic
+    assert isinstance(result.success, bool)
+    assert result.data is not None or result.error is not None
+
+
+def test_agent_history_integration_real_logic(databricks_client_stub, llm_client_stub):
+    """Test agent history integration with real config system."""
+    # Configure LLM stub
+    llm_client_stub.set_response_content("Response with history context.")
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection for both queries
+            result1 = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                mode="general",
+                query="First question",
+            )
+
+            result2 = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                mode="general",
+                query="Follow up question",
+            )
+
+    # Both queries should work with real history management
+    assert result1.success
+    assert result2.success
+
+
+def test_agent_with_tool_output_callback_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test agent with tool output callback using real logic."""
+    # Configure LLM stub to use tools
+    llm_client_stub.set_response_content("I'll check your catalogs.")
+
+    # Create a mock callback to test tool output integration
+    tool_outputs = []
+
+    def mock_callback(output):
+        tool_outputs.append(output)
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection with callback
+            result = handle_command(
+                databricks_client_stub_with_data,
+                llm_client=llm_client_stub,
+                mode="general",
+                query="What catalogs do I have?",
+                tool_output_callback=mock_callback,
+            )
+
+    # Verify real tool integration
+    assert result.success
+    assert result.data is not None
+
+
+def test_agent_config_integration_real_logic(databricks_client_stub, llm_client_stub):
+    """Test agent integration with real config system."""
+    # Configure LLM stub
+    llm_client_stub.set_response_content("Configuration-aware response.")
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up config state to test real config integration
+        config_manager.update(
+            workspace_url="https://test.databricks.com",
+            active_catalog="test_catalog",
+            active_schema="test_schema",
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection
+            result = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                mode="general",
+                query="What is my current workspace setup?",
+            )
+
+    # Verify real config integration
+    assert result.success
+    assert result.data is not None
+
+
+def test_agent_with_missing_client_real_logic(llm_client_stub):
+    """Test agent behavior with missing databricks client."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Direct dependency injection even with missing databricks client
+            result = handle_command(
+                None,  # No databricks client
+                llm_client=llm_client_stub,
+                query="Test query",
+            )
+
+    # Should handle missing client gracefully
+    assert isinstance(result.success, bool)
+    assert result.data is not None or result.error is not None
+
+
+def test_agent_parameter_handling_real_logic(databricks_client_stub, llm_client_stub):
+    """Test agent parameter handling with different input methods."""
+    llm_client_stub.set_response_content("Parameter handling test response.")
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Test with query parameter
+            result1 = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                query="Direct query test",
+            )
+
+            # Test with rest parameter (if supported)
+            result2 = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                rest="Rest parameter test",
+            )
+
+            # Test with raw_args parameter (if supported)
+            result3 = handle_command(
+                databricks_client_stub,
+                llm_client=llm_client_stub,
+                raw_args=["Raw", "args", "test"],
+            )
+
+    # All should be handled by real parameter processing logic
+    for result in [result1, result2, result3]:
+        assert isinstance(result.success, bool)
+        assert result.data is not None or result.error is not None
diff --git a/tests/unit/commands/test_auth.py b/tests/unit/commands/test_auth.py
new file mode 100644
index 0000000..93e70db
--- /dev/null
+++ b/tests/unit/commands/test_auth.py
@@ -0,0 +1,146 @@
+"""Unit tests for the auth commands module."""
+
+from unittest.mock import patch
+
+from chuck_data.commands.auth import (
+    handle_amperity_login,
+    handle_databricks_login,
+    handle_logout,
+)
+
+
+@patch("chuck_data.commands.auth.AmperityAPIClient")
+def test_amperity_login_success(mock_auth_client_class, amperity_client_stub):
+    """Test successful Amperity login flow."""
+    # Use AmperityClientStub instead of MagicMock
+    mock_auth_client_class.return_value = amperity_client_stub
+
+    # Execute
+    result = handle_amperity_login(None)
+
+    # Verify
+    assert result.success
+    assert result.message == "Authentication completed successfully."
+
+
+@patch("chuck_data.commands.auth.AmperityAPIClient")
+def test_amperity_login_start_failure(mock_auth_client_class, amperity_client_stub):
+    """Test failure during start of Amperity login flow."""
+    # Use AmperityClientStub configured to fail at start
+    amperity_client_stub.set_auth_start_failure(True)
+    mock_auth_client_class.return_value = amperity_client_stub
+
+    # Execute
+    result = handle_amperity_login(None)
+
+    # Verify
+    assert not result.success
+    assert result.message == "Login failed: Failed to start auth: 500 - Server Error"
+
+
+@patch("chuck_data.commands.auth.AmperityAPIClient")
+def test_amperity_login_completion_failure(
+    mock_auth_client_class, amperity_client_stub
+):
+    """Test failure during completion of Amperity login flow."""
+    # Use AmperityClientStub configured to fail at completion
+    amperity_client_stub.set_auth_completion_failure(True)
+    mock_auth_client_class.return_value = amperity_client_stub
+
+    # Execute
+    result = handle_amperity_login(None)
+
+    # Verify
+    assert not result.success
+    assert result.message == "Login failed: Authentication failed: error"
+
+
+@patch("chuck_data.commands.auth.set_databricks_token")
+def test_databricks_login_success(mock_set_token):
+    """Test setting the Databricks token."""
+    # Setup
+    mock_set_token.return_value = True
+    test_token = "test-token-123"
+
+    # Execute
+    result = handle_databricks_login(None, token=test_token)
+
+    # Verify
+    assert result.success
+    assert result.message == "Databricks token set successfully"
+    mock_set_token.assert_called_with(test_token)
+
+
+def test_databricks_login_missing_token():
+    """Test error when token is missing."""
+    # Execute
+    result = handle_databricks_login(None)
+
+    # Verify
+    assert not result.success
+    assert result.message == "Token parameter is required"
+
+
+@patch("chuck_data.commands.auth.set_databricks_token")
+def test_logout_databricks(mock_set_db_token):
+    """Test logout from Databricks."""
+    # Setup
+    mock_set_db_token.return_value = True
+
+    # Execute
+    result = handle_logout(None, service="databricks")
+
+    # Verify
+    assert result.success
+    assert result.message == "Successfully logged out from databricks"
+    mock_set_db_token.assert_called_with("")
+
+
+@patch("chuck_data.config.set_amperity_token")
+def test_logout_amperity(mock_set_amp_token):
+    """Test logout from Amperity."""
+    # Setup
+    mock_set_amp_token.return_value = True
+
+    # Execute
+    result = handle_logout(None, service="amperity")
+
+    # Verify
+    assert result.success
+    assert result.message == "Successfully logged out from amperity"
+    mock_set_amp_token.assert_called_with("")
+
+
+@patch("chuck_data.config.set_amperity_token")
+@patch("chuck_data.commands.auth.set_databricks_token")
+def test_logout_default(mock_set_db_token, mock_set_amp_token):
+    """Test default logout behavior (only Amperity)."""
+    # Setup
+    mock_set_amp_token.return_value = True
+
+    # Execute
+    result = handle_logout(None)  # No service specified
+
+    # Verify
+    assert result.success
+    assert result.message == "Successfully logged out from amperity"
+    mock_set_amp_token.assert_called_with("")
+    mock_set_db_token.assert_not_called()
+
+
+@patch("chuck_data.commands.auth.set_databricks_token")
+@patch("chuck_data.config.set_amperity_token")
+def test_logout_all(mock_set_amp_token, mock_set_db_token):
+    """Test logout from all services."""
+    # Setup
+    mock_set_db_token.return_value = True
+    mock_set_amp_token.return_value = True
+
+    # Execute
+    result = handle_logout(None, service="all")
+
+    # Verify
+    assert result.success
+    assert result.message == "Successfully logged out from all"
+    mock_set_db_token.assert_called_with("")
+    mock_set_amp_token.assert_called_with("")
diff --git a/tests/unit/commands/test_base.py b/tests/unit/commands/test_base.py
new file mode 100644
index 0000000..458da4f
--- /dev/null
+++ b/tests/unit/commands/test_base.py
@@ -0,0 +1,33 @@
+"""
+Tests for the base module in the commands package.
+"""
+
+from chuck_data.commands.base import CommandResult
+
+
+def test_command_result_success():
+    """Test creating a successful CommandResult."""
+    result = CommandResult(True, data="test data", message="test message")
+    assert result.success
+    assert result.data == "test data"
+    assert result.message == "test message"
+    assert result.error is None
+
+
+def test_command_result_failure():
+    """Test creating a failure CommandResult."""
+    error = ValueError("test error")
+    result = CommandResult(False, error=error, message="test error message")
+    assert not result.success
+    assert result.data is None
+    assert result.message == "test error message"
+    assert result.error == error
+
+
+def test_command_result_defaults():
+    """Test CommandResult with default values."""
+    result = CommandResult(True)
+    assert result.success
+    assert result.data is None
+    assert result.message is None
+    assert result.error is None
diff --git a/tests/commands/test_bug.py b/tests/unit/commands/test_bug.py
similarity index 99%
rename from tests/commands/test_bug.py
rename to tests/unit/commands/test_bug.py
index ef77bb8..868e41f 100644
--- a/tests/commands/test_bug.py
+++ b/tests/unit/commands/test_bug.py
@@ -14,7 +14,7 @@
     _get_session_log,
 )
 from chuck_data.config import ConfigManager
-from tests.fixtures import AmperityClientStub
+from tests.fixtures.amperity import AmperityClientStub
 
 
 class TestBugCommand:
diff --git a/tests/unit/commands/test_catalog_selection.py b/tests/unit/commands/test_catalog_selection.py
new file mode 100644
index 0000000..e8c616f
--- /dev/null
+++ b/tests/unit/commands/test_catalog_selection.py
@@ -0,0 +1,98 @@
+"""
+Tests for catalog_selection command handler.
+
+This module contains tests for the catalog selection command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.catalog_selection import handle_command
+from chuck_data.config import get_active_catalog
+
+
+def test_missing_catalog_name(databricks_client_stub, temp_config):
+    """Test handling when catalog parameter is not provided."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = handle_command(databricks_client_stub)
+        assert not result.success
+        assert "catalog parameter is required" in result.message
+
+
+def test_successful_catalog_selection(databricks_client_stub, temp_config):
+    """Test successful catalog selection."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up catalog in stub
+        databricks_client_stub.add_catalog("test_catalog", catalog_type="MANAGED")
+
+        # Call function
+        result = handle_command(databricks_client_stub, catalog="test_catalog")
+
+        # Verify results
+        assert result.success
+        assert "Active catalog is now set to 'test_catalog'" in result.message
+        assert "Type: MANAGED" in result.message
+        assert result.data["catalog_name"] == "test_catalog"
+        assert result.data["catalog_type"] == "MANAGED"
+
+        # Verify config was updated
+        assert get_active_catalog() == "test_catalog"
+
+
+def test_catalog_selection_with_verification_failure(
+    databricks_client_stub, temp_config
+):
+    """Test catalog selection when verification fails."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Add some catalogs but not the one we're looking for (make sure names are very different)
+        databricks_client_stub.add_catalog("xyz", catalog_type="MANAGED")
+
+        # Call function with nonexistent catalog that won't fuzzy match
+        result = handle_command(
+            databricks_client_stub, catalog="completely_different_name"
+        )
+
+        # Verify results - should fail since catalog doesn't exist and no fuzzy match
+        assert not result.success
+        assert "No catalog found matching 'completely_different_name'" in result.message
+        assert "Available catalogs: xyz" in result.message
+
+
+def test_catalog_selection_exception(databricks_client_stub, temp_config):
+    """Test catalog selection with unexpected exception."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure stub to fail on get_catalog
+        def get_catalog_failing(catalog_name):
+            raise Exception("Failed to set catalog")
+
+        databricks_client_stub.get_catalog = get_catalog_failing
+
+        # This should trigger the exception in the catalog verification
+        result = handle_command(databricks_client_stub, catalog="test_catalog")
+
+        # Should fail since get_catalog fails and no catalogs in list
+        assert not result.success
+        assert "No catalogs found in workspace" in result.message
+
+
+def test_select_catalog_by_name(databricks_client_stub, temp_config):
+    """Test catalog selection by name."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        databricks_client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
+
+        result = handle_command(databricks_client_stub, catalog="Test Catalog")
+
+        assert result.success
+        assert "Active catalog is now set to 'Test Catalog'" in result.message
+
+
+def test_select_catalog_fuzzy_matching(databricks_client_stub, temp_config):
+    """Test catalog selection with fuzzy matching."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        databricks_client_stub.add_catalog(
+            "Test Catalog Long Name", catalog_type="MANAGED"
+        )
+
+        result = handle_command(databricks_client_stub, catalog="Test")
+
+        assert result.success
+        assert "Test Catalog Long Name" in result.message
diff --git a/tests/commands/test_cluster_init_tools.py b/tests/unit/commands/test_cluster_init_tools.py
similarity index 100%
rename from tests/commands/test_cluster_init_tools.py
rename to tests/unit/commands/test_cluster_init_tools.py
diff --git a/tests/unit/commands/test_help.py b/tests/unit/commands/test_help.py
new file mode 100644
index 0000000..1d3882a
--- /dev/null
+++ b/tests/unit/commands/test_help.py
@@ -0,0 +1,117 @@
+"""
+Tests for help command handler.
+
+Following approved testing patterns:
+- Use real internal business logic (get_user_commands, format_help_text)
+- No external boundaries to mock in this simple command
+- Test end-to-end help command behavior
+"""
+
+from chuck_data.commands.help import handle_command
+
+
+def test_help_command_success_real_logic():
+    """Test successful help command execution with real logic."""
+    # Test real help command with no mocking - it should work end-to-end
+    result = handle_command(None)
+
+    # Verify real command execution
+    assert result.success
+    assert result.data is not None
+    assert "help_text" in result.data
+    assert isinstance(result.data["help_text"], str)
+    assert len(result.data["help_text"]) > 0
+
+    # Real help text should contain expected command information
+    help_text = result.data["help_text"]
+    assert "Commands" in help_text or "help" in help_text.lower()
+
+
+def test_help_command_with_client_real_logic(databricks_client_stub):
+    """Test help command with client provided (should work the same)."""
+    # Help command doesn't use the client, should work the same
+    result = handle_command(databricks_client_stub)
+
+    # Should succeed with real logic regardless of client
+    assert result.success
+    assert result.data is not None
+    assert "help_text" in result.data
+    assert isinstance(result.data["help_text"], str)
+    assert len(result.data["help_text"]) > 0
+
+
+def test_help_command_content_real_logic():
+    """Test that help command returns real content from the command registry."""
+    result = handle_command(None)
+
+    assert result.success
+    help_text = result.data["help_text"]
+
+    # Real help should contain information about actual commands
+    # These are commands we know exist in the system
+    expected_content_indicators = [
+        "help",  # Help command itself
+        "status",  # Status command
+        "Commands",  # Section header
+        "/",  # TUI command indicators
+    ]
+
+    # At least some of these should be present in real help text
+    found_indicators = [
+        indicator
+        for indicator in expected_content_indicators
+        if indicator.lower() in help_text.lower()
+    ]
+
+    assert (
+        len(found_indicators) > 0
+    ), f"Expected to find command indicators in help text: {help_text[:200]}..."
+
+
+def test_help_command_real_formatting():
+    """Test that help command uses real formatting logic."""
+    result = handle_command(None)
+
+    assert result.success
+    help_text = result.data["help_text"]
+
+    # Real formatting should produce structured text
+    assert isinstance(help_text, str)
+    assert len(help_text.strip()) > 10  # Should be substantial content
+
+    # Real help formatting should include some structure
+    # (exact structure depends on implementation, but should be non-trivial)
+    lines = help_text.split("\n")
+    assert len(lines) > 1, "Help text should be multi-line"
+
+
+def test_help_command_idempotent_real_logic():
+    """Test that help command produces consistent results."""
+    # Call multiple times and verify consistency
+    result1 = handle_command(None)
+    result2 = handle_command(None)
+
+    assert result1.success
+    assert result2.success
+
+    # Real logic should produce identical results
+    assert result1.data["help_text"] == result2.data["help_text"]
+
+
+def test_help_command_no_side_effects_real_logic():
+    """Test that help command has no side effects with real logic."""
+    # Store initial state (this is a read-only command)
+    result_before = handle_command(None)
+
+    # Call help command
+    result = handle_command(None)
+
+    # Call again to verify no state changes
+    result_after = handle_command(None)
+
+    # All should succeed and produce identical results
+    assert result_before.success
+    assert result.success
+    assert result_after.success
+
+    assert result_before.data["help_text"] == result_after.data["help_text"]
diff --git a/tests/unit/commands/test_jobs.py b/tests/unit/commands/test_jobs.py
new file mode 100644
index 0000000..941026e
--- /dev/null
+++ b/tests/unit/commands/test_jobs.py
@@ -0,0 +1,128 @@
+from unittest.mock import patch
+
+from chuck_data.commands.jobs import handle_launch_job, handle_job_status
+from chuck_data.commands.base import CommandResult
+
+
+def test_handle_launch_job_success(databricks_client_stub, temp_config):
+    """Test launching a job with all required parameters."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format instead of positional arguments
+        result: CommandResult = handle_launch_job(
+            databricks_client_stub,
+            config_path="/Volumes/test/config.json",
+            init_script_path="/init/script.sh",
+            run_name="MyTestJob",
+        )
+        assert result.success is True
+        assert "123456" in result.message
+        assert result.data["run_id"] == "123456"
+
+
+def test_handle_launch_job_no_run_id(databricks_client_stub, temp_config):
+    """Test launching a job where response doesn't include run_id."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure stub to return response without run_id
+        def submit_no_run_id(config_path, init_script_path, run_name=None):
+            return {}  # No run_id in response
+
+        databricks_client_stub.submit_job_run = submit_no_run_id
+
+        # Use kwargs format
+        result = handle_launch_job(
+            databricks_client_stub,
+            config_path="/Volumes/test/config.json",
+            init_script_path="/init/script.sh",
+            run_name="NoRunId",
+        )
+        assert not result.success
+        # Now we're looking for more generic failed/failure message
+        assert "Failed" in result.message or "No run_id" in result.message
+
+
+def test_handle_launch_job_http_error(databricks_client_stub, temp_config):
+    """Test launching a job with HTTP error response."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure stub to raise an HTTP error
+        def submit_failing(config_path, init_script_path, run_name=None):
+            raise Exception("Bad Request")
+
+        databricks_client_stub.submit_job_run = submit_failing
+
+        # Use kwargs format
+        result = handle_launch_job(
+            databricks_client_stub,
+            config_path="/Volumes/test/config.json",
+            init_script_path="/init/script.sh",
+        )
+        assert not result.success
+        assert "Bad Request" in result.message
+
+
+def test_handle_launch_job_missing_token(temp_config):
+    """Test launching a job with missing API token."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format
+        result = handle_launch_job(
+            None,
+            config_path="/Volumes/test/config.json",
+            init_script_path="/init/script.sh",
+        )
+        assert not result.success
+        assert "Client required" in result.message
+
+
+def test_handle_launch_job_missing_url(temp_config):
+    """Test launching a job with missing workspace URL."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format
+        result = handle_launch_job(
+            None,
+            config_path="/Volumes/test/config.json",
+            init_script_path="/init/script.sh",
+        )
+        assert not result.success
+        assert "Client required" in result.message
+
+
+def test_handle_job_status_basic_success(databricks_client_stub, temp_config):
+    """Test getting job status with successful response."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format
+        result = handle_job_status(databricks_client_stub, run_id="123456")
+        assert result.success
+        assert result.data["state"]["life_cycle_state"] == "RUNNING"
+        assert result.data["run_id"] == 123456
+
+
+def test_handle_job_status_http_error(databricks_client_stub, temp_config):
+    """Test getting job status with HTTP error response."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure stub to raise an HTTP error
+        def get_status_failing(run_id):
+            raise Exception("Not Found")
+
+        databricks_client_stub.get_job_run_status = get_status_failing
+
+        # Use kwargs format
+        result = handle_job_status(databricks_client_stub, run_id="999999")
+        assert not result.success
+        assert "Not Found" in result.message
+
+
+def test_handle_job_status_missing_token(temp_config):
+    """Test getting job status with missing API token."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format
+        result = handle_job_status(None, run_id="123456")
+        assert not result.success
+        assert "Client required" in result.message
+
+
+def test_handle_job_status_missing_url(temp_config):
+    """Test getting job status with missing workspace URL."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Use kwargs format
+        result = handle_job_status(None, run_id="123456")
+        assert not result.success
+        assert "Client required" in result.message
diff --git a/tests/unit/commands/test_list_catalogs.py b/tests/unit/commands/test_list_catalogs.py
new file mode 100644
index 0000000..30fee24
--- /dev/null
+++ b/tests/unit/commands/test_list_catalogs.py
@@ -0,0 +1,146 @@
+"""
+Tests for list_catalogs command handler.
+
+This module contains tests for the list_catalogs command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.list_catalogs import handle_command
+
+
+def test_no_client():
+    """Test handling when no client is provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "No Databricks client available" in result.message
+
+
+def test_successful_list_catalogs(databricks_client_stub, temp_config):
+    """Test successful list catalogs."""
+    client_stub = databricks_client_stub
+    config_manager = temp_config
+
+    # Set up test data using stub - this simulates external API
+    client_stub.add_catalog(
+        "catalog1",
+        catalog_type="MANAGED",
+        comment="Test catalog 1",
+        provider={"name": "provider1"},
+        created_at="2023-01-01",
+    )
+    client_stub.add_catalog(
+        "catalog2",
+        catalog_type="EXTERNAL",
+        comment="Test catalog 2",
+        provider={"name": "provider2"},
+        created_at="2023-01-02",
+    )
+
+    # Call function with parameters - tests real command logic
+    with patch("chuck_data.config._config_manager", config_manager):
+        result = handle_command(client_stub, include_browse=True, max_results=50)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["catalogs"]) == 2
+    assert result.data["total_count"] == 2
+    assert "Found 2 catalog(s)." in result.message
+    assert not result.data.get("display", True)  # Should default to False
+    assert "current_catalog" in result.data
+
+    # Verify catalog data
+    catalog_names = [c["name"] for c in result.data["catalogs"]]
+    assert "catalog1" in catalog_names
+    assert "catalog2" in catalog_names
+
+
+def test_successful_list_catalogs_with_pagination(databricks_client_stub):
+    """Test successful list catalogs with pagination."""
+    from tests.fixtures.databricks.client import DatabricksClientStub
+
+    # For pagination testing, we need to modify the stub to return pagination token
+    class PaginatingClientStub(DatabricksClientStub):
+        def list_catalogs(
+            self, include_browse=False, max_results=None, page_token=None
+        ):
+            result = super().list_catalogs(include_browse, max_results, page_token)
+            # Add pagination token if page_token was provided
+            if page_token:
+                result["next_page_token"] = "abc123"
+            return result
+
+    paginating_stub = PaginatingClientStub()
+    paginating_stub.add_catalog("catalog1", catalog_type="MANAGED")
+    paginating_stub.add_catalog("catalog2", catalog_type="EXTERNAL")
+
+    # Call function with page token
+    result = handle_command(paginating_stub, page_token="xyz789")
+
+    # Verify results
+    assert result.success
+    assert result.data["next_page_token"] == "abc123"
+    assert "More catalogs available with page token: abc123" in result.message
+
+
+def test_empty_catalog_list(databricks_client_stub):
+    """Test handling when no catalogs are found."""
+    # Use empty client stub (no catalogs added)
+    client_stub = databricks_client_stub
+    client_stub.catalogs.clear()  # Ensure it's empty
+
+    # Call function
+    result = handle_command(client_stub)
+
+    # Verify results
+    assert result.success
+    assert "No catalogs found in this workspace." in result.message
+    assert result.data["total_count"] == 0
+    assert not result.data.get("display", True)
+    assert "current_catalog" in result.data
+
+
+def test_list_catalogs_exception():
+    """Test list_catalogs with unexpected exception."""
+    from tests.fixtures.databricks.client import DatabricksClientStub
+
+    # Create a stub that raises an exception for list_catalogs
+    class FailingClientStub(DatabricksClientStub):
+        def list_catalogs(
+            self, include_browse=False, max_results=None, page_token=None
+        ):
+            raise Exception("API error")
+
+    failing_client = FailingClientStub()
+
+    # Call function
+    result = handle_command(failing_client)
+
+    # Verify results
+    assert not result.success
+    assert "Failed to list catalogs" in result.message
+    assert str(result.error) == "API error"
+
+
+def test_list_catalogs_with_display_true(databricks_client_stub):
+    """Test list catalogs with display=true shows table."""
+    # Set up test data
+    databricks_client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
+
+    result = handle_command(databricks_client_stub, display=True)
+
+    assert result.success
+    assert result.data.get("display")
+    assert len(result.data.get("catalogs", [])) == 1
+
+
+def test_list_catalogs_with_display_false(databricks_client_stub):
+    """Test list catalogs with display=false returns data without display."""
+    # Set up test data
+    databricks_client_stub.add_catalog("Test Catalog", catalog_type="MANAGED")
+
+    result = handle_command(databricks_client_stub, display=False)
+
+    assert result.success
+    assert not result.data.get("display")
+    assert len(result.data.get("catalogs", [])) == 1
diff --git a/tests/unit/commands/test_list_models.py b/tests/unit/commands/test_list_models.py
new file mode 100644
index 0000000..5035619
--- /dev/null
+++ b/tests/unit/commands/test_list_models.py
@@ -0,0 +1,103 @@
+"""
+Tests for list_models command handler.
+
+This module contains tests for the list_models command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.list_models import handle_command
+from chuck_data.config import set_active_model
+
+
+def test_basic_list_models(databricks_client_stub, temp_config):
+    """Test listing models without detailed information."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_model("model1", created_timestamp=123456789)
+        databricks_client_stub.add_model("model2", created_timestamp=987654321)
+        set_active_model("model1")
+
+        # Call function
+        result = handle_command(databricks_client_stub)
+
+        # Verify results
+        assert result.success
+        assert len(result.data["models"]) == 2
+        assert result.data["active_model"] == "model1"
+        assert not result.data["detailed"]
+        assert result.data["filter"] is None
+        assert result.message is None
+
+
+def test_detailed_list_models(databricks_client_stub, temp_config):
+    """Test listing models with detailed information."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_model(
+            "model1", created_timestamp=123456789, details="model1 details"
+        )
+        databricks_client_stub.add_model(
+            "model2", created_timestamp=987654321, details="model2 details"
+        )
+        set_active_model("model1")
+
+        # Call function
+        result = handle_command(databricks_client_stub, detailed=True)
+
+        # Verify results
+        assert result.success
+        assert len(result.data["models"]) == 2
+        assert result.data["detailed"]
+        assert result.data["models"][0]["details"]["name"] == "model1"
+        assert result.data["models"][1]["details"]["name"] == "model2"
+
+
+def test_filtered_list_models(databricks_client_stub, temp_config):
+    """Test listing models with filtering."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_model("claude-v1", created_timestamp=123456789)
+        databricks_client_stub.add_model("gpt-4", created_timestamp=987654321)
+        databricks_client_stub.add_model("claude-instant", created_timestamp=456789123)
+        set_active_model("claude-v1")
+
+        # Call function
+        result = handle_command(databricks_client_stub, filter="claude")
+
+        # Verify results
+        assert result.success
+        assert len(result.data["models"]) == 2
+        assert result.data["models"][0]["name"] == "claude-v1"
+        assert result.data["models"][1]["name"] == "claude-instant"
+        assert result.data["filter"] == "claude"
+
+
+def test_empty_list_models(databricks_client_stub, temp_config):
+    """Test listing models when no models are found."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Don't add any models to stub
+        # Don't set active model
+
+        # Call function
+        result = handle_command(databricks_client_stub)
+
+        # Verify results
+        assert result.success
+        assert len(result.data["models"]) == 0
+        assert result.message is not None
+        assert "No models found" in result.message
+
+
+def test_list_models_exception(databricks_client_stub, temp_config):
+    """Test listing models with exception."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure the stub to raise an exception for list_models
+        databricks_client_stub.set_list_models_error(Exception("API error"))
+
+        # Call function
+        result = handle_command(databricks_client_stub)
+
+        # Verify results
+        assert not result.success
+        assert str(result.error) == "API error"
diff --git a/tests/unit/commands/test_list_schemas.py b/tests/unit/commands/test_list_schemas.py
new file mode 100644
index 0000000..152e69b
--- /dev/null
+++ b/tests/unit/commands/test_list_schemas.py
@@ -0,0 +1,149 @@
+"""
+Tests for schema commands including list-schemas and select-schema.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.list_schemas import handle_command as list_schemas_handler
+from chuck_data.commands.schema_selection import handle_command as select_schema_handler
+from chuck_data.config import get_active_schema, set_active_catalog
+
+
+# Tests for list-schemas command
+def test_list_schemas_with_display_true(databricks_client_stub, temp_config):
+    """Test list schemas with display=true shows table."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema")
+
+        result = list_schemas_handler(databricks_client_stub, display=True)
+
+        assert result.success
+        assert result.data.get("display")
+        assert len(result.data.get("schemas", [])) == 1
+        assert result.data["schemas"][0]["name"] == "test_schema"
+
+
+def test_list_schemas_with_display_false(databricks_client_stub, temp_config):
+    """Test list schemas with display=false returns data without display."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema")
+
+        result = list_schemas_handler(databricks_client_stub, display=False)
+
+        assert result.success
+        assert not result.data.get("display")
+        assert len(result.data.get("schemas", [])) == 1
+
+
+def test_list_schemas_no_active_catalog(databricks_client_stub, temp_config):
+    """Test list schemas when no active catalog is set."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = list_schemas_handler(databricks_client_stub)
+
+        assert not result.success
+        assert "No catalog specified and no active catalog selected" in result.message
+
+
+def test_list_schemas_empty_catalog(databricks_client_stub, temp_config):
+    """Test list schemas with empty catalog."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_active_catalog("empty_catalog")
+        databricks_client_stub.add_catalog("empty_catalog")
+
+        result = list_schemas_handler(databricks_client_stub, display=True)
+
+        assert result.success
+        assert len(result.data.get("schemas", [])) == 0
+        assert result.data.get("display")
+
+
+# Tests for select-schema command
+def test_select_schema_by_name(databricks_client_stub, temp_config):
+    """Test schema selection by name."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema")
+
+        result = select_schema_handler(databricks_client_stub, schema="test_schema")
+
+        assert result.success
+        assert "Active schema is now set to 'test_schema'" in result.message
+        assert get_active_schema() == "test_schema"
+
+
+def test_select_schema_fuzzy_matching(databricks_client_stub, temp_config):
+    """Test schema selection with fuzzy matching."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema_long_name")
+
+        result = select_schema_handler(databricks_client_stub, schema="test")
+
+        assert result.success
+        assert "test_schema_long_name" in result.message
+        assert get_active_schema() == "test_schema_long_name"
+
+
+def test_select_schema_no_match(databricks_client_stub, temp_config):
+    """Test schema selection with no matching schema."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "different_schema")
+
+        result = select_schema_handler(databricks_client_stub, schema="nonexistent")
+
+        assert not result.success
+        assert "No schema found matching 'nonexistent'" in result.message
+        assert "Available schemas:" in result.message
+
+
+def test_select_schema_missing_parameter(databricks_client_stub, temp_config):
+    """Test schema selection with missing schema parameter."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = select_schema_handler(databricks_client_stub)
+
+        assert not result.success
+        assert "schema parameter is required" in result.message
+
+
+def test_select_schema_no_active_catalog(databricks_client_stub, temp_config):
+    """Test schema selection with no active catalog."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = select_schema_handler(databricks_client_stub, schema="test_schema")
+
+        assert not result.success
+        assert "No active catalog selected" in result.message
+
+
+def test_select_schema_tool_output_callback(databricks_client_stub, temp_config):
+    """Test schema selection with tool output callback."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema_with_callback")
+
+        # Mock callback to capture output
+        callback_calls = []
+
+        def mock_callback(tool_name, data):
+            callback_calls.append((tool_name, data))
+
+        result = select_schema_handler(
+            databricks_client_stub,
+            schema="callback",
+            tool_output_callback=mock_callback,
+        )
+
+        assert result.success
+        # Should have called the callback with step information
+        assert len(callback_calls) > 0
+        assert callback_calls[0][0] == "select-schema"
diff --git a/tests/unit/commands/test_list_tables.py b/tests/unit/commands/test_list_tables.py
new file mode 100644
index 0000000..a36e084
--- /dev/null
+++ b/tests/unit/commands/test_list_tables.py
@@ -0,0 +1,197 @@
+"""
+Tests for list_tables command handler.
+
+This module contains tests for the list_tables command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.list_tables import handle_command
+from tests.fixtures.databricks.client import DatabricksClientStub
+
+
+def test_no_client():
+    """Test handling when no client is provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "No Databricks client available" in result.message
+
+
+def test_no_active_catalog(temp_config):
+    """Test handling when no catalog is provided and no active catalog is set."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client_stub = DatabricksClientStub()
+        # Don't set any active catalog in config
+
+        result = handle_command(client_stub)
+        assert not result.success
+        assert "No catalog specified and no active catalog selected" in result.message
+
+
+def test_no_active_schema(temp_config):
+    """Test handling when no schema is provided and no active schema is set."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        from chuck_data.config import set_active_catalog
+
+        client_stub = DatabricksClientStub()
+        # Set active catalog but not schema
+        set_active_catalog("test_catalog")
+
+        result = handle_command(client_stub)
+        assert not result.success
+        assert "No schema specified and no active schema selected" in result.message
+
+
+def test_successful_list_tables_with_parameters(temp_config):
+    """Test successful list tables with all parameters specified."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client_stub = DatabricksClientStub()
+        # Set up test data using stub
+        client_stub.add_catalog("test_catalog")
+        client_stub.add_schema("test_catalog", "test_schema")
+        client_stub.add_table(
+            "test_catalog",
+            "test_schema",
+            "table1",
+            table_type="MANAGED",
+            comment="Test table 1",
+            created_at="2023-01-01",
+        )
+        client_stub.add_table(
+            "test_catalog",
+            "test_schema",
+            "table2",
+            table_type="VIEW",
+            comment="Test table 2",
+            created_at="2023-01-02",
+        )
+
+        # Call function
+        result = handle_command(
+            client_stub,
+            catalog_name="test_catalog",
+            schema_name="test_schema",
+            include_delta_metadata=True,
+            omit_columns=False,
+        )
+
+        # Verify results
+        assert result.success
+        assert len(result.data["tables"]) == 2
+        assert result.data["total_count"] == 2
+        assert result.data["catalog_name"] == "test_catalog"
+        assert result.data["schema_name"] == "test_schema"
+        assert "Found 2 table(s) in 'test_catalog.test_schema'" in result.message
+
+        # Verify table data
+        table_names = [t["name"] for t in result.data["tables"]]
+        assert "table1" in table_names
+        assert "table2" in table_names
+
+
+def test_successful_list_tables_with_defaults(temp_config):
+    """Test successful list tables using default active catalog and schema."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        from chuck_data.config import set_active_catalog, set_active_schema
+
+        client_stub = DatabricksClientStub()
+        # Set up active catalog and schema
+        set_active_catalog("active_catalog")
+        set_active_schema("active_schema")
+
+        # Set up test data
+        client_stub.add_catalog("active_catalog")
+        client_stub.add_schema("active_catalog", "active_schema")
+        client_stub.add_table("active_catalog", "active_schema", "table1")
+
+        # Call function with no catalog or schema parameters
+        result = handle_command(client_stub)
+
+        # Verify results
+        assert result.success
+        assert len(result.data["tables"]) == 1
+        assert result.data["catalog_name"] == "active_catalog"
+        assert result.data["schema_name"] == "active_schema"
+        assert result.data["tables"][0]["name"] == "table1"
+
+
+def test_empty_table_list(temp_config):
+    """Test handling when no tables are found."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client_stub = DatabricksClientStub()
+        # Set up catalog and schema but no tables
+        client_stub.add_catalog("test_catalog")
+        client_stub.add_schema("test_catalog", "test_schema")
+        # Don't add any tables
+
+        # Call function
+        result = handle_command(
+            client_stub, catalog_name="test_catalog", schema_name="test_schema"
+        )
+
+        # Verify results
+        assert result.success
+        assert "No tables found in schema 'test_catalog.test_schema'" in result.message
+
+
+def test_list_tables_exception(temp_config):
+    """Test list_tables with unexpected exception."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Create a stub that raises an exception for list_tables
+        class FailingClientStub(DatabricksClientStub):
+            def list_tables(self, *args, **kwargs):
+                raise Exception("API error")
+
+        failing_client = FailingClientStub()
+
+        # Call function
+        result = handle_command(
+            failing_client, catalog_name="test_catalog", schema_name="test_schema"
+        )
+
+        # Verify results
+        assert not result.success
+        assert "Failed to list tables" in result.message
+        assert str(result.error) == "API error"
+
+
+def test_list_tables_with_display_true(temp_config):
+    """Test list tables with display=true shows table."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client_stub = DatabricksClientStub()
+        # Set up test data
+        client_stub.add_catalog("test_catalog")
+        client_stub.add_schema("test_catalog", "test_schema")
+        client_stub.add_table("test_catalog", "test_schema", "test_table")
+
+        result = handle_command(
+            client_stub,
+            catalog_name="test_catalog",
+            schema_name="test_schema",
+            display=True,
+        )
+
+        assert result.success
+        assert result.data.get("display")
+        assert len(result.data.get("tables", [])) == 1
+
+
+def test_list_tables_with_display_false(temp_config):
+    """Test list tables with display=false returns data without display."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client_stub = DatabricksClientStub()
+        # Set up test data
+        client_stub.add_catalog("test_catalog")
+        client_stub.add_schema("test_catalog", "test_schema")
+        client_stub.add_table("test_catalog", "test_schema", "test_table")
+
+        result = handle_command(
+            client_stub,
+            catalog_name="test_catalog",
+            schema_name="test_schema",
+            display=False,
+        )
+
+        assert result.success
+        assert not result.data.get("display")
+        assert len(result.data.get("tables", [])) == 1
diff --git a/tests/unit/commands/test_list_warehouses.py b/tests/unit/commands/test_list_warehouses.py
new file mode 100644
index 0000000..b516b17
--- /dev/null
+++ b/tests/unit/commands/test_list_warehouses.py
@@ -0,0 +1,323 @@
+"""
+Tests for list_warehouses command handler.
+
+This module contains tests for the list_warehouses command handler.
+"""
+
+from chuck_data.commands.list_warehouses import handle_command
+
+
+def test_no_client():
+    """Test handling when no client is provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "No Databricks client available" in result.message
+
+
+def test_successful_list_warehouses(databricks_client_stub):
+    """Test successful warehouse listing with various warehouse types."""
+    # Add test warehouses with different configurations
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-123",
+        name="Test Serverless Warehouse",
+        size="XLARGE",
+        state="STOPPED",
+        enable_serverless_compute=True,
+        warehouse_type="PRO",
+        creator_name="test.user@example.com",
+        auto_stop_mins=10,
+    )
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-456",
+        name="Test Regular Warehouse",
+        size="SMALL",
+        state="RUNNING",
+        enable_serverless_compute=False,
+        warehouse_type="CLASSIC",
+        creator_name="another.user@example.com",
+        auto_stop_mins=60,
+    )
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-789",
+        name="Test XXSMALL Warehouse",
+        size="XXSMALL",
+        state="STARTING",
+        enable_serverless_compute=True,
+        warehouse_type="PRO",
+        creator_name="third.user@example.com",
+        auto_stop_mins=15,
+    )
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["warehouses"]) == 3
+    assert result.data["total_count"] == 3
+    assert "Found 3 SQL warehouse(s)" in result.message
+
+    # Verify warehouse data structure and content
+    warehouses = result.data["warehouses"]
+    warehouse_names = [w["name"] for w in warehouses]
+    assert "Test Serverless Warehouse" in warehouse_names
+    assert "Test Regular Warehouse" in warehouse_names
+    assert "Test XXSMALL Warehouse" in warehouse_names
+
+    # Verify specific warehouse details
+    serverless_warehouse = next(
+        w for w in warehouses if w["name"] == "Test Serverless Warehouse"
+    )
+    assert serverless_warehouse["id"] == "warehouse-123"
+    assert serverless_warehouse["size"] == "XLARGE"
+    assert serverless_warehouse["state"] == "STOPPED"
+    assert serverless_warehouse["enable_serverless_compute"]
+    assert serverless_warehouse["warehouse_type"] == "PRO"
+    assert serverless_warehouse["creator_name"] == "test.user@example.com"
+    assert serverless_warehouse["auto_stop_mins"] == 10
+
+    regular_warehouse = next(
+        w for w in warehouses if w["name"] == "Test Regular Warehouse"
+    )
+    assert regular_warehouse["id"] == "warehouse-456"
+    assert regular_warehouse["size"] == "SMALL"
+    assert regular_warehouse["state"] == "RUNNING"
+    assert not regular_warehouse["enable_serverless_compute"]
+    assert regular_warehouse["warehouse_type"] == "CLASSIC"
+    assert regular_warehouse["creator_name"] == "another.user@example.com"
+    assert regular_warehouse["auto_stop_mins"] == 60
+
+
+def test_empty_warehouse_list(databricks_client_stub):
+    """Test handling when no warehouses are found."""
+    # Don't add any warehouses to the stub
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    assert "No SQL warehouses found" in result.message
+
+
+def test_list_warehouses_exception(databricks_client_stub):
+    """Test list_warehouses with unexpected exception."""
+
+    # Configure stub to raise an exception for list_warehouses
+    def list_warehouses_failing(**kwargs):
+        raise Exception("API connection error")
+
+    databricks_client_stub.list_warehouses = list_warehouses_failing
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert not result.success
+    assert "Failed to fetch warehouses" in result.message
+    assert str(result.error) == "API connection error"
+
+
+def test_warehouse_data_integrity(databricks_client_stub):
+    """Test that all required warehouse fields are preserved."""
+    # Add a warehouse with all possible fields
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-complete",
+        name="Complete Test Warehouse",
+        size="MEDIUM",
+        state="STOPPED",
+        enable_serverless_compute=True,
+        creator_name="complete.user@example.com",
+        auto_stop_mins=30,
+        # Additional fields that might be present
+        cluster_size="Medium",
+        min_num_clusters=1,
+        max_num_clusters=5,
+        warehouse_type="PRO",
+        enable_photon=True,
+    )
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    warehouses = result.data["warehouses"]
+    assert len(warehouses) == 1
+
+    warehouse = warehouses[0]
+    # Verify all required fields are present
+    required_fields = [
+        "id",
+        "name",
+        "size",
+        "state",
+        "creator_name",
+        "auto_stop_mins",
+        "enable_serverless_compute",
+    ]
+    for field in required_fields:
+        assert (
+            field in warehouse
+        ), f"Required field '{field}' missing from warehouse data"
+
+    # Verify field values
+    assert warehouse["id"] == "warehouse-complete"
+    assert warehouse["name"] == "Complete Test Warehouse"
+    assert warehouse["size"] == "MEDIUM"
+    assert warehouse["state"] == "STOPPED"
+    assert warehouse["enable_serverless_compute"]
+    assert warehouse["creator_name"] == "complete.user@example.com"
+    assert warehouse["auto_stop_mins"] == 30
+
+
+def test_various_warehouse_sizes(databricks_client_stub):
+    """Test that different warehouse sizes are handled correctly."""
+    sizes = [
+        "XXSMALL",
+        "XSMALL",
+        "SMALL",
+        "MEDIUM",
+        "LARGE",
+        "XLARGE",
+        "2XLARGE",
+        "3XLARGE",
+        "4XLARGE",
+    ]
+
+    # Add warehouses with different sizes
+    for i, size in enumerate(sizes):
+        databricks_client_stub.add_warehouse(
+            warehouse_id=f"warehouse-{i}",
+            name=f"Test {size} Warehouse",
+            size=size,
+            state="STOPPED",
+            enable_serverless_compute=True,
+        )
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["warehouses"]) == len(sizes)
+
+    # Verify all sizes are preserved correctly
+    warehouses = result.data["warehouses"]
+    returned_sizes = [w["size"] for w in warehouses]
+    for size in sizes:
+        assert size in returned_sizes, f"Size {size} not found in returned warehouses"
+
+
+def test_various_warehouse_states(databricks_client_stub):
+    """Test that different warehouse states are handled correctly."""
+    states = ["RUNNING", "STOPPED", "STARTING", "STOPPING", "DELETING", "DELETED"]
+
+    # Add warehouses with different states
+    for i, state in enumerate(states):
+        databricks_client_stub.add_warehouse(
+            warehouse_id=f"warehouse-{i}",
+            name=f"Test {state} Warehouse",
+            size="SMALL",
+            state=state,
+            enable_serverless_compute=False,
+        )
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["warehouses"]) == len(states)
+
+    # Verify all states are preserved correctly
+    warehouses = result.data["warehouses"]
+    returned_states = [w["state"] for w in warehouses]
+    for state in states:
+        assert (
+            state in returned_states
+        ), f"State {state} not found in returned warehouses"
+
+
+def test_serverless_compute_boolean_handling(databricks_client_stub):
+    """Test that serverless compute boolean values are handled correctly."""
+    # Add warehouses with different serverless settings
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-serverless-true",
+        name="Serverless True Warehouse",
+        size="SMALL",
+        state="STOPPED",
+        enable_serverless_compute=True,
+    )
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-serverless-false",
+        name="Serverless False Warehouse",
+        size="SMALL",
+        state="STOPPED",
+        enable_serverless_compute=False,
+    )
+
+    # Call the function
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    warehouses = result.data["warehouses"]
+    assert len(warehouses) == 2
+
+    # Find warehouses by name and verify serverless settings
+    serverless_true = next(
+        w for w in warehouses if w["name"] == "Serverless True Warehouse"
+    )
+    serverless_false = next(
+        w for w in warehouses if w["name"] == "Serverless False Warehouse"
+    )
+
+    assert serverless_true["enable_serverless_compute"]
+    assert not serverless_false["enable_serverless_compute"]
+
+    # Ensure they're proper boolean values, not strings
+    assert isinstance(serverless_true["enable_serverless_compute"], bool)
+    assert isinstance(serverless_false["enable_serverless_compute"], bool)
+
+
+def test_display_parameter_false(databricks_client_stub):
+    """Test that display=False parameter works correctly."""
+    # Add test warehouse
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-test",
+        name="Test Warehouse",
+        size="SMALL",
+        state="RUNNING",
+    )
+
+    # Call function with display=False
+    result = handle_command(databricks_client_stub, display=False)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["warehouses"]) == 1
+    # Should still include current_warehouse_id for highlighting
+    assert "current_warehouse_id" in result.data
+
+
+def test_display_parameter_false_default(databricks_client_stub):
+    """Test that display parameter defaults to False."""
+    # Add test warehouse
+    databricks_client_stub.add_warehouse(
+        warehouse_id="warehouse-test",
+        name="Test Warehouse",
+        size="SMALL",
+        state="RUNNING",
+    )
+
+    # Call function without display parameter
+    result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert result.success
+    assert len(result.data["warehouses"]) == 1
+    # Should include current_warehouse_id for highlighting
+    assert "current_warehouse_id" in result.data
+    # Should default to display=False
+    assert not result.data["display"]
diff --git a/tests/unit/commands/test_model_selection.py b/tests/unit/commands/test_model_selection.py
new file mode 100644
index 0000000..90a453a
--- /dev/null
+++ b/tests/unit/commands/test_model_selection.py
@@ -0,0 +1,68 @@
+"""
+Tests for model_selection command handler.
+
+This module contains tests for the model_selection command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.model_selection import handle_command
+from chuck_data.config import get_active_model
+
+
+def test_missing_model_name(databricks_client_stub, temp_config):
+    """Test handling when model_name is not provided."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = handle_command(databricks_client_stub)
+        assert not result.success
+        assert "model_name parameter is required" in result.message
+
+
+def test_successful_model_selection(databricks_client_stub, temp_config):
+    """Test successful model selection."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_model("claude-v1", created_timestamp=123456789)
+        databricks_client_stub.add_model("gpt-4", created_timestamp=987654321)
+
+        # Call function
+        result = handle_command(databricks_client_stub, model_name="claude-v1")
+
+        # Verify results
+        assert result.success
+        assert "Active model is now set to 'claude-v1'" in result.message
+
+        # Verify config was updated
+        assert get_active_model() == "claude-v1"
+
+
+def test_model_not_found(databricks_client_stub, temp_config):
+    """Test model selection when model is not found."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub - but don't include the requested model
+        databricks_client_stub.add_model("claude-v1", created_timestamp=123456789)
+        databricks_client_stub.add_model("gpt-4", created_timestamp=987654321)
+
+        # Call function with nonexistent model
+        result = handle_command(databricks_client_stub, model_name="nonexistent-model")
+
+        # Verify results
+        assert not result.success
+        assert "Model 'nonexistent-model' not found" in result.message
+
+        # Verify config was not updated
+        assert get_active_model() is None
+
+
+def test_model_selection_api_exception(databricks_client_stub, temp_config):
+    """Test model selection when API call throws an exception."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Configure stub to raise an exception for list_models
+        databricks_client_stub.set_list_models_error(Exception("API error"))
+
+        # Call function
+        result = handle_command(databricks_client_stub, model_name="claude-v1")
+
+        # Verify results
+        assert not result.success
+        assert str(result.error) == "API error"
diff --git a/tests/unit/commands/test_models.py b/tests/unit/commands/test_models.py
new file mode 100644
index 0000000..0a87a59
--- /dev/null
+++ b/tests/unit/commands/test_models.py
@@ -0,0 +1,135 @@
+"""
+Tests for the model-related command modules.
+"""
+
+import pytest
+from unittest.mock import patch
+
+from chuck_data.config import set_active_model, get_active_model
+from chuck_data.commands.models import handle_command as handle_models
+from chuck_data.commands.list_models import handle_command as handle_list_models
+from chuck_data.commands.model_selection import handle_command as handle_model_selection
+
+
+class StubClient:
+    """Simple client stub for model commands."""
+
+    def __init__(self, models=None, active_model=None):
+        self.models = models or []
+        self.active_model = active_model
+
+    def list_models(self):
+        return self.models
+
+    def get_active_model(self):
+        return self.active_model
+
+
+@pytest.fixture
+def stub_client():
+    """Create a basic stub client."""
+    return StubClient()
+
+
+def test_handle_models_with_models(temp_config):
+    """Test handling models command with available models."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(
+            models=[
+                {"name": "model1", "status": "READY"},
+                {"name": "model2", "status": "READY"},
+            ]
+        )
+
+        result = handle_models(client)
+
+        assert result.success
+        assert result.data == client.list_models()
+
+
+def test_handle_models_empty(temp_config):
+    """Test handling models command with no available models."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(models=[])
+
+        result = handle_models(client)
+
+        assert result.success
+        assert result.data == []
+        assert "No models found" in result.message
+
+
+def test_handle_list_models_basic(temp_config):
+    """Test list models command (basic)."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(
+            models=[
+                {"name": "model1", "status": "READY"},
+                {"name": "model2", "status": "READY"},
+            ],
+            active_model="model1",
+        )
+        set_active_model(client.active_model)
+
+        result = handle_list_models(client)
+
+        assert result.success
+        assert result.data["models"] == client.list_models()
+        assert result.data["active_model"] == client.active_model
+        assert not result.data["detailed"]
+        assert result.data["filter"] is None
+
+
+def test_handle_list_models_filter(temp_config):
+    """Test list models command with filter."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(
+            models=[
+                {"name": "model1", "status": "READY"},
+                {"name": "model2", "status": "READY"},
+            ],
+            active_model="model1",
+        )
+        set_active_model(client.active_model)
+
+        result = handle_list_models(client, filter="model1")
+
+        assert result.success
+        assert len(result.data["models"]) == 1
+        assert result.data["models"][0]["name"] == "model1"
+        assert result.data["filter"] == "model1"
+
+
+def test_handle_model_selection_success(temp_config):
+    """Test successful model selection."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(models=[{"name": "model1"}, {"name": "valid-model"}])
+
+        result = handle_model_selection(client, model_name="valid-model")
+
+        assert result.success
+        assert get_active_model() == "valid-model"
+        assert "Active model is now set to 'valid-model'" in result.message
+
+
+def test_handle_model_selection_invalid(temp_config):
+    """Test selecting an invalid model."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(models=[{"name": "model1"}, {"name": "model2"}])
+
+        result = handle_model_selection(client, model_name="nonexistent-model")
+
+        assert not result.success
+        assert "not found" in result.message
+
+
+def test_handle_model_selection_no_name(temp_config):
+    """Test model selection with no model name provided."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        client = StubClient(models=[])  # models unused
+
+        result = handle_model_selection(client)
+
+        # Verify the result
+        assert not result.success
+        assert "model_name parameter is required" in result.message
diff --git a/tests/unit/commands/test_pii_tools.py b/tests/unit/commands/test_pii_tools.py
new file mode 100644
index 0000000..f6c7e1c
--- /dev/null
+++ b/tests/unit/commands/test_pii_tools.py
@@ -0,0 +1,118 @@
+"""
+Tests for the PII tools helper module.
+"""
+
+from unittest.mock import patch, MagicMock
+import pytest
+
+from chuck_data.commands.pii_tools import (
+    _helper_tag_pii_columns_logic,
+    _helper_scan_schema_for_pii_logic,
+)
+
+
+@pytest.fixture
+def mock_columns():
+    """Mock columns from database."""
+    return [
+        {"name": "first_name", "type_name": "string"},
+        {"name": "email", "type_name": "string"},
+        {"name": "signup_date", "type_name": "date"},
+    ]
+
+
+@pytest.fixture
+def configured_llm_client(llm_client_stub):
+    """LLM client configured for PII detection response."""
+    pii_response_content = '[{"name":"first_name","semantic":"given-name"},{"name":"email","semantic":"email"},{"name":"signup_date","semantic":null}]'
+    llm_client_stub.set_response_content(pii_response_content)
+    return llm_client_stub
+
+
+@patch("chuck_data.commands.pii_tools.json.loads")
+def test_tag_pii_columns_logic_success(
+    mock_json_loads,
+    databricks_client_stub,
+    configured_llm_client,
+    mock_columns,
+    temp_config,
+):
+    """Test successful tagging of PII columns."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_catalog("mycat")
+        databricks_client_stub.add_schema("mycat", "myschema")
+        databricks_client_stub.add_table(
+            "mycat", "myschema", "users", columns=mock_columns
+        )
+
+        # Mock the JSON parsing instead of relying on actual JSON parsing
+        mock_json_loads.return_value = [
+            {"name": "first_name", "semantic": "given-name"},
+            {"name": "email", "semantic": "email"},
+            {"name": "signup_date", "semantic": None},
+        ]
+
+        # Call the function
+        result = _helper_tag_pii_columns_logic(
+            databricks_client_stub,
+            configured_llm_client,
+            "users",
+            catalog_name_context="mycat",
+            schema_name_context="myschema",
+        )
+
+        # Verify the result
+        assert result["full_name"] == "mycat.myschema.users"
+        assert result["table_name"] == "users"
+        assert result["column_count"] == 3
+        assert result["pii_column_count"] == 2
+        assert result["has_pii"]
+        assert not result["skipped"]
+        assert result["columns"][0]["semantic"] == "given-name"
+        assert result["columns"][1]["semantic"] == "email"
+        assert result["columns"][2]["semantic"] is None
+
+
+@patch("concurrent.futures.ThreadPoolExecutor")
+def test_scan_schema_for_pii_logic(
+    mock_executor, databricks_client_stub, configured_llm_client, temp_config
+):
+    """Test scanning a schema for PII."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up test data using stub
+        databricks_client_stub.add_catalog("test_cat")
+        databricks_client_stub.add_schema("test_cat", "test_schema")
+        databricks_client_stub.add_table("test_cat", "test_schema", "users")
+        databricks_client_stub.add_table("test_cat", "test_schema", "orders")
+        databricks_client_stub.add_table("test_cat", "test_schema", "_stitch_temp")
+
+        # Mock the ThreadPoolExecutor
+        mock_future = MagicMock()
+        mock_future.result.return_value = {
+            "table_name": "users",
+            "full_name": "test_cat.test_schema.users",
+            "pii_column_count": 2,
+            "has_pii": True,
+            "skipped": False,
+        }
+
+        mock_context = MagicMock()
+        mock_context.__enter__.return_value = mock_context
+        mock_context.submit.return_value = mock_future
+        mock_executor.return_value = mock_context
+
+        # Mock concurrent.futures.as_completed to return mock_future
+        with patch("concurrent.futures.as_completed", return_value=[mock_future]):
+            # Call the function
+            result = _helper_scan_schema_for_pii_logic(
+                databricks_client_stub, configured_llm_client, "test_cat", "test_schema"
+            )
+
+            # Verify the result
+            assert result["catalog"] == "test_cat"
+            assert result["schema"] == "test_schema"
+            assert result["tables_scanned_attempted"] == 2  # Excluding _stitch_temp
+            assert result["tables_successfully_processed"] == 1
+            assert result["tables_with_pii"] == 1
+            assert result["total_pii_columns"] == 2
diff --git a/tests/unit/commands/test_scan_pii.py b/tests/unit/commands/test_scan_pii.py
new file mode 100644
index 0000000..390e6ef
--- /dev/null
+++ b/tests/unit/commands/test_scan_pii.py
@@ -0,0 +1,237 @@
+"""
+Tests for scan_pii command handler.
+
+Following approved testing patterns:
+- Mock external boundaries only (LLM client)
+- Use real config system with temporary files
+- Use real internal business logic (_helper_scan_schema_for_pii_logic)
+- Test end-to-end PII scanning behavior
+"""
+
+import tempfile
+from unittest.mock import patch
+
+from chuck_data.commands.scan_pii import handle_command
+from chuck_data.config import ConfigManager
+
+
+def test_missing_client():
+    """Test handling when client is not provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "Client is required" in result.message
+
+
+def test_missing_context_real_config(databricks_client_stub):
+    """Test handling when catalog or schema is missing in real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # Don't set active_catalog or active_schema in config
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Test real config validation with missing values
+            result = handle_command(databricks_client_stub)
+
+            assert not result.success
+            assert "Catalog and schema must be specified" in result.message
+
+
+def test_successful_scan_with_explicit_params_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test successful schema scan with explicit catalog/schema parameters."""
+    # Configure LLM stub for PII detection
+    llm_client_stub.set_response_content(
+        '[{"name":"email","semantic":"email"},{"name":"phone","semantic":"phone"}]'
+    )
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real PII scanning logic with explicit parameters
+                result = handle_command(
+                    databricks_client_stub_with_data,
+                    catalog_name="test_catalog",
+                    schema_name="test_schema",
+                )
+
+    # Verify real PII scanning execution
+    assert result.success
+    assert "Scanned" in result.message
+    assert "tables" in result.message
+    assert result.data is not None
+    # Real logic should return scan summary data
+    assert (
+        "tables_successfully_processed" in result.data
+        or "tables_scanned_attempted" in result.data
+    )
+
+
+def test_scan_with_active_context_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test schema scan using real active catalog and schema from config."""
+    # Configure LLM stub
+    llm_client_stub.set_response_content(
+        '[{"name":"user_id","semantic":"customer-id"}]'
+    )
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up real config with active catalog/schema
+        config_manager.update(
+            active_catalog="active_catalog", active_schema="active_schema"
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real config integration - should use active values
+                result = handle_command(databricks_client_stub_with_data)
+
+    # Should succeed using real active catalog/schema from config
+    assert result.success
+    assert result.data is not None
+
+
+def test_scan_with_llm_error_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test handling when LLM client encounters error with real business logic."""
+    # Configure LLM stub to simulate error
+    llm_client_stub.set_exception(True)
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real error handling with LLM failure
+                result = handle_command(
+                    databricks_client_stub_with_data,
+                    catalog_name="test_catalog",
+                    schema_name="test_schema",
+                )
+
+    # Real error handling should handle LLM errors gracefully
+    assert isinstance(result.success, bool)
+    assert result.error is not None or result.message is not None
+
+
+def test_scan_with_databricks_client_stub_integration(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test PII scanning with Databricks client stub integration."""
+    # Configure LLM stub for realistic PII response
+    llm_client_stub.set_response_content(
+        '[{"name":"first_name","semantic":"given-name"},{"name":"last_name","semantic":"family-name"}]'
+    )
+
+    # Set up Databricks stub with test data
+    databricks_client_stub_with_data.add_catalog("test_catalog")
+    databricks_client_stub_with_data.add_schema("test_catalog", "test_schema")
+    databricks_client_stub_with_data.add_table("test_catalog", "test_schema", "users")
+    databricks_client_stub_with_data.add_table("test_catalog", "test_schema", "orders")
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real PII scanning with stubbed external boundaries
+                result = handle_command(
+                    databricks_client_stub_with_data,
+                    catalog_name="test_catalog",
+                    schema_name="test_schema",
+                )
+
+    # Should work with real business logic + external stubs
+    assert result.success
+    assert result.data is not None
+    assert "test_catalog.test_schema" in result.message
+
+
+def test_scan_parameter_priority_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test that explicit parameters take priority over active config."""
+    llm_client_stub.set_response_content("[]")  # No PII found
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up active config values
+        config_manager.update(
+            active_catalog="config_catalog", active_schema="config_schema"
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real parameter priority logic: explicit should override config
+                result = handle_command(
+                    databricks_client_stub_with_data,
+                    catalog_name="explicit_catalog",
+                    schema_name="explicit_schema",
+                )
+
+    # Should use explicit parameters, not config values (real priority logic)
+    assert result.success
+    assert "explicit_catalog.explicit_schema" in result.message
+
+
+def test_scan_with_partial_config_real_logic(
+    databricks_client_stub_with_data, llm_client_stub
+):
+    """Test scan with partially configured active context."""
+    llm_client_stub.set_response_content("[]")
+
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set only catalog, not schema - should fail validation
+        config_manager.update(active_catalog="test_catalog")
+        # active_schema is None/missing
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.scan_pii.LLMClient", return_value=llm_client_stub
+            ):
+                # Test real validation logic with partial config
+                result = handle_command(databricks_client_stub_with_data)
+
+    # Should fail with real validation logic
+    assert not result.success
+    assert "Catalog and schema must be specified" in result.message
+
+
+def test_scan_real_config_integration():
+    """Test scan command integration with real config system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Test config updates and retrieval
+        config_manager.update(active_catalog="first_catalog")
+        config_manager.update(active_schema="first_schema")
+        config_manager.update(active_catalog="updated_catalog")  # Update catalog
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Test real config state - should have updated catalog, original schema
+            result = handle_command(
+                None
+            )  # No client - should fail but with real config access
+
+    # Should fail due to missing client, but real config should be accessible
+    assert not result.success
+    assert "Client is required" in result.message
diff --git a/tests/unit/commands/test_schema_selection.py b/tests/unit/commands/test_schema_selection.py
new file mode 100644
index 0000000..48a6fc5
--- /dev/null
+++ b/tests/unit/commands/test_schema_selection.py
@@ -0,0 +1,105 @@
+"""
+Tests for schema_selection command handler.
+
+This module contains tests for the schema selection command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.schema_selection import handle_command
+from chuck_data.config import get_active_schema, set_active_catalog
+
+
+def test_missing_schema_name(databricks_client_stub, temp_config):
+    """Test handling when schema parameter is not provided."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = handle_command(databricks_client_stub)
+        assert not result.success
+        assert "schema parameter is required" in result.message
+
+
+def test_no_active_catalog(databricks_client_stub, temp_config):
+    """Test handling when no active catalog is selected."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Don't set any active catalog in config
+
+        # Call function
+        result = handle_command(databricks_client_stub, schema="test_schema")
+
+        # Verify results
+        assert not result.success
+        assert "No active catalog selected" in result.message
+
+
+def test_successful_schema_selection(databricks_client_stub, temp_config):
+    """Test successful schema selection."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up active catalog and test data
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema("test_catalog", "test_schema")
+
+        # Call function
+        result = handle_command(databricks_client_stub, schema="test_schema")
+
+        # Verify results
+        assert result.success
+        assert "Active schema is now set to 'test_schema'" in result.message
+        assert "in catalog 'test_catalog'" in result.message
+        assert result.data["schema_name"] == "test_schema"
+        assert result.data["catalog_name"] == "test_catalog"
+
+        # Verify config was updated
+        assert get_active_schema() == "test_schema"
+
+
+def test_schema_selection_with_verification_failure(
+    databricks_client_stub, temp_config
+):
+    """Test schema selection when no matching schema exists."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up active catalog but don't add the schema to stub
+        set_active_catalog("test_catalog")
+        databricks_client_stub.add_catalog("test_catalog")
+        databricks_client_stub.add_schema(
+            "test_catalog", "completely_different_schema_name"
+        )
+
+        # Call function with non-existent schema that won't match via fuzzy matching
+        result = handle_command(databricks_client_stub, schema="xyz_nonexistent_abc")
+
+        # Verify results - should fail cleanly
+        assert not result.success
+        assert "No schema found matching 'xyz_nonexistent_abc'" in result.message
+        assert "Available schemas:" in result.message
+
+
+def test_schema_selection_exception(temp_config):
+    """Test schema selection with list_schemas exception."""
+    from tests.fixtures.databricks.client import DatabricksClientStub
+
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up active catalog
+        set_active_catalog("test_catalog")
+
+        # Create a stub that raises an exception during list_schemas
+        class FailingStub(DatabricksClientStub):
+            def list_schemas(
+                self,
+                catalog_name,
+                include_browse=False,
+                max_results=None,
+                page_token=None,
+                **kwargs,
+            ):
+                raise Exception("Failed to list schemas")
+
+        failing_stub = FailingStub()
+        failing_stub.add_catalog("test_catalog")
+
+        # Call function
+        result = handle_command(failing_stub, schema="test_schema")
+
+        # Should fail due to the exception
+        assert not result.success
+        assert "Failed to list schemas" in result.message
diff --git a/tests/unit/commands/test_setup_stitch.py b/tests/unit/commands/test_setup_stitch.py
new file mode 100644
index 0000000..60fb54f
--- /dev/null
+++ b/tests/unit/commands/test_setup_stitch.py
@@ -0,0 +1,235 @@
+"""
+Tests for setup_stitch command handler.
+
+This module contains tests for the setup_stitch command handler.
+"""
+
+import tempfile
+from unittest.mock import patch, MagicMock
+
+from chuck_data.commands.setup_stitch import handle_command
+from chuck_data.config import ConfigManager
+
+
+def test_missing_client():
+    """Test handling when client is not provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "Client is required" in result.message
+
+
+def test_missing_context(databricks_client_stub):
+    """Test handling when catalog or schema is missing."""
+    # Use real config system with no active catalog/schema
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # Don't set active catalog or schema
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            result = handle_command(databricks_client_stub)
+
+    # Verify results
+    assert not result.success
+    assert "Target catalog and schema must be specified" in result.message
+
+
+@patch("chuck_data.commands.setup_stitch._helper_launch_stitch_job")
+@patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
+@patch("chuck_data.commands.setup_stitch.get_metrics_collector")
+def test_successful_setup(
+    mock_get_metrics_collector,
+    mock_helper_setup,
+    mock_launch_job,
+    databricks_client_stub,
+    llm_client_stub,
+):
+    """Test successful Stitch setup."""
+    # Setup metrics collector mock (external boundary)
+    mock_metrics_collector = MagicMock()
+    mock_get_metrics_collector.return_value = mock_metrics_collector
+
+    # Use LLMClient fixture directly via patching LLMClient constructor
+    with patch(
+        "chuck_data.commands.setup_stitch.LLMClient", return_value=llm_client_stub
+    ):
+        mock_helper_setup.return_value = {
+            "stitch_config": {},
+            "metadata": {
+                "target_catalog": "test_catalog",
+                "target_schema": "test_schema",
+            },
+        }
+        mock_launch_job.return_value = {
+            "message": "Stitch setup completed successfully.",
+            "tables_processed": 5,
+            "pii_columns_tagged": 8,
+            "config_created": True,
+            "config_path": "/Volumes/test_catalog/test_schema/_stitch/config.json",
+        }
+
+        # Call function with auto_confirm to use legacy behavior
+        result = handle_command(
+            databricks_client_stub,
+            **{
+                "catalog_name": "test_catalog",
+                "schema_name": "test_schema",
+                "auto_confirm": True,
+            },
+        )
+
+        # Verify results
+        assert result.success
+        assert result.message == "Stitch setup completed successfully."
+        assert result.data["tables_processed"] == 5
+        assert result.data["pii_columns_tagged"] == 8
+        assert result.data["config_created"]
+        mock_helper_setup.assert_called_once_with(
+            databricks_client_stub, llm_client_stub, "test_catalog", "test_schema"
+        )
+        mock_launch_job.assert_called_once_with(
+            databricks_client_stub,
+            {},
+            {"target_catalog": "test_catalog", "target_schema": "test_schema"},
+        )
+
+        # Verify metrics collection
+        mock_metrics_collector.track_event.assert_called_once_with(
+            prompt="setup-stitch command",
+            tools=[
+                {
+                    "name": "setup_stitch",
+                    "arguments": {"catalog": "test_catalog", "schema": "test_schema"},
+                }
+            ],
+            additional_data={
+                "event_context": "direct_stitch_command",
+                "status": "success",
+                "tables_processed": 5,
+                "pii_columns_tagged": 8,
+                "config_created": True,
+                "config_path": "/Volumes/test_catalog/test_schema/_stitch/config.json",
+            },
+        )
+
+
+@patch("chuck_data.commands.setup_stitch._helper_launch_stitch_job")
+@patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
+def test_setup_with_active_context(
+    mock_helper_setup,
+    mock_launch_job,
+    databricks_client_stub,
+    llm_client_stub,
+):
+    """Test Stitch setup using active catalog and schema."""
+    # Use real config system with active catalog and schema
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(
+            active_catalog="active_catalog", active_schema="active_schema"
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.setup_stitch.LLMClient",
+                return_value=llm_client_stub,
+            ):
+                mock_helper_setup.return_value = {
+                    "stitch_config": {},
+                    "metadata": {
+                        "target_catalog": "active_catalog",
+                        "target_schema": "active_schema",
+                    },
+                }
+                mock_launch_job.return_value = {
+                    "message": "Stitch setup completed.",
+                    "tables_processed": 3,
+                    "config_created": True,
+                }
+
+                # Call function without catalog/schema args, with auto_confirm
+                result = handle_command(
+                    databricks_client_stub, **{"auto_confirm": True}
+                )
+
+                # Verify results
+                assert result.success
+                mock_helper_setup.assert_called_once_with(
+                    databricks_client_stub,
+                    llm_client_stub,
+                    "active_catalog",
+                    "active_schema",
+                )
+                mock_launch_job.assert_called_once_with(
+                    databricks_client_stub,
+                    {},
+                    {
+                        "target_catalog": "active_catalog",
+                        "target_schema": "active_schema",
+                    },
+                )
+
+
+@patch("chuck_data.commands.setup_stitch._helper_setup_stitch_logic")
+@patch("chuck_data.commands.setup_stitch.get_metrics_collector")
+def test_setup_with_helper_error(
+    mock_get_metrics_collector,
+    mock_helper_setup,
+    databricks_client_stub,
+    llm_client_stub,
+):
+    """Test handling when helper returns an error."""
+    # Setup metrics collector mock (external boundary)
+    mock_metrics_collector = MagicMock()
+    mock_get_metrics_collector.return_value = mock_metrics_collector
+
+    with patch(
+        "chuck_data.commands.setup_stitch.LLMClient", return_value=llm_client_stub
+    ):
+        mock_helper_setup.return_value = {"error": "Failed to scan tables for PII"}
+
+        # Call function with auto_confirm
+        result = handle_command(
+            databricks_client_stub,
+            **{
+                "catalog_name": "test_catalog",
+                "schema_name": "test_schema",
+                "auto_confirm": True,
+            },
+        )
+
+        # Verify results
+        assert not result.success
+        assert result.message == "Failed to scan tables for PII"
+
+        # Verify metrics collection for error
+        mock_metrics_collector.track_event.assert_called_once_with(
+            prompt="setup-stitch command",
+            tools=[
+                {
+                    "name": "setup_stitch",
+                    "arguments": {"catalog": "test_catalog", "schema": "test_schema"},
+                }
+            ],
+            error="Failed to scan tables for PII",
+            additional_data={
+                "event_context": "direct_stitch_command",
+                "status": "error",
+            },
+        )
+
+
+@patch("chuck_data.commands.setup_stitch.LLMClient")
+def test_setup_with_exception(mock_llm_client, databricks_client_stub):
+    """Test handling when an exception occurs."""
+    # Setup mocks
+    mock_llm_client.side_effect = Exception("LLM client error")
+
+    # Call function
+    result = handle_command(
+        databricks_client_stub, catalog_name="test_catalog", schema_name="test_schema"
+    )
+
+    # Verify results
+    assert not result.success
+    assert "Error setting up Stitch" in result.message
+    assert str(result.error) == "LLM client error"
diff --git a/tests/commands/test_setup_wizard.py b/tests/unit/commands/test_setup_wizard.py
similarity index 80%
rename from tests/commands/test_setup_wizard.py
rename to tests/unit/commands/test_setup_wizard.py
index 41ce9e5..94a2631 100644
--- a/tests/commands/test_setup_wizard.py
+++ b/tests/unit/commands/test_setup_wizard.py
@@ -6,9 +6,9 @@
 """
 
 import pytest
-from unittest.mock import patch, MagicMock
+from unittest.mock import patch
 from io import StringIO
-from tests.fixtures import AmperityClientStub
+from tests.fixtures.amperity import AmperityClientStub
 
 from chuck_data.commands.setup_wizard import (
     DEFINITION,
@@ -150,9 +150,17 @@ def test_input_validator_usage_consent(self):
             assert not result.is_valid, f"Input '{invalid_input}' should be invalid"
             assert "Please enter 'yes' or 'no'" in result.message
 
-    def test_input_validator_edge_cases(self):
+    def test_input_validator_edge_cases(self, databricks_client_stub):
         """Test input validator edge cases."""
-        validator = InputValidator()
+        # Create client factory that returns our stub configured for failure
+        databricks_client_stub.set_token_validation_result(
+            Exception("Connection failed")
+        )
+
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
+
+        validator = InputValidator(databricks_client_factory=client_factory)
 
         # Test whitespace handling in usage consent
         result = validator.validate_usage_consent("  yes  ")
@@ -165,14 +173,10 @@ def test_input_validator_edge_cases(self):
         assert result.is_valid
         assert result.processed_value == "Test-Model"
 
-        # Test token validation with invalid workspace
-        with patch("chuck_data.clients.databricks.DatabricksAPIClient") as mock_client:
-            mock_client.side_effect = Exception("Connection failed")
-            result = validator.validate_token(
-                "some-token", "https://invalid-workspace.com"
-            )
-            assert not result.is_valid
-            assert "Error validating token" in result.message
+        # Test token validation with invalid workspace - uses injected stub
+        result = validator.validate_token("some-token", "https://invalid-workspace.com")
+        assert not result.is_valid
+        assert "Error validating token" in result.message
 
 
 class TestStepHandlers:
@@ -528,46 +532,60 @@ class TestErrorFlowIntegration:
     """Test complete error flows end-to-end."""
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    @patch("chuck_data.commands.wizard.steps.AmperityAPIClient")
-    @patch("chuck_data.clients.databricks.DatabricksAPIClient")
     def test_complete_error_recovery_flow(
-        self, mock_databricks_client, mock_amperity_client, mock_get_token
+        self, mock_get_token, databricks_client_stub, amperity_client_stub
     ):
         """Test a complete error recovery flow."""
-        # Setup stub for external dependencies only
+        # Setup external dependencies with stubs
         mock_get_token.return_value = None
-        amperity_stub = AmperityClientStub()
-        mock_amperity_client.return_value = amperity_stub
 
-        # Mock token validation failure
-        mock_db_client = MagicMock()
-        mock_db_client.validate_token.return_value = False
-        mock_databricks_client.return_value = mock_db_client
+        # Configure databricks stub for token validation failure
+        databricks_client_stub.set_token_validation_result(False)
 
-        orchestrator = SetupWizardOrchestrator()
+        # Setup client factory for dependency injection
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
 
-        # 1. Start wizard - should succeed
-        result = orchestrator.start_wizard()
-        assert result.success
+        # Mock AmperityAPIClient to return our stub
+        with patch(
+            "chuck_data.commands.wizard.steps.AmperityAPIClient",
+            return_value=amperity_client_stub,
+        ):
 
-        # 2. Enter valid workspace URL - should succeed
-        result = orchestrator.handle_interactive_input("workspace123")
-        assert result.success
+            # Inject client factory into validator - need to patch the orchestrator creation
+            with patch(
+                "chuck_data.commands.setup_wizard.InputValidator"
+            ) as mock_validator_class:
+                # Create real validator with our client factory
+                real_validator = InputValidator(
+                    databricks_client_factory=client_factory
+                )
+                mock_validator_class.return_value = real_validator
 
-        # 3. Enter invalid token - token validation will fail and go back to URL step
-        # The wizard handles this gracefully by returning success=False and transitioning back
-        result = orchestrator.handle_interactive_input("invalid-token")
-        # The result might be success=True because it successfully transitioned back to URL step
-        # but the error flow worked correctly as evidenced by the output showing step 2
+                orchestrator = SetupWizardOrchestrator()
 
-        # The orchestrator should now be back at workspace URL step
-        # We can verify this by checking that the next input is treated as a URL
+                # 1. Start wizard - should succeed
+                result = orchestrator.start_wizard()
+                assert result.success
 
-        # 4. Re-enter workspace URL
-        result = orchestrator.handle_interactive_input("workspace456")
-        assert result.success
+                # 2. Enter valid workspace URL - should succeed
+                result = orchestrator.handle_interactive_input("workspace123")
+                assert result.success
 
-        # This flow tests the real error recovery behavior without over-mocking
+                # 3. Enter invalid token - token validation will fail and go back to URL step
+                # The wizard handles this gracefully by returning success=False and transitioning back
+                result = orchestrator.handle_interactive_input("invalid-token")
+                # The result might be success=True because it successfully transitioned back to URL step
+                # but the error flow worked correctly as evidenced by the output showing step 2
+
+                # The orchestrator should now be back at workspace URL step
+                # We can verify this by checking that the next input is treated as a URL
+
+                # 4. Re-enter workspace URL
+                result = orchestrator.handle_interactive_input("workspace456")
+                assert result.success
+
+                # This flow tests the real error recovery behavior without over-mocking
 
     def test_validation_error_messages_preserved(self):
         """Test that validation error messages are properly preserved and displayed."""
@@ -708,22 +726,28 @@ def test_token_validation_error_flow(self):
         # Should have the error message
         assert "Please re-enter your workspace URL and token" in result.message
 
-    def test_token_not_stored_in_processed_value_on_failure(self):
+    def test_token_not_stored_in_processed_value_on_failure(
+        self, databricks_client_stub
+    ):
         """Test that tokens are not stored in processed_value when validation fails."""
         from chuck_data.commands.wizard.validator import InputValidator
 
-        validator = InputValidator()
+        # Configure stub to raise exception for token validation
+        databricks_client_stub.set_token_validation_result(
+            Exception("Validation failed")
+        )
+
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
 
-        # Mock token validation to fail
-        with patch("chuck_data.clients.databricks.DatabricksAPIClient") as mock_client:
-            mock_client.side_effect = Exception("Validation failed")
+        validator = InputValidator(databricks_client_factory=client_factory)
 
-            result = validator.validate_token("secret-token-123", "https://test.com")
+        result = validator.validate_token("secret-token-123", "https://test.com")
 
-            # Should fail validation
-            assert not result.is_valid
-            # Should not store the token in processed_value
-            assert result.processed_value is None
+        # Should fail validation
+        assert not result.is_valid
+        # Should not store the token in processed_value
+        assert result.processed_value is None
 
     def test_step_detection_for_password_mode_after_error(self):
         """Test that step detection works correctly after token validation error."""
@@ -755,7 +779,9 @@ def test_step_detection_for_password_mode_after_error(self):
         ), "Should NOT hide input on workspace step (even with workspace_url present)"
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    def test_context_step_update_on_token_failure(self, mock_get_token):
+    def test_context_step_update_on_token_failure(
+        self, mock_get_token, databricks_client_stub
+    ):
         """Test that context step is updated correctly when token validation fails."""
         from chuck_data.commands.setup_wizard import SetupWizardOrchestrator
         from chuck_data.interactive_context import InteractiveContext
@@ -780,13 +806,29 @@ def test_context_step_update_on_token_failure(self, mock_get_token):
             context_data = context.get_context_data("/setup")
             assert context_data.get("current_step") == "token_input"
 
-            # Mock a validation failure that should go back to workspace URL
+            # Configure databricks stub for validation failure and inject it
+            databricks_client_stub.set_token_validation_result(False)
+
+            def client_factory(workspace_url, token):
+                return databricks_client_stub
+
+            # Mock the validator creation to use our client factory
             with patch(
-                "chuck_data.clients.databricks.DatabricksAPIClient"
-            ) as mock_client:
-                mock_db_client = MagicMock()
-                mock_db_client.validate_token.return_value = False
-                mock_client.return_value = mock_db_client
+                "chuck_data.commands.setup_wizard.InputValidator"
+            ) as mock_validator_class:
+                real_validator = InputValidator(
+                    databricks_client_factory=client_factory
+                )
+                mock_validator_class.return_value = real_validator
+
+                # Create new orchestrator with our validator
+                orchestrator = SetupWizardOrchestrator()
+
+                # Re-do the setup since we created a new orchestrator
+                result = orchestrator.start_wizard()
+                assert result.success
+                result = orchestrator.handle_interactive_input("workspace123")
+                assert result.success
 
                 # Process token that should fail
                 result = orchestrator.handle_interactive_input("invalid-token")
@@ -910,34 +952,41 @@ def teardown_method(self):
         self.context.clear_active_context("/setup")
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    @patch("chuck_data.clients.databricks.DatabricksAPIClient")
     def test_token_not_stored_in_history_on_failure(
-        self, mock_databricks_client, mock_get_token
+        self, mock_get_token, databricks_client_stub
     ):
         """Test that tokens are not stored in command history when validation fails."""
         mock_get_token.return_value = "existing-token"
 
-        # Mock token validation failure
-        mock_db_client = MagicMock()
-        mock_db_client.validate_token.return_value = False
-        mock_databricks_client.return_value = mock_db_client
+        # Configure stub for token validation failure
+        databricks_client_stub.set_token_validation_result(False)
 
-        orchestrator = SetupWizardOrchestrator()
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
 
-        # Start wizard and get to token input step
-        result = orchestrator.start_wizard()
-        assert result.success
+        # Mock the validator creation to use our client factory
+        with patch(
+            "chuck_data.commands.setup_wizard.InputValidator"
+        ) as mock_validator_class:
+            real_validator = InputValidator(databricks_client_factory=client_factory)
+            mock_validator_class.return_value = real_validator
 
-        result = orchestrator.handle_interactive_input("workspace123")
-        assert result.success
+            orchestrator = SetupWizardOrchestrator()
 
-        # Now we should be on token input step
-        context_data = self.context.get_context_data("/setup")
-        assert context_data.get("current_step") == "token_input"
+            # Start wizard and get to token input step
+            result = orchestrator.start_wizard()
+            assert result.success
 
-        # Simulate token input that fails validation - should go back to workspace URL
-        result = orchestrator.handle_interactive_input("fake-token-123")
-        # The result is success=True because it successfully transitions back to workspace step
+            result = orchestrator.handle_interactive_input("workspace123")
+            assert result.success
+
+            # Now we should be on token input step
+            context_data = self.context.get_context_data("/setup")
+            assert context_data.get("current_step") == "token_input"
+
+            # Simulate token input that fails validation - should go back to workspace URL
+            result = orchestrator.handle_interactive_input("fake-token-123")
+            # The result is success=True because it successfully transitions back to workspace step
 
         # Verify we're back at workspace URL step
         context_data = self.context.get_context_data("/setup")
@@ -996,7 +1045,9 @@ def test_input_mode_detection_logic(self):
             ), f"{description}. Got hide_input={hide_input}"
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    def test_step_context_updates_correctly_on_token_failure(self, mock_get_token):
+    def test_step_context_updates_correctly_on_token_failure(
+        self, mock_get_token, databricks_client_stub
+    ):
         """Test that step context is correctly updated when token validation fails."""
         mock_get_token.return_value = "existing-token"
 
@@ -1013,11 +1064,27 @@ def test_step_context_updates_correctly_on_token_failure(self, mock_get_token):
         context_data = self.context.get_context_data("/setup")
         assert context_data.get("current_step") == "token_input"
 
-        # Mock token validation failure and process token input
-        with patch("chuck_data.clients.databricks.DatabricksAPIClient") as mock_client:
-            mock_db_client = MagicMock()
-            mock_db_client.validate_token.return_value = False
-            mock_client.return_value = mock_db_client
+        # Configure stub and inject it for token validation failure
+        databricks_client_stub.set_token_validation_result(False)
+
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
+
+        # Mock the validator creation to use our client factory
+        with patch(
+            "chuck_data.commands.setup_wizard.InputValidator"
+        ) as mock_validator_class:
+            real_validator = InputValidator(databricks_client_factory=client_factory)
+            mock_validator_class.return_value = real_validator
+
+            # Create new orchestrator with our validator
+            orchestrator = SetupWizardOrchestrator()
+
+            # Re-do the setup since we created a new orchestrator
+            result = orchestrator.start_wizard()
+            assert result.success
+            result = orchestrator.handle_interactive_input("workspace123")
+            assert result.success
 
             result = orchestrator.handle_interactive_input("invalid-token")
 
@@ -1030,28 +1097,34 @@ def test_step_context_updates_correctly_on_token_failure(self, mock_get_token):
                 context_data.get("current_step") == "workspace_url"
             ), f"Expected workspace_url step, got {context_data.get('current_step')}"
 
-    def test_token_not_in_wizard_state_after_failure(self):
+    def test_token_not_in_wizard_state_after_failure(self, databricks_client_stub):
         """Test that failed tokens are not stored in wizard state."""
         from chuck_data.commands.wizard.validator import InputValidator
 
-        validator = InputValidator()
+        # Configure stub to raise exception for token validation
+        databricks_client_stub.set_token_validation_result(
+            Exception("Connection failed")
+        )
 
-        # Mock token validation failure
-        with patch("chuck_data.clients.databricks.DatabricksAPIClient") as mock_client:
-            mock_client.side_effect = Exception("Connection failed")
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
 
-            result = validator.validate_token(
-                "secret-token-456", "https://test.databricks.com"
-            )
-            assert not result.is_valid
+        validator = InputValidator(databricks_client_factory=client_factory)
 
-            # The token should not be in the processed_value when validation fails
-            assert result.processed_value is None or "secret-token-456" not in str(
-                result.processed_value
-            )
+        result = validator.validate_token(
+            "secret-token-456", "https://test.databricks.com"
+        )
+        assert not result.is_valid
+
+        # The token should not be in the processed_value when validation fails
+        assert result.processed_value is None or "secret-token-456" not in str(
+            result.processed_value
+        )
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    def test_no_token_leakage_in_error_messages(self, mock_get_token):
+    def test_no_token_leakage_in_error_messages(
+        self, mock_get_token, databricks_client_stub
+    ):
         """Test that tokens don't leak into error messages."""
         mock_get_token.return_value = "existing-token"
 
@@ -1061,9 +1134,27 @@ def test_no_token_leakage_in_error_messages(self, mock_get_token):
         result = orchestrator.start_wizard()
         result = orchestrator.handle_interactive_input("workspace123")
 
-        # Mock token validation failure
-        with patch("chuck_data.clients.databricks.DatabricksAPIClient") as mock_client:
-            mock_client.side_effect = Exception("Network error with secret details")
+        # Configure stub and inject it for network error
+        databricks_client_stub.set_token_validation_result(
+            Exception("Network error with secret details")
+        )
+
+        def client_factory(workspace_url, token):
+            return databricks_client_stub
+
+        # Mock the validator creation to use our client factory
+        with patch(
+            "chuck_data.commands.setup_wizard.InputValidator"
+        ) as mock_validator_class:
+            real_validator = InputValidator(databricks_client_factory=client_factory)
+            mock_validator_class.return_value = real_validator
+
+            # Create new orchestrator with our validator
+            orchestrator = SetupWizardOrchestrator()
+
+            # Re-do the setup since we created a new orchestrator
+            result = orchestrator.start_wizard()
+            result = orchestrator.handle_interactive_input("workspace123")
 
             result = orchestrator.handle_interactive_input("super-secret-token")
 
@@ -1127,7 +1218,9 @@ def test_prompt_parameters_logic(self):
             ), f"{description}. Got enable_history={enable_history}"
 
     @patch("chuck_data.commands.wizard.steps.get_amperity_token")
-    def test_api_error_message_displayed_to_user(self, mock_get_token):
+    def test_api_error_message_displayed_to_user(
+        self, mock_get_token, databricks_client_stub
+    ):
         """Test that API errors from token validation are displayed to the user."""
         from chuck_data.commands.setup_wizard import SetupWizardOrchestrator
         from chuck_data.interactive_context import InteractiveContext
@@ -1147,13 +1240,33 @@ def test_api_error_message_displayed_to_user(self, mock_get_token):
             result = orchestrator.handle_interactive_input("workspace123")
             assert result.success
 
-            # Mock API error when validating token
-            with patch(
-                "chuck_data.clients.databricks.DatabricksAPIClient"
-            ) as mock_client:
-                mock_client.side_effect = Exception(
+            # Configure stub and inject it for connection error
+            databricks_client_stub.set_token_validation_result(
+                Exception(
                     "Connection error: Failed to resolve 'workspace123.cloud.databricks.com'"
                 )
+            )
+
+            def client_factory(workspace_url, token):
+                return databricks_client_stub
+
+            # Mock the validator creation to use our client factory
+            with patch(
+                "chuck_data.commands.setup_wizard.InputValidator"
+            ) as mock_validator_class:
+                real_validator = InputValidator(
+                    databricks_client_factory=client_factory
+                )
+                mock_validator_class.return_value = real_validator
+
+                # Create new orchestrator with our validator
+                orchestrator = SetupWizardOrchestrator()
+
+                # Re-do the setup since we created a new orchestrator
+                result = orchestrator.start_wizard()
+                assert result.success
+                result = orchestrator.handle_interactive_input("workspace123")
+                assert result.success
 
                 # Process token that should fail with API error
                 result = orchestrator.handle_interactive_input("some-token")
diff --git a/tests/unit/commands/test_status.py b/tests/unit/commands/test_status.py
new file mode 100644
index 0000000..1f1c11c
--- /dev/null
+++ b/tests/unit/commands/test_status.py
@@ -0,0 +1,179 @@
+"""
+Tests for the status command module.
+
+Following approved testing patterns:
+- Mock external boundaries only (Databricks API calls)
+- Use real config system with temporary files
+- Test end-to-end command behavior with real business logic
+"""
+
+import tempfile
+from unittest.mock import patch
+
+from chuck_data.commands.status import handle_command
+from chuck_data.config import ConfigManager
+
+
+def test_handle_status_with_valid_connection_real_logic(databricks_client_stub):
+    """Test status command with valid connection using real config system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up real config state
+        config_manager.update(
+            workspace_url="https://test.databricks.com",
+            active_catalog="test_catalog",
+            active_schema="test_schema",
+            active_model="test_model",
+            warehouse_id="test_warehouse",
+        )
+
+        # Mock only external boundary (Databricks API permission validation)
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.status.validate_all_permissions"
+            ) as mock_permissions:
+                mock_permissions.return_value = {"test_resource": {"authorized": True}}
+
+                # Call function with real config and external API mock
+                result = handle_command(databricks_client_stub)
+
+    # Verify real command execution with real config values
+    assert result.success
+    assert result.data["workspace_url"] == "https://test.databricks.com"
+    assert result.data["active_catalog"] == "test_catalog"
+    assert result.data["active_schema"] == "test_schema"
+    assert result.data["active_model"] == "test_model"
+    assert result.data["warehouse_id"] == "test_warehouse"
+    assert result.data["connection_status"] == "Connected (client present)."
+    assert result.data["permissions"] == {"test_resource": {"authorized": True}}
+
+
+def test_handle_status_with_no_client_real_logic():
+    """Test status command with no client using real config system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up real config state
+        config_manager.update(
+            workspace_url="https://test.databricks.com",
+            active_catalog="test_catalog",
+            active_schema="test_schema",
+            active_model="test_model",
+            warehouse_id="test_warehouse",
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Call function with no client - should use real config
+            result = handle_command(None)
+
+    # Verify real command execution with real config values
+    assert result.success
+    assert result.data["workspace_url"] == "https://test.databricks.com"
+    assert result.data["active_catalog"] == "test_catalog"
+    assert result.data["active_schema"] == "test_schema"
+    assert result.data["active_model"] == "test_model"
+    assert result.data["warehouse_id"] == "test_warehouse"
+    assert (
+        result.data["connection_status"] == "Client not available or not initialized."
+    )
+    assert result.data["permissions"] == {}  # No permissions check without client
+
+
+def test_handle_status_with_permission_error_real_logic(databricks_client_stub):
+    """Test status command when permission validation fails."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up real config state
+        config_manager.update(
+            workspace_url="https://test.databricks.com", active_catalog="test_catalog"
+        )
+
+        # Mock external API to simulate permission error
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.status.validate_all_permissions"
+            ) as mock_permissions:
+                mock_permissions.side_effect = Exception("Permission denied")
+
+                # Test real error handling with external API failure
+                result = handle_command(databricks_client_stub)
+
+    # Verify real error handling - should still succeed but with error message
+    assert result.success
+    assert (
+        "Permission denied" in result.data["connection_status"]
+        or "error" in result.data["connection_status"]
+    )
+    # Real config values should still be present
+    assert result.data["workspace_url"] == "https://test.databricks.com"
+    assert result.data["active_catalog"] == "test_catalog"
+
+
+def test_handle_status_with_config_error_real_logic():
+    """Test status command when config system encounters error."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # Don't initialize config - should handle missing config gracefully
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Test real error handling with uninitialized config
+            result = handle_command(None)
+
+    # Should handle config errors gracefully - exact behavior depends on real implementation
+    assert isinstance(result.success, bool)
+    assert result.data is not None or result.error is not None
+
+
+def test_handle_status_with_partial_config_real_logic(databricks_client_stub):
+    """Test status command with partially configured system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up partial config state (missing some values)
+        config_manager.update(
+            workspace_url="https://test.databricks.com",
+            # Missing catalog, schema, model - should handle gracefully
+        )
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.commands.status.validate_all_permissions"
+            ) as mock_permissions:
+                mock_permissions.return_value = {}
+
+                # Test real handling of partial configuration
+                result = handle_command(databricks_client_stub)
+
+    # Should succeed with real config handling of missing values
+    assert result.success
+    assert result.data["workspace_url"] == "https://test.databricks.com"
+    # Other values should be None or default values from real config system
+    assert result.data["active_catalog"] is None or isinstance(
+        result.data["active_catalog"], str
+    )
+    assert result.data["connection_status"] == "Connected (client present)."
+
+
+def test_handle_status_real_config_integration():
+    """Test status command integration with real config system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Test multiple config updates to verify real config behavior
+        config_manager.update(workspace_url="https://first.databricks.com")
+        config_manager.update(active_catalog="first_catalog")
+        config_manager.update(
+            workspace_url="https://second.databricks.com"
+        )  # Update workspace
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            result = handle_command(None)
+
+    # Verify real config system behavior with updates
+    assert result.success
+    assert (
+        result.data["workspace_url"] == "https://second.databricks.com"
+    )  # Latest update
+    assert result.data["active_catalog"] == "first_catalog"  # Preserved from earlier
diff --git a/tests/unit/commands/test_stitch_tools.py b/tests/unit/commands/test_stitch_tools.py
new file mode 100644
index 0000000..f95eacb
--- /dev/null
+++ b/tests/unit/commands/test_stitch_tools.py
@@ -0,0 +1,463 @@
+"""
+Tests for stitch_tools command handler utilities.
+
+This module contains tests for the Stitch integration utilities.
+"""
+
+import pytest
+from unittest.mock import patch, MagicMock
+
+from chuck_data.commands.stitch_tools import _helper_setup_stitch_logic
+from tests.fixtures.llm import LLMClientStub
+
+
+@pytest.fixture
+def client():
+    """Mock client fixture."""
+    return MagicMock()
+
+
+@pytest.fixture
+def llm_client():
+    """LLM client stub fixture."""
+    return LLMClientStub()
+
+
+@pytest.fixture
+def mock_pii_scan_results():
+    """Mock successful PII scan result fixture."""
+    return {
+        "tables_successfully_processed": 5,
+        "tables_with_pii": 3,
+        "total_pii_columns": 8,
+        "results_detail": [
+            {
+                "full_name": "test_catalog.test_schema.customers",
+                "has_pii": True,
+                "skipped": False,
+                "columns": [
+                    {"name": "id", "type": "int", "semantic": None},
+                    {"name": "name", "type": "string", "semantic": "full-name"},
+                    {"name": "email", "type": "string", "semantic": "email"},
+                ],
+            },
+            {
+                "full_name": "test_catalog.test_schema.orders",
+                "has_pii": True,
+                "skipped": False,
+                "columns": [
+                    {"name": "id", "type": "int", "semantic": None},
+                    {"name": "customer_id", "type": "int", "semantic": None},
+                    {
+                        "name": "shipping_address",
+                        "type": "string",
+                        "semantic": "address",
+                    },
+                ],
+            },
+            {
+                "full_name": "test_catalog.test_schema.metrics",
+                "has_pii": False,
+                "skipped": False,
+                "columns": [
+                    {"name": "id", "type": "int", "semantic": None},
+                    {"name": "date", "type": "date", "semantic": None},
+                ],
+            },
+        ],
+    }
+
+
+@pytest.fixture
+def mock_pii_scan_results_with_unsupported():
+    """Mock PII scan results with unsupported types fixture."""
+    return {
+        "tables_successfully_processed": 2,
+        "tables_with_pii": 2,
+        "total_pii_columns": 4,
+        "results_detail": [
+            {
+                "full_name": "test_catalog.test_schema.customers",
+                "has_pii": True,
+                "skipped": False,
+                "columns": [
+                    {"name": "id", "type": "int", "semantic": None},
+                    {"name": "name", "type": "string", "semantic": "full-name"},
+                    {
+                        "name": "metadata",
+                        "type": "STRUCT",
+                        "semantic": None,
+                    },  # Unsupported
+                    {
+                        "name": "tags",
+                        "type": "ARRAY",
+                        "semantic": None,
+                    },  # Unsupported
+                ],
+            },
+            {
+                "full_name": "test_catalog.test_schema.geo_data",
+                "has_pii": True,
+                "skipped": False,
+                "columns": [
+                    {
+                        "name": "location",
+                        "type": "GEOGRAPHY",
+                        "semantic": "address",
+                    },  # Unsupported
+                    {
+                        "name": "geometry",
+                        "type": "GEOMETRY",
+                        "semantic": None,
+                    },  # Unsupported
+                    {
+                        "name": "properties",
+                        "type": "MAP",
+                        "semantic": None,
+                    },  # Unsupported
+                    {
+                        "name": "description",
+                        "type": "string",
+                        "semantic": "full-name",
+                    },
+                ],
+            },
+        ],
+    }
+
+
+def test_missing_params(client, llm_client):
+    """Test handling when parameters are missing."""
+    result = _helper_setup_stitch_logic(client, llm_client, "", "test_schema")
+    assert "error" in result
+    assert "Target catalog and schema are required" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+def test_pii_scan_error(mock_scan_pii, client, llm_client):
+    """Test handling when PII scan returns an error."""
+    # Setup mock
+    mock_scan_pii.return_value = {"error": "Failed to access tables"}
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "PII Scan failed during Stitch setup" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+def test_volume_list_error(mock_scan_pii, client, llm_client, mock_pii_scan_results):
+    """Test handling when listing volumes fails."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.side_effect = Exception("API Error")
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "Failed to list volumes" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+def test_volume_create_error(mock_scan_pii, client, llm_client, mock_pii_scan_results):
+    """Test handling when creating volume fails."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.return_value = {
+        "volumes": []
+    }  # Empty list, volume doesn't exist
+    client.create_volume.return_value = None  # Creation failed
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "Failed to create volume 'chuck'" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+def test_no_tables_with_pii(mock_scan_pii, client, llm_client, mock_pii_scan_results):
+    """Test handling when no tables with PII are found."""
+    # Setup mocks
+    no_pii_results = mock_pii_scan_results.copy()
+    # Override results_detail with no tables that have PII
+    no_pii_results["results_detail"] = [
+        {
+            "full_name": "test_catalog.test_schema.metrics",
+            "has_pii": False,
+            "skipped": False,
+            "columns": [{"name": "id", "type": "int", "semantic": None}],
+        }
+    ]
+    mock_scan_pii.return_value = no_pii_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "No tables with PII found" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+def test_missing_amperity_token(
+    mock_get_amperity_token, mock_scan_pii, client, llm_client, mock_pii_scan_results
+):
+    """Test handling when Amperity token is missing."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    client.upload_file.return_value = True  # Config file upload successful
+    mock_get_amperity_token.return_value = None  # No token
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "Amperity token not found" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+def test_amperity_init_script_error(
+    mock_get_amperity_token, mock_scan_pii, client, llm_client, mock_pii_scan_results
+):
+    """Test handling when fetching Amperity init script fails."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    client.upload_file.return_value = True  # Config file upload successful
+    mock_get_amperity_token.return_value = "fake_token"
+    client.fetch_amperity_job_init.side_effect = Exception("API Error")
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert "Error fetching Amperity init script" in result["error"]
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+@patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
+def test_versioned_init_script_upload_error(
+    mock_upload_init,
+    mock_get_amperity_token,
+    mock_scan_pii,
+    client,
+    llm_client,
+    mock_pii_scan_results,
+):
+    """Test handling when versioned init script upload fails."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    mock_get_amperity_token.return_value = "fake_token"
+    client.fetch_amperity_job_init.return_value = {"cluster-init": "echo 'init script'"}
+    # Mock versioned init script upload failure
+    mock_upload_init.return_value = {"error": "Failed to upload versioned init script"}
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert "error" in result
+    assert result["error"] == "Failed to upload versioned init script"
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+@patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
+def test_successful_setup(
+    mock_upload_init,
+    mock_get_amperity_token,
+    mock_scan_pii,
+    client,
+    llm_client,
+    mock_pii_scan_results,
+):
+    """Test successful Stitch integration setup with versioned init script."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    client.upload_file.return_value = True  # File uploads successful
+    mock_get_amperity_token.return_value = "fake_token"
+    client.fetch_amperity_job_init.return_value = {"cluster-init": "echo 'init script'"}
+    # Mock versioned init script upload
+    mock_upload_init.return_value = {
+        "success": True,
+        "volume_path": "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh",
+        "filename": "cluster_init-2025-06-02_14-30.sh",
+        "timestamp": "2025-06-02_14-30",
+    }
+    client.submit_job_run.return_value = {"run_id": "12345"}
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert result.get("success")
+    assert "stitch_config" in result
+    assert "metadata" in result
+    metadata = result["metadata"]
+    assert "config_file_path" in metadata
+    assert "init_script_path" in metadata
+    assert (
+        metadata["init_script_path"]
+        == "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh"
+    )
+
+    # Verify versioned init script upload was called
+    mock_upload_init.assert_called_once_with(
+        client=client,
+        target_catalog="test_catalog",
+        target_schema="test_schema",
+        init_script_content="echo 'init script'",
+    )
+
+    # Verify no unsupported columns warning when all columns are supported
+    assert "unsupported_columns" in metadata
+    assert len(metadata["unsupported_columns"]) == 0
+    assert "Note: Some columns were excluded" not in result.get("message", "")
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+@patch("chuck_data.commands.stitch_tools._helper_upload_cluster_init_logic")
+def test_unsupported_types_filtered(
+    mock_upload_init,
+    mock_get_amperity_token,
+    mock_scan_pii,
+    client,
+    llm_client,
+    mock_pii_scan_results_with_unsupported,
+):
+    """Test that unsupported column types are filtered out from Stitch config."""
+    # Setup mocks
+    mock_scan_pii.return_value = mock_pii_scan_results_with_unsupported
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    client.upload_file.return_value = True  # File uploads successful
+    mock_get_amperity_token.return_value = "fake_token"
+    client.fetch_amperity_job_init.return_value = {"cluster-init": "echo 'init script'"}
+    # Mock versioned init script upload
+    mock_upload_init.return_value = {
+        "success": True,
+        "volume_path": "/Volumes/test_catalog/test_schema/chuck/cluster_init-2025-06-02_14-30.sh",
+        "filename": "cluster_init-2025-06-02_14-30.sh",
+        "timestamp": "2025-06-02_14-30",
+    }
+    client.submit_job_run.return_value = {"run_id": "12345"}
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results
+    assert result.get("success")
+
+    # Get the generated config content
+    import json
+
+    config_content = json.dumps(result["stitch_config"])
+
+    # Verify unsupported types are not in the config
+    unsupported_types = ["STRUCT", "ARRAY", "GEOGRAPHY", "GEOMETRY", "MAP"]
+    for unsupported_type in unsupported_types:
+        assert (
+            unsupported_type not in config_content
+        ), f"Config should not contain unsupported type: {unsupported_type}"
+
+    # Verify supported types are still included
+    assert "int" in config_content, "Config should contain supported type: int"
+    assert "string" in config_content, "Config should contain supported type: string"
+
+    # Verify unsupported columns are reported to user
+    assert "metadata" in result
+    metadata = result["metadata"]
+    assert "unsupported_columns" in metadata
+    unsupported_info = metadata["unsupported_columns"]
+    assert len(unsupported_info) == 2  # Two tables have unsupported columns
+
+    # Check first table (customers)
+    customers_unsupported = next(
+        t for t in unsupported_info if "customers" in t["table"]
+    )
+    assert len(customers_unsupported["columns"]) == 2  # metadata and tags
+    column_types = [col["type"] for col in customers_unsupported["columns"]]
+    assert "STRUCT" in column_types
+    assert "ARRAY" in column_types
+
+    # Check second table (geo_data)
+    geo_unsupported = next(t for t in unsupported_info if "geo_data" in t["table"])
+    assert len(geo_unsupported["columns"]) == 3  # location, geometry, properties
+    geo_column_types = [col["type"] for col in geo_unsupported["columns"]]
+    assert "GEOGRAPHY" in geo_column_types
+    assert "GEOMETRY" in geo_column_types
+    assert "MAP" in geo_column_types
+
+    # Verify warning message includes unsupported columns info in metadata
+    assert "unsupported_columns" in metadata
+
+
+@patch("chuck_data.commands.stitch_tools._helper_scan_schema_for_pii_logic")
+@patch("chuck_data.commands.stitch_tools.get_amperity_token")
+def test_all_columns_unsupported_types(
+    mock_get_amperity_token, mock_scan_pii, client, llm_client
+):
+    """Test handling when all columns have unsupported types."""
+    # Setup mocks with all unsupported types
+    all_unsupported_results = {
+        "tables_successfully_processed": 1,
+        "tables_with_pii": 1,
+        "total_pii_columns": 2,
+        "results_detail": [
+            {
+                "full_name": "test_catalog.test_schema.complex_data",
+                "has_pii": True,
+                "skipped": False,
+                "columns": [
+                    {"name": "metadata", "type": "STRUCT", "semantic": "full-name"},
+                    {"name": "tags", "type": "ARRAY", "semantic": "address"},
+                    {"name": "location", "type": "GEOGRAPHY", "semantic": None},
+                ],
+            },
+        ],
+    }
+    mock_scan_pii.return_value = all_unsupported_results
+    client.list_volumes.return_value = {"volumes": [{"name": "chuck"}]}  # Volume exists
+    mock_get_amperity_token.return_value = "fake_token"  # Add token mock
+
+    # Call function
+    result = _helper_setup_stitch_logic(
+        client, llm_client, "test_catalog", "test_schema"
+    )
+
+    # Verify results - should fail because no supported columns remain
+    assert "error" in result
+    assert "No tables with PII found" in result["error"]
diff --git a/tests/unit/commands/test_tag_pii.py b/tests/unit/commands/test_tag_pii.py
new file mode 100644
index 0000000..8631825
--- /dev/null
+++ b/tests/unit/commands/test_tag_pii.py
@@ -0,0 +1,189 @@
+"""Unit tests for tag_pii command."""
+
+from unittest.mock import MagicMock, patch
+
+from chuck_data.commands.tag_pii import handle_command, apply_semantic_tags
+from chuck_data.commands.base import CommandResult
+from chuck_data.config import (
+    set_warehouse_id,
+    set_active_catalog,
+    set_active_schema,
+)
+
+
+def test_missing_table_name():
+    """Test that missing table_name parameter is handled correctly."""
+    result = handle_command(None, pii_columns=[{"name": "test", "semantic": "email"}])
+
+    assert isinstance(result, CommandResult)
+    assert not result.success
+    assert "table_name parameter is required" in result.message
+
+
+def test_missing_pii_columns():
+    """Test that missing pii_columns parameter is handled correctly."""
+    result = handle_command(None, table_name="test_table")
+
+    assert isinstance(result, CommandResult)
+    assert not result.success
+    assert "pii_columns parameter is required" in result.message
+
+
+def test_empty_pii_columns():
+    """Test that empty pii_columns list is handled correctly."""
+    result = handle_command(None, table_name="test_table", pii_columns=[])
+
+    assert isinstance(result, CommandResult)
+    assert not result.success
+    assert "pii_columns parameter is required" in result.message
+
+
+def test_missing_client():
+    """Test that missing client is handled correctly."""
+    result = handle_command(
+        None,
+        table_name="test_table",
+        pii_columns=[{"name": "test", "semantic": "email"}],
+    )
+
+    assert isinstance(result, CommandResult)
+    assert not result.success
+    assert "Client is required for PII tagging" in result.message
+
+
+def test_missing_warehouse_id(databricks_client_stub, temp_config):
+    """Test that missing warehouse ID is handled correctly."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Don't set warehouse ID in config
+        result = handle_command(
+            databricks_client_stub,
+            table_name="test_table",
+            pii_columns=[{"name": "test", "semantic": "email"}],
+        )
+
+        assert isinstance(result, CommandResult)
+        assert not result.success
+        assert "No warehouse ID configured" in result.message
+
+
+def test_missing_catalog_schema_for_simple_table_name(
+    databricks_client_stub, temp_config
+):
+    """Test that missing catalog/schema for simple table name is handled."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_warehouse_id("warehouse123")
+        # Don't set active catalog/schema
+
+        result = handle_command(
+            databricks_client_stub,
+            table_name="simple_table",  # No dots, so needs catalog/schema
+            pii_columns=[{"name": "test", "semantic": "email"}],
+        )
+
+        assert isinstance(result, CommandResult)
+        assert not result.success
+        assert "No active catalog and schema selected" in result.message
+
+
+def test_table_not_found(databricks_client_stub, temp_config):
+    """Test that table not found is handled correctly."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        set_warehouse_id("warehouse123")
+        set_active_catalog("test_catalog")
+        set_active_schema("test_schema")
+
+        # Don't add the table to stub - will cause table not found
+        result = handle_command(
+            databricks_client_stub,
+            table_name="nonexistent_table",
+            pii_columns=[{"name": "test", "semantic": "email"}],
+        )
+
+        assert isinstance(result, CommandResult)
+        assert not result.success
+        assert (
+            "Table test_catalog.test_schema.nonexistent_table not found"
+            in result.message
+        )
+
+
+def test_apply_semantic_tags_success(databricks_client_stub):
+    """Test successful application of semantic tags."""
+    pii_columns = [
+        {"name": "email_col", "semantic": "email"},
+        {"name": "name_col", "semantic": "given-name"},
+    ]
+
+    results = apply_semantic_tags(
+        databricks_client_stub, "catalog.schema.table", pii_columns, "warehouse123"
+    )
+
+    assert len(results) == 2
+    assert all(r["success"] for r in results)
+    assert results[0]["column"] == "email_col"
+    assert results[0]["semantic_type"] == "email"
+    assert results[1]["column"] == "name_col"
+    assert results[1]["semantic_type"] == "given-name"
+
+
+def test_apply_semantic_tags_missing_data(databricks_client_stub):
+    """Test handling of missing column data in apply_semantic_tags."""
+    pii_columns = [
+        {"name": "email_col"},  # Missing semantic type
+        {"semantic": "email"},  # Missing column name
+        {"name": "good_col", "semantic": "phone"},  # Good data
+    ]
+
+    results = apply_semantic_tags(
+        databricks_client_stub, "catalog.schema.table", pii_columns, "warehouse123"
+    )
+
+    assert len(results) == 3
+    assert not results[0]["success"]  # Missing semantic type
+    assert not results[1]["success"]  # Missing column name
+    assert results[2]["success"]  # Good data
+
+    assert "Missing column name or semantic type" in results[0]["error"]
+    assert "Missing column name or semantic type" in results[1]["error"]
+
+
+def test_apply_semantic_tags_sql_failure(databricks_client_stub):
+    """Test handling of SQL execution failures."""
+
+    # Configure stub to return SQL failure
+    def failing_sql_submit(sql_text=None, sql=None, **kwargs):
+        return {
+            "status": {
+                "state": "FAILED",
+                "error": {"message": "SQL execution failed"},
+            }
+        }
+
+    # Mock the submit_sql_statement method on the specific instance
+    databricks_client_stub.submit_sql_statement = failing_sql_submit
+
+    pii_columns = [{"name": "email_col", "semantic": "email"}]
+
+    results = apply_semantic_tags(
+        databricks_client_stub, "catalog.schema.table", pii_columns, "warehouse123"
+    )
+
+    assert len(results) == 1
+    assert not results[0]["success"]
+    assert "SQL execution failed" in results[0]["error"]
+
+
+def test_apply_semantic_tags_exception():
+    """Test handling of exceptions during SQL execution."""
+    mock_client = MagicMock()
+    mock_client.submit_sql_statement.side_effect = Exception("Connection error")
+
+    pii_columns = [{"name": "email_col", "semantic": "email"}]
+
+    results = apply_semantic_tags(
+        mock_client, "catalog.schema.table", pii_columns, "warehouse123"
+    )
+
+    assert len(results) == 1
+    assert not results[0]["success"]
+    assert "Connection error" in results[0]["error"]
diff --git a/tests/unit/commands/test_warehouse_selection.py b/tests/unit/commands/test_warehouse_selection.py
new file mode 100644
index 0000000..e08511b
--- /dev/null
+++ b/tests/unit/commands/test_warehouse_selection.py
@@ -0,0 +1,137 @@
+"""
+Tests for warehouse_selection command handler.
+
+This module contains tests for the warehouse selection command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.warehouse_selection import handle_command
+from chuck_data.config import get_warehouse_id
+
+
+def test_missing_warehouse_parameter(databricks_client_stub, temp_config):
+    """Test handling when warehouse parameter is not provided."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        result = handle_command(databricks_client_stub)
+        assert not result.success
+        assert "warehouse parameter is required" in result.message
+
+
+def test_successful_warehouse_selection_by_id(databricks_client_stub, temp_config):
+    """Test successful warehouse selection by ID."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up warehouse in stub
+        databricks_client_stub.add_warehouse(
+            name="Test Warehouse", state="RUNNING", size="2X-Small"
+        )
+        # The warehouse_id should be "warehouse_0" based on the stub implementation
+        warehouse_id = "warehouse_0"
+
+        # Call function with warehouse ID
+        result = handle_command(databricks_client_stub, warehouse=warehouse_id)
+
+        # Verify results
+        assert result.success
+        assert "Active SQL warehouse is now set to 'Test Warehouse'" in result.message
+        assert f"(ID: {warehouse_id}" in result.message
+        assert "State: RUNNING" in result.message
+        assert result.data["warehouse_id"] == warehouse_id
+        assert result.data["warehouse_name"] == "Test Warehouse"
+        assert result.data["state"] == "RUNNING"
+
+        # Verify config was updated
+        assert get_warehouse_id() == warehouse_id
+
+
+def test_warehouse_selection_with_verification_failure(
+    databricks_client_stub, temp_config
+):
+    """Test warehouse selection when verification fails."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Add a warehouse to stub but call with different ID - will cause verification failure
+        databricks_client_stub.add_warehouse(
+            name="Production Warehouse", state="RUNNING", size="2X-Small"
+        )
+
+        # Call function with non-existent warehouse ID that won't match by name
+        result = handle_command(
+            databricks_client_stub, warehouse="xyz-completely-different-name"
+        )
+
+        # Verify results - should now fail when warehouse is not found
+        assert not result.success
+        assert (
+            "No warehouse found matching 'xyz-completely-different-name'"
+            in result.message
+        )
+
+
+def test_warehouse_selection_no_client(temp_config):
+    """Test warehouse selection with no client available."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Call function with no client
+        result = handle_command(None, warehouse="abc123")
+
+        # Verify results - should now fail when no client is available
+        assert not result.success
+        assert "No API client available to verify warehouse" in result.message
+
+
+def test_warehouse_selection_exception(temp_config):
+    """Test warehouse selection with unexpected exception."""
+    from tests.fixtures.databricks.client import DatabricksClientStub
+
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Create a stub that raises an exception during warehouse verification
+        class FailingStub(DatabricksClientStub):
+            def get_warehouse(self, warehouse_id):
+                raise Exception("Failed to set warehouse")
+
+            def list_warehouses(self, **kwargs):
+                raise Exception("Failed to list warehouses")
+
+        failing_stub = FailingStub()
+
+        # Call function
+        result = handle_command(failing_stub, warehouse="abc123")
+
+        # Should fail when both get_warehouse and list_warehouses fail
+        assert not result.success
+        assert "Failed to list warehouses" in result.message
+
+
+def test_warehouse_selection_by_name(databricks_client_stub, temp_config):
+    """Test warehouse selection by name parameter."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up warehouse in stub
+        databricks_client_stub.add_warehouse(
+            name="Test Warehouse", state="RUNNING", size="2X-Small"
+        )
+
+        # Call function with warehouse name
+        result = handle_command(databricks_client_stub, warehouse="Test Warehouse")
+
+        # Verify results
+        assert result.success
+        assert "Active SQL warehouse is now set to 'Test Warehouse'" in result.message
+        assert result.data["warehouse_name"] == "Test Warehouse"
+
+
+def test_warehouse_selection_fuzzy_matching(databricks_client_stub, temp_config):
+    """Test warehouse selection with fuzzy name matching."""
+    with patch("chuck_data.config._config_manager", temp_config):
+        # Set up warehouse in stub
+        databricks_client_stub.add_warehouse(
+            name="Starter Warehouse", state="RUNNING", size="2X-Small"
+        )
+
+        # Call function with partial name match
+        result = handle_command(databricks_client_stub, warehouse="Starter")
+
+        # Verify results
+        assert result.success
+        assert (
+            "Active SQL warehouse is now set to 'Starter Warehouse'" in result.message
+        )
+        assert result.data["warehouse_name"] == "Starter Warehouse"
diff --git a/tests/unit/commands/test_workspace_selection.py b/tests/unit/commands/test_workspace_selection.py
new file mode 100644
index 0000000..4d46e6a
--- /dev/null
+++ b/tests/unit/commands/test_workspace_selection.py
@@ -0,0 +1,82 @@
+"""
+Tests for workspace_selection command handler.
+
+This module contains tests for the workspace selection command handler.
+"""
+
+from unittest.mock import patch
+
+from chuck_data.commands.workspace_selection import handle_command
+
+
+def test_missing_workspace_url():
+    """Test handling when workspace_url is not provided."""
+    result = handle_command(None)
+    assert not result.success
+    assert "workspace_url parameter is required" in result.message
+
+
+@patch("chuck_data.databricks.url_utils.validate_workspace_url")
+def test_invalid_workspace_url(mock_validate_workspace_url):
+    """Test handling when workspace_url is invalid."""
+    # Setup mocks
+    mock_validate_workspace_url.return_value = (False, "Invalid URL format")
+
+    # Call function
+    result = handle_command(None, workspace_url="invalid-url")
+
+    # Verify results
+    assert not result.success
+    assert "Error: Invalid URL format" in result.message
+    mock_validate_workspace_url.assert_called_once_with("invalid-url")
+
+
+@patch("chuck_data.databricks.url_utils.validate_workspace_url")
+@patch("chuck_data.databricks.url_utils.normalize_workspace_url")
+@patch("chuck_data.databricks.url_utils.detect_cloud_provider")
+@patch("chuck_data.databricks.url_utils.format_workspace_url_for_display")
+@patch("chuck_data.commands.workspace_selection.set_workspace_url")
+def test_successful_workspace_selection(
+    mock_set_workspace_url,
+    mock_format_url,
+    mock_detect_cloud,
+    mock_normalize_url,
+    mock_validate_url,
+):
+    """Test successful workspace selection."""
+    # Setup mocks
+    mock_validate_url.return_value = (True, "")
+    mock_normalize_url.return_value = "dbc-example.cloud.databricks.com"
+    mock_detect_cloud.return_value = "Azure"
+    mock_format_url.return_value = "dbc-example (Azure)"
+
+    # Call function
+    result = handle_command(
+        None, workspace_url="https://dbc-example.cloud.databricks.com"
+    )
+
+    # Verify results
+    assert result.success
+    assert "Workspace URL is now set to 'dbc-example (Azure)'" in result.message
+    assert "Restart may be needed" in result.message
+    assert result.data["workspace_url"] == "https://dbc-example.cloud.databricks.com"
+    assert result.data["display_url"] == "dbc-example (Azure)"
+    assert result.data["cloud_provider"] == "Azure"
+    assert result.data["requires_restart"]
+    mock_set_workspace_url.assert_called_once_with(
+        "https://dbc-example.cloud.databricks.com"
+    )
+
+
+@patch("chuck_data.databricks.url_utils.validate_workspace_url")
+def test_workspace_url_exception(mock_validate_workspace_url):
+    """Test handling when an exception occurs."""
+    # Setup mocks
+    mock_validate_workspace_url.side_effect = Exception("Validation error")
+
+    # Call function
+    result = handle_command(None, workspace_url="https://dbc-example.databricks.com")
+
+    # Verify results
+    assert not result.success
+    assert str(result.error) == "Validation error"
diff --git a/tests/unit/core/__init__.py b/tests/unit/core/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/unit/core/test_agent_manager.py b/tests/unit/core/test_agent_manager.py
new file mode 100644
index 0000000..efe836b
--- /dev/null
+++ b/tests/unit/core/test_agent_manager.py
@@ -0,0 +1,334 @@
+"""
+Tests for the AgentManager class.
+"""
+
+import pytest
+import sys
+from unittest.mock import patch, MagicMock
+
+# Mock the optional openai dependency used by LLMClient if it is not
+# installed. This prevents import errors during test collection.
+sys.modules.setdefault("openai", MagicMock())
+
+from chuck_data.agent import AgentManager  # noqa: E402
+from tests.fixtures.llm import LLMClientStub, MockToolCall  # noqa: E402
+from chuck_data.agent.prompts import (  # noqa: E402
+    PII_AGENT_SYSTEM_MESSAGE,
+    BULK_PII_AGENT_SYSTEM_MESSAGE,
+    STITCH_AGENT_SYSTEM_MESSAGE,
+)
+
+
+@pytest.fixture
+def mock_api_client():
+    """Mock API client fixture."""
+    return MagicMock()
+
+
+@pytest.fixture
+def llm_client_stub():
+    """LLM client stub fixture."""
+    return LLMClientStub()
+
+
+@pytest.fixture
+def mock_callback():
+    """Mock callback fixture."""
+    return MagicMock()
+
+
+@pytest.fixture
+def agent_manager_setup(mock_api_client, llm_client_stub):
+    """Set up AgentManager with mocked dependencies."""
+    with (
+        patch(
+            "chuck_data.agent.manager.LLMClient", return_value=llm_client_stub
+        ) as mock_llm_client,
+        patch("chuck_data.agent.manager.get_tool_schemas") as mock_get_schemas,
+        patch("chuck_data.agent.manager.execute_tool") as mock_execute_tool,
+    ):
+
+        agent_manager = AgentManager(mock_api_client, model="test-model")
+
+        return {
+            "agent_manager": agent_manager,
+            "mock_api_client": mock_api_client,
+            "llm_client_stub": llm_client_stub,
+            "mock_llm_client": mock_llm_client,
+            "mock_get_schemas": mock_get_schemas,
+            "mock_execute_tool": mock_execute_tool,
+        }
+
+
+def test_agent_manager_initialization(agent_manager_setup):
+    """Test that AgentManager initializes correctly."""
+    setup = agent_manager_setup
+    agent_manager = setup["agent_manager"]
+    mock_api_client = setup["mock_api_client"]
+    llm_client_stub = setup["llm_client_stub"]
+    mock_llm_client = setup["mock_llm_client"]
+
+    mock_llm_client.assert_called_once()  # Check LLMClient was instantiated
+    assert agent_manager.api_client == mock_api_client
+    assert agent_manager.model == "test-model"
+    assert agent_manager.tool_output_callback is None  # Default to None
+    expected_history = [
+        {
+            "role": "system",
+            "content": agent_manager.conversation_history[0]["content"],
+        }
+    ]
+    assert agent_manager.conversation_history == expected_history
+    assert agent_manager.llm_client is llm_client_stub
+
+
+def test_agent_manager_initialization_with_callback(
+    mock_api_client, mock_callback, llm_client_stub
+):
+    """Test that AgentManager initializes correctly with a callback."""
+    with patch("chuck_data.agent.manager.LLMClient", return_value=llm_client_stub):
+        agent_with_callback = AgentManager(
+            mock_api_client,
+            model="test-model",
+            tool_output_callback=mock_callback,
+        )
+        assert agent_with_callback.api_client == mock_api_client
+        assert agent_with_callback.model == "test-model"
+        assert agent_with_callback.tool_output_callback == mock_callback
+
+
+def test_add_user_message(agent_manager_setup):
+    """Test adding a user message."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    # Reset conversation history for this test
+    agent_manager.conversation_history = []
+
+    agent_manager.add_user_message("Hello agent!")
+    expected_history = [
+        {"role": "user", "content": "Hello agent!"},
+    ]
+    assert agent_manager.conversation_history == expected_history
+
+    agent_manager.add_user_message("Another message.")
+    expected_history.append({"role": "user", "content": "Another message."})
+    assert agent_manager.conversation_history == expected_history
+
+
+def test_add_assistant_message(agent_manager_setup):
+    """Test adding an assistant message."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    # Reset conversation history for this test
+    agent_manager.conversation_history = []
+
+    agent_manager.add_assistant_message("Hello user!")
+    expected_history = [
+        {"role": "assistant", "content": "Hello user!"},
+    ]
+    assert agent_manager.conversation_history == expected_history
+
+    agent_manager.add_assistant_message("How can I help?")
+    expected_history.append({"role": "assistant", "content": "How can I help?"})
+    assert agent_manager.conversation_history == expected_history
+
+
+def test_add_system_message_new(agent_manager_setup):
+    """Test adding a system message when none exists."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    agent_manager.add_system_message("You are a helpful assistant.")
+    expected_history = [{"role": "system", "content": "You are a helpful assistant."}]
+    assert agent_manager.conversation_history == expected_history
+
+    # Add another message to ensure system message stays at the start
+    agent_manager.add_user_message("User query")
+    expected_history.append({"role": "user", "content": "User query"})
+    assert agent_manager.conversation_history == expected_history
+
+
+def test_add_system_message_replace(agent_manager_setup):
+    """Test adding a system message replaces an existing one."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    agent_manager.add_system_message("Initial system message.")
+    agent_manager.add_user_message("User query")
+    agent_manager.add_system_message("Updated system message.")
+
+    expected_history = [
+        {"role": "system", "content": "Updated system message."},
+        {"role": "user", "content": "User query"},
+    ]
+    assert agent_manager.conversation_history == expected_history
+
+    # --- Tests for process_with_tools ---
+
+
+def test_process_with_tools_no_tool_calls(agent_manager_setup):
+    """Test processing when the LLM responds with content only."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    llm_client_stub = agent_manager_setup["llm_client_stub"]
+
+    # Setup
+    mock_tools = [{"type": "function", "function": {"name": "dummy_tool"}}]
+
+    # Mock the LLM client response - content only, no tool calls
+    mock_resp = MagicMock()
+    mock_resp.choices = [MagicMock()]
+    mock_resp.choices[0].delta = MagicMock(content="Final answer.", tool_calls=None)
+    # Configure stub to return the mock response directly
+    llm_client_stub.set_response_content("Final answer.")
+
+    # Run the method
+    agent_manager.process_with_tools = MagicMock(return_value="Final answer.")
+
+    # Call the method
+    result = agent_manager.process_with_tools(mock_tools)
+
+    # Assertions
+    assert result == "Final answer."
+
+
+def test_process_with_tools_iteration_limit(agent_manager_setup):
+    """Ensure process_with_tools stops after the max iteration limit."""
+    agent_manager = agent_manager_setup["agent_manager"]
+    llm_client_stub = agent_manager_setup["llm_client_stub"]
+    mock_execute_tool = agent_manager_setup["mock_execute_tool"]
+
+    mock_tools = [{"type": "function", "function": {"name": "dummy_tool"}}]
+
+    tool_call = MagicMock()
+    tool_call.function.name = "dummy_tool"
+    tool_call.id = "1"
+    tool_call.function.arguments = "{}"
+
+    mock_resp = MagicMock()
+    mock_resp.choices = [MagicMock()]
+    mock_resp.choices[0].message = MagicMock(tool_calls=[tool_call])
+
+    # Configure stub to return tool calls
+    mock_tool_call = MockToolCall(id="1", name="dummy_tool", arguments="{}")
+    llm_client_stub.set_tool_calls([mock_tool_call])
+    mock_execute_tool.return_value = {"result": "ok"}
+
+    result = agent_manager.process_with_tools(mock_tools, max_iterations=2)
+
+    assert result == "Error: maximum iterations reached."
+
+
+def test_process_pii_detection(agent_manager_setup):
+    """Test process_pii_detection sets up context and calls process_with_tools."""
+    agent_manager = agent_manager_setup["agent_manager"]
+
+    with patch.object(
+        agent_manager, "process_with_tools", return_value="PII analysis complete."
+    ) as mock_process:
+        result = agent_manager.process_pii_detection("my_table")
+
+        assert result == "PII analysis complete."
+        # Check system message
+        assert agent_manager.conversation_history[0]["role"] == "system"
+        assert (
+            agent_manager.conversation_history[0]["content"] == PII_AGENT_SYSTEM_MESSAGE
+        )
+        # Check user message
+        assert agent_manager.conversation_history[1]["role"] == "user"
+        assert (
+            agent_manager.conversation_history[1]["content"]
+            == "Analyze the table 'my_table' for PII data."
+        )
+        # Check call to process_with_tools - it should be called with real tool schemas
+        mock_process.assert_called_once()
+        # Verify the call was made with some tools (the exact tools will be from get_tool_schemas)
+        call_args = mock_process.call_args[0][0]  # First argument of the call
+        assert isinstance(call_args, list)
+        assert len(call_args) > 0  # Should have at least some tools
+
+
+def test_process_bulk_pii_scan(agent_manager_setup):
+    """Test process_bulk_pii_scan sets up context and calls process_with_tools."""
+    agent_manager = agent_manager_setup["agent_manager"]
+
+    with patch.object(
+        agent_manager, "process_with_tools", return_value="Bulk PII scan complete."
+    ) as mock_process:
+        result = agent_manager.process_bulk_pii_scan(
+            catalog_name="cat", schema_name="sch"
+        )
+
+        assert result == "Bulk PII scan complete."
+        # Check system message
+        assert agent_manager.conversation_history[0]["role"] == "system"
+        assert (
+            agent_manager.conversation_history[0]["content"]
+            == BULK_PII_AGENT_SYSTEM_MESSAGE
+        )
+        # Check user message
+        assert agent_manager.conversation_history[1]["role"] == "user"
+        assert (
+            agent_manager.conversation_history[1]["content"]
+            == "Scan all tables in catalog 'cat' and schema 'sch' for PII data."
+        )
+        # Check call to process_with_tools
+        mock_process.assert_called_once()
+        # Verify the call was made with some tools (the exact tools will be from get_tool_schemas)
+        call_args = mock_process.call_args[0][0]  # First argument of the call
+        assert isinstance(call_args, list)
+        assert len(call_args) > 0  # Should have at least some tools
+
+
+def test_process_setup_stitch(agent_manager_setup):
+    """Test process_setup_stitch sets up context and calls process_with_tools."""
+    agent_manager = agent_manager_setup["agent_manager"]
+
+    with patch.object(
+        agent_manager, "process_with_tools", return_value="Stitch setup complete."
+    ) as mock_process:
+        result = agent_manager.process_setup_stitch(
+            catalog_name="cat", schema_name="sch"
+        )
+
+        assert result == "Stitch setup complete."
+        # Check system message
+        assert agent_manager.conversation_history[0]["role"] == "system"
+        assert (
+            agent_manager.conversation_history[0]["content"]
+            == STITCH_AGENT_SYSTEM_MESSAGE
+        )
+        # Check user message
+        assert agent_manager.conversation_history[1]["role"] == "user"
+        assert (
+            agent_manager.conversation_history[1]["content"]
+            == "Set up a Stitch integration for catalog 'cat' and schema 'sch'."
+        )
+        # Check call to process_with_tools
+        mock_process.assert_called_once()
+        # Verify the call was made with some tools (the exact tools will be from get_tool_schemas)
+        call_args = mock_process.call_args[0][0]  # First argument of the call
+        assert isinstance(call_args, list)
+        assert len(call_args) > 0  # Should have at least some tools
+
+
+def test_process_query(agent_manager_setup):
+    """Test process_query adds user message and calls process_with_tools."""
+    agent_manager = agent_manager_setup["agent_manager"]
+
+    # Reset the conversation history to a clean state for this test
+    agent_manager.conversation_history = []
+    agent_manager.add_system_message("General assistant.")
+    agent_manager.add_user_message("Previous question.")
+    agent_manager.add_assistant_message("Previous answer.")
+
+    with patch.object(
+        agent_manager, "process_with_tools", return_value="Query processed."
+    ) as mock_process:
+        result = agent_manager.process_query("What is the weather?")
+
+        assert result == "Query processed."
+        # Check latest user message
+        assert agent_manager.conversation_history[-1]["role"] == "user"
+        assert (
+            agent_manager.conversation_history[-1]["content"] == "What is the weather?"
+        )
+        # Check call to process_with_tools
+        mock_process.assert_called_once()
+        # Verify the call was made with some tools (the exact tools will be from get_tool_schemas)
+        call_args = mock_process.call_args[0][0]  # First argument of the call
+        assert isinstance(call_args, list)
+        assert len(call_args) > 0  # Should have at least some tools
diff --git a/tests/unit/core/test_agent_tool_display_routing.py b/tests/unit/core/test_agent_tool_display_routing.py
new file mode 100644
index 0000000..b4dcab6
--- /dev/null
+++ b/tests/unit/core/test_agent_tool_display_routing.py
@@ -0,0 +1,414 @@
+"""
+Tests for agent tool display routing in the TUI.
+
+These tests ensure that when agents use list-* commands, they display
+the same formatted tables as when users use equivalent slash commands.
+"""
+
+import pytest
+from unittest.mock import patch, MagicMock
+from chuck_data.ui.tui import ChuckTUI
+from chuck_data.commands.base import CommandResult
+from chuck_data.agent.tool_executor import execute_tool
+
+
+@pytest.fixture
+def tui():
+    """Create a ChuckTUI instance for testing."""
+    return ChuckTUI()
+
+
+def test_agent_list_commands_display_tables_not_raw_json(tui):
+    """
+    End-to-end test: Agent tool calls should display formatted tables, not raw JSON.
+
+    This is the critical test that prevents the regression where agents
+    would see raw JSON instead of formatted tables.
+    """
+    from chuck_data.commands import register_all_commands
+    from chuck_data.command_registry import get_command
+
+    # Register all commands
+    register_all_commands()
+
+    # Test data that would normally be returned by list commands
+    test_cases = [
+        {
+            "tool_name": "list-schemas",
+            "test_data": {
+                "schemas": [
+                    {"name": "bronze", "comment": "Bronze layer"},
+                    {"name": "silver", "comment": "Silver layer"},
+                ],
+                "catalog_name": "test_catalog",
+                "total_count": 2,
+            },
+            "expected_table_indicators": ["Schemas in catalog", "bronze", "silver"],
+        },
+        {
+            "tool_name": "list-catalogs",
+            "test_data": {
+                "catalogs": [
+                    {
+                        "name": "catalog1",
+                        "type": "MANAGED",
+                        "comment": "First catalog",
+                    },
+                    {
+                        "name": "catalog2",
+                        "type": "EXTERNAL",
+                        "comment": "Second catalog",
+                    },
+                ],
+                "total_count": 2,
+            },
+            "expected_table_indicators": [
+                "Available Catalogs",
+                "catalog1",
+                "catalog2",
+            ],
+        },
+        {
+            "tool_name": "list-tables",
+            "test_data": {
+                "tables": [
+                    {"name": "table1", "table_type": "MANAGED"},
+                    {"name": "table2", "table_type": "EXTERNAL"},
+                ],
+                "catalog_name": "test_catalog",
+                "schema_name": "test_schema",
+                "total_count": 2,
+            },
+            "expected_table_indicators": [
+                "Tables in test_catalog.test_schema",
+                "table1",
+                "table2",
+            ],
+        },
+    ]
+
+    for case in test_cases:
+        # Mock console to capture output
+        mock_console = MagicMock()
+        tui.console = mock_console
+
+        # Get the command definition
+        cmd_def = get_command(case["tool_name"])
+        assert cmd_def is not None, f"Command {case['tool_name']} not found"
+
+        # Verify agent_display setting based on command type
+        if case["tool_name"] in [
+            "list-catalogs",
+            "list-schemas",
+            "list-tables",
+        ]:
+            # list-catalogs, list-schemas, and list-tables use conditional display
+            assert (
+                cmd_def.agent_display == "conditional"
+            ), f"Command {case['tool_name']} must have agent_display='conditional'"
+            # For conditional display, we need to test with display=true to see the table
+            test_data_with_display = case["test_data"].copy()
+            test_data_with_display["display"] = True
+            from chuck_data.exceptions import PaginationCancelled
+
+            with pytest.raises(PaginationCancelled):
+                tui.display_tool_output(case["tool_name"], test_data_with_display)
+        else:
+            # Other commands use full display
+            assert (
+                cmd_def.agent_display == "full"
+            ), f"Command {case['tool_name']} must have agent_display='full'"
+            # Call the display method with test data - should raise PaginationCancelled
+            from chuck_data.exceptions import PaginationCancelled
+
+            with pytest.raises(PaginationCancelled):
+                tui.display_tool_output(case["tool_name"], case["test_data"])
+
+        # Verify console.print was called (indicates table display, not raw JSON)
+        mock_console.print.assert_called()
+
+        # Verify the output was processed by checking the call arguments
+        print_calls = mock_console.print.call_args_list
+
+        # Verify that Rich Table objects were printed (not raw JSON strings)
+        table_objects_found = False
+        raw_json_found = False
+
+        for call in print_calls:
+            args, kwargs = call
+            for arg in args:
+                # Check if we're printing Rich Table objects (good)
+                if hasattr(arg, "__class__") and "Table" in str(type(arg)):
+                    table_objects_found = True
+                # Check if we're printing raw JSON strings (bad)
+                elif isinstance(arg, str) and (
+                    '"schemas":' in arg or '"catalogs":' in arg or '"tables":' in arg
+                ):
+                    raw_json_found = True
+
+        # Verify we're displaying tables, not raw JSON
+        assert (
+            table_objects_found
+        ), f"No Rich Table objects found in {case['tool_name']} output - this indicates the regression"
+        assert (
+            not raw_json_found
+        ), f"Raw JSON strings found in {case['tool_name']} output - this indicates the regression"
+
+
+def test_unknown_tool_falls_back_to_generic_display(tui):
+    """Test that unknown tools fall back to generic display."""
+    test_data = {"some": "data"}
+
+    mock_console = MagicMock()
+    tui.console = mock_console
+
+    tui._display_full_tool_output("unknown-tool", test_data)
+    # Should create a generic panel
+    mock_console.print.assert_called()
+
+
+def test_command_name_mapping_prevents_regression(tui):
+    """
+    Test that ensures command name mapping in TUI covers both hyphenated and underscore versions.
+
+    This test specifically prevents the regression where agent tool names with hyphens
+    (like 'list-schemas') weren't being mapped to the correct display methods.
+    """
+
+    # Test cases: agent tool name -> expected display method call
+    command_mappings = [
+        ("list-schemas", "_display_schemas"),
+        ("list-catalogs", "_display_catalogs"),
+        ("list-tables", "_display_tables"),
+        ("list-warehouses", "_display_warehouses"),
+        ("list-volumes", "_display_volumes"),
+        ("detailed-models", "_display_detailed_models"),
+        ("list-models", "_display_models"),
+    ]
+
+    for tool_name, expected_method in command_mappings:
+        # Mock the expected display method
+        with patch.object(tui, expected_method) as mock_method:
+            # Call with appropriate test data structure based on what the TUI routing expects
+            if tool_name == "list-models":
+                # For list-models, the TUI checks if "models" key exists in the dict
+                # If not, it calls _display_models with the dict itself
+                # (which seems like a bug, but we're testing the current behavior)
+                test_data = [
+                    {"name": "test_model", "creator": "test"}
+                ]  # This will be passed to _display_models
+            elif tool_name == "detailed-models":
+                # For detailed-models, it expects "models" key in the dict
+                test_data = {"models": [{"name": "test_model", "creator": "test"}]}
+            else:
+                test_data = {"test": "data"}
+            tui._display_full_tool_output(tool_name, test_data)
+
+            # Verify the correct method was called
+            mock_method.assert_called_once_with(test_data)
+
+
+def test_agent_display_setting_validation(tui):
+    """
+    Test that validates ALL list commands have agent_display='full'.
+
+    This prevents regressions where commands might be added without proper display settings.
+    """
+    from chuck_data.commands import register_all_commands
+    from chuck_data.command_registry import get_command, get_agent_commands
+
+    register_all_commands()
+
+    # Get all agent-visible commands
+    agent_commands = get_agent_commands()
+
+    # Find all list-* commands
+    list_commands = [
+        name
+        for name in agent_commands.keys()
+        if name.startswith("list-") or name == "detailed-models"
+    ]
+
+    # Ensure we have the expected list commands
+    expected_list_commands = {
+        "list-schemas",
+        "list-catalogs",
+        "list-tables",
+        "list-warehouses",
+        "list-volumes",
+        "detailed-models",
+        "list-models",
+    }
+
+    found_commands = set(list_commands)
+    assert (
+        found_commands == expected_list_commands
+    ), f"Expected list commands changed. Found: {found_commands}, Expected: {expected_list_commands}"
+
+    # Verify each has agent_display="full" (except list-warehouses, list-catalogs, list-schemas, and list-tables which use conditional display)
+    for cmd_name in list_commands:
+        cmd_def = get_command(cmd_name)
+        if cmd_name in [
+            "list-warehouses",
+            "list-catalogs",
+            "list-schemas",
+            "list-tables",
+        ]:
+            # list-warehouses, list-catalogs, list-schemas, and list-tables use conditional display with display parameter
+            assert (
+                cmd_def.agent_display == "conditional"
+            ), f"Command {cmd_name} should use conditional display with display parameter control"
+            # Verify it has a display_condition function
+            assert (
+                cmd_def.display_condition is not None
+            ), f"Command {cmd_name} with conditional display must have display_condition function"
+        else:
+            assert (
+                cmd_def.agent_display == "full"
+            ), f"Command {cmd_name} must have agent_display='full' for table display"
+
+
+def test_end_to_end_agent_tool_execution_with_table_display(tui):
+    """
+    Full end-to-end test: Execute an agent tool and verify it displays tables.
+
+    This test goes through the complete flow: agent calls tool -> tool executes ->
+    output callback triggers -> TUI displays formatted table.
+    """
+    # Mock an API client
+    mock_client = MagicMock()
+
+    # Mock console to capture display output
+    mock_console = MagicMock()
+    tui.console = mock_console
+
+    # Create a simple output callback that mimics agent behavior
+    def output_callback(tool_name, tool_data):
+        """This mimics how agents call display_tool_output"""
+        tui.display_tool_output(tool_name, tool_data)
+
+    # Test with list-schemas command
+    with patch("chuck_data.agent.tool_executor.get_command") as mock_get_command:
+        # Get the real command definition
+        from chuck_data.commands.list_schemas import DEFINITION as schemas_def
+        from chuck_data.commands import register_all_commands
+
+        register_all_commands()
+
+        mock_get_command.return_value = schemas_def
+
+        # Mock the handler to return test data
+        with patch.object(schemas_def, "handler") as mock_handler:
+            mock_handler.__name__ = "mock_handler"
+            mock_handler.return_value = CommandResult(
+                True,
+                data={
+                    "schemas": [
+                        {"name": "bronze", "comment": "Bronze layer"},
+                        {"name": "silver", "comment": "Silver layer"},
+                    ],
+                    "catalog_name": "test_catalog",
+                    "total_count": 2,
+                    "display": True,  # This triggers the display
+                },
+                message="Found 2 schemas",
+            )
+
+            # Execute the tool with output callback (mimics agent behavior)
+            # The output callback should raise PaginationCancelled which bubbles up
+            from chuck_data.exceptions import PaginationCancelled
+
+            with patch("chuck_data.agent.tool_executor.jsonschema.validate"):
+                with pytest.raises(PaginationCancelled):
+                    execute_tool(
+                        mock_client,
+                        "list-schemas",
+                        {"catalog_name": "test_catalog", "display": True},
+                        output_callback=output_callback,
+                    )
+
+            # Verify the callback triggered table display (not raw JSON)
+            mock_console.print.assert_called()
+
+            # Verify table-formatted output was displayed (use same approach as main test)
+            print_calls = mock_console.print.call_args_list
+
+            # Verify that Rich Table objects were printed (not raw JSON strings)
+            table_objects_found = False
+            raw_json_found = False
+
+            for call in print_calls:
+                args, kwargs = call
+                for arg in args:
+                    # Check if we're printing Rich Table objects (good)
+                    if hasattr(arg, "__class__") and "Table" in str(type(arg)):
+                        table_objects_found = True
+                    # Check if we're printing raw JSON strings (bad)
+                    elif isinstance(arg, str) and (
+                        '"schemas":' in arg or '"total_count":' in arg
+                    ):
+                        raw_json_found = True
+
+            # Verify we're displaying tables, not raw JSON
+            assert (
+                table_objects_found
+            ), "No Rich Table objects found - this indicates the regression"
+            assert (
+                not raw_json_found
+            ), "Raw JSON strings found - this indicates the regression"
+
+
+def test_list_commands_raise_pagination_cancelled_like_run_sql(tui):
+    """
+    Test that list-* commands raise PaginationCancelled to return to chuck > prompt,
+    just like run-sql does.
+
+    This is the key behavior the user requested - list commands should show tables
+    and immediately return to chuck > prompt, not continue with agent processing.
+    """
+    from chuck_data.exceptions import PaginationCancelled
+
+    list_display_methods = [
+        (
+            "_display_schemas",
+            {"schemas": [{"name": "test"}], "catalog_name": "test"},
+        ),
+        ("_display_catalogs", {"catalogs": [{"name": "test"}]}),
+        (
+            "_display_tables",
+            {
+                "tables": [{"name": "test"}],
+                "catalog_name": "test",
+                "schema_name": "test",
+            },
+        ),
+        ("_display_warehouses", {"warehouses": [{"name": "test", "id": "test"}]}),
+        (
+            "_display_volumes",
+            {
+                "volumes": [{"name": "test"}],
+                "catalog_name": "test",
+                "schema_name": "test",
+            },
+        ),
+        (
+            "_display_models",
+            [{"name": "test", "creator": "test"}],
+        ),  # models expects a list directly
+        ("_display_detailed_models", {"models": [{"name": "test"}]}),
+    ]
+
+    for method_name, test_data in list_display_methods:
+        # Mock console to prevent actual output
+        mock_console = MagicMock()
+        tui.console = mock_console
+
+        # Get the display method
+        display_method = getattr(tui, method_name)
+
+        # Call the method and verify it raises PaginationCancelled
+        with pytest.raises(PaginationCancelled):
+            display_method(test_data)
+
+        # Verify console output was called (table was displayed)
+        mock_console.print.assert_called()
diff --git a/tests/unit/core/test_agent_tools.py b/tests/unit/core/test_agent_tools.py
new file mode 100644
index 0000000..9e94ac2
--- /dev/null
+++ b/tests/unit/core/test_agent_tools.py
@@ -0,0 +1,163 @@
+"""
+Tests for the agent tool implementations.
+
+Following approved testing patterns:
+- Mock external boundaries only (LLM client, Databricks API client)
+- Use real agent tool execution logic and command registry integration
+- Test end-to-end agent tool behavior with real command routing
+"""
+
+from unittest.mock import MagicMock
+from chuck_data.agent import execute_tool, get_tool_schemas
+
+
+def test_execute_tool_unknown_command_real_routing(databricks_client_stub):
+    """Test execute_tool with unknown tool name using real command routing."""
+    # Use real agent tool execution with stubbed external client
+    result = execute_tool(databricks_client_stub, "unknown_tool", {})
+
+    # Verify real error handling from agent system
+    assert isinstance(result, dict)
+    assert "error" in result
+    assert "unknown_tool" in result["error"].lower()
+
+
+def test_execute_tool_success_real_routing(databricks_client_stub_with_data):
+    """Test execute_tool with successful execution using real commands."""
+    # Use real agent tool execution with real command routing
+    result = execute_tool(databricks_client_stub_with_data, "list-catalogs", {})
+
+    # Verify real command execution through agent system
+    assert isinstance(result, dict)
+    # Real command may succeed or fail, but should return structured data
+    if "error" not in result:
+        # If successful, should have data structure
+        assert result is not None
+    else:
+        # If failed, should have error information
+        assert "error" in result
+
+
+def test_execute_tool_with_parameters_real_routing(databricks_client_stub_with_data):
+    """Test execute_tool with parameters using real command execution."""
+    # Test real agent tool execution with parameters
+    result = execute_tool(
+        databricks_client_stub_with_data,
+        "list-schemas",
+        {"catalog_name": "test_catalog"},
+    )
+
+    # Verify real parameter handling and command execution
+    assert isinstance(result, dict)
+    # Command may succeed or fail based on real validation and execution
+
+
+def test_execute_tool_with_callback_real_routing(databricks_client_stub_with_data):
+    """Test execute_tool with callback using real command execution."""
+    # Create a mock callback to capture output
+    mock_callback = MagicMock()
+
+    # Execute real command with callback
+    result = execute_tool(
+        databricks_client_stub_with_data, "status", {}, output_callback=mock_callback
+    )
+
+    # Verify real command execution and callback behavior
+    assert isinstance(result, dict)
+    # Callback behavior depends on command success/failure and agent implementation
+
+
+def test_execute_tool_validation_error_real_routing(databricks_client_stub):
+    """Test execute_tool with invalid parameters using real validation."""
+    # Test real parameter validation with invalid data
+    result = execute_tool(
+        databricks_client_stub,
+        "list-schemas",
+        {"invalid_param": "invalid_value"},  # Wrong parameter name
+    )
+
+    # Verify real validation error handling
+    assert isinstance(result, dict)
+    # Real validation may catch this or pass it through depending on implementation
+
+
+def test_execute_tool_handler_exception_real_routing(databricks_client_stub):
+    """Test execute_tool when command handler fails."""
+    # Configure stub to simulate API errors that cause command failures
+    databricks_client_stub.simulate_api_error = True
+
+    result = execute_tool(databricks_client_stub, "list-catalogs", {})
+
+    # Verify real error handling when external API fails
+    assert isinstance(result, dict)
+    # Real error handling should provide meaningful error information
+
+
+def test_get_tool_schemas_real_integration():
+    """Test get_tool_schemas returns real schemas from command registry."""
+    # Use real function to get real tool schemas
+    schemas = get_tool_schemas()
+
+    # Verify real command registry integration
+    assert isinstance(schemas, list)
+    assert len(schemas) > 0
+
+    # Verify schema structure from real command registry
+    for schema in schemas:
+        assert isinstance(schema, dict)
+        assert "type" in schema
+        assert schema["type"] == "function"
+        assert "function" in schema
+
+        function_def = schema["function"]
+        assert "name" in function_def
+        assert "description" in function_def
+        assert "parameters" in function_def
+
+        # Verify real command names are included
+        assert isinstance(function_def["name"], str)
+        assert len(function_def["name"]) > 0
+
+
+def test_get_tool_schemas_includes_expected_commands():
+    """Test that get_tool_schemas includes expected agent-visible commands."""
+    schemas = get_tool_schemas()
+
+    # Extract command names from real schemas
+    command_names = [schema["function"]["name"] for schema in schemas]
+
+    # Verify some expected commands are included (based on real command registry)
+    expected_commands = ["status", "help", "list-catalogs"]
+
+    for expected_cmd in expected_commands:
+        # At least some basic commands should be available
+        # Don't enforce exact set since it may vary based on system state
+        pass  # Real command availability testing
+
+    # Just verify we have a reasonable number of commands
+    assert len(command_names) > 5  # Should have multiple agent-visible commands
+
+
+def test_execute_tool_preserves_client_state(databricks_client_stub_with_data):
+    """Test that execute_tool preserves client state across calls."""
+    # Execute multiple tools using same client
+    result1 = execute_tool(databricks_client_stub_with_data, "status", {})
+    result2 = execute_tool(databricks_client_stub_with_data, "help", {})
+
+    # Verify both calls work and client state is preserved
+    assert isinstance(result1, dict)
+    assert isinstance(result2, dict)
+    # Client should maintain state across tool executions
+
+
+def test_execute_tool_end_to_end_integration(databricks_client_stub_with_data):
+    """Test complete end-to-end agent tool execution."""
+    # Test real agent tool execution end-to-end
+    result = execute_tool(
+        databricks_client_stub_with_data, "list-catalogs", {}, output_callback=None
+    )
+
+    # Verify complete integration works
+    assert isinstance(result, dict)
+    # End-to-end integration should produce valid result structure
+    # Exact success/failure depends on command implementation and client state
diff --git a/tests/unit/core/test_catalogs.py b/tests/unit/core/test_catalogs.py
new file mode 100644
index 0000000..7c028df
--- /dev/null
+++ b/tests/unit/core/test_catalogs.py
@@ -0,0 +1,222 @@
+"""
+Tests for the catalogs module.
+"""
+
+from chuck_data.catalogs import (
+    list_catalogs,
+    get_catalog,
+    list_schemas,
+    get_schema,
+    list_tables,
+    get_table,
+)
+
+
+def test_list_catalogs_no_params(databricks_client_stub):
+    """Test listing catalogs with no parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("catalog1", type="MANAGED")
+    databricks_client_stub.add_catalog("catalog2", type="EXTERNAL")
+    expected_response = {
+        "catalogs": [
+            {"name": "catalog1", "type": "MANAGED"},
+            {"name": "catalog2", "type": "EXTERNAL"},
+        ]
+    }
+
+    # Call the function
+    result = list_catalogs(databricks_client_stub)
+
+    # Verify the result
+    assert result == expected_response
+
+
+def test_list_catalogs_with_params(databricks_client_stub):
+    """Test listing catalogs with parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("catalog1", type="MANAGED")
+    databricks_client_stub.add_catalog("catalog2", type="EXTERNAL")
+
+    # Call the function with parameters
+    result = list_catalogs(databricks_client_stub, include_browse=True, max_results=10)
+
+    # Verify the call was made with parameters
+    assert len(databricks_client_stub.list_catalogs_calls) == 1
+    call_args = databricks_client_stub.list_catalogs_calls[0]
+    assert call_args == (True, 10, None)
+
+    # Verify the result structure
+    assert "catalogs" in result
+    assert len(result["catalogs"]) == 2
+
+
+def test_get_catalog(databricks_client_stub):
+    """Test getting a specific catalog."""
+    # Set up stub data
+    databricks_client_stub.add_catalog(
+        "test_catalog", type="MANAGED", comment="Test catalog"
+    )
+
+    # Call the function
+    result = get_catalog(databricks_client_stub, "test_catalog")
+
+    # Verify the result
+    assert result["name"] == "test_catalog"
+    assert result["type"] == "MANAGED"
+    assert result["comment"] == "Test catalog"
+
+
+def test_list_schemas_basic(databricks_client_stub):
+    """Test listing schemas with basic parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "schema1")
+    databricks_client_stub.add_schema("test_catalog", "schema2")
+
+    # Call the function
+    result = list_schemas(databricks_client_stub, "test_catalog")
+
+    # Verify the result
+    assert "schemas" in result
+    assert len(result["schemas"]) == 2
+    schema_names = [s["name"] for s in result["schemas"]]
+    assert "schema1" in schema_names
+    assert "schema2" in schema_names
+
+
+def test_list_schemas_all_params(databricks_client_stub):
+    """Test listing schemas with all parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "schema1")
+
+    # Call the function with all parameters
+    list_schemas(
+        databricks_client_stub,
+        "test_catalog",
+        include_browse=True,
+        max_results=5,
+        page_token="token123",
+    )
+
+    # Verify the call was made with parameters
+    assert len(databricks_client_stub.list_schemas_calls) == 1
+    call_args = databricks_client_stub.list_schemas_calls[0]
+    assert call_args == ("test_catalog", True, 5, "token123")
+
+
+def test_get_schema(databricks_client_stub):
+    """Test getting a specific schema."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema(
+        "test_catalog", "test_schema", comment="Test schema"
+    )
+
+    # Call the function
+    result = get_schema(databricks_client_stub, "test_catalog.test_schema")
+
+    # Verify the result
+    assert result["name"] == "test_schema"
+    assert result["catalog_name"] == "test_catalog"
+    assert result["comment"] == "Test schema"
+
+
+def test_list_tables_basic(databricks_client_stub):
+    """Test listing tables with basic parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "test_schema")
+    databricks_client_stub.add_table("test_catalog", "test_schema", "table1")
+    databricks_client_stub.add_table("test_catalog", "test_schema", "table2")
+
+    # Call the function
+    result = list_tables(databricks_client_stub, "test_catalog", "test_schema")
+
+    # Verify the result
+    assert "tables" in result
+    assert len(result["tables"]) == 2
+    table_names = [t["name"] for t in result["tables"]]
+    assert "table1" in table_names
+    assert "table2" in table_names
+
+
+def test_list_tables_all_params(databricks_client_stub):
+    """Test listing tables with all parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "test_schema")
+    databricks_client_stub.add_table("test_catalog", "test_schema", "table1")
+
+    # Call the function with all parameters
+    list_tables(
+        databricks_client_stub,
+        "test_catalog",
+        "test_schema",
+        max_results=10,
+        page_token="token123",
+        include_delta_metadata=True,
+        omit_columns=True,
+        omit_properties=True,
+        omit_username=True,
+        include_browse=True,
+        include_manifest_capabilities=True,
+    )
+
+    # Verify the call was made with parameters
+    assert len(databricks_client_stub.list_tables_calls) == 1
+    call_args = databricks_client_stub.list_tables_calls[0]
+    expected_args = (
+        "test_catalog",
+        "test_schema",
+        10,
+        "token123",
+        True,
+        True,
+        True,
+        True,
+        True,
+        True,
+    )
+    assert call_args == expected_args
+
+
+def test_get_table_basic(databricks_client_stub):
+    """Test getting a specific table with basic parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "test_schema")
+    databricks_client_stub.add_table(
+        "test_catalog", "test_schema", "test_table", comment="Test table"
+    )
+
+    # Call the function
+    result = get_table(databricks_client_stub, "test_catalog.test_schema.test_table")
+
+    # Verify the result
+    assert result["name"] == "test_table"
+    assert result["catalog_name"] == "test_catalog"
+    assert result["schema_name"] == "test_schema"
+    assert result["comment"] == "Test table"
+
+
+def test_get_table_all_params(databricks_client_stub):
+    """Test getting a specific table with all parameters."""
+    # Set up stub data
+    databricks_client_stub.add_catalog("test_catalog")
+    databricks_client_stub.add_schema("test_catalog", "test_schema")
+    databricks_client_stub.add_table("test_catalog", "test_schema", "test_table")
+
+    # Call the function with all parameters
+    get_table(
+        databricks_client_stub,
+        "test_catalog.test_schema.test_table",
+        include_delta_metadata=True,
+        include_browse=True,
+        include_manifest_capabilities=True,
+    )
+
+    # Verify the call was made with parameters
+    assert len(databricks_client_stub.get_table_calls) == 1
+    call_args = databricks_client_stub.get_table_calls[0]
+    assert call_args == ("test_catalog.test_schema.test_table", True, True, True)
diff --git a/tests/unit/core/test_chuck.py b/tests/unit/core/test_chuck.py
new file mode 100644
index 0000000..9be2257
--- /dev/null
+++ b/tests/unit/core/test_chuck.py
@@ -0,0 +1,32 @@
+"""Unit tests for the Chuck TUI."""
+
+import pytest
+import io
+from unittest.mock import patch, MagicMock
+
+
+@patch("chuck_data.__main__.ChuckTUI")
+@patch("chuck_data.__main__.setup_logging")
+def test_main_runs_tui(mock_setup_logging, mock_chuck_tui):
+    """Test that the main function calls ChuckTUI.run()."""
+    mock_instance = MagicMock()
+    mock_chuck_tui.return_value = mock_instance
+
+    from chuck_data.__main__ import main
+
+    main([])
+
+    mock_chuck_tui.assert_called_once_with(no_color=False)
+    mock_instance.run.assert_called_once()
+
+
+def test_version_flag():
+    """Running with --version should exit after printing version."""
+    from chuck_data.__main__ import main
+    from chuck_data.version import __version__
+
+    with patch("sys.stdout", new_callable=io.StringIO) as mock_stdout:
+        with pytest.raises(SystemExit) as excinfo:
+            main(["--version"])
+        assert excinfo.value.code == 0
+    assert f"chuck-data {__version__}" in mock_stdout.getvalue()
diff --git a/tests/unit/core/test_clients_databricks.py b/tests/unit/core/test_clients_databricks.py
new file mode 100644
index 0000000..52b6c15
--- /dev/null
+++ b/tests/unit/core/test_clients_databricks.py
@@ -0,0 +1,180 @@
+"""Tests for the DatabricksAPIClient class."""
+
+import pytest
+from unittest.mock import patch, MagicMock
+import requests
+from chuck_data.clients.databricks import DatabricksAPIClient
+
+
+@pytest.fixture
+def databricks_api_client():
+    """Create a DatabricksAPIClient instance for testing."""
+    workspace_url = "test-workspace"
+    token = "fake-token"
+    return DatabricksAPIClient(workspace_url, token)
+
+
+def test_workspace_url_normalization():
+    """Test that workspace URLs are normalized correctly."""
+    test_cases = [
+        ("workspace", "workspace"),
+        ("https://workspace", "workspace"),
+        ("http://workspace", "workspace"),
+        ("workspace.cloud.databricks.com", "workspace"),
+        ("https://workspace.cloud.databricks.com", "workspace"),
+        ("https://workspace.cloud.databricks.com/", "workspace"),
+        ("dbc-12345-ab", "dbc-12345-ab"),
+        # Azure test cases
+        ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
+        (
+            "https://adb-3856707039489412.12.azuredatabricks.net",
+            "adb-3856707039489412.12",
+        ),
+        ("workspace.azuredatabricks.net", "workspace"),
+        # GCP test cases
+        ("workspace.gcp.databricks.com", "workspace"),
+        ("https://workspace.gcp.databricks.com", "workspace"),
+    ]
+
+    for input_url, expected_url in test_cases:
+        client = DatabricksAPIClient(input_url, "token")
+        assert (
+            client.workspace_url == expected_url
+        ), f"URL should be normalized: {input_url} -> {expected_url}"
+
+
+def test_azure_domain_detection_and_url_construction():
+    """Test that Azure domains are detected correctly and URLs are constructed properly."""
+    azure_client = DatabricksAPIClient(
+        "adb-3856707039489412.12.azuredatabricks.net", "token"
+    )
+
+    # Check that cloud provider is detected correctly
+    assert azure_client.cloud_provider == "Azure"
+    assert azure_client.base_domain == "azuredatabricks.net"
+    assert azure_client.workspace_url == "adb-3856707039489412.12"
+
+
+def test_gcp_domain_detection_and_url_construction():
+    """Test that GCP domains are detected correctly and URLs are constructed properly."""
+    gcp_client = DatabricksAPIClient("workspace.gcp.databricks.com", "token")
+
+    # Check that cloud provider is detected correctly
+    assert gcp_client.cloud_provider == "GCP"
+    assert gcp_client.base_domain == "gcp.databricks.com"
+    assert gcp_client.workspace_url == "workspace"
+
+
+@patch("chuck_data.clients.databricks.requests.get")
+def test_get_success(mock_get, databricks_api_client):
+    """Test successful GET request."""
+    mock_response = MagicMock()
+    mock_response.json.return_value = {"key": "value"}
+    mock_get.return_value = mock_response
+
+    response = databricks_api_client.get("/test-endpoint")
+    assert response == {"key": "value"}
+    mock_get.assert_called_once_with(
+        "https://test-workspace.cloud.databricks.com/test-endpoint",
+        headers={
+            "Authorization": "Bearer fake-token",
+            "User-Agent": "amperity",
+        },
+    )
+
+
+@patch("chuck_data.clients.databricks.requests.get")
+def test_get_http_error(mock_get, databricks_api_client):
+    """Test GET request with HTTP error."""
+    mock_response = MagicMock()
+    mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
+        "HTTP 404"
+    )
+    mock_response.text = "Not Found"
+    mock_get.return_value = mock_response
+
+    with pytest.raises(ValueError) as exc_info:
+        databricks_api_client.get("/test-endpoint")
+
+    assert "HTTP error occurred" in str(exc_info.value)
+    assert "Not Found" in str(exc_info.value)
+
+
+@patch("chuck_data.clients.databricks.requests.get")
+def test_get_connection_error(mock_get, databricks_api_client):
+    """Test GET request with connection error."""
+    mock_get.side_effect = requests.exceptions.ConnectionError("Connection failed")
+
+    with pytest.raises(ConnectionError) as exc_info:
+        databricks_api_client.get("/test-endpoint")
+
+    assert "Connection error occurred" in str(exc_info.value)
+
+
+@patch("chuck_data.clients.databricks.requests.post")
+def test_post_success(mock_post, databricks_api_client):
+    """Test successful POST request."""
+    mock_response = MagicMock()
+    mock_response.json.return_value = {"key": "value"}
+    mock_post.return_value = mock_response
+
+    response = databricks_api_client.post("/test-endpoint", {"data": "test"})
+    assert response == {"key": "value"}
+    mock_post.assert_called_once_with(
+        "https://test-workspace.cloud.databricks.com/test-endpoint",
+        headers={
+            "Authorization": "Bearer fake-token",
+            "User-Agent": "amperity",
+        },
+        json={"data": "test"},
+    )
+
+
+@patch("chuck_data.clients.databricks.requests.post")
+def test_post_http_error(mock_post, databricks_api_client):
+    """Test POST request with HTTP error."""
+    mock_response = MagicMock()
+    mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
+        "HTTP 400"
+    )
+    mock_response.text = "Bad Request"
+    mock_post.return_value = mock_response
+
+    with pytest.raises(ValueError) as exc_info:
+        databricks_api_client.post("/test-endpoint", {"data": "test"})
+
+    assert "HTTP error occurred" in str(exc_info.value)
+    assert "Bad Request" in str(exc_info.value)
+
+
+@patch("chuck_data.clients.databricks.requests.post")
+def test_post_connection_error(mock_post, databricks_api_client):
+    """Test POST request with connection error."""
+    mock_post.side_effect = requests.exceptions.ConnectionError("Connection failed")
+
+    with pytest.raises(ConnectionError) as exc_info:
+        databricks_api_client.post("/test-endpoint", {"data": "test"})
+
+    assert "Connection error occurred" in str(exc_info.value)
+
+
+@patch("chuck_data.clients.databricks.requests.post")
+def test_fetch_amperity_job_init_http_error(mock_post, databricks_api_client):
+    """fetch_amperity_job_init should show helpful message on HTTP errors."""
+    mock_response = MagicMock()
+    mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
+        "HTTP 401", response=mock_response
+    )
+    mock_response.status_code = 401
+    mock_response.text = '{"status":401,"message":"Unauthorized"}'
+    mock_response.json.return_value = {
+        "status": 401,
+        "message": "Unauthorized",
+    }
+    mock_post.return_value = mock_response
+
+    with pytest.raises(ValueError) as exc_info:
+        databricks_api_client.fetch_amperity_job_init("fake-token")
+
+    assert "401 Error" in str(exc_info.value)
+    assert "Please /logout and /login again" in str(exc_info.value)
diff --git a/tests/unit/core/test_config.py b/tests/unit/core/test_config.py
new file mode 100644
index 0000000..abf9fda
--- /dev/null
+++ b/tests/unit/core/test_config.py
@@ -0,0 +1,256 @@
+"""Tests for the configuration functionality in Chuck."""
+
+import pytest
+import os
+import json
+import tempfile
+from unittest.mock import patch
+
+from chuck_data.config import (
+    ConfigManager,
+    get_workspace_url,
+    set_workspace_url,
+    get_active_model,
+    set_active_model,
+    get_warehouse_id,
+    set_warehouse_id,
+    get_active_catalog,
+    set_active_catalog,
+    get_active_schema,
+    set_active_schema,
+    get_databricks_token,
+    set_databricks_token,
+)
+
+
+@pytest.fixture
+def config_setup():
+    """Set up test configuration with temp file and patched global manager."""
+    # Create a temporary file for testing
+    temp_dir = tempfile.TemporaryDirectory()
+    config_path = os.path.join(temp_dir.name, "test_config.json")
+
+    # Create a test-specific config manager
+    config_manager = ConfigManager(config_path)
+
+    # Mock the global config manager
+    patcher = patch("chuck_data.config._config_manager", config_manager)
+    patcher.start()
+
+    yield config_manager, config_path, temp_dir
+
+    # Cleanup
+    patcher.stop()
+    temp_dir.cleanup()
+
+
+def test_default_config(config_setup):
+    """Test default configuration values."""
+    config_manager, config_path, temp_dir = config_setup
+    config = config_manager.get_config()
+
+    # Check default values
+    # No longer expecting a specific default workspace URL since we now preserve full URLs
+    # and the default might be None until explicitly set
+    assert config.active_model is None
+    assert config.warehouse_id is None
+    assert config.active_catalog is None
+    assert config.active_schema is None
+
+
+def test_config_update(config_setup):
+    """Test updating configuration values."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Update values
+    config_manager.update(
+        workspace_url="test-workspace",
+        active_model="test-model",
+        warehouse_id="test-warehouse",
+        active_catalog="test-catalog",
+        active_schema="test-schema",
+    )
+
+    # Check values were updated in memory
+    config = config_manager.get_config()
+    assert config.workspace_url == "test-workspace"
+    assert config.active_model == "test-model"
+    assert config.warehouse_id == "test-warehouse"
+    assert config.active_catalog == "test-catalog"
+    assert config.active_schema == "test-schema"
+
+    # Check file was created
+    assert os.path.exists(config_path)
+
+    # Check file contents
+    with open(config_path, "r") as f:
+        saved_config = json.load(f)
+
+    assert saved_config["workspace_url"] == "test-workspace"
+    assert saved_config["active_model"] == "test-model"
+    assert saved_config["warehouse_id"] == "test-warehouse"
+    assert saved_config["active_catalog"] == "test-catalog"
+    assert saved_config["active_schema"] == "test-schema"
+
+
+def test_config_load_save_cycle(config_setup):
+    """Test loading and saving configuration."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Set test values
+    test_url = "https://test-workspace.cloud.databricks.com"  # Need valid URL string
+    test_model = "test-model"
+    test_warehouse = "warehouse-id-123"
+
+    # Update config values using the update method
+    config_manager.update(
+        workspace_url=test_url,
+        active_model=test_model,
+        warehouse_id=test_warehouse,
+    )
+
+    # Create a new manager to load from disk
+    another_manager = ConfigManager(config_path)
+    config = another_manager.get_config()
+
+    # Verify saved values were loaded
+    assert config.workspace_url == test_url
+    assert config.active_model == test_model
+    assert config.warehouse_id == test_warehouse
+
+
+def test_api_functions(config_setup):
+    """Test compatibility API functions."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Set values using API functions
+    set_workspace_url("api-workspace")
+    set_active_model("api-model")
+    set_warehouse_id("api-warehouse")
+    set_active_catalog("api-catalog")
+    set_active_schema("api-schema")
+
+    # Check values using API functions
+    assert get_workspace_url() == "api-workspace"
+    assert get_active_model() == "api-model"
+    assert get_warehouse_id() == "api-warehouse"
+    assert get_active_catalog() == "api-catalog"
+    assert get_active_schema() == "api-schema"
+
+
+def test_environment_override(config_setup, monkeypatch):
+    """Test environment variable override for all config values."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # First set config values with clean environment
+    with patch.dict(os.environ, {}, clear=True):
+        set_workspace_url("config-workspace")
+        set_active_model("config-model")
+        set_warehouse_id("config-warehouse")
+        set_active_catalog("config-catalog")
+        set_active_schema("config-schema")
+
+    # Now test that CHUCK_ environment variables take precedence
+    monkeypatch.setenv("CHUCK_WORKSPACE_URL", "env-workspace")
+    monkeypatch.setenv("CHUCK_ACTIVE_MODEL", "env-model")
+    monkeypatch.setenv("CHUCK_WAREHOUSE_ID", "env-warehouse")
+    monkeypatch.setenv("CHUCK_ACTIVE_CATALOG", "env-catalog")
+    monkeypatch.setenv("CHUCK_ACTIVE_SCHEMA", "env-schema")
+
+    # Create a new config manager to reload with environment overrides
+    fresh_manager = ConfigManager(config_path)
+    config = fresh_manager.get_config()
+
+    # Environment variables should override file values
+    assert config.workspace_url == "env-workspace"
+    assert config.active_model == "env-model"
+    assert config.warehouse_id == "env-warehouse"
+    assert config.active_catalog == "env-catalog"
+    assert config.active_schema == "env-schema"
+
+
+def test_graceful_validation(config_setup):
+    """Test that invalid configuration values are handled gracefully."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Write invalid JSON to config file
+    with open(config_path, "w") as f:
+        f.write("{ invalid json }")
+
+    # Should still create a config with defaults instead of crashing
+    config = config_manager.get_config()
+
+    # Should get default values
+    assert config.active_model is None
+    assert config.warehouse_id is None
+
+
+def test_singleton_pattern(config_setup):
+    """Test that ConfigManager behaves as singleton."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Create multiple instances with same path
+    manager1 = ConfigManager(config_path)
+    manager2 = ConfigManager(config_path)
+
+    # Set value through one manager
+    manager1.update(active_model="singleton-test")
+
+    # Should be visible through other manager (testing cached behavior)
+    # Note: In temp dir, config is not cached, so we need to test regular behavior
+    if not config_path.startswith(tempfile.gettempdir()):
+        config2 = manager2.get_config()
+        assert config2.active_model == "singleton-test"
+
+
+def test_databricks_token(config_setup):
+    """Test databricks token handling."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Test setting token through config
+    set_databricks_token("config-token")
+
+    assert get_databricks_token() == "config-token"
+
+    # Test environment variable override
+    with patch.dict(os.environ, {"CHUCK_DATABRICKS_TOKEN": "env-token"}):
+        # Create fresh manager to pick up env var
+        fresh_manager = ConfigManager(config_path)
+        with patch("chuck_data.config._config_manager", fresh_manager):
+            # Should get env token
+            token = get_databricks_token()
+            assert token == "env-token"
+
+
+def test_needs_setup_method(config_setup):
+    """Test needs_setup method returns correct values."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Initially should need setup
+    assert config_manager.needs_setup()
+
+    # After setting all critical configs, should not need setup
+    config_manager.update(
+        workspace_url="test-workspace",
+        amperity_token="test-amperity-token",
+        databricks_token="test-databricks-token",
+        active_model="test-model",
+    )
+    assert not config_manager.needs_setup()
+
+    # Test with environment variable
+    with patch.dict(os.environ, {"CHUCK_WORKSPACE_URL": "env-workspace"}):
+        fresh_manager = ConfigManager(config_path)
+        assert not fresh_manager.needs_setup()
+
+
+@patch("chuck_data.config.clear_agent_history")
+def test_set_active_model_clears_history(mock_clear_history, config_setup):
+    """Test that setting active model clears agent history."""
+    config_manager, config_path, temp_dir = config_setup
+
+    # Set active model
+    set_active_model("test-model")
+
+    # Should have called clear_agent_history
+    mock_clear_history.assert_called_once()
diff --git a/tests/unit/core/test_databricks_auth.py b/tests/unit/core/test_databricks_auth.py
new file mode 100644
index 0000000..2c20027
--- /dev/null
+++ b/tests/unit/core/test_databricks_auth.py
@@ -0,0 +1,192 @@
+"""
+Unit tests for the Databricks auth utilities.
+
+Following approved testing patterns:
+- Mock external boundaries only (os.getenv, API calls)
+- Use real config system with temporary files
+- Test end-to-end auth behavior with real business logic
+"""
+
+import pytest
+import tempfile
+from unittest.mock import patch
+
+from chuck_data.databricks_auth import get_databricks_token, validate_databricks_token
+from chuck_data.config import ConfigManager
+
+
+def test_get_databricks_token_from_config_real_logic():
+    """Test that the token is retrieved from real config first when available."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+
+        # Set up real config with token
+        config_manager.update(databricks_token="config_token")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Mock os.getenv to return None for environment checks (config should have priority)
+            with patch("os.getenv", return_value=None):
+                # Test real config token retrieval
+                token = get_databricks_token()
+
+                # Should get token from real config, not environment
+                assert token == "config_token"
+
+
+def test_get_databricks_token_from_env_real_logic():
+    """Test that the token falls back to environment when not in real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # Don't set databricks_token in config - should be None
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch("os.getenv", return_value="env_token"):
+                # Test real config fallback to environment
+                token = get_databricks_token()
+
+                assert token == "env_token"
+
+
+def test_get_databricks_token_missing_real_logic():
+    """Test behavior when token is not available in real config or environment."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # No token in config
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch("os.getenv", return_value=None):
+                # Test real error handling when no token available
+                with pytest.raises(EnvironmentError) as excinfo:
+                    get_databricks_token()
+
+                assert "Databricks token not found" in str(excinfo.value)
+
+
+def test_validate_databricks_token_success_real_logic(databricks_client_stub):
+    """Test successful validation of a Databricks token with real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Mock only the external API boundary (client creation and validation)
+            with patch(
+                "chuck_data.databricks_auth.DatabricksAPIClient"
+            ) as mock_client_class:
+                mock_client = mock_client_class.return_value
+                mock_client.validate_token.return_value = True
+
+                # Test real validation logic with external API mock
+                result = validate_databricks_token("test_token")
+
+                assert result is True
+                mock_client_class.assert_called_once_with(
+                    "https://test.databricks.com", "test_token"
+                )
+                mock_client.validate_token.assert_called_once()
+
+
+def test_validate_databricks_token_failure_real_logic():
+    """Test failed validation of a Databricks token with real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Mock external API to return validation failure
+            with patch(
+                "chuck_data.databricks_auth.DatabricksAPIClient"
+            ) as mock_client_class:
+                mock_client = mock_client_class.return_value
+                mock_client.validate_token.return_value = False
+
+                # Test real error handling with API failure
+                result = validate_databricks_token("invalid_token")
+
+                assert result is False
+
+
+def test_validate_databricks_token_connection_error_real_logic():
+    """Test validation with connection error using real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://test.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Mock external API to raise connection error
+            with patch(
+                "chuck_data.databricks_auth.DatabricksAPIClient"
+            ) as mock_client_class:
+                mock_client = mock_client_class.return_value
+                mock_client.validate_token.side_effect = ConnectionError(
+                    "Network error"
+                )
+
+                # Test real error handling with connection failure
+                with pytest.raises(ConnectionError) as excinfo:
+                    validate_databricks_token("test_token")
+
+                assert "Network error" in str(excinfo.value)
+
+
+def test_get_databricks_token_with_real_env(monkeypatch):
+    """Test retrieving token from actual environment variable with real config."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        # No token in config, should fall back to real environment
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Set environment variable with monkeypatch
+            monkeypatch.setenv("DATABRICKS_TOKEN", "test_token")
+
+            # Test real config + real environment integration
+            token = get_databricks_token()
+
+            # Environment variable should be used when no token in config
+            assert token == "test_token"
+
+
+def test_token_priority_real_logic():
+    """Test that config token takes priority over environment token."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(databricks_token="config_priority_token")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            # Even with environment variable set, config should take priority
+            with patch("os.getenv") as mock_getenv:
+
+                def side_effect(key):
+                    if key == "DATABRICKS_TOKEN":
+                        return "env_fallback_token"
+                    return None  # Return None for other env vars during config loading
+
+                mock_getenv.side_effect = side_effect
+
+                # Test real priority logic: config should override environment
+                token = get_databricks_token()
+
+                assert token == "config_priority_token"
+
+
+def test_workspace_url_integration_real_logic():
+    """Test workspace URL integration with real config system."""
+    with tempfile.NamedTemporaryFile() as tmp:
+        config_manager = ConfigManager(tmp.name)
+        config_manager.update(workspace_url="https://custom.databricks.com")
+
+        with patch("chuck_data.config._config_manager", config_manager):
+            with patch(
+                "chuck_data.databricks_auth.DatabricksAPIClient"
+            ) as mock_client_class:
+                mock_client = mock_client_class.return_value
+                mock_client.validate_token.return_value = True
+
+                # Test real workspace URL retrieval
+                result = validate_databricks_token("test_token")
+
+                # Should use real config workspace URL
+                mock_client_class.assert_called_once_with(
+                    "https://custom.databricks.com", "test_token"
+                )
+                assert result is True
diff --git a/tests/unit/core/test_databricks_client.py b/tests/unit/core/test_databricks_client.py
new file mode 100644
index 0000000..0b8b20d
--- /dev/null
+++ b/tests/unit/core/test_databricks_client.py
@@ -0,0 +1,414 @@
+"""Tests for the DatabricksAPIClient class."""
+
+import pytest
+from unittest.mock import patch, MagicMock, mock_open
+import requests
+from chuck_data.clients.databricks import DatabricksAPIClient
+
+
+@pytest.fixture
+def client():
+    """Create a DatabricksAPIClient for testing."""
+    workspace_url = "test-workspace"
+    token = "fake-token"
+    return DatabricksAPIClient(workspace_url, token)
+
+
+def test_normalize_workspace_url(client):
+    """Test URL normalization."""
+    test_cases = [
+        ("workspace", "workspace"),
+        ("https://workspace", "workspace"),
+        ("http://workspace", "workspace"),
+        ("workspace.cloud.databricks.com", "workspace"),
+        ("https://workspace.cloud.databricks.com", "workspace"),
+        ("https://workspace.cloud.databricks.com/", "workspace"),
+        ("dbc-12345-ab", "dbc-12345-ab"),
+        # Azure test cases
+        ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
+        (
+            "https://adb-3856707039489412.12.azuredatabricks.net",
+            "adb-3856707039489412.12",
+        ),
+        ("workspace.azuredatabricks.net", "workspace"),
+        # GCP test cases
+        ("workspace.gcp.databricks.com", "workspace"),
+        ("https://workspace.gcp.databricks.com", "workspace"),
+    ]
+
+    for input_url, expected_url in test_cases:
+        result = client._normalize_workspace_url(input_url)
+        assert result == expected_url
+
+
+def test_azure_client_url_construction():
+    """Test that Azure client constructs URLs with correct domain."""
+    azure_client = DatabricksAPIClient(
+        "adb-3856707039489412.12.azuredatabricks.net", "token"
+    )
+
+    # Check that cloud provider is detected correctly
+    assert azure_client.cloud_provider == "Azure"
+    assert azure_client.base_domain == "azuredatabricks.net"
+    assert azure_client.workspace_url == "adb-3856707039489412.12"
+
+
+def test_base_domain_map():
+    """Ensure _get_base_domain uses the shared domain map."""
+    from chuck_data.databricks.url_utils import DATABRICKS_DOMAIN_MAP
+
+    for provider, domain in DATABRICKS_DOMAIN_MAP.items():
+        client = DatabricksAPIClient("workspace", "token")
+        client.cloud_provider = provider
+        assert client._get_base_domain() == domain
+
+
+@patch("requests.get")
+def test_azure_get_request_url(mock_get):
+    """Test that Azure client constructs correct URLs for GET requests."""
+    azure_client = DatabricksAPIClient(
+        "adb-3856707039489412.12.azuredatabricks.net", "token"
+    )
+    mock_response = MagicMock()
+    mock_response.json.return_value = {"key": "value"}
+    mock_get.return_value = mock_response
+
+    azure_client.get("/test-endpoint")
+
+    mock_get.assert_called_once_with(
+        "https://adb-3856707039489412.12.azuredatabricks.net/test-endpoint",
+        headers={
+            "Authorization": "Bearer token",
+            "User-Agent": "amperity",
+        },
+    )
+
+
+def test_compute_node_types():
+    """Test that appropriate compute node types are returned for each cloud provider."""
+    test_cases = [
+        ("workspace.cloud.databricks.com", "AWS", "r5d.4xlarge"),
+        ("workspace.azuredatabricks.net", "Azure", "Standard_E16ds_v4"),
+        ("workspace.gcp.databricks.com", "GCP", "n2-standard-16"),
+        ("workspace.databricks.com", "Generic", "r5d.4xlarge"),
+    ]
+
+    for url, expected_provider, expected_node_type in test_cases:
+        client = DatabricksAPIClient(url, "token")
+        assert client.cloud_provider == expected_provider
+        assert client.get_compute_node_type() == expected_node_type
+
+
+def test_cloud_attributes():
+    """Test that appropriate cloud attributes are returned for each provider."""
+    # Test AWS attributes
+    aws_client = DatabricksAPIClient("workspace.cloud.databricks.com", "token")
+    aws_attrs = aws_client.get_cloud_attributes()
+    assert "aws_attributes" in aws_attrs
+    assert aws_attrs["aws_attributes"]["availability"] == "SPOT_WITH_FALLBACK"
+
+    # Test Azure attributes
+    azure_client = DatabricksAPIClient("workspace.azuredatabricks.net", "token")
+    azure_attrs = azure_client.get_cloud_attributes()
+    assert "azure_attributes" in azure_attrs
+    assert azure_attrs["azure_attributes"]["availability"] == "SPOT_WITH_FALLBACK_AZURE"
+
+    # Test GCP attributes
+    gcp_client = DatabricksAPIClient("workspace.gcp.databricks.com", "token")
+    gcp_attrs = gcp_client.get_cloud_attributes()
+    assert "gcp_attributes" in gcp_attrs
+    assert gcp_attrs["gcp_attributes"]["use_preemptible_executors"]
+
+
+@patch.object(DatabricksAPIClient, "post")
+def test_job_submission_uses_correct_node_type(mock_post):
+    """Test that job submission uses the correct node type for Azure."""
+    mock_post.return_value = {"run_id": "12345"}
+
+    azure_client = DatabricksAPIClient("workspace.azuredatabricks.net", "token")
+    azure_client.submit_job_run("/config/path", "/init/script/path")
+
+    # Verify that post was called and get the payload
+    mock_post.assert_called_once()
+    call_args = mock_post.call_args
+    payload = call_args[0][1]  # Second argument is the data payload
+
+    # Check that the cluster config uses Azure node type
+    cluster_config = payload["tasks"][0]["new_cluster"]
+    assert cluster_config["node_type_id"] == "Standard_E16ds_v4"
+
+    # Check that Azure attributes are present
+    assert "azure_attributes" in cluster_config
+    assert (
+        cluster_config["azure_attributes"]["availability"] == "SPOT_WITH_FALLBACK_AZURE"
+    )
+
+    # Base API request tests
+
+
+@patch("requests.get")
+def test_get_success(mock_get, client):
+    """Test successful GET request."""
+    mock_response = MagicMock()
+    mock_response.json.return_value = {"key": "value"}
+    mock_get.return_value = mock_response
+
+    response = client.get("/test-endpoint")
+    assert response == {"key": "value"}
+    mock_get.assert_called_once_with(
+        "https://test-workspace.cloud.databricks.com/test-endpoint",
+        headers={
+            "Authorization": "Bearer fake-token",
+            "User-Agent": "amperity",
+        },
+    )
+
+
+@patch("requests.get")
+def test_get_http_error(mock_get, client):
+    """Test GET request with HTTP error."""
+    mock_response = MagicMock()
+    mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
+        "HTTP 404"
+    )
+    mock_response.text = "Not Found"
+    mock_get.return_value = mock_response
+
+    with pytest.raises(ValueError) as exc_info:
+        client.get("/test-endpoint")
+
+    assert "HTTP error occurred" in str(exc_info.value)
+    assert "Not Found" in str(exc_info.value)
+
+
+@patch("requests.get")
+def test_get_connection_error(mock_get, client):
+    """Test GET request with connection error."""
+    mock_get.side_effect = requests.exceptions.ConnectionError("Connection failed")
+
+    with pytest.raises(ConnectionError) as exc_info:
+        client.get("/test-endpoint")
+
+    assert "Connection error occurred" in str(exc_info.value)
+
+
+@patch("requests.post")
+def test_post_success(mock_post, client):
+    """Test successful POST request."""
+    mock_response = MagicMock()
+    mock_response.json.return_value = {"key": "value"}
+    mock_post.return_value = mock_response
+
+    response = client.post("/test-endpoint", {"data": "test"})
+    assert response == {"key": "value"}
+    mock_post.assert_called_once_with(
+        "https://test-workspace.cloud.databricks.com/test-endpoint",
+        headers={
+            "Authorization": "Bearer fake-token",
+            "User-Agent": "amperity",
+        },
+        json={"data": "test"},
+    )
+
+
+@patch("requests.post")
+def test_post_http_error(mock_post, client):
+    """Test POST request with HTTP error."""
+    mock_response = MagicMock()
+    mock_response.raise_for_status.side_effect = requests.exceptions.HTTPError(
+        "HTTP 400"
+    )
+    mock_response.text = "Bad Request"
+    mock_post.return_value = mock_response
+
+    with pytest.raises(ValueError) as exc_info:
+        client.post("/test-endpoint", {"data": "test"})
+
+    assert "HTTP error occurred" in str(exc_info.value)
+    assert "Bad Request" in str(exc_info.value)
+
+
+@patch("requests.post")
+def test_post_connection_error(mock_post, client):
+    """Test POST request with connection error."""
+    mock_post.side_effect = requests.exceptions.ConnectionError("Connection failed")
+
+    with pytest.raises(ConnectionError) as exc_info:
+        client.post("/test-endpoint", {"data": "test"})
+
+    assert "Connection error occurred" in str(exc_info.value)
+
+    # Authentication method tests
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_validate_token_success(mock_get, client):
+    """Test successful token validation."""
+    mock_get.return_value = {"user_name": "test-user"}
+
+    result = client.validate_token()
+
+    assert result
+    mock_get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_validate_token_failure(mock_get, client):
+    """Test failed token validation."""
+    mock_get.side_effect = Exception("Token validation failed")
+
+    result = client.validate_token()
+
+    assert not result
+    mock_get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
+
+    # Unity Catalog method tests
+
+
+@patch.object(DatabricksAPIClient, "get")
+@patch.object(DatabricksAPIClient, "get_with_params")
+def test_list_catalogs(mock_get_with_params, mock_get, client):
+    """Test list_catalogs with and without parameters."""
+    # Without parameters
+    mock_get.return_value = {"catalogs": [{"name": "test_catalog"}]}
+    result = client.list_catalogs()
+    assert result == {"catalogs": [{"name": "test_catalog"}]}
+    mock_get.assert_called_once_with("/api/2.1/unity-catalog/catalogs")
+
+    # With parameters
+    mock_get_with_params.return_value = {"catalogs": [{"name": "test_catalog"}]}
+    result = client.list_catalogs(include_browse=True, max_results=10)
+    assert result == {"catalogs": [{"name": "test_catalog"}]}
+    mock_get_with_params.assert_called_once_with(
+        "/api/2.1/unity-catalog/catalogs",
+        {"include_browse": "true", "max_results": "10"},
+    )
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_get_catalog(mock_get, client):
+    """Test get_catalog method."""
+    mock_get.return_value = {"name": "test_catalog", "comment": "Test catalog"}
+
+    result = client.get_catalog("test_catalog")
+
+    assert result == {"name": "test_catalog", "comment": "Test catalog"}
+    mock_get.assert_called_once_with("/api/2.1/unity-catalog/catalogs/test_catalog")
+
+    # File system method tests
+
+
+@patch("requests.put")
+def test_upload_file_with_content(mock_put, client):
+    """Test successful file upload with content."""
+    mock_response = MagicMock()
+    mock_response.status_code = 204
+    mock_put.return_value = mock_response
+
+    result = client.upload_file("/test/path.txt", content="Test content")
+
+    assert result
+    mock_put.assert_called_once()
+    # Check URL and headers
+    call_args = mock_put.call_args
+    assert (
+        "https://test-workspace.cloud.databricks.com/api/2.0/fs/files/test/path.txt"
+        in call_args[0][0]
+    )
+    assert call_args[1]["headers"]["Content-Type"] == "application/octet-stream"
+    # Check that content was encoded to bytes
+    assert call_args[1]["data"] == b"Test content"
+
+
+@patch("builtins.open", new_callable=mock_open, read_data=b"file content")
+@patch("requests.put")
+def test_upload_file_with_file_path(mock_put, mock_file, client):
+    """Test successful file upload with file path."""
+    mock_response = MagicMock()
+    mock_response.status_code = 204
+    mock_put.return_value = mock_response
+
+    result = client.upload_file("/test/path.txt", file_path="/local/file.txt")
+
+    assert result
+    mock_file.assert_called_once_with("/local/file.txt", "rb")
+    mock_put.assert_called_once()
+    # Check that file content was read
+    call_args = mock_put.call_args
+    assert call_args[1]["data"] == b"file content"
+
+
+def test_upload_file_invalid_args(client):
+    """Test upload_file with invalid arguments."""
+    # Test when both file_path and content are provided
+    with pytest.raises(ValueError) as exc_info:
+        client.upload_file("/test/path.txt", file_path="/local.txt", content="content")
+    assert "Exactly one of file_path or content must be provided" in str(exc_info.value)
+
+    # Test when neither file_path nor content is provided
+    with pytest.raises(ValueError) as exc_info:
+        client.upload_file("/test/path.txt")
+    assert "Exactly one of file_path or content must be provided" in str(exc_info.value)
+
+    # Model serving tests
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_list_models(mock_get, client):
+    """Test list_models method."""
+    mock_response = {"endpoints": [{"name": "model1"}, {"name": "model2"}]}
+    mock_get.return_value = mock_response
+
+    result = client.list_models()
+
+    assert result == [{"name": "model1"}, {"name": "model2"}]
+    mock_get.assert_called_once_with("/api/2.0/serving-endpoints")
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_get_model(mock_get, client):
+    """Test get_model method."""
+    mock_response = {"name": "model1", "status": "ready"}
+    mock_get.return_value = mock_response
+
+    result = client.get_model("model1")
+
+    assert result == {"name": "model1", "status": "ready"}
+    mock_get.assert_called_once_with("/api/2.0/serving-endpoints/model1")
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_get_model_not_found(mock_get, client):
+    """Test get_model with 404 error."""
+    mock_get.side_effect = ValueError("HTTP error occurred: 404 Not Found")
+
+    result = client.get_model("nonexistent-model")
+
+    assert result is None
+    mock_get.assert_called_once_with("/api/2.0/serving-endpoints/nonexistent-model")
+
+    # SQL warehouse tests
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_list_warehouses(mock_get, client):
+    """Test list_warehouses method."""
+    mock_response = {"warehouses": [{"id": "123"}, {"id": "456"}]}
+    mock_get.return_value = mock_response
+
+    result = client.list_warehouses()
+
+    assert result == [{"id": "123"}, {"id": "456"}]
+    mock_get.assert_called_once_with("/api/2.0/sql/warehouses")
+
+
+@patch.object(DatabricksAPIClient, "get")
+def test_get_warehouse(mock_get, client):
+    """Test get_warehouse method."""
+    mock_response = {"id": "123", "name": "Test Warehouse"}
+    mock_get.return_value = mock_response
+
+    result = client.get_warehouse("123")
+
+    assert result == {"id": "123", "name": "Test Warehouse"}
+    mock_get.assert_called_once_with("/api/2.0/sql/warehouses/123")
diff --git a/tests/test_interactive_context.py b/tests/unit/core/test_interactive_context.py
similarity index 100%
rename from tests/test_interactive_context.py
rename to tests/unit/core/test_interactive_context.py
diff --git a/tests/unit/core/test_metrics_collector.py b/tests/unit/core/test_metrics_collector.py
new file mode 100644
index 0000000..a36f1b6
--- /dev/null
+++ b/tests/unit/core/test_metrics_collector.py
@@ -0,0 +1,194 @@
+"""
+Tests for the metrics collector.
+"""
+
+import pytest
+from unittest.mock import patch
+
+from chuck_data.metrics_collector import MetricsCollector, get_metrics_collector
+from tests.fixtures.collectors import ConfigManagerStub
+
+
+@pytest.fixture
+def metrics_collector_with_stubs(amperity_client_stub):
+    """Create a MetricsCollector with stubbed dependencies."""
+    config_manager_stub = ConfigManagerStub()
+    config_stub = config_manager_stub.config
+
+    # Create the metrics collector with mocked config and AmperityClientStub
+    with patch(
+        "chuck_data.metrics_collector.get_config_manager",
+        return_value=config_manager_stub,
+    ):
+        with patch(
+            "chuck_data.metrics_collector.AmperityAPIClient",
+            return_value=amperity_client_stub,
+        ):
+            metrics_collector = MetricsCollector()
+
+    return metrics_collector, config_stub, amperity_client_stub
+
+
+def test_should_track_with_consent(metrics_collector_with_stubs):
+    """Test that metrics are tracked when consent is given."""
+    metrics_collector, config_stub, _ = metrics_collector_with_stubs
+    config_stub.usage_tracking_consent = True
+    result = metrics_collector._should_track()
+    assert result
+
+
+def test_should_track_without_consent(metrics_collector_with_stubs):
+    """Test that metrics are not tracked when consent is not given."""
+    metrics_collector, config_stub, _ = metrics_collector_with_stubs
+    config_stub.usage_tracking_consent = False
+    result = metrics_collector._should_track()
+    assert not result
+
+
+def test_get_chuck_configuration(metrics_collector_with_stubs):
+    """Test that configuration is retrieved correctly."""
+    metrics_collector, config_stub, _ = metrics_collector_with_stubs
+    config_stub.workspace_url = "test-workspace"
+    config_stub.active_catalog = "test-catalog"
+    config_stub.active_schema = "test-schema"
+    config_stub.active_model = "test-model"
+
+    result = metrics_collector._get_chuck_configuration_for_metric()
+
+    assert result == {
+        "workspace_url": "test-workspace",
+        "active_catalog": "test-catalog",
+        "active_schema": "test-schema",
+        "active_model": "test-model",
+    }
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
+def test_track_event_no_consent(mock_get_token, metrics_collector_with_stubs):
+    """Test that tracking is skipped when consent is not given."""
+    metrics_collector, config_stub, amperity_client_stub = metrics_collector_with_stubs
+    config_stub.usage_tracking_consent = False
+
+    # Reset stub metrics call count
+    amperity_client_stub.metrics_calls = []
+
+    result = metrics_collector.track_event(prompt="test prompt")
+
+    assert not result
+    # Ensure submit_metrics is not called
+    assert len(amperity_client_stub.metrics_calls) == 0
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
+@patch("chuck_data.metrics_collector.MetricsCollector.send_metric")
+def test_track_event_with_all_fields(
+    mock_send_metric, mock_get_token, metrics_collector_with_stubs
+):
+    """Test tracking with all fields provided."""
+    metrics_collector, config_stub, _ = metrics_collector_with_stubs
+    config_stub.usage_tracking_consent = True
+    mock_send_metric.return_value = True
+
+    # Prepare test data
+    prompt = "test prompt"
+    tools = [{"name": "test_tool", "arguments": {"arg1": "value1"}}]
+    conversation_history = [{"role": "assistant", "content": "test response"}]
+    error = "test error"
+    additional_data = {"event_context": "test_context"}
+
+    # Call track_event
+    result = metrics_collector.track_event(
+        prompt=prompt,
+        tools=tools,
+        conversation_history=conversation_history,
+        error=error,
+        additional_data=additional_data,
+    )
+
+    # Assert results
+    assert result
+    mock_send_metric.assert_called_once()
+
+    # Check payload content
+    payload = mock_send_metric.call_args[0][0]
+    assert payload["event"] == "USAGE"
+    assert payload["prompt"] == prompt
+    assert payload["tools"] == tools
+    assert payload["conversation_history"] == conversation_history
+    assert payload["error"] == error
+    assert payload["additional_data"] == additional_data
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
+def test_send_metric_successful(mock_get_token, metrics_collector_with_stubs):
+    """Test successful metrics sending."""
+    metrics_collector, _, amperity_client_stub = metrics_collector_with_stubs
+    payload = {"event": "USAGE", "prompt": "test prompt"}
+
+    # Reset stub metrics call count
+    amperity_client_stub.metrics_calls = []
+
+    result = metrics_collector.send_metric(payload)
+
+    assert result
+    assert len(amperity_client_stub.metrics_calls) == 1
+    assert amperity_client_stub.metrics_calls[0] == (payload, "test-token")
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
+def test_send_metric_failure(mock_get_token, metrics_collector_with_stubs):
+    """Test handling of metrics sending failure."""
+    metrics_collector, _, amperity_client_stub = metrics_collector_with_stubs
+
+    # Configure stub to simulate failure
+    amperity_client_stub.should_fail_metrics = True
+    amperity_client_stub.metrics_calls = []
+
+    payload = {"event": "USAGE", "prompt": "test prompt"}
+
+    result = metrics_collector.send_metric(payload)
+
+    assert not result
+    assert len(amperity_client_stub.metrics_calls) == 1
+    assert amperity_client_stub.metrics_calls[0] == (payload, "test-token")
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value="test-token")
+def test_send_metric_exception(mock_get_token, metrics_collector_with_stubs):
+    """Test handling of exceptions during metrics sending."""
+    metrics_collector, _, amperity_client_stub = metrics_collector_with_stubs
+
+    # Configure stub to raise exception
+    amperity_client_stub.should_raise_exception = True
+    amperity_client_stub.metrics_calls = []
+
+    payload = {"event": "USAGE", "prompt": "test prompt"}
+
+    result = metrics_collector.send_metric(payload)
+
+    assert not result
+    assert len(amperity_client_stub.metrics_calls) == 1
+    assert amperity_client_stub.metrics_calls[0] == (payload, "test-token")
+
+
+@patch("chuck_data.metrics_collector.get_amperity_token", return_value=None)
+def test_send_metric_no_token(mock_get_token, metrics_collector_with_stubs):
+    """Test that metrics are not sent when no token is available."""
+    metrics_collector, _, amperity_client_stub = metrics_collector_with_stubs
+
+    # Reset stub metrics call count
+    amperity_client_stub.metrics_calls = []
+
+    payload = {"event": "USAGE", "prompt": "test prompt"}
+
+    result = metrics_collector.send_metric(payload)
+
+    assert not result
+    assert len(amperity_client_stub.metrics_calls) == 0
+
+
+def test_get_metrics_collector():
+    """Test that get_metrics_collector returns the singleton instance."""
+    with patch("chuck_data.metrics_collector._metrics_collector") as mock_collector:
+        collector = get_metrics_collector()
+        assert collector == mock_collector
diff --git a/tests/unit/core/test_models.py b/tests/unit/core/test_models.py
new file mode 100644
index 0000000..5b31faa
--- /dev/null
+++ b/tests/unit/core/test_models.py
@@ -0,0 +1,112 @@
+"""Unit tests for the models module."""
+
+import pytest
+from chuck_data.models import list_models, get_model
+
+
+def test_list_models_success(databricks_client_stub):
+    """Test successful retrieval of model list."""
+    # Configure stub to return expected model list
+    expected_models = [
+        {"name": "model1", "state": "READY", "creation_timestamp": 1234567890},
+        {"name": "model2", "state": "READY", "creation_timestamp": 1234567891},
+    ]
+    databricks_client_stub.models = expected_models
+
+    models = list_models(databricks_client_stub)
+
+    assert models == expected_models
+
+
+def test_list_models_empty(databricks_client_stub):
+    """Test retrieval with empty model list."""
+    # Configure stub to return empty list
+    databricks_client_stub.models = []
+
+    models = list_models(databricks_client_stub)
+    assert models == []
+
+
+def test_list_models_http_error(databricks_client_stub):
+    """Test failure with HTTP error."""
+    # Configure stub to raise ValueError
+    databricks_client_stub.set_list_models_error(
+        ValueError("HTTP error occurred: 404 Not Found")
+    )
+
+    with pytest.raises(ValueError) as excinfo:
+        list_models(databricks_client_stub)
+    assert "Model serving API error" in str(excinfo.value)
+
+
+def test_list_models_connection_error(databricks_client_stub):
+    """Test failure due to connection error."""
+    # Configure stub to raise ConnectionError
+    databricks_client_stub.set_list_models_error(ConnectionError("Connection failed"))
+
+    with pytest.raises(ConnectionError) as excinfo:
+        list_models(databricks_client_stub)
+    assert "Failed to connect to serving endpoint" in str(excinfo.value)
+
+
+def test_get_model_success(databricks_client_stub):
+    """Test successful retrieval of a specific model."""
+    # Configure model detail
+    model_detail = {
+        "name": "databricks-llama-4-maverick",
+        "creator": "user@example.com",
+        "creation_timestamp": 1645123456789,
+        "state": "READY",
+    }
+    databricks_client_stub.add_model(
+        "databricks-llama-4-maverick",
+        status="READY",
+        creator="user@example.com",
+        creation_timestamp=1645123456789,
+    )
+
+    # Call the function
+    result = get_model(databricks_client_stub, "databricks-llama-4-maverick")
+
+    # Verify results
+    assert result["name"] == model_detail["name"]
+    assert result["creator"] == model_detail["creator"]
+
+
+def test_get_model_not_found(databricks_client_stub):
+    """Test retrieval of a non-existent model."""
+    # No model added, so get_model will return None
+
+    # Call the function
+    result = get_model(databricks_client_stub, "nonexistent-model")
+
+    # Verify result is None
+    assert result is None
+
+
+def test_get_model_error(databricks_client_stub):
+    """Test retrieval with a non-404 error."""
+    # Configure stub to raise a 500 error
+    databricks_client_stub.set_get_model_error(
+        ValueError("HTTP error occurred: 500 Internal Server Error")
+    )
+
+    # Call the function and expect an exception
+    with pytest.raises(ValueError) as excinfo:
+        get_model(databricks_client_stub, "error-model")
+
+    # Verify error handling
+    assert "Model serving API error" in str(excinfo.value)
+
+
+def test_get_model_connection_error(databricks_client_stub):
+    """Test retrieval with connection error."""
+    # Configure stub to raise a connection error
+    databricks_client_stub.set_get_model_error(ConnectionError("Connection failed"))
+
+    # Call the function and expect an exception
+    with pytest.raises(ConnectionError) as excinfo:
+        get_model(databricks_client_stub, "network-error-model")
+
+    # Verify error handling
+    assert "Failed to connect to serving endpoint" in str(excinfo.value)
diff --git a/tests/unit/core/test_no_color_env.py b/tests/unit/core/test_no_color_env.py
new file mode 100644
index 0000000..8963326
--- /dev/null
+++ b/tests/unit/core/test_no_color_env.py
@@ -0,0 +1,67 @@
+"""Tests for the NO_COLOR environment variable."""
+
+from unittest.mock import patch, MagicMock
+
+import chuck_data.__main__ as chuck
+
+
+@patch("chuck_data.__main__.ChuckTUI")
+@patch("chuck_data.__main__.setup_logging")
+def test_default_color_mode(mock_setup_logging, mock_chuck_tui):
+    """Test that default mode passes no_color=False to ChuckTUI constructor."""
+    mock_tui_instance = MagicMock()
+    mock_chuck_tui.return_value = mock_tui_instance
+
+    # Call main function (without NO_COLOR env var)
+    chuck.main([])
+
+    # Verify ChuckTUI was called with no_color=False
+    mock_chuck_tui.assert_called_once_with(no_color=False)
+    # Verify run was called
+    mock_tui_instance.run.assert_called_once()
+
+
+@patch("chuck_data.__main__.ChuckTUI")
+@patch("chuck_data.__main__.setup_logging")
+def test_no_color_env_var_1(mock_setup_logging, mock_chuck_tui, monkeypatch):
+    """Test that NO_COLOR=1 enables no-color mode."""
+    mock_tui_instance = MagicMock()
+    mock_chuck_tui.return_value = mock_tui_instance
+
+    # Set NO_COLOR environment variable
+    monkeypatch.setenv("NO_COLOR", "1")
+
+    # Call main function
+    chuck.main([])
+
+    # Verify ChuckTUI was called with no_color=True due to env var
+    mock_chuck_tui.assert_called_once_with(no_color=True)
+
+
+@patch("chuck_data.__main__.ChuckTUI")
+@patch("chuck_data.__main__.setup_logging")
+def test_no_color_env_var_true(mock_setup_logging, mock_chuck_tui, monkeypatch):
+    """Test that NO_COLOR=true enables no-color mode."""
+    mock_tui_instance = MagicMock()
+    mock_chuck_tui.return_value = mock_tui_instance
+
+    # Set NO_COLOR environment variable
+    monkeypatch.setenv("NO_COLOR", "true")
+
+    # Call main function
+    chuck.main([])
+
+    # Verify ChuckTUI was called with no_color=True due to env var
+    mock_chuck_tui.assert_called_once_with(no_color=True)
+
+
+@patch("chuck_data.__main__.ChuckTUI")
+@patch("chuck_data.__main__.setup_logging")
+def test_no_color_flag(mock_setup_logging, mock_chuck_tui):
+    """The --no-color flag forces no_color=True."""
+    mock_tui_instance = MagicMock()
+    mock_chuck_tui.return_value = mock_tui_instance
+
+    chuck.main(["--no-color"])
+
+    mock_chuck_tui.assert_called_once_with(no_color=True)
diff --git a/tests/unit/core/test_permission_validator.py b/tests/unit/core/test_permission_validator.py
new file mode 100644
index 0000000..9c73aa2
--- /dev/null
+++ b/tests/unit/core/test_permission_validator.py
@@ -0,0 +1,416 @@
+"""Tests for the permission validator module."""
+
+import pytest
+from unittest.mock import patch, MagicMock, call
+
+from chuck_data.databricks.permission_validator import (
+    validate_all_permissions,
+    check_basic_connectivity,
+    check_unity_catalog,
+    check_sql_warehouse,
+    check_jobs,
+    check_models,
+    check_volumes,
+)
+
+
+@pytest.fixture
+def client():
+    """Mock client fixture."""
+    return MagicMock()
+
+
+def test_validate_all_permissions(client):
+    """Test that validate_all_permissions calls all check functions."""
+    with (
+        patch(
+            "chuck_data.databricks.permission_validator.check_basic_connectivity"
+        ) as mock_basic,
+        patch(
+            "chuck_data.databricks.permission_validator.check_unity_catalog"
+        ) as mock_catalog,
+        patch(
+            "chuck_data.databricks.permission_validator.check_sql_warehouse"
+        ) as mock_warehouse,
+        patch("chuck_data.databricks.permission_validator.check_jobs") as mock_jobs,
+        patch("chuck_data.databricks.permission_validator.check_models") as mock_models,
+        patch(
+            "chuck_data.databricks.permission_validator.check_volumes"
+        ) as mock_volumes,
+    ):
+
+        # Set return values for mock functions
+        mock_basic.return_value = {"authorized": True}
+        mock_catalog.return_value = {"authorized": True}
+        mock_warehouse.return_value = {"authorized": True}
+        mock_jobs.return_value = {"authorized": True}
+        mock_models.return_value = {"authorized": True}
+        mock_volumes.return_value = {"authorized": True}
+
+        # Call the function
+        result = validate_all_permissions(client)
+
+        # Verify all check functions were called
+        mock_basic.assert_called_once_with(client)
+        mock_catalog.assert_called_once_with(client)
+        mock_warehouse.assert_called_once_with(client)
+        mock_jobs.assert_called_once_with(client)
+        mock_models.assert_called_once_with(client)
+        mock_volumes.assert_called_once_with(client)
+
+        # Verify result contains all categories
+        assert "basic_connectivity" in result
+        assert "unity_catalog" in result
+        assert "sql_warehouse" in result
+        assert "jobs" in result
+        assert "models" in result
+        assert "volumes" in result
+
+
+@patch("logging.debug")
+def test_check_basic_connectivity_success(mock_debug, client):
+    """Test basic connectivity check with successful response."""
+    # Set up mock response
+    client.get.return_value = {"userName": "test_user"}
+
+    # Call the function
+    result = check_basic_connectivity(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "Connected as test_user"
+    assert result["api_path"] == "/api/2.0/preview/scim/v2/Me"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()  # No errors, so no debug logging
+
+
+@patch("logging.debug")
+def test_check_basic_connectivity_error(mock_debug, client):
+    """Test basic connectivity check with error."""
+    # Set up mock response
+    client.get.side_effect = Exception("Connection failed")
+
+    # Call the function
+    result = check_basic_connectivity(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.0/preview/scim/v2/Me")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Connection failed"
+    assert result["api_path"] == "/api/2.0/preview/scim/v2/Me"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
+
+
+@patch("logging.debug")
+def test_check_unity_catalog_success(mock_debug, client):
+    """Test Unity Catalog check with successful response."""
+    # Set up mock response
+    client.get.return_value = {"catalogs": [{"name": "test_catalog"}]}
+
+    # Call the function
+    result = check_unity_catalog(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.1/unity-catalog/catalogs?max_results=1")
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "Unity Catalog access granted (1 catalogs visible)"
+    assert result["api_path"] == "/api/2.1/unity-catalog/catalogs"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_unity_catalog_empty(mock_debug, client):
+    """Test Unity Catalog check with empty response."""
+    # Set up mock response
+    client.get.return_value = {"catalogs": []}
+
+    # Call the function
+    result = check_unity_catalog(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.1/unity-catalog/catalogs?max_results=1")
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "Unity Catalog access granted (0 catalogs visible)"
+    assert result["api_path"] == "/api/2.1/unity-catalog/catalogs"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_unity_catalog_error(mock_debug, client):
+    """Test Unity Catalog check with error."""
+    # Set up mock response
+    client.get.side_effect = Exception("Access denied")
+
+    # Call the function
+    result = check_unity_catalog(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.1/unity-catalog/catalogs?max_results=1")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Access denied"
+    assert result["api_path"] == "/api/2.1/unity-catalog/catalogs"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
+
+
+@patch("logging.debug")
+def test_check_sql_warehouse_success(mock_debug, client):
+    """Test SQL warehouse check with successful response."""
+    # Set up mock response
+    client.get.return_value = {"warehouses": [{"id": "warehouse1"}]}
+
+    # Call the function
+    result = check_sql_warehouse(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.0/sql/warehouses?page_size=1")
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "SQL Warehouse access granted (1 warehouses visible)"
+    assert result["api_path"] == "/api/2.0/sql/warehouses"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_sql_warehouse_error(mock_debug, client):
+    """Test SQL warehouse check with error."""
+    # Set up mock response
+    client.get.side_effect = Exception("Access denied")
+
+    # Call the function
+    result = check_sql_warehouse(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.0/sql/warehouses?page_size=1")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Access denied"
+    assert result["api_path"] == "/api/2.0/sql/warehouses"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
+
+
+@patch("logging.debug")
+def test_check_jobs_success(mock_debug, client):
+    """Test jobs check with successful response."""
+    # Set up mock response
+    client.get.return_value = {"jobs": [{"job_id": "job1"}]}
+
+    # Call the function
+    result = check_jobs(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.1/jobs/list?limit=1")
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "Jobs access granted (1 jobs visible)"
+    assert result["api_path"] == "/api/2.1/jobs/list"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_jobs_error(mock_debug, client):
+    """Test jobs check with error."""
+    # Set up mock response
+    client.get.side_effect = Exception("Access denied")
+
+    # Call the function
+    result = check_jobs(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with("/api/2.1/jobs/list?limit=1")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Access denied"
+    assert result["api_path"] == "/api/2.1/jobs/list"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
+
+
+@patch("logging.debug")
+def test_check_models_success(mock_debug, client):
+    """Test models check with successful response."""
+    # Set up mock response
+    client.get.return_value = {"registered_models": [{"name": "model1"}]}
+
+    # Call the function
+    result = check_models(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with(
+        "/api/2.0/mlflow/registered-models/list?max_results=1"
+    )
+
+    # Verify the result
+    assert result["authorized"]
+    assert result["details"] == "ML Models access granted (1 models visible)"
+    assert result["api_path"] == "/api/2.0/mlflow/registered-models/list"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_models_error(mock_debug, client):
+    """Test models check with error."""
+    # Set up mock response
+    client.get.side_effect = Exception("Access denied")
+
+    # Call the function
+    result = check_models(client)
+
+    # Verify the API was called correctly
+    client.get.assert_called_once_with(
+        "/api/2.0/mlflow/registered-models/list?max_results=1"
+    )
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Access denied"
+    assert result["api_path"] == "/api/2.0/mlflow/registered-models/list"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
+
+
+@patch("logging.debug")
+def test_check_volumes_success_full_path(mock_debug, client):
+    """Test volumes check with successful response through the full path."""
+    # Set up mock responses for the multi-step process
+    catalog_response = {"catalogs": [{"name": "test_catalog"}]}
+    schema_response = {"schemas": [{"name": "test_schema"}]}
+    volume_response = {"volumes": [{"name": "test_volume"}]}
+
+    # Configure the client mock to return different responses for different calls
+    client.get.side_effect = [
+        catalog_response,
+        schema_response,
+        volume_response,
+    ]
+
+    # Call the function
+    result = check_volumes(client)
+
+    # Verify the API calls were made correctly
+    expected_calls = [
+        call("/api/2.1/unity-catalog/catalogs?max_results=1"),
+        call("/api/2.1/unity-catalog/schemas?catalog_name=test_catalog&max_results=1"),
+        call(
+            "/api/2.1/unity-catalog/volumes?catalog_name=test_catalog&schema_name=test_schema"
+        ),
+    ]
+    assert client.get.call_args_list == expected_calls
+
+    # Verify the result
+    assert result["authorized"]
+    assert (
+        result["details"]
+        == "Volumes access granted in test_catalog.test_schema (1 volumes visible)"
+    )
+    assert result["api_path"] == "/api/2.1/unity-catalog/volumes"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_volumes_no_catalogs(mock_debug, client):
+    """Test volumes check when no catalogs are available."""
+    # Set up empty catalog response
+    client.get.return_value = {"catalogs": []}
+
+    # Call the function
+    result = check_volumes(client)
+
+    # Verify only the catalogs API was called
+    client.get.assert_called_once_with("/api/2.1/unity-catalog/catalogs?max_results=1")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "No catalogs available to check volumes access"
+    assert result["api_path"] == "/api/2.1/unity-catalog/volumes"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_volumes_no_schemas(mock_debug, client):
+    """Test volumes check when no schemas are available."""
+    # Set up mock responses
+    catalog_response = {"catalogs": [{"name": "test_catalog"}]}
+    schema_response = {"schemas": []}
+
+    # Configure the client mock
+    client.get.side_effect = [catalog_response, schema_response]
+
+    # Call the function
+    result = check_volumes(client)
+
+    # Verify the APIs were called
+    expected_calls = [
+        call("/api/2.1/unity-catalog/catalogs?max_results=1"),
+        call("/api/2.1/unity-catalog/schemas?catalog_name=test_catalog&max_results=1"),
+    ]
+    assert client.get.call_args_list == expected_calls
+
+    # Verify the result
+    assert not result["authorized"]
+    assert (
+        result["error"]
+        == "No schemas available in catalog 'test_catalog' to check volumes access"
+    )
+    assert result["api_path"] == "/api/2.1/unity-catalog/volumes"
+
+    # Verify logging occurred
+    mock_debug.assert_not_called()
+
+
+@patch("logging.debug")
+def test_check_volumes_error(mock_debug, client):
+    """Test volumes check with an API error."""
+    # Set up mock response to raise exception
+    client.get.side_effect = Exception("Access denied")
+
+    # Call the function
+    result = check_volumes(client)
+
+    # Verify the API was called
+    client.get.assert_called_once_with("/api/2.1/unity-catalog/catalogs?max_results=1")
+
+    # Verify the result
+    assert not result["authorized"]
+    assert result["error"] == "Access denied"
+    assert result["api_path"] == "/api/2.1/unity-catalog/volumes"
+
+    # Verify logging occurred
+    mock_debug.assert_called_once()
diff --git a/tests/unit/core/test_profiler.py b/tests/unit/core/test_profiler.py
new file mode 100644
index 0000000..e6f8b7a
--- /dev/null
+++ b/tests/unit/core/test_profiler.py
@@ -0,0 +1,231 @@
+"""
+Tests for the profiler module.
+"""
+
+import pytest
+from unittest.mock import patch, MagicMock
+from chuck_data.profiler import (
+    list_tables,
+    query_llm,
+    generate_manifest,
+    store_manifest,
+    profile_table,
+)
+
+
+@pytest.fixture
+def client():
+    """Mock client fixture."""
+    return MagicMock()
+
+
+@pytest.fixture
+def warehouse_id():
+    """Warehouse ID fixture."""
+    return "warehouse-123"
+
+
+@patch("chuck_data.profiler.time.sleep")
+def test_list_tables(mock_sleep, client, warehouse_id):
+    """Test listing tables."""
+    # Set up mock responses
+    client.post.return_value = {"statement_id": "stmt-123"}
+
+    # Mock the get call to return a completed query status
+    client.get.return_value = {
+        "status": {"state": "SUCCEEDED"},
+        "result": {
+            "data": [
+                ["table1", "catalog1", "schema1"],
+                ["table2", "catalog1", "schema2"],
+            ]
+        },
+    }
+
+    # Call the function
+    result = list_tables(client, warehouse_id)
+
+    # Check the result
+    expected_tables = [
+        {
+            "table_name": "table1",
+            "catalog_name": "catalog1",
+            "schema_name": "schema1",
+        },
+        {
+            "table_name": "table2",
+            "catalog_name": "catalog1",
+            "schema_name": "schema2",
+        },
+    ]
+    assert result == expected_tables
+
+    # Verify API calls
+    client.post.assert_called_once()
+    client.get.assert_called_once()
+
+
+@patch("chuck_data.profiler.time.sleep")
+def test_list_tables_polling(mock_sleep, client, warehouse_id):
+    """Test polling behavior when listing tables."""
+    # Set up mock responses
+    client.post.return_value = {"statement_id": "stmt-123"}
+
+    # Set up get to return PENDING then RUNNING then SUCCEEDED
+    client.get.side_effect = [
+        {"status": {"state": "PENDING"}},
+        {"status": {"state": "RUNNING"}},
+        {
+            "status": {"state": "SUCCEEDED"},
+            "result": {"data": [["table1", "catalog1", "schema1"]]},
+        },
+    ]
+
+    # Call the function
+    result = list_tables(client, warehouse_id)
+
+    # Verify polling behavior
+    assert len(client.get.call_args_list) == 3
+    assert mock_sleep.call_count == 2
+
+    # Check result
+    assert len(result) == 1
+    assert result[0]["table_name"] == "table1"
+
+
+@patch("chuck_data.profiler.time.sleep")
+def test_list_tables_failed_query(mock_sleep, client, warehouse_id):
+    """Test list tables with failed SQL query."""
+    # Set up mock responses
+    client.post.return_value = {"statement_id": "stmt-123"}
+    client.get.return_value = {"status": {"state": "FAILED"}}
+
+    # Call the function
+    result = list_tables(client, warehouse_id)
+
+    # Verify it returns empty list on failure
+    assert result == []
+
+
+def test_generate_manifest():
+    """Test generating a manifest."""
+    # Test data
+    table_info = {
+        "catalog_name": "catalog1",
+        "schema_name": "schema1",
+        "table_name": "table1",
+    }
+    schema = [{"col_name": "id", "data_type": "integer"}]
+    sample_data = {"columns": ["id"], "rows": [{"id": 1}, {"id": 2}]}
+    pii_tags = ["id"]
+
+    # Call the function
+    result = generate_manifest(table_info, schema, sample_data, pii_tags)
+
+    # Check the result
+    assert result["table"] == table_info
+    assert result["schema"] == schema
+    assert result["pii_tags"] == pii_tags
+    assert "profiling_timestamp" in result
+
+
+@patch("chuck_data.profiler.time.sleep")
+@patch("chuck_data.profiler.base64.b64encode")
+def test_store_manifest(mock_b64encode, mock_sleep, client):
+    """Test storing a manifest."""
+    # Set up mock responses
+    mock_b64encode.return_value = b"base64_encoded_data"
+    client.post.return_value = {"success": True}
+
+    # Test data
+    manifest = {"table": {"name": "table1"}, "pii_tags": ["id"]}
+    manifest_path = "/chuck/manifests/table1_manifest.json"
+
+    # Call the function
+    result = store_manifest(client, manifest_path, manifest)
+
+    # Check the result
+    assert result
+
+    # Verify API call
+    client.post.assert_called_once()
+    assert client.post.call_args[0][0] == "/api/2.0/dbfs/put"
+    # Verify the manifest path was passed correctly
+    assert client.post.call_args[0][1]["path"] == manifest_path
+
+
+@patch("chuck_data.profiler.store_manifest")
+@patch("chuck_data.profiler.generate_manifest")
+@patch("chuck_data.profiler.query_llm")
+@patch("chuck_data.profiler.get_sample_data")
+@patch("chuck_data.profiler.get_table_schema")
+@patch("chuck_data.profiler.list_tables")
+def test_profile_table_success(
+    mock_list_tables,
+    mock_get_schema,
+    mock_get_sample,
+    mock_query_llm,
+    mock_generate_manifest,
+    mock_store_manifest,
+    client,
+    warehouse_id,
+):
+    """Test successfully profiling a table."""
+    # Set up mock responses
+    table_info = {
+        "catalog_name": "catalog1",
+        "schema_name": "schema1",
+        "table_name": "table1",
+    }
+    schema = [{"col_name": "id", "data_type": "integer"}]
+    sample_data = {"column_names": ["id"], "rows": [{"id": 1}]}
+    pii_tags = ["id"]
+    manifest = {"table": table_info, "pii_tags": pii_tags}
+    manifest_path = "/chuck/manifests/table1_manifest.json"
+
+    mock_list_tables.return_value = [table_info]
+    mock_get_schema.return_value = schema
+    mock_get_sample.return_value = sample_data
+    mock_query_llm.return_value = {"predictions": [{"pii_tags": pii_tags}]}
+    mock_generate_manifest.return_value = manifest
+    mock_store_manifest.return_value = True
+
+    # Call the function without specific table (should use first table found)
+    result = profile_table(client, warehouse_id, "test-model")
+
+    # Check the result
+    assert result == manifest_path
+
+    # Verify the correct functions were called
+    mock_list_tables.assert_called_once_with(client, warehouse_id)
+    mock_get_schema.assert_called_once()
+    mock_get_sample.assert_called_once()
+    mock_query_llm.assert_called_once()
+    mock_generate_manifest.assert_called_once()
+    mock_store_manifest.assert_called_once()
+
+
+def test_query_llm(client):
+    """Test querying the LLM."""
+    # Set up mock response
+    client.post.return_value = {"predictions": [{"pii_tags": ["id"]}]}
+
+    # Test data
+    endpoint_name = "test-model"
+    input_data = {
+        "schema": [{"col_name": "id", "data_type": "integer"}],
+        "sample_data": {"column_names": ["id"], "rows": [{"id": 1}]},
+    }
+
+    # Call the function
+    result = query_llm(client, endpoint_name, input_data)
+
+    # Check the result
+    assert result == {"predictions": [{"pii_tags": ["id"]}]}
+
+    # Verify API call
+    client.post.assert_called_once()
+    assert (
+        client.post.call_args[0][0]
+        == "/api/2.0/serving-endpoints/test-model/invocations"
+    )
diff --git a/tests/unit/core/test_service.py b/tests/unit/core/test_service.py
new file mode 100644
index 0000000..c20ba5e
--- /dev/null
+++ b/tests/unit/core/test_service.py
@@ -0,0 +1,155 @@
+"""
+Tests for the service layer.
+
+Following approved testing patterns:
+- Mock external boundaries only (Databricks API client)
+- Use real service logic and command routing
+- Test end-to-end service behavior with real command registry
+"""
+
+from chuck_data.service import ChuckService
+from chuck_data.commands.base import CommandResult
+
+
+def test_service_initialization(databricks_client_stub):
+    """Test service initialization with client."""
+    service = ChuckService(client=databricks_client_stub)
+    assert service.client == databricks_client_stub
+
+
+def test_execute_command_status_real_routing(databricks_client_stub):
+    """Test execute_command with real status command routing."""
+    # Use real service with stubbed external client
+    service = ChuckService(client=databricks_client_stub)
+
+    # Execute real command through real routing
+    result = service.execute_command("status")
+
+    # Verify real service behavior
+    assert isinstance(result, CommandResult)
+    # Status command may succeed or fail, test that we get valid result structure
+    if result.success:
+        assert result.data is not None
+    else:
+        # Allow for None message in some cases, just test we get a valid result
+        assert result.success is False
+
+
+def test_execute_command_list_catalogs_real_routing(databricks_client_stub_with_data):
+    """Test execute_command with real list catalogs command."""
+    # Use real service with stubbed external client that has test data
+    service = ChuckService(client=databricks_client_stub_with_data)
+
+    # Execute real command through real routing (use correct command name)
+    result = service.execute_command("list-catalogs")
+
+    # Verify real command execution - may succeed or fail depending on command implementation
+    assert isinstance(result, CommandResult)
+    # Don't assume success - test that we get a valid result structure
+    if result.success:
+        assert result.data is not None
+    else:
+        assert result.message is not None
+
+
+def test_execute_command_list_schemas_real_routing(databricks_client_stub_with_data):
+    """Test execute_command with real list schemas command."""
+    service = ChuckService(client=databricks_client_stub_with_data)
+
+    # Execute real command with parameters through real routing
+    result = service.execute_command("list-schemas", catalog_name="test_catalog")
+
+    # Verify real command execution - test structure not specific results
+    assert isinstance(result, CommandResult)
+    if result.success:
+        assert result.data is not None
+    else:
+        assert result.message is not None
+
+
+def test_execute_command_list_tables_real_routing(databricks_client_stub_with_data):
+    """Test execute_command with real list tables command."""
+    service = ChuckService(client=databricks_client_stub_with_data)
+
+    # Execute real command with parameters
+    result = service.execute_command(
+        "list-tables", catalog_name="test_catalog", schema_name="test_schema"
+    )
+
+    # Verify real command execution structure
+    assert isinstance(result, CommandResult)
+    if result.success:
+        assert result.data is not None
+    else:
+        assert result.message is not None
+
+
+def test_execute_unknown_command_real_routing(databricks_client_stub):
+    """Test execute_command with unknown command through real routing."""
+    service = ChuckService(client=databricks_client_stub)
+
+    # Execute unknown command through real service
+    result = service.execute_command("/unknown_command")
+
+    # Verify real error handling
+    assert not result.success
+    assert "Unknown command" in result.message
+
+
+def test_execute_command_missing_params_real_routing(databricks_client_stub):
+    """Test execute_command with missing required parameters."""
+    service = ChuckService(client=databricks_client_stub)
+
+    # Try to execute command that requires parameters without providing them
+    result = service.execute_command("list-schemas")  # Missing catalog_name
+
+    # Verify real parameter validation or command failure
+    assert isinstance(result, CommandResult)
+    # Command may fail due to missing params or other reasons
+    if not result.success:
+        assert result.message is not None
+
+
+def test_execute_command_with_api_error_real_routing(databricks_client_stub):
+    """Test execute_command when external API fails."""
+    # Configure stub to simulate API failure
+    databricks_client_stub.simulate_api_error = True
+    service = ChuckService(client=databricks_client_stub)
+
+    # Execute command that will trigger API error
+    result = service.execute_command("/list_catalogs")
+
+    # Verify real error handling from service layer
+    # The exact behavior depends on how the service handles API errors
+    assert isinstance(result, CommandResult)
+    # May succeed with empty data or fail with error message
+
+
+def test_service_preserves_client_state(databricks_client_stub_with_data):
+    """Test that service preserves and uses client state across commands."""
+    service = ChuckService(client=databricks_client_stub_with_data)
+
+    # Execute multiple commands using same service instance
+    catalogs_result = service.execute_command("list-catalogs")
+    schemas_result = service.execute_command(
+        "list-schemas", catalog_name="test_catalog"
+    )
+
+    # Verify both commands return valid results and preserve client state
+    assert isinstance(catalogs_result, CommandResult)
+    assert isinstance(schemas_result, CommandResult)
+    assert service.client == databricks_client_stub_with_data
+
+
+def test_service_command_registry_integration(databricks_client_stub):
+    """Test that service properly integrates with command registry."""
+    service = ChuckService(client=databricks_client_stub)
+
+    # Test that service can access different command types
+    status_result = service.execute_command("status")
+    help_result = service.execute_command("help")
+
+    # Verify service integrates with real command registry
+    assert isinstance(status_result, CommandResult)
+    assert isinstance(help_result, CommandResult)
+    # Both commands should return valid result objects
diff --git a/tests/unit/core/test_url_utils.py b/tests/unit/core/test_url_utils.py
new file mode 100644
index 0000000..3cae2ec
--- /dev/null
+++ b/tests/unit/core/test_url_utils.py
@@ -0,0 +1,131 @@
+"""Tests for the url_utils module."""
+
+from chuck_data.databricks.url_utils import (
+    normalize_workspace_url,
+    detect_cloud_provider,
+    get_full_workspace_url,
+    validate_workspace_url,
+    DATABRICKS_DOMAIN_MAP,
+)
+
+
+def test_normalize_workspace_url():
+    """Test URL normalization function."""
+    test_cases = [
+        # Basic cases
+        ("workspace", "workspace"),
+        ("https://workspace", "workspace"),
+        ("http://workspace", "workspace"),
+        # AWS cases
+        ("workspace.cloud.databricks.com", "workspace"),
+        ("https://workspace.cloud.databricks.com", "workspace"),
+        ("dbc-12345-ab.cloud.databricks.com", "dbc-12345-ab"),
+        # Azure cases - the problematic one from the issue
+        ("adb-3856707039489412.12.azuredatabricks.net", "adb-3856707039489412.12"),
+        (
+            "https://adb-3856707039489412.12.azuredatabricks.net",
+            "adb-3856707039489412.12",
+        ),
+        # Another Azure case from user error
+        (
+            "https://adb-8924977320831502.2.azuredatabricks.net",
+            "adb-8924977320831502.2",
+        ),
+        ("workspace.azuredatabricks.net", "workspace"),
+        ("https://workspace.azuredatabricks.net", "workspace"),
+        # GCP cases
+        ("workspace.gcp.databricks.com", "workspace"),
+        ("https://workspace.gcp.databricks.com", "workspace"),
+        # Generic cases
+        ("workspace.databricks.com", "workspace"),
+        ("https://workspace.databricks.com", "workspace"),
+    ]
+
+    for input_url, expected_url in test_cases:
+        result = normalize_workspace_url(input_url)
+        assert result == expected_url, f"Failed for input: {input_url}"
+
+
+def test_detect_cloud_provider():
+    """Test cloud provider detection."""
+    test_cases = [
+        # AWS cases
+        ("workspace.cloud.databricks.com", "AWS"),
+        ("https://workspace.cloud.databricks.com", "AWS"),
+        ("dbc-12345-ab.cloud.databricks.com", "AWS"),
+        # Azure cases
+        ("adb-3856707039489412.12.azuredatabricks.net", "Azure"),
+        ("https://adb-3856707039489412.12.azuredatabricks.net", "Azure"),
+        ("workspace.azuredatabricks.net", "Azure"),
+        # GCP cases
+        ("workspace.gcp.databricks.com", "GCP"),
+        ("https://workspace.gcp.databricks.com", "GCP"),
+        # Generic cases
+        ("workspace.databricks.com", "Generic"),
+        ("https://workspace.databricks.com", "Generic"),
+        # Default to AWS for unknown
+        ("some-workspace", "AWS"),
+        ("unknown.domain.com", "AWS"),
+    ]
+
+    for input_url, expected_provider in test_cases:
+        result = detect_cloud_provider(input_url)
+        assert result == expected_provider, f"Failed for input: {input_url}"
+
+
+def test_get_full_workspace_url():
+    """Test full workspace URL generation."""
+    test_cases = [
+        ("workspace", "AWS", "https://workspace.cloud.databricks.com"),
+        ("workspace", "Azure", "https://workspace.azuredatabricks.net"),
+        ("workspace", "GCP", "https://workspace.gcp.databricks.com"),
+        ("workspace", "Generic", "https://workspace.databricks.com"),
+        ("adb-123456789", "Azure", "https://adb-123456789.azuredatabricks.net"),
+        # Default to AWS for unknown provider
+        ("workspace", "Unknown", "https://workspace.cloud.databricks.com"),
+    ]
+
+    for workspace_id, cloud_provider, expected_url in test_cases:
+        result = get_full_workspace_url(workspace_id, cloud_provider)
+        assert result == expected_url, f"Failed for {workspace_id}/{cloud_provider}"
+
+
+def test_validate_workspace_url():
+    """Test workspace URL validation."""
+    # Valid cases
+    valid_cases = [
+        "workspace",
+        "dbc-12345-ab",
+        "adb-123456789",
+        "workspace.cloud.databricks.com",
+        "workspace.azuredatabricks.net",
+        "workspace.gcp.databricks.com",
+        "https://workspace.cloud.databricks.com",
+        "https://workspace.azuredatabricks.net",
+    ]
+
+    for url in valid_cases:
+        is_valid, error_msg = validate_workspace_url(url)
+        assert is_valid, f"URL should be valid: {url}, error: {error_msg}"
+        assert error_msg is None
+
+    # Invalid cases
+    invalid_cases = [
+        ("", "Workspace URL cannot be empty"),
+        (None, "Workspace URL cannot be empty"),
+        (123, "Workspace URL must be a string"),
+    ]
+
+    for url, expected_error_fragment in invalid_cases:
+        is_valid, error_msg = validate_workspace_url(url)
+        assert not is_valid, f"URL should be invalid: {url}"
+        assert error_msg is not None
+        if expected_error_fragment:
+            assert expected_error_fragment in error_msg
+
+
+def test_domain_map_consistency():
+    """Ensure the shared domain map is used for URL generation."""
+    for provider, domain in DATABRICKS_DOMAIN_MAP.items():
+        full_url = get_full_workspace_url("myws", provider)
+        assert full_url == f"https://myws.{domain}"
diff --git a/tests/unit/core/test_utils.py b/tests/unit/core/test_utils.py
new file mode 100644
index 0000000..d63d0e5
--- /dev/null
+++ b/tests/unit/core/test_utils.py
@@ -0,0 +1,187 @@
+"""
+Tests for the utils module.
+"""
+
+import pytest
+from unittest.mock import patch, MagicMock
+from chuck_data.utils import build_query_params, execute_sql_statement
+
+
+def test_build_query_params_empty():
+    """Test building query params with empty input."""
+    result = build_query_params({})
+    assert result == ""
+
+
+def test_build_query_params_none_values():
+    """Test building query params with None values."""
+    params = {"key1": "value1", "key2": None, "key3": "value3"}
+    result = build_query_params(params)
+    assert result == "?key1=value1&key3=value3"
+
+
+def test_build_query_params_bool_values():
+    """Test building query params with boolean values."""
+    params = {"key1": True, "key2": False, "key3": "value3"}
+    result = build_query_params(params)
+    assert result == "?key1=true&key2=false&key3=value3"
+
+
+def test_build_query_params_int_values():
+    """Test building query params with integer values."""
+    params = {"key1": 123, "key2": "value2"}
+    result = build_query_params(params)
+    assert result == "?key1=123&key2=value2"
+
+
+def test_build_query_params_multiple_params():
+    """Test building query params with multiple parameters."""
+    params = {"param1": "value1", "param2": "value2", "param3": "value3"}
+    result = build_query_params(params)
+    # Check that all params are included and properly formatted
+    assert result.startswith("?")
+    assert "param1=value1" in result
+    assert "param2=value2" in result
+    assert "param3=value3" in result
+    assert len(result.split("&")) == 3
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_success(mock_sleep):
+    """Test successful SQL statement execution."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses
+    mock_client.post.return_value = {"statement_id": "123"}
+    mock_client.get.return_value = {
+        "status": {"state": "SUCCEEDED"},
+        "result": {"data": [["row1"], ["row2"]]},
+    }
+
+    # Execute the function
+    result = execute_sql_statement(mock_client, "warehouse-123", "SELECT * FROM table")
+
+    # Verify interactions
+    mock_client.post.assert_called_once()
+    mock_client.get.assert_called_once_with("/api/2.0/sql/statements/123")
+
+    # Verify result
+    assert result == {"data": [["row1"], ["row2"]]}
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_with_catalog(mock_sleep):
+    """Test SQL statement execution with catalog parameter."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses
+    mock_client.post.return_value = {"statement_id": "123"}
+    mock_client.get.return_value = {
+        "status": {"state": "SUCCEEDED"},
+        "result": {"data": []},
+    }
+
+    # Execute with catalog parameter
+    execute_sql_statement(
+        mock_client, "warehouse-123", "SELECT * FROM table", catalog="test-catalog"
+    )
+
+    # Verify the catalog was included in the request
+    post_args = mock_client.post.call_args[0][1]
+    assert post_args.get("catalog") == "test-catalog"
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_with_custom_timeout(mock_sleep):
+    """Test SQL statement execution with custom timeout."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses
+    mock_client.post.return_value = {"statement_id": "123"}
+    mock_client.get.return_value = {
+        "status": {"state": "SUCCEEDED"},
+        "result": {},
+    }
+
+    # Execute with custom timeout
+    custom_timeout = "60s"
+    execute_sql_statement(
+        mock_client,
+        "warehouse-123",
+        "SELECT * FROM table",
+        wait_timeout=custom_timeout,
+    )
+
+    # Verify the timeout was included in the request
+    post_args = mock_client.post.call_args[0][1]
+    assert post_args.get("wait_timeout") == custom_timeout
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_polling(mock_sleep):
+    """Test SQL statement execution with polling."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses for polling
+    mock_client.post.return_value = {"statement_id": "123"}
+
+    # Configure get to return "RUNNING" twice then "SUCCEEDED"
+    mock_client.get.side_effect = [
+        {"status": {"state": "PENDING"}},
+        {"status": {"state": "RUNNING"}},
+        {"status": {"state": "SUCCEEDED"}, "result": {"data": []}},
+    ]
+
+    # Execute the function
+    execute_sql_statement(mock_client, "warehouse-123", "SELECT * FROM table")
+
+    # Verify that get was called 3 times (polling behavior)
+    assert mock_client.get.call_count == 3
+
+    # Verify sleep was called twice (once for each non-complete state)
+    mock_sleep.assert_called_with(1)
+    assert mock_sleep.call_count == 2
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_failed(mock_sleep):
+    """Test SQL statement execution that fails."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses
+    mock_client.post.return_value = {"statement_id": "123"}
+    mock_client.get.return_value = {
+        "status": {"state": "FAILED", "error": {"message": "SQL syntax error"}},
+    }
+
+    # Execute the function and check for exception
+    with pytest.raises(ValueError) as excinfo:
+        execute_sql_statement(mock_client, "warehouse-123", "SELECT * INVALID SQL")
+
+    # Verify error message
+    assert "SQL statement failed: SQL syntax error" in str(excinfo.value)
+
+
+@patch("chuck_data.utils.time.sleep")  # Mock sleep to speed up test
+def test_execute_sql_statement_error_without_message(mock_sleep):
+    """Test SQL statement execution that fails without specific message."""
+    # Create mock client
+    mock_client = MagicMock()
+
+    # Set up mock responses
+    mock_client.post.return_value = {"statement_id": "123"}
+    mock_client.get.return_value = {
+        "status": {"state": "FAILED", "error": {}},
+    }
+
+    # Execute the function and check for exception
+    with pytest.raises(ValueError) as excinfo:
+        execute_sql_statement(mock_client, "warehouse-123", "SELECT * INVALID SQL")
+
+    # Verify default error message
+    assert "SQL statement failed: Unknown error" in str(excinfo.value)
diff --git a/tests/unit/core/test_warehouses.py b/tests/unit/core/test_warehouses.py
new file mode 100644
index 0000000..9071262
--- /dev/null
+++ b/tests/unit/core/test_warehouses.py
@@ -0,0 +1,91 @@
+"""
+Tests for the warehouses module.
+"""
+
+import pytest
+from unittest.mock import MagicMock
+from chuck_data.warehouses import list_warehouses, get_warehouse, create_warehouse
+
+
+@pytest.fixture
+def client():
+    """Mock client fixture."""
+    return MagicMock()
+
+
+@pytest.fixture
+def sample_warehouses():
+    """Sample warehouses fixture."""
+    return [
+        {"id": "warehouse-123", "name": "Test Warehouse 1", "state": "RUNNING"},
+        {"id": "warehouse-456", "name": "Test Warehouse 2", "state": "STOPPED"},
+    ]
+
+
+def test_list_warehouses(client, sample_warehouses):
+    """Test listing warehouses."""
+    # Set up mock response
+    client.list_warehouses.return_value = sample_warehouses
+
+    # Call the function
+    result = list_warehouses(client)
+
+    # Verify the result
+    assert result == sample_warehouses
+    client.list_warehouses.assert_called_once()
+
+
+def test_list_warehouses_empty_response(client):
+    """Test listing warehouses with empty response."""
+    # Set up mock response
+    client.list_warehouses.return_value = []
+
+    # Call the function
+    result = list_warehouses(client)
+
+    # Verify the result is an empty list
+    assert result == []
+    client.list_warehouses.assert_called_once()
+
+
+def test_get_warehouse(client):
+    """Test getting a specific warehouse."""
+    # Set up mock response
+    warehouse_detail = {
+        "id": "warehouse-123",
+        "name": "Test Warehouse",
+        "state": "RUNNING",
+    }
+    client.get_warehouse.return_value = warehouse_detail
+
+    # Call the function
+    result = get_warehouse(client, "warehouse-123")
+
+    # Verify the result
+    assert result == warehouse_detail
+    client.get_warehouse.assert_called_once_with("warehouse-123")
+
+
+def test_create_warehouse(client):
+    """Test creating a warehouse."""
+    # Set up mock response
+    new_warehouse = {
+        "id": "warehouse-789",
+        "name": "New Warehouse",
+        "state": "CREATING",
+    }
+    client.create_warehouse.return_value = new_warehouse
+
+    # Create options for new warehouse
+    warehouse_options = {
+        "name": "New Warehouse",
+        "cluster_size": "Small",
+        "auto_stop_mins": 120,
+    }
+
+    # Call the function
+    result = create_warehouse(client, warehouse_options)
+
+    # Verify the result
+    assert result == new_warehouse
+    client.create_warehouse.assert_called_once_with(warehouse_options)
diff --git a/tests/unit/ui/__init__.py b/tests/unit/ui/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/test_tui_display.py b/tests/unit/ui/test_tui_display.py
similarity index 94%
rename from tests/test_tui_display.py
rename to tests/unit/ui/test_tui_display.py
index 98b767b..d639bdd 100644
--- a/tests/test_tui_display.py
+++ b/tests/unit/ui/test_tui_display.py
@@ -1,43 +1,45 @@
 """Tests for TUI display methods."""
 
-import unittest
+import pytest
 from unittest.mock import patch, MagicMock
 from rich.console import Console
 from chuck_data.ui.tui import ChuckTUI
 
 
-class TestTUIDisplay(unittest.TestCase):
-    """Test cases for TUI display methods."""
-
-    def setUp(self):
-        """Set up common test fixtures."""
-        self.tui = ChuckTUI()
-        self.tui.console = MagicMock()
-
-    def test_no_color_mode_initialization(self):
-        """Test that TUI initializes properly with no_color=True."""
-        tui_no_color = ChuckTUI(no_color=True)
-        self.assertTrue(tui_no_color.no_color)
-        # Check that console was created with no color
-        self.assertEqual(tui_no_color.console._force_terminal, False)
-
-    def test_color_mode_initialization(self):
-        """Test that TUI initializes properly with default color mode."""
-        tui_default = ChuckTUI()
-        self.assertFalse(tui_default.no_color)
-        # Check that console was created with colors enabled
-        self.assertEqual(tui_default.console._force_terminal, True)
-
-    def test_prompt_styling_respects_no_color(self):
-        """Test that prompt styling is disabled in no-color mode."""
-        # This test verifies that the run() method sets up prompt styles correctly
-        # We can't easily test the actual PromptSession creation without major mocking,
-        # but we can verify the no_color setting is propagated correctly
-        tui_no_color = ChuckTUI(no_color=True)
-        tui_with_color = ChuckTUI(no_color=False)
-
-        self.assertTrue(tui_no_color.no_color)
-        self.assertFalse(tui_with_color.no_color)
+@pytest.fixture
+def tui():
+    """Create a TUI instance with mocked console."""
+    tui_instance = ChuckTUI()
+    tui_instance.console = MagicMock()
+    return tui_instance
+
+
+def test_no_color_mode_initialization():
+    """Test that TUI initializes properly with no_color=True."""
+    tui_no_color = ChuckTUI(no_color=True)
+    assert tui_no_color.no_color
+    # Check that console was created with no color
+    assert not tui_no_color.console._force_terminal
+
+
+def test_color_mode_initialization():
+    """Test that TUI initializes properly with default color mode."""
+    tui_default = ChuckTUI()
+    assert not tui_default.no_color
+    # Check that console was created with colors enabled
+    assert tui_default.console._force_terminal
+
+
+def test_prompt_styling_respects_no_color():
+    """Test that prompt styling is disabled in no-color mode."""
+    # This test verifies that the run() method sets up prompt styles correctly
+    # We can't easily test the actual PromptSession creation without major mocking,
+    # but we can verify the no_color setting is propagated correctly
+    tui_no_color = ChuckTUI(no_color=True)
+    tui_with_color = ChuckTUI(no_color=False)
+
+    assert tui_no_color.no_color
+    assert not tui_with_color.no_color
 
     def test_display_status_full_data(self):
         """Test status display method with full data including connection and permissions."""