Prepare sample database script test cases by AwaisKamran · Pull Request #128 · Conrad-X/text2SQL

AwaisKamran · 2025-05-22T19:25:05Z

Description

This PR corresponds to the following Write-Test-Cases-For-Prepare-Sample-Dataset

…pare-sample-database-script

…o_questions

…ing method

…pare-sample-database-script

… for get_train_file method

…pare-sample-database-script

Copilot

Pull Request Overview

This PR adds unit tests for the sample dataset preparation pipeline and updates the signature and in-call usage of the add_schema_used function.

Added tests for functions such as get_train_file_path, create_train_file, copy_bird_train_file, get_train_data, and add_schema_used.
Updated add_schema_used to accept an additional train_file parameter and modified calls accordingly.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
server/test/preprocess/test_prepare_sample_dataset.py	Adds comprehensive test cases for sample dataset functions.
server/preprocess/prepare_sample_dataset.py	Updates the add_schema_used function signature and call sites to pass train_file as a Path object; adjusts file path handling.
server/init.py	Introduces package documentation for the server package.

Copilot · 2025-05-22T19:25:57Z

server/preprocess/prepare_sample_dataset.py


    if train_data:  
-        add_schema_used(train_data, dataset_type)
+        add_schema_used(train_data, dataset_type, Path(train_file))


Both the public function and its caller share the same name 'add_schema_used' but now with an added parameter, which could lead to confusion or unintended recursion; consider renaming one of these to clearly differentiate their responsibilities.

Copilot · 2025-05-22T19:25:57Z

server/preprocess/prepare_sample_dataset.py

    train_file = get_train_file_path()
    dataset_type = PATH_CONFIG.sample_dataset_type
-    train_data = get_train_data(train_file)
+    train_data = get_train_data(Path(train_file))


[nitpick] Consider using consistent types for file paths across your functions; either update get_train_data to accept a Path object or convert the Path to a string before passing it.

Suggested change

train_data = get_train_data(Path(train_file))

train_data = get_train_data(Path(train_file)) # Ensure get_train_data supports Path objects

Copilot · 2025-05-22T19:25:58Z

server/test/preprocess/test_prepare_sample_dataset.py

+    @patch('os.path.exists', return_value=True)
+    @patch('os.makedirs')
+    @patch('shutil.copyfile')
+    @patch('server.preprocess.prepare_sample_dataset.add_sequential_ids_to_questions')  # Mocking API key during test


[nitpick] The comment 'Mocking API key during test' is misleading for add_sequential_ids_to_questions; please update it to accurately reflect the mocked functionality.

AwaisKamran added 30 commits May 15, 2025 16:07

refactor: prepare_sample_dataset updated

e6c212e

refactor: misleading parameters removed from function

7307e36

Merge branch 'main' of github.com:Conrad-X/text2SQL into refactor-pre…

05994f9

…pare-sample-database-script

fix: replace add_question_id_for_bird_train with add_sequential_ids_t…

9091924

…o_questions

feat: Replace alive bar with tqdm

479da2c

refactor: replace write_train_data_to_file with save_json_to_file

5390337

refactor: update docstrings

b206ccc

refactor: rename error_messages.py to response_messages.py

760e0f9

refactor: reused bird utils constants

1bba42a

refactor: response messages updated for consistency

75be5ec

refactor: update import paths

bc96600

refactor: use with statement instead of conventional connection creat…

daced9a

…ing method

refactor: create sql connection explicitly

8afeb3d

refactor: update import statements

af5dd1c

docs: add docstring to create_database_connection function

f74ab77

refactor: fixed docstrings

0a26d3a

refactor: copilot suggestions accommodated

37363b2

refactor: segregated current_db and train_data fetching

51a1c32

refactor: typehints added

7dd755e

docs: add module docstring to indexing_constants

b6e5a09

Merge branch 'main' of github.com:Conrad-X/text2SQL into refactor-pre…

3c5a6a7

…pare-sample-database-script

feat: updated TODO comment for add_database_descriptions

b0cfc7d

fix: import statement changed

cf1b0ea

chore: merge conflicts resolved

b503583

refactor: error message name issue

68fa986

refactor: add_schema_used updated

7d659c8

fix: update parameters for add_schema_used method

8cd8470

chore: LLMConfig added to add_database_descriptions & typehints added…

b8f2238

… for get_train_file method

chore: connection removed from add_schema_used

85d1a07

fix: fixed docstrings

bb9e8bc

AwaisKamran added 6 commits May 22, 2025 11:47

chore: added try/catch around updating schema_used

49b44fc

chore: item[SCHEMA_USED] updated

db53d4b

fix: close_connection method removed

0a5c46b

Merge branch 'main' of github.com:Conrad-X/text2SQL into refactor-pre…

c52ebaf

…pare-sample-database-script

Merge branch 'main' of github.com:Conrad-X/text2SQL into refactor-pre…

e526088

…pare-sample-database-script

chore: add test cases for prepare_sample_dataset.py

eeabf76

AwaisKamran self-assigned this May 22, 2025

Copilot AI review requested due to automatic review settings May 22, 2025 19:25

AwaisKamran added the enhancement New feature or request label May 22, 2025

Copilot AI reviewed May 22, 2025

View reviewed changes

AwaisKamran added 2 commits May 23, 2025 00:42

chore: update test cases

320d9ab

fix: update test cases

21dbc1e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare sample database script test cases#128

Prepare sample database script test cases#128
AwaisKamran wants to merge 38 commits intomainfrom
prepare-sample-database-script-test-cases

AwaisKamran commented May 22, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 22, 2025

Uh oh!

Copilot AI May 22, 2025

Uh oh!

Copilot AI May 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	train_data = get_train_data(Path(train_file))
	train_data = get_train_data(Path(train_file)) # Ensure get_train_data supports Path objects

Conversation

AwaisKamran commented May 22, 2025

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants