Skip to content

Example: Add s3 example#44

Open
yuye-aws wants to merge 1 commit intoJon-AtAWS:jh-dataprocessfrom
yuye-aws:Feat/S3Example
Open

Example: Add s3 example#44
yuye-aws wants to merge 1 commit intoJon-AtAWS:jh-dataprocessfrom
yuye-aws:Feat/S3Example

Conversation

@yuye-aws
Copy link
Collaborator

@yuye-aws yuye-aws commented Oct 13, 2025

This PR adds an example on how to use the S3. The example is very similar to the dense exact search. To the best of my knowledge, S3 only support exact search now.

Since this PR is just adding an example, I'm not including any UTs and ITs.

@yuye-aws yuye-aws requested a review from Jon-AtAWS October 13, 2025 09:01
}

def __init__(self, max_number_of_docs: int = -1):
def __init__(self, dataset_path: str = DEFAULT_DATASET_PATH, max_number_of_docs: int = -1):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm adding this change to allow user to specify the directory path

Signed-off-by: yuye-aws <yuyezhu@amazon.com>
"chunk_embedding": {
"type": "knn_vector",
"dimension": model_dimensions,
"method": {"engine": "s3vector"},
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is where to specify the s3vector

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant