Logic to write results data to S3 when size > 90% of 400KB #27

Nathan-Kulas-Tyler · 2024-08-02T21:27:01Z

Issue #26

Dynamo DB writes are failing for large plans

Description of changes:

drs-plan-automation/cfn/lambda/drs-plan-automation/src/drs_automation_plan.py:
This change checks the size of the item to be written, and writes it to S3 instead of Dynamo DB when the size is too large.
The item written to Dynamo DB is updated to only include the partition key and sort fields, as well as S3Bucket and S3Key where the item is stored.

I tested this condition by extracting the data in our environment from the CloudWatch logs, which included a dump of the DynamoDB item to be written. I converted this back to json and then wrote it to a file and added it as a new file in the lambda function. I then commented out the majority of the lambda handler code, and replaced it with:

    result_path = os.path.join(os.path.dirname(__file__), 'result.json')
    print("loading event event file")
    with open(result_path, 'r') as config_file:
        result = json.load(config_file)
    
    record_status(result, ddb_client, s3_client)

I confirmed the item was written to S3, and the item was written to the DynamoDB table with the updated format.
I performed these steps in a UAT environment, using an existing appId, planId, and executionId in the json payload.

drs-plan-automation/cfn/lambda/drs-plan-automation/template.yaml:
Updated the template for the Lambda function to include a new ENV variable for the S3 Bucket. The bucket that is already created as part of this solution is used for this purpose. I also updated the permission set to include S3 permissions.

drs-plan-automation/cfn/lambda/drs-plan-automation-api/src/app.js:
The API is updated to process records that are stored in S3 when applicable.
Updated the two functions that retrieve results to parse the record looking for the S3Bucket and S3Key fields that are now inserted. If these fields exist, it will read the file from S3 and parse the json, and update the return items.
I confirmed that the Disaster Recovery Accelerator page loads the records that are stored in S3.

drs-plan-automation/cfn/lambda/drs-plan-automation-api/template.yaml
Updated the template for the Lambda function to include permission set to include S3 permissions.
I had originally added the ENV variable for the S3 bucket to the template, but commented this out since the bucket name is stored in the dynamoDB item as well.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Dynamo DB writes are failing for large data sets This change checks the size of the item to be written, and writes it to S3 instead of Dynamo DB when the size is too large. The API is updated to process records that are stored in S3 when applicable.

S3 needed ARN not bucket name Indented the Sid of the S3 policy to the wrong level in the api policy

Nathan-Kulas-Tyler added 2 commits August 2, 2024 17:11

Correcting issues in YAML

0ffa13d

S3 needed ARN not bucket name Indented the Sid of the S3 policy to the wrong level in the api policy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Logic to write results data to S3 when size > 90% of 400KB #27

Logic to write results data to S3 when size > 90% of 400KB #27

Uh oh!

Nathan-Kulas-Tyler commented Aug 2, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Logic to write results data to S3 when size > 90% of 400KB #27

Are you sure you want to change the base?

Logic to write results data to S3 when size > 90% of 400KB #27

Uh oh!

Conversation

Nathan-Kulas-Tyler commented Aug 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Nathan-Kulas-Tyler commented Aug 2, 2024 •

edited

Loading