Skip to content

[optional] Thorough docs review PR#177

Draft
mglbleta wants to merge 4 commits intomainfrom
docs_review
Draft

[optional] Thorough docs review PR#177
mglbleta wants to merge 4 commits intomainfrom
docs_review

Conversation

@mglbleta
Copy link
Copy Markdown
Collaborator

@mglbleta mglbleta commented Mar 13, 2026

📄 Description

PR for branch for making documentation edits as part of thorough review. Welcome other to make edits on this branch/PR. If it doesn't fall within the "code freeze" goals, or code revision priorities before handoff, I can save edits locally and apply to the codebase in the next UC Berkeley owned repo.

I will flag high level concerns for discussion on Tuesday meeting. UW team can decide if it is important that they revise docs before handoff, or can be left to UC Berkeley/LBL team for later.

✅ Checklist

  • I ran pre-commit run --all-files and all checks pass
  • [] Tests added/updated where needed
  • Docs added/updated if applicable
  • I have linked the issue this PR closes (if any)

🔗 Related Issues

Resolves #163

💡 Type of change

Type Checked?
🐞 Bug fix [ ]
✨ New feature [ ]
📝 Documentation [x]
♻️ Refactor [ ]
🛠️ Build/CI [ ]
Other (explain) [ ]

🧪 How to test

📝 Notes to reviewers

Docs review is in progress. More observations/considerations are noted in issue #163 for discussion than actual edits made in this PR. Future commits to this PR will populate as more documents are reviewed.

Summary by CodeRabbit

  • Documentation
    • Updated architecture documentation with clearer production and development environment details, including database version specifications and Google Cloud Platform configurations.
    • Clarified Jupyter notebook setup documentation.
    • Standardized documentation formatting across files.

… unsure what the project_filebrowser.db file in my staged changes is but it will not allow me to not commit. feel free to review and discard edit.
Copilot AI review requested due to automatic review settings March 13, 2026 00:59
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Mar 13, 2026

📝 Walkthrough

Walkthrough

This pull request updates documentation files across the repository, primarily normalizing file endings with trailing newlines. Substantive changes include updates to architecture.md with enhanced cloud infrastructure and database specifications, along with a clarified purpose statement in notebook_setup.md.

Changes

Cohort / File(s) Summary
Architecture Documentation
docs/architecture.md
Removed Overview section; updated database references to include versions (PostgreSQL 13+/17+); expanded GCP details with production/dev environment specifics, Cloud SQL, Cloud Run, Artifact Registry, and Secret Manager; added TODO comment blocks for potential future documentation improvements.
Notebook Setup Clarification
docs/notebook_setup.md
Minor text update to Purpose statement: changed em-dash to colon and appended "namespace packages used in this repository" for clarity.
Documentation Trailing Newline Normalization
docs/CODE_OF_CONDUCT.md, docs/CONTRIBUTING.md, docs/ERD_VIEW.md, docs/README.md, docs/datamodels/README.md, docs/deployment/README.md, docs/pipeline/README.md, docs/resources/README.md, docs/webservice/README.md
Added or normalized trailing newlines to ensure consistent file formatting across documentation.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Suggested reviewers

  • lsetiawan

Poem

🐰 Hops through docs with careful grace,
Newlines tidied, every place,
Architecture shines so bright,
Cloud and database set right!
Documentation's now pristine—
The clearest repo ever seen!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name Status Explanation Resolution
Title check ❓ Inconclusive The title '[optional] Thorough docs review PR' is somewhat vague and includes unnecessary meta-language like '[optional]' that doesn't convey the specific changes made. Consider a more descriptive title that clearly indicates the scope of documentation changes, such as 'Update documentation with architecture details and formatting improvements' or remove the '[optional]' prefix for clarity.
✅ Passed checks (4 passed)
Check name Status Explanation
Description check ✅ Passed The description mostly follows the template with context about the review process and decision-making, though it lacks detailed specifics about actual file changes made in this PR.
Linked Issues check ✅ Passed The PR addresses issue #163 which requests thorough documentation review and reporting of required changes. The PR shows multiple documentation file updates including architecture improvements and formatting fixes.
Out of Scope Changes check ✅ Passed All changes in the PR are documentation-related (docs directory files), which align with issue #163's requirement to thoroughly review and update documentation. No code logic changes are present.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch docs_review
📝 Coding Plan
  • Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Tip

You can validate your CodeRabbit configuration file in your editor.

If your editor has YAML language server, you can enable auto-completion and validation by adding # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json at the top of your CodeRabbit configuration file.

@mglbleta
Copy link
Copy Markdown
Collaborator Author

My pixi run pre-commit-all does the whitespace trimming thing, so it makes it look like more edits were made than they actually were with "13 files changed" when actually its about 2.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates project documentation and documentation pointers as part of a broader docs review, while also modifying the Pixi lockfile configuration for PyPI resolution and local package installation mode.

Changes:

  • Edited docs/architecture.md and docs/notebook_setup.md for clarity/accuracy and added TODO notes.
  • Updated multiple docs/**/README.md pointer files (appear to be symlink-like stubs).
  • Updated pixi.lock (removed pypi-prerelease-mode options; marked local packages as editable: true).

Reviewed changes

Copilot reviewed 2 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
pixi.lock Alters Pixi/PyPI resolution options and local package install mode (editable).
docs/architecture.md Updates infra/version statements and adds TODO notes; removes the prior “Overview” section.
docs/notebook_setup.md Minor wording/punctuation change in the purpose statement.
docs/webservice/README.md Updates a pointer/stub line (potentially symlink-related).
docs/resources/README.md Updates a pointer/stub line (potentially symlink-related).
docs/pipeline/README.md Updates a pointer/stub line (potentially symlink-related).
docs/deployment/README.md Updates a pointer/stub line (potentially symlink-related).
docs/datamodels/README.md Updates a pointer/stub line (potentially symlink-related).
docs/README.md Updates a pointer/stub line (potentially symlink-related).
docs/ERD_VIEW.md Updates a pointer/stub line (potentially symlink-related).
docs/CONTRIBUTING.md Updates a pointer/stub line (potentially symlink-related).
docs/CODE_OF_CONDUCT.md Updates a pointer/stub line (potentially symlink-related).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

- **Production deployment:** All core infrastructure (database, application
containers, orchestration, and secrets) runs on GCP using Cloud SQL, Cloud
Run, Artifact Registry, and Secret Manager
- **Database Hosting:** PostgreSQL 17+ with PostGIS (Cloud SQL on GCP for
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (2)
docs/architecture.md (2)

315-318: Good practice to flag potentially outdated content.

The TODO comment appropriately signals to reviewers and future maintainers that this section needs verification. Consider creating a tracking issue to ensure this review happens.

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/architecture.md` around lines 315 - 318, The TODO note marking the "Core
Domain Models" section as potentially outdated should be replaced with an
actionable tracking reference: create a ticket (e.g., in your issue tracker) to
review and update the "Core Domain Models" section and then update the comment
to include the ticket ID/URL, an assignee, and a target review date;
specifically edit the TODO block in docs/architecture.md around the "Core Domain
Models" header to reference the new issue, include a short checklist (verify
model names, relationships, and examples) and mark the task as pending so
reviewers can see status until the doc is updated.

509-512: Good practice to flag future documentation improvements.

The TODO comment appropriately notes the opportunity to make this section more descriptive. Consider creating a tracking issue to prioritize this work.

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/architecture.md` around lines 509 - 512, Create a tracking issue in the
project tracker for the documentation rewrite referenced by the TODO and update
the TODO in docs/architecture.md (the comment containing "TODO (2026-03-12)") to
reference that issue ID or URL, include an owner and priority, and add a short
note about the expected scope (reduce bullet reliance, add descriptive language
and future architecture considerations) so the TODO is actionable and
discoverable.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docs/architecture.md`:
- Line 89: Update the inconsistent database version wording so the "Database"
bullet (the line starting with "**Database**: PostgreSQL 13+ (17 in dev/staging)
with PostGIS extension") and the later sentence that reads "PostgreSQL 17+ with
PostGIS (Cloud SQL on GCP for production, local PostGIS for development)" convey
the same environment mapping; explicitly state which PostgreSQL major version is
used for production, staging, and development (e.g., "Production: PostgreSQL 17+
on Cloud SQL with PostGIS; Staging: PostgreSQL 17; Development: PostgreSQL 13+
with local PostGIS") and replace one of the conflicting lines with that
definitive mapping so both the "**Database**" bullet and the "PostgreSQL 17+
with PostGIS..." sentence match.

---

Nitpick comments:
In `@docs/architecture.md`:
- Around line 315-318: The TODO note marking the "Core Domain Models" section as
potentially outdated should be replaced with an actionable tracking reference:
create a ticket (e.g., in your issue tracker) to review and update the "Core
Domain Models" section and then update the comment to include the ticket ID/URL,
an assignee, and a target review date; specifically edit the TODO block in
docs/architecture.md around the "Core Domain Models" header to reference the new
issue, include a short checklist (verify model names, relationships, and
examples) and mark the task as pending so reviewers can see status until the doc
is updated.
- Around line 509-512: Create a tracking issue in the project tracker for the
documentation rewrite referenced by the TODO and update the TODO in
docs/architecture.md (the comment containing "TODO (2026-03-12)") to reference
that issue ID or URL, include an owner and priority, and add a short note about
the expected scope (reduce bullet reliance, add descriptive language and future
architecture considerations) so the TODO is actionable and discoverable.

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1387c6e3-6167-4845-ab6b-99867e6ec2f5

📥 Commits

Reviewing files that changed from the base of the PR and between dffea8d and 39bd4ed.

⛔ Files ignored due to path filters (2)
  • anaconda_projects/db/project_filebrowser.db is excluded by !**/*.db
  • pixi.lock is excluded by !**/*.lock
📒 Files selected for processing (11)
  • docs/CODE_OF_CONDUCT.md
  • docs/CONTRIBUTING.md
  • docs/ERD_VIEW.md
  • docs/README.md
  • docs/architecture.md
  • docs/datamodels/README.md
  • docs/deployment/README.md
  • docs/notebook_setup.md
  • docs/pipeline/README.md
  • docs/resources/README.md
  • docs/webservice/README.md


- **Programming Language**: Python 3.12+
- **Database**: PostgreSQL 13+ with PostGIS extension
- **Database**: PostgreSQL 13+ (17 in dev/staging) with PostGIS extension
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor

Database version inconsistency with line 127-128.

Line 89 states PostgreSQL 17 is used in "dev/staging", but lines 127-128 state "PostgreSQL 17+ with PostGIS (Cloud SQL on GCP for production, local PostGIS for development)". This creates confusion about which environments use which PostgreSQL versions.

Please clarify:

  • Is PostgreSQL 17+ used in production (as line 127 suggests)?
  • Is PostgreSQL 13+ still the minimum for development?
  • What version is used in dev/staging vs production?
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/architecture.md` at line 89, Update the inconsistent database version
wording so the "Database" bullet (the line starting with "**Database**:
PostgreSQL 13+ (17 in dev/staging) with PostGIS extension") and the later
sentence that reads "PostgreSQL 17+ with PostGIS (Cloud SQL on GCP for
production, local PostGIS for development)" convey the same environment mapping;
explicitly state which PostgreSQL major version is used for production, staging,
and development (e.g., "Production: PostgreSQL 17+ on Cloud SQL with PostGIS;
Staging: PostgreSQL 17; Development: PostgreSQL 13+ with local PostGIS") and
replace one of the conflicting lines with that definitive mapping so both the
"**Database**" bullet and the "PostgreSQL 17+ with PostGIS..." sentence match.

@lsetiawan lsetiawan marked this pull request as draft March 19, 2026 22:51
Rewrite docs symlink entries without trailing newline payloads so Windows checkout/restore operations do not produce unreadable links.

No content changes to target markdown files; only symlink metadata normalization.
@read-the-docs-community
Copy link
Copy Markdown

read-the-docs-community bot commented Mar 21, 2026

Documentation build overview

📚 ca-biositing | 🛠️ Build #32063906 | 📁 Comparing ca5ea30 against latest (14f5740)


🔍 Preview build

Show files changed (31 files in total): 📝 25 modified | ➕ 2 added | ➖ 4 deleted
File Status
404.html 📝 modified
index.html 📝 modified
CODE_OF_CONDUCT/index.html 📝 modified
CONTRIBUTING/index.html 📝 modified
ERD_VIEW/index.html ➖ deleted
NASS_API_COMMODITY_ANALYSIS/index.html ➕ added
api/index.html 📝 modified
architecture/index.html 📝 modified
ca_biositing_local_setup/index.html 📝 modified
datamodels/index.html 📝 modified
deployment/index.html 📝 modified
notebook_setup/index.html 📝 modified
pipeline/index.html 📝 modified
resources/index.html 📝 modified
webservice/index.html 📝 modified
api/datamodels/index.html 📝 modified
api/pipeline/index.html 📝 modified
datamodels/CONTRIBUTING/index.html ➖ deleted
datamodels/SCHEMA_DRIFT/index.html 📝 modified
datamodels/SQL_FIRST_WORKFLOW/index.html 📝 modified
pipeline/ALEMBIC_WORKFLOW/index.html 📝 modified
pipeline/CONTRIBUTING/index.html ➖ deleted
pipeline/DOCKER_WORKFLOW/index.html 📝 modified
pipeline/ETL_WORKFLOW/index.html 📝 modified
pipeline/GCP_SETUP/index.html 📝 modified
pipeline/PREFECT_WORKFLOW/index.html 📝 modified
webservice/CONTRIBUTING/index.html ➖ deleted
pipeline/USDA/ANALYTICS_HANDOFF/index.html 📝 modified
pipeline/USDA/FUTURE_TODO_FULL_COMMODITY_MAPPING/index.html ➕ added
pipeline/USDA/USDA_ETL_GUIDE/index.html 📝 modified
pipeline/USDA/USDA_ETL_HANDOFF/index.html 📝 modified

- remove ERD_VIEW docs page references/files

- update mkdocs repo URL to sustainability-software-lab

- refresh pipeline and datamodels readmes for current structure

- apply contributing/deployment workflow doc edits
Previous overneiw was redundant with documentation overview page and contained misleading language. This new version maintains benefits of an overview paragraph while removing misleading language.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
@mglbleta
Copy link
Copy Markdown
Collaborator Author

mglbleta commented Apr 9, 2026

This PR is largely superseded by PR #225. Some revisions may not be captured in #225 and manual review and migration of edits from this PR to that one or a future PR is recommended.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Chore: Thoroughly double check Documentation

5 participants