Skip to content

Conversation

@keatonwilson
Copy link
Collaborator

Vignette Improvements: Enhanced Readability and Clarity

Overview

This PR significantly improves the DuckDB vignette by fixing technical inaccuracies, clarifying database structure, and simplifying visualizations for better readability. It also represents a manual QA/QC to ensure all vignette code operates after column name changes to database schema.

Changes Made

Documentation

  • Small updates to comments and documentation language - mostly fixing typos and improving code block commenting.

Visualization Improvements

Time Analysis Plot (Multi-table Block, Plot 1)

  • Addressed crowded 26-genera x-axis issue
  • Maintained core comparison of water potential by genus and time of day
  • Improved readability through better layout options

Study Timeline Plot (Multi-table Block, Plot 2)

  • Replaced overwhelming 258-study individual timeline with aggregated view
  • Now shows count of active studies over time by region using stacked bar chart
  • More effectively communicates temporal coverage patterns without overwhelming detail

Species Bar Chart (Visualizations Section)

  • Reduced from 410 species to top 20 species with lowest (most extreme) water potentials
  • Added clear title "Top 20 Species with Lowest Water Potential" indicating the filtering criterion
  • Significantly improved readability while maintaining key insights about stress tolerance

Impact

These changes make the vignette more accurate as a reference document, easier to follow for new users, and provide clearer visual insights into PSInet database structure and contents. The visualizations now effectively communicate key patterns without overwhelming readers with excessive detail.

Fixed spelling of "associated" and "Auxiliary" in the database structure section.
- Switch water potential example from Sorghum to Quercus
- Add tidyr library import
- Improve plot aesthetics (flip axes, filter by sample size)
- Add study timeline filtering for better visualization
- Expand time series analysis section with daily aggregation, autocorrelation, and decomposition
- Fix join conditions to include plot_id
- Add data completeness checks (drop_na, sample size filtering)
…tials

Modified the chamber water potential example to display only the 20 species with the lowest (most extreme) water potentials instead of all species, improving readability and adding a fill color to the bar plot.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants