Skip to content

Docs: Add folder READMEs and repository map#26

Open
Copilot wants to merge 3 commits intoEEbrami-LISSY-pathfrom
copilot/docsfolder-readmes
Open

Docs: Add folder READMEs and repository map#26
Copilot wants to merge 3 commits intoEEbrami-LISSY-pathfrom
copilot/docsfolder-readmes

Conversation

Copy link
Contributor

Copilot AI commented Oct 22, 2025

Overview

This PR adds comprehensive folder-level documentation and a centralized repository map to improve navigation and onboarding for contributors. Each important folder now has a dedicated README with purpose, quick start commands, contents overview, conventions, and maintainer information.

Changes

Repository Map (Root README)

Added a new "Repository Map" section to the root README with a navigable table linking to 14 folder READMEs:

Category Folders Documented
Analysis & Toolkit DART, DART/MIMA, analysis, analysis/data-availability, analysis/data-availability/visuals
Remote Execution LISSY, LISSY/Tutorial, LISSY/DART_Validation, LISSY/MIMA5
Infrastructure scripts, docs, METIS-LIS
Data & Conversion xlsxFiles, xlsxConverted

New Folder READMEs (10 Created)

  • DART/README.md — Data Analysis & Reproducibility Toolkit overview with validation and plotting commands
  • LISSY/README.md — Remote-execution system documentation with dataset syntax reference and security conventions
  • LISSY/DART_Validation/README.md — Validation workflow comparing LISSY results with DART tables
  • LISSY/MIMA5/README.md — MIMA5 analysis scripts and plotting utilities
  • analysis/README.md — Cross-national poverty and data availability analyses
  • scripts/README.md — Repository-wide utility scripts for conversion
  • docs/README.md — Updated from placeholder to proper documentation folder overview
  • METIS-LIS/README.md — LIS metadata system resources (codebooks, indicators, waves)
  • xlsxFiles/README.md — Source Excel files documentation
  • xlsxConverted/README.md — Machine-readable converted outputs (CSV/JSON/MD)

Refined Existing READMEs (2 Updated)

  • analysis/data-availability/README.md — Streamlined algorithm descriptions, made more concise
  • analysis/data-availability/visuals/README.md — Condensed visualization documentation while preserving key information

Preserved Excellence (4 Kept As-Is)

  • DART/MIMA/README.md — Already comprehensive MIMA workflow documentation
  • LISSY/Tutorial/README.md — Already excellent self-teaching materials
  • Tutorial subdirectory READMEs preserved

Documentation Standards

All READMEs follow a consistent template:

# Folder Name

## Purpose
1-2 sentence description of what lives here and why it matters

## Quick Start
Runnable commands tailored to actual files in the folder

## Contents
- List of key files/subdirectories with relative links
- Brief description of each item

## Conventions
- What not to commit (secrets, large files)
- How to regenerate outputs
- Dependencies and environment setup

## Maintainers
- @EEbrami

Quality Metrics

Concise: 15-40 lines per README (average 30 lines)
Actionable: Real commands that run against actual repository files
Consistent: Uniform structure across all folders
Professional: Clear, action-oriented language
Linked: Repository map provides easy navigation
Privacy-Aware: Security guidance for LISSY folders (credentials, data handling)

Files Modified

  • 13 files changed: 10 created, 2 refined, 1 updated (root README)
  • +444/-145 lines: Net addition of 299 lines of documentation
  • No code changes: Documentation only (markdown files)

Benefits

  1. Improved Onboarding: New contributors can quickly understand repository structure
  2. Easy Navigation: Repository map provides one-click access to all documentation
  3. Clear Conventions: Each folder documents what to commit/ignore and how to regenerate outputs
  4. Consistent Experience: Uniform documentation style across all folders
  5. Security Guidance: LISSY folders include privacy and credentials handling instructions

Folders Excluded (with Rationale)

Infrastructure and build artifact folders intentionally excluded from README creation:

  • .github/ — GitHub workflows (infrastructure)
  • .git/ — Git metadata (infrastructure)
  • xlsxConverted/csvFiles, jsonFiles, mdFiles — Build artifact subfolders (parent folder has README)

Ready for review. Each folder README can be reviewed independently. The repository map in the root README provides a comprehensive overview of the documentation structure.

Original prompt

Prompt for the Copilot coding agent:
Task: Add or update README.md files for important folders and update the root README with a “Repository map” that links to each folder’s README.

Repository and branch:

  • Repo: EEbrami/Poverty-Project
  • Base branch: EEbrami-LISSY-path
  • Create a working branch from EEbrami-LISSY-path named docs/folder-readmes

Scope:

  1. Discover folders that matter:

    • Enumerate all top-level and key subfolders used by contributors (code, scripts, analysis, examples, docs, pipelines, data-availability, LISSY, DART, etc.).
    • Exclude noisy or infrastructure-only folders from README creation unless they already exist or are user-facing: e.g., .github, .git, pycache, .venv, node_modules, .idea, .vscode, build, dist, large data dumps.
    • Propose the list “Folders that matter” with a short 1-line rationale per folder. I must approve the list in your PR comment before merge.
  2. For each “matters” folder:

    • If README.md exists: keep existing content, tighten it to a concise professional style, and add any missing standard sections (Purpose, Quick start, Contents, Conventions, Maintainers).
    • If README.md does not exist: create one using the template below and adapt to the folder’s real contents.
    • Use clear, action-oriented “Quick start” commands that actually run (tailor to R/Python/etc. present in the folder).
    • If the folder generates large outputs, state what is not committed and how to regenerate.
    • If the folder handles data or remote systems (e.g., LISSY), include privacy/secrets guidance and link to onboarding.
  3. Update the root README.md:

    • Add a new section “Repository map” linking to every folder README you created/updated.
    • Use a compact table: Folder | Short description | Link (relative links).
    • Keep tone concise and professional.
  4. Quality and consistency:

    • Keep each folder README to ~10–30 lines. Prefer runnable examples over prose.
    • Match repo conventions (headings, tone, casing).
    • Do not overwrite useful existing details—merge and improve.
    • Run markdown lint/formatter if available; otherwise ensure clean headings and lists.
    • Use separate commits per folder for easier review, with messages like:
      • docs(readme): add README for
      • docs(readme): refine README for
      • docs: add repository map linking folder READMEs
  5. Output/PR etiquette:

    • Push commits to docs/folder-readmes.
    • Open a PR against EEbrami-LISSY-path titled “Docs: add folder READMEs and repository map”.
    • In the PR description, include:
      • The “Folders that matter” list with rationale (checkbox list).
      • A table of files added/modified.
      • Screenshots of the rendered “Repository map” section (optional).
    • Add a top-level PR comment showing the diff snippets for each README and wait for my approval before merging.

Templates to use and adapt:
Use these as starting points. Fill in concrete commands, inputs, and outputs for each folder.

# <Folder name>

Purpose
- One- or two-sentence description of what lives here and why it matters to contributors.

Quick start
- Example command to run or reproduce a key result.
- Note any required inputs or environment variables.

Contents
- scripts/ — main runnable steps
- notebooks/ — exploration and examples
- figures/ or outputs/ — generated artifacts (do not commit large binaries)
- data/ — sample or manifest listing required upstream data

Conventions
- Dependencies/versions that commonly trip people up.
- What not to commit (secrets, large files) and how to regenerate locally.

Maintainers
- @EEbrami
# LISSY (Remote-execution system docs & tutorials)

Purpose
- Onboarding and examples for submitting reproducible jobs to the LISSY remote-execution system.

Quick start
- Read getting-started.md for setup.
- Run an example (adjust to actual files): R: `Rscript examples/R/submit_example.R`
- Use job_templates/ as a base; annotate expected runtime and memory.

Contents
- getting-started.md — environment, auth, submission basics
- examples/ — language-specific runnable examples (R, SAS, SPSS, Stata)
- job_templates/ — reusable job scripts/manifests

Conventions
- Do not commit secrets; store credentials externally.
- Document resource hints (CPU/RAM/time) and I/O paths in each job.

Maintainers
- @EEbrami
# DART (Data Analysis & Reproducibility Toolkit)

Purpose
- Shared analysis utilities, validation/preprocessing scripts, and methodological notes for income/poverty analyses.

Quick start
- Reproduce main tables: `Rscript scripts/analyze.R --output outputs/`
- See scripts/ for steps and notebooks/ for exploration.

Contents
- scripts/ — analysis and preprocessing
- notebooks/ — exploratory work
- outputs/ — generated tables/figures (avoid committing large files)

Conventions
- Keep outputs deterministic; pin critical package versions if needed.
...

</details>



<!-- START COPILOT CODING AGENT TIPS -->
---

✨ Let Copilot coding agent [set things up for you](https://github.com/EEbrami/Poverty-Project/issues/new?title=✨+Set+up+Copilot+instructions&body=Configure%20instructions%20for%20this%20repository%20as%20documented%20in%20%5BBest%20practices%20for%20Copilot%20coding%20agent%20in%20your%20repository%5D%28https://gh.io/copilot-coding-agent-tips%29%2E%0A%0A%3COnboard%20this%20repo%3E&assignees=copilot) — coding agent works faster and does higher quality work when set up for your repo.

Copilot AI and others added 2 commits October 22, 2025 03:17
…es, xlsxConverted, analysis, and LISSY subfolders

Co-authored-by: EEbrami <148649132+EEbrami@users.noreply.github.com>
Co-authored-by: EEbrami <148649132+EEbrami@users.noreply.github.com>
Copilot AI changed the title [WIP] Add and update README files for key folders with repository map Docs: Add folder READMEs and repository map Oct 22, 2025
Copilot AI requested a review from EEbrami October 22, 2025 03:23
@EEbrami EEbrami marked this pull request as ready for review October 22, 2025 04:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants