youtube-crawl

Local-first YouTube transcript desk for loading a video, verifying the source, reading the raw transcript, and only then choosing whether to generate summary, detail notes, or transcript-aware chat with your own OpenAI, Claude, or Google API key.

No account system, no hosted vector store, and no eager AI generation. The default workflow is source first, analysis second.

Product Tour

Home	Preview

Workspace	Settings

Design Principles

Preview before analysis. The app loads the transcript, metadata, language, and opening lines on /preview so the user can verify the source before spending tokens.
Transcript first. /workspace opens on the raw script tab and treats summary, detail, and chat as optional lenses instead of the default UI.
On-demand AI only. Summary and detail are requested when their tabs are opened, not during the initial load path.
Retrieval-grounded chat. The chat route scores transcript chunks against the current question plus recent conversation, then sends only the relevant evidence instead of the full transcript every turn.
Local-first persistence. Workspace state, recent history, provider choice, model names, instructions, and API keys stay in browser local storage on the current device.
Resilient transcript loading. The input parser accepts watch URLs, short URLs, Shorts URLs, embed URLs, and raw video IDs, then the fetcher tries multiple YouTube client strategies before failing.
No Python runtime on the current main branch. Transcript loading now uses a pure JavaScript fetcher against YouTube endpoints.

How It Works

Normalize the input. The app extracts a valid 11-character video ID from multiple YouTube URL formats or a raw ID.
Fetch transcript and metadata in parallel. /api/transcript calls the transcript fetcher and YouTube oEmbed together, then cleans entities, decorates segments with timestamps, and builds plain plus timestamped transcript text.
Persist the working set locally. The selected video, transcript payload, generated documents, chat history, and settings are cached in browser storage so the workspace can be reopened quickly on the same machine.
Generate long-form documents with chunk-and-merge. /api/assistant splits long transcripts into bounded chunks, generates structured notes for each chunk, and merges them into one summary or detailed reading companion in the requested language.
Answer chat questions with targeted evidence. The chat path builds searchable transcript chunks, ranks them with lightweight token scoring, keeps only the latest conversation history, and returns source previews alongside the answer.

App Flow

flowchart LR
    A["Paste a YouTube link or raw video ID"] --> B["Normalize input"]
    B --> C["Fetch transcript + metadata"]
    C --> D["Preview metadata and opening lines"]
    D --> E["Open transcript workspace"]
    E --> F["Read raw transcript"]
    F --> G["Generate summary/detail on demand"]
    F --> H["Ask transcript-aware follow-up questions"]

Technical Notes

Transcript fetching is implemented in pure JS and uses YouTube caption endpoints with multiple client contexts to improve reliability.
The transcript payload includes raw segments, plain transcript text, timestamped transcript text, and derived stats such as duration, segment count, word count, and character count.
Summary and detail prompts enforce a structured output format instead of free-form dumping.
Chat answers are instructed to stay inside transcript evidence and cite timestamps for factual claims.
Recent history is keyed by video ID so revisiting the same video updates the existing saved entry instead of duplicating it.

Stack

Next.js 16
React 19
Tailwind CSS 4
Pure JS YouTube transcript fetcher
Bring-your-own OpenAI, Claude, and Google provider support
Electrobun desktop shell scripts for desktop packaging

Quick Start

Requirements:

Node.js 22+
npm 10+

Install dependencies:

npm install

Start the local app:

npm run dev

Open http://localhost:3000.

Scripts

npm run dev: start the Next.js development server
npm run build: create a production build
npm run lint: run ESLint
npm run desktop:dev: run the web app and Electrobun shell together
npm run desktop:build: bundle the desktop build
npm run desktop:run: run the packaged desktop entrypoint

Project Docs

Verification

The repository has been checked locally with:

npm run lint
npm run build

README screenshots were generated with Playwright using scripts/capture-readme-screenshots.mjs.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.claude		.claude
.github		.github
icon.iconset		icon.iconset
output/playwright/readme		output/playwright/readme
public		public
scripts		scripts
src		src
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.ko.md		README.ko.md
README.md		README.md
SECURITY.md		SECURITY.md
electrobun.config.ts		electrobun.config.ts
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

youtube-crawl

Product Tour

Design Principles

How It Works

App Flow

Technical Notes

Stack

Quick Start

Scripts

Project Docs

Verification

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

youtube-crawl

Product Tour

Design Principles

How It Works

App Flow

Technical Notes

Stack

Quick Start

Scripts

Project Docs

Verification

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages