| Paper to Comics |
|
The Ultimate Research & Implementation Assistant for Claude Code & Cursor
Sherlock is a curated collection of high-powered AI skills designed to transform your research workflow. It doesn't just read papersβit deduces, visualizes, and reconstructs them into working code.
From deep-diving into literature to converting methodology into Python, Sherlock equips your AI agent with the tools of a master detective and a master engineer.
Sherlock equips your agent with these specialized skills:
Transforms academic theory into executable reality.
- 4-Stage Pipeline: Systematically converts research papers into code (Algorithm Extraction β Concept Analysis β Planning β Implementation).
- No Hallucinations: Forces a structured intermediate representation (YAML) before writing a single line of code.
- Reproducibility First: Prioritizes accuracy and paper fidelity over "clever" coding.
The heavy lifter for comprehensive reports.
- Multi-Pass Drafting: Spawns parallel sub-agents to draft different sections of a report.
- Evidence Tracking: Maintains strictly cited evidence tablesβno claim goes unsourced.
- High Fidelity: Produces professional-grade reports with strict formatting compliance.
Explains the unexplainable through visual narratives.
- Visual Translation: Turns dense academic text into educational comics.
- Style Adaptive: Choose from Classic, Tech/Futuristic, Warm, or Chalkboard art styles.
- Gemini Powered: Uses the
genimg-gemini-webskill for consistent character consistency across panels.
X-Ray vision for PDFs.
- MinerU Integration: High-precision parsing of formulas, tables, and latex from PDFs.
- Style Rewrite: Can rewrite complex papers into "Storytelling", "Academic", or "Concise" formats.
- Metadata Extraction: Automatically pulls title, authors, and citations.
The visual engine.
- Image Generation: Generates images via Google's Gemini Web.
- Multi-Modal: Handles text-to-image and image-to-text tasks.
- Session Awareness: Maintains context across multi-turn conversations for consistent output.
Blueprints for understanding.
- Schema Generation: Transforms methodology sections into structural visual schemas.
- Prompt Engineering: Generates high-precision prompts for DALL-E 3 or Midjourney based on paper logic.
- Layout Logic: Detects if a system is Linear, Cyclic, Hierarchical, or Parallel.
-
Clone the Repository
git clone https://github.com/proyecto26/research-skills.git .claude -
Configure
.claude/config.json(Optional but Recommended) Ensure your skills are registered with your agent runtime (Claude Code / Cursor MCP). -
Authentication Some skills (like
genimg-gemini-webandpaper-analyzer) require API tokens or login sessions.- Gemini: Run
npx -y bun .claude/skills/genimg-gemini-web/scripts/main.ts --login - MinerU: Export
MINERU_TOKENin your environment.
- Gemini: Run
"Sherlock, implement this paper for me."
Triggers Paper2Code to read the PDF, plan the architecture, and write the Python implementation.
"Sherlock, explain this complex transformer architecture as a comic."
Triggers Paper Comic to create a visual storyboard explaining the concept.
"Sherlock, I need a deep research report on the state of LLM reasoning."
Triggers Deep Research to crawl, analyze, and compile a multi-page cited report.
"Sherlock, visualize the flow of data in this system."
Triggers Visual Architect to design a schematic diagram prompt.
skills/
βββ deep-research/ # Report generation & evidence tracking
βββ paper2code/ # Paper implementation pipeline
βββ paper-comic/ # Educational comic generator
βββ paper-analyzer/ # PDF parsing & style rewriting
βββ genimg-gemini-web/ # Image generation backend
βββ visual-architect/ # Visual schema design
Sherlock learns from the community. If you have a new skill module or an improvement to an existing detective tool, please open a PR!
Made with β€οΈ