Skip to content

feat: 네이버 블로그 리서치 스킬 추가#107

Merged
vkehfdl1 merged 3 commits intoNomaDamas:mainfrom
cockyb:feat/naver-blog-research
Apr 12, 2026
Merged

feat: 네이버 블로그 리서치 스킬 추가#107
vkehfdl1 merged 3 commits intoNomaDamas:mainfrom
cockyb:feat/naver-blog-research

Conversation

@cockyb
Copy link
Copy Markdown
Contributor

@cockyb cockyb commented Apr 11, 2026

Summary

  • API 키 없이 python3 표준 라이브러리만으로 동작하는 네이버 블로그 리서치 스킬 추가
  • 블로그 검색(naver_search.py), 원문 읽기(naver_read.py), 이미지 다운로드(naver_download_images.py) 3개 스크립트 구성
  • 모바일 버전(m.blog.naver.com) 활용으로 iframe 없이 본문 직접 추출

주요 기능

  • 네이버 블로그 키워드 검색 (관련도순/최신순, 페이지네이션 지원)
  • 블로그 포스트 원문 텍스트 및 이미지 URL 추출
  • 블로그 이미지 CDN에서 로컬 다운로드 (ThreadPoolExecutor 병렬 처리)
  • stdin 파이프 체이닝 지원 (naver_read.py | naver_download_images.py)

Test plan

  • python3 scripts/naver_search.py "검색어" --count 5 실행하여 JSON 결과 확인
  • python3 scripts/naver_read.py "블로그URL" 실행하여 본문 텍스트 추출 확인
  • naver_read.py 출력을 naver_download_images.py에 파이프하여 이미지 다운로드 확인
  • --insecure 플래그가 네이버 도메인에만 허용되는지 확인

🤖 Generated with Claude Code

cockyb and others added 3 commits April 12, 2026 02:30
API 키 없이 python3 표준 라이브러리만으로 네이버 블로그 검색, 원문 읽기, 이미지 다운로드를 수행하는 스킬.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Naver's current blog search surface does not honor the older
where=blog + sort query pattern used by this skill. The request
now targets the blog tab surface, uses the observed NSO sort
controls, and trims each parsed page to the visible 15-result
window so count-based pagination returns distinct results.

Constraint: Must keep using stdlib-only HTTP scraping without adding dependencies
Constraint: Current Naver blog tab behavior requires ssc/tab parameters plus nso sort controls
Rejected: Keep where=blog and tune start values only | still returned repeated first-page results
Rejected: Leave sort=date as-is | current endpoint ignored it and returned relevance ordering
Confidence: medium
Scope-risk: narrow
Reversibility: clean
Directive: Re-verify request params against live Naver markup before changing paging or sort semantics again
Tested: python3 -m py_compile on naver-blog-research scripts and new regression test; PYTHONPATH=.:scripts python3 -m unittest scripts.test_naver_blog_search; npm run lint; live naver_search.py --count 20/30 --sort sim; live naver_search.py --count 10/20 --sort date
Not-tested: Full npm run test remains blocked by unrelated local pyexpat/libexpat environment failures in patent-search tests
PR 107 adds the skill and feature guide, but the repository landing page
still omitted it from the user-facing capability list. This commit keeps the
README aligned with the actual shipped skill set so users can discover the
new entry point from the main docs.

Constraint: README capability tables and feature lists should stay aligned with docs/features entries
Rejected: Leave README unchanged until merge | hides the new skill from the main index during PR review
Confidence: high
Scope-risk: narrow
Reversibility: clean
Directive: When adding a new skill guide, update both the summary table and the included-features list together
Tested: README diff review; verified docs/features/naver-blog-research.md link target exists
Not-tested: Full npm run ci (docs-only change)
@vkehfdl1 vkehfdl1 merged commit 4f015c5 into NomaDamas:main Apr 12, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants