feat(main): use thread pool for concurrent scraping #83

Mirza-Samad-Ahmed-Baig · 2025-07-05T08:03:13Z

This pull request introduces concurrent scraping to the spider_some_note function in main.py using a ThreadPoolExecutor, significantly improving the performance of scraping multiple notes—especially when retrieving all notes from a user.

Problem
The previous implementation of spider_some_note scraped notes sequentially, which was:
Slow for users with many notes
Inefficient, as it did not leverage available system resources

Solution
Refactored the spider_some_note function to use Python’s ThreadPoolExecutor, enabling:
Concurrent scraping of notes
Parallel execution, reducing overall scraping time

Benefits
Improved Performance: Dramatically faster scraping of multiple notes
Increased Efficiency: Maximizes use of system resources by running scraping tasks in parallel

feat(main): use thread pool for concurrent scraping

8639c1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(main): use thread pool for concurrent scraping #83

feat(main): use thread pool for concurrent scraping #83

Mirza-Samad-Ahmed-Baig commented Jul 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(main): use thread pool for concurrent scraping #83

Are you sure you want to change the base?

feat(main): use thread pool for concurrent scraping #83

Conversation

Mirza-Samad-Ahmed-Baig commented Jul 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant