🚀 🚀 🚀 A crawler tool for fetching historical articles from WeChat public accounts, supporting Windows, Linux, and Mac platforms, and capable of stable and efficient extraction of large amounts of article data.
- Bulk scraping of historical articles from WeChat public accounts
- Support for extracting article content, authors, publication dates, and more
- Offers multiple data storage formats (Excel, CSV, JSON)
- Python 3.x
- requests
- BeautifulSoup4
- pandas
- logging
- Archiving content from public accounts
- Data analysis of articles
- Content backup management
- Python 3.7+
pip install -r requirements.txt
iniCopyCOOKIE- WeChat cookie informationX_WECHAT_KEY- WeChat keyX_WECHAT_UIN- User identifierEXPORTKEY- Export keyUSER_AGENT- Browser identifierPASS_TICKET- Pass ticketBIZ- Unique identifier for the public account
- Configuration parameter retrieval and update
- Support for reading configuration files
- Automated parameter validation
- Automatic configuration template generation
- Article list scraping
- Pagination for list retrieval
- Automatic page navigation
- Error retry mechanism
- Content parsing of articles
- Title extraction
- Author information
- Publication date
- Geolocation data
Download Charles
Cracking Charles
Install the certificate and unknown problems: https://blog.csdn.net/m0_63892927/article/details/136680867
-
Running Issues
Sometimes issues occur, mostly due to content being controlled. Simply ignore such cases as shown below:





