WebMirror is a high-performance website cloning tool powered by HTTrack. It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your device.
WebMirror is optimized for Google Colab, providing a clean, professional interface using Colab Forms so you don't even have to look at the code!
- ✅ Full Website Download: HTML, CSS, JS, and Images.
- ✅ Recursive Extraction: Follows links to download subpages.
- ✅ Colab UI: Simple forms for URL and folder name input.
- ✅ Automated Packaging: Automatically zips the cloned site for download.
- ✅ Professional Look: Designed for GitHub portfolios and high visibility.
- Open the Notebook: Click on the
WebMirror_Colab.ipynbfile in this repository. - Run Step 1: Install the necessary
httrackbinaries. - Configure: Enter the Target URL of the website you want to clone.
- Clone: Wait for the process to complete (time depends on site size).
- Download: Run the last cell to download your cloned site as a
.zipfile.
WebMirror uses the following core commands:
# Install HTTrack
!apt-get install httrack -y
# Clone Website
!httrack [URL] -O [Project_Name] "+*.png" "+*.gif" "+*.jpg" "+*.css" "+*.js" -vThis project was meticulously developed to meet professional standards. If you encounter any issues or need customization, feel free to reach out.
🌟 If you find this tool useful, don't forget to give it a STAR! 🌟
