An AI-driven recruitment tool designed to scrape and analyze GitHub and Google Scholar profiles to identify top candidates for AI and ML roles.
AIrecruiter/
├── src/
│ ├── core/ # Core data models and classes
│ │ ├── candidate.py
│ │ ├── person.py
│ │ ├── data.py
│ │ └── prof.py
│ ├── scrapers/ # Web scraping modules
│ │ ├── github.py
│ │ ├── googlescholar.py
│ │ ├── linkedin.py
│ │ └── authors.py
│ ├── scanners/ # Scanning and filtering modules
│ │ ├── scangit.py
│ │ ├── scangs.py
│ │ └── scanauth.py
│ └── utils/ # Utility functions
│ ├── llm.py
│ ├── normalise.py
│ ├── combiner.py
│ └── query_classifier.py
├── data/ # Data files and CSV outputs
├── templates/ # Flask HTML templates
├── static/ # Static CSS files
├── app.py # Main Flask application
└── requirements.txt # Python dependencies
- Query Classification: Automatically classifies recruitment queries
- GitHub Profile Analysis: Scores GitHub profiles based on AI/ML relevance
- Google Scholar Integration: Fetches and analyzes academic profiles
- Author Filtering: Filters and classifies co-authors from research papers
- Location-Based Search: Supports location-specific candidate searches
- Web Interface: Flask-based web UI for easy interaction
- Clone the repository
- Create a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
python app.pyThe application will start on http://localhost:5000
- "Find top 6 students who have worked on TensorFlow and have a strong GitHub presence in Boston."
- "Recruit top 5 students in California who have worked on computer vision projects."
- "Find top 8 programmers in Seattle who have worked on GPT-3 and have published papers on NLP."
- Query Classification: Input query is classified to determine search type (GitHub, Scholar, Student)
- Profile Fetching: Relevant profiles are fetched based on classification
- Scoring: Profiles are scored using relevance algorithms
- Filtering: Results are filtered and normalized
- Output: Results are saved to CSV files in the
data/directory
- Python 3.7+
- See
requirements.txtfor full dependency list
MIT License