Skip to content

Published at GGC-SD - coy of itskills for Fall 2025 to develop responsive website

License

Notifications You must be signed in to change notification settings

GGC-SD/GrizzlyPaths

 
 

Repository files navigation

JobsMatch: Web/Mobile application used by IT students for matching jobs to GGC courses

Description

This tool is designed to assist IT students in exploring different entry-level positions and then view the specific courses at GGC which would provide the entry-level skills for each position they are interested in. The scope of this project for Spring 2021 is to gather job listing datasets, clean and filter the initial datasets, manually test the project's goal, and statistically analyze data about job skills. The result of the project is that we have proven that our process works and that important job-skills data can be captured, filtered, displayed, and can be ultimately used in an application. Significant further development is required. Other IT students at GGC will have the opportunity to provide more research into different entry-level positions and provide the actual development of the Web/Mobile application to create a highly-beneficial working tool.

Spring 2025 Scope: Further develop the website tool by cleaning, analyzing, and creating visualizations from webscraped job posting sites such as LinkedIn, Glassdoor, and Indeed.com. These results will be used to attain the top 5-10 jobs and their correlating primary technical and soft skills depending on major or field. The user selected job/skill will result in the tool displaying courses that teach those necessary skills, ensuring the student will learn the skills necessary to attain their dream job. The team is working to update and revamp the current website to be more interactive and dynamic for users.

Updates Fall 2025: GrizzlyPaths is a web application that is meant to help upcoming IT majors by showing them a road map of skills that will be required for jobs in the desired industry. Our focus is on creating a road map that will display 10 job titles that will be displayed based on the occurrence within multiple job searching websites. As they choose the job, hard and soft skills will then be displayed based on importance from a variety of companies. Those skills will have corresponding courses at Georgia Gwinnett College that students must attend.

Project Demo Video

Project Website

Spring '23: Grizzly Path

Notebook

Final Report

Fall '25 Team

  • Students: Sidibaba Simpara - Project Manager & UI/UX design, Hieu Do - Lead Programmer & Documentation Lead, Charles Sarpong - Testing Lead, William Chokbengboune - Data Modeler & Client Liaison
  • Advisor: Dr. Anca Doloc-Mihu
  • Team Flyer
  • Team Photo with Clients
  • Coding Documentation

Spring '25 Team

  • Student: Michelle Webb - Data Analyzer/Project Manager, Nikhita Nikhita- Visualization/Project Documenter, Krishan Bhalsod - Data Modeler/Data Analyzer/data cleaning/collection, Lucas Leon Visualization/Client_Liason
  • Advisor: Dr. Anca Doloc-Mihu, Assistant Professor of Information Technology

Fall '23 Team

  • Students: Sam Downs
  • Advisor: Dr. Anca Doloc-Mihu

Spring '23 Team

alt text

  • Students: Anel Coralic, Sam Downs, Ashley Mendez
  • Advisor: Dr. Anca Doloc-Mihu, Assistant Professor of Information Technology

Spring '22 Team

  • Student: Michael Murillo Martinez
  • Advisor: Dr. Anca Doloc-Mihu, Assistant Professor of Information Technology

Summer '21 Team

  • Student: Hugh Smith
  • Advisor: Dr. Anca Doloc-Mihu, Assistant Professor of Information Technology

Publications

STaRS Symposium Poster

CREATE Symposium

Outreach Activities

  • ITEC 2140 Introduction to Java, Profession Xin Xu, April 27, 2021
  • ITEC 2140 Introduction to Java, Profession Xin Xu, April 28, 2021

Technology

Fall '25

  • Backend: Firebase
  • Frontend: HTML/CSS, JavaScript
  • Project Dependencies: React + Vite, Jest, Vitest, Boostrap, Paraphase, CSV-parser, Chart, Firebase SDK, Firebase CLI
  • Version Control: Git & Github
  • Project Management: Jira

Spring '25

Fall '23

Spring '23

Summer '21

I utilized the website kaggle.com to obtain the job listing datasets, Microsoft Excel for data cleansing, Google Drive for online file location, and Google Colab Notebook for python development, data analysis, and display.

  1. https://www.kaggle.com/
  2. https://www.microsoft.com/en-us/microsoft-365/excel
  3. https://drive.google.com/drive/my-drive
  4. https://colab.research.google.com/notebooks/intro.ipynb?utm_source=scs-index
  5. Technical Results: https://github.com/GGC-DSA/itskills/blob/main/media/Technical%20Results/SD%20Skills.png

Project Setup/Installation

  1. Consulted with advisor (and searched online) to find job listing dataset sources
  2. Researched online which job titles are considered entry-level within GGC IT concentrations (SD and DSA)
  3. Downloaded datasets which likely contained job titles in the concentration researched
  4. Created a master spreadsheet to summarize all datasets downloaded
  5. Used MS Excel to explore and filter down to entry-level job titles in each dataset Made sure each dataset of job listings contained a column where the details about required skills, abilities, and responsibilities were specified by employers
  6. Saved dataset copies on Google Drive
  7. Saved a specific subset file for each dataset containing enough of one type of job title:
    1. For example: I created a dataset called DS-8
    2. Using excel I filtered titles down to just "Junior Software Developer" and save that file as "DS-8 Junior Software Developer"
  8. Signed up for access to Google Colab and created the Colab notebook for this project
  9. Created Python code that...
    1. Linked notebook to my Google drive
    2. Loaded single job title datasets (from step 7)
    3. Created dictionary to strip out common words during analysis
    4. Stripped meaningless characters out of data
    5. Created counter object to automatically rank top 1000 words
    6. Created dictionaries to home in on select words during analysis
      • Created MS Excel file "SD Key Skills Python Dictionary Builder"
      • Copied Top 1000 words/ranks from Python results to this file
      • Worked through list to create common word and skill set dictionaries
      • Formatted to put back into Python coding to build dictionaries
    7. Created analysis loop to filter out common words and filter down to skill words
    8. Created histogram plot to display skills words in a ranked order

Updates Fall 2025:

  1. Install Visual Studio Code
  2. Install Git
  3. Install Node.js
    • 3.1: Verify installation
      • node -v
      • npm -v
  4. Clone the repository:
    • git clone https://github.com/GGC-SD/GrizzlyPaths.git
    • cd GrizzlyPaths/docs-Fall2025
  5. Install project dependencies:
    • React + Vite: npm create vite@latest my-react-app -- --template (Replace my-react-app with your desired project name)
    • React-router-dom: npm install react-router-dom
    • Bootstrap: npm install bootstrap@5.3.3
    • Bootstrap-icon: npm install bootstrap-icons
    • React-chartjs-2: npm install react-chartjs-2
    • Paraphase: npm install papaparse
    • Csv-parser: npm install csv-parser
    • Firebase CLI globally for deployment: npm install -g firebase-tools
    • Install Firebase SDK in the project: npm install firebase
    • Vitest: npm install vitest --save-dev
    • Jest: npm install jest --save-dev
  6. Run the project:
    • npm run dev
  7. Run the test:
    • npm run test

Usage

  1. Open up colab Notebook for project
  2. Click on left side arrow buttons at each step starting from the top
    1. Run access to Google Drive at start of each session
    2. For each new dataset
      1. Run dataset load for specific subset on Google Drive
      2. Run definitions for excluding common words and garbage characters
      3. Run long list analysis of top 1000 words
      4. Run definitions for focusing in on skill-set words
      5. Run short list analysis of specific skill-sets
        1. Will print lists of skill words in different orders
        2. Will display histogram showing top skill words in chosen order

Fall '25:

  1. Log in to the website. If you don’t have an account, click the link to create one. If you have an account but you forgot your password, click the link to reset password
  2. In the dashboard, you can change the major by using the dropdown.
  3. You can click on the Recommend Course link to see 3000/4000 level courses for the each major.
  4. You can click on the Roadmap to see the visualize of the job posting for each major. When you click the major that you want to choose, it will show the job posting. When you click the job posting, it will show the technical skill of that job postings. Then, you can click on the technical skill to show the classes that you need to take. To change jobs, click the arrow in the bottom of the screen to reload the wheel. The wheel will then disappear and only show the majors.
  5. The logout is located in the upper right corner of the Dashboard.

Project Status

  1. Datasets collected from Jan '23
  2. Cleaned and Analyzed for common skills and job titles
  3. Grizzly Path Website up to date since April '23

Datasets

Fall 2025

Spring 2025

Cleaned

Final

Fall 2023

Cleaned

Original

Main methods for Analysis, ML/AI

Fall '25

  • Firebase - Log in page with full function.
  • Show the general of student information in dashboard.
  • There is a general list of 3000/4000 recommended courses in each major.
  • Have a chart to show to top 5 jobs for each major.
  • Import button for cleaned csv of merged data is in place.

Spr '25:

  • Python - value counts
  • Prince module for MCA
  • MCA/K-Means for classification and clustering

Fall '23:

  • Python - value_counts()
  • Predicting job titles - Naive Byes, Logistic Regression, Support Vector Machine, Random Forest

2 Main Results

Spring '25:

Tree Map with Courses and Related Skills Systems Security Jobs with SKills

Fall '23:

Systems and Security Common Job Titles Software Developer Common Skills for Web Developer

Fall 2025 Remaining Scope

  1. Admin authentication must be implemented in further iterations of project.
  2. Visualization of information can be improved if necessary.
  3. Page formatting needs to be improved.
  4. Webscraping and updating button must be added to admin box.
  5. Course history section that has intractability with roadmap needs to be added to student.
  6. Main page needs to be separated into 3 view for student, admin, and guest, so that login isn’t necessary.

Spring 2025 Remaining Scope

  1. Have a continuous influx of data with live job posting data
  2. Create a survey for IT faculty to gather which specific skills are taught for each class - Dr. Anca can distribute
  3. Include full range of courses available for domains
  4. Incorporate more data from other popular job sites or elsewhere
  5. Incorporating more interactive visuals as we could not implement our PowerBI dashboard into the website due to paywall
  6. Improve chatbot algorithm for better matching courses with job skills

Fall 2023 Remaining Scope

  1. Create/Update GGC class survey and ask collect data from students using the survey.
  2. Collect more information about Entriprise Systems Classes.
  3. Add a disclaimer page
  4. Refactor the TreeCreation.js file

Spring 2022 Remaing Scope of Project

  1. Create/Update GGC class survey. Needs to be user friendly and easier to extract data.
  2. Ask GGC IT students to complete survey
  3. Associate skills from job titles to GGC courses.
  4. Update Grizzly Path website with GGC courses

About

Published at GGC-SD - coy of itskills for Fall 2025 to develop responsive website

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.8%
  • Other 1.2%