GitHub - copypastin/MOAIS: Analysis project to detect similarities between student-written automated code and human-written code.

Measure of Artificial Intelligence Similarity 🗿 (MOAIS)

MOAIS is a tool designed to detect the similarity between automated code snippets and human-written code. This project is inspired by Stanford's Measure of Software Similarity (MOSS), which aims to automatically detect software plagiarism.

See the original MOSS paper for more details:

(https://theory.stanford.edu/~aiken/moss/) (https://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf)

How does it work?

In short...

MOAIS takes a programming assignment and rubric as input.
It uses AI to generate hundreds of variations of solutions to the problem.
These generated solutions are then compared against the student-written code to assess similarity.

In detail...

MOAIS leverages the GitHub Copilot CLI's wide-range of agents to generate code snippets based on the rubric.
- Additionally, prompts are also designed to push for more unique responses.
Using these generated snippets, MOAIS creates a comprehensive dataset of potential solutions.
Using the Winnowing algorithm, it breaks code snippets into small hashes.
These hashes are then compared using Jaccard Similarity Index to identify similarities and potential plagiarism through similarities in code structure and logic.

Theories to be tested

AI-generated code can be used as a benchmark to evaluate the originality of student submissions as they follow similar patterns and structures.
Monte Carlo methods can be employed to estimate the distribution of similarities between code snippets.

Installation Requirements

Python 3.8 or higher (and packages listed in requirements.txt)
Copilot CLI
GitHub CLI

TODO

Create a more robust prompt engineering strategy to generate diverse code snippets.
Create a better winnowing algorithm to improve code similarity detection.
- The downside of the current is that variable names and other syntactic elements can significantly affect the similarity score, even if the underlying logic is the same.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Measure of Artificial Intelligence Similarity 🗿 (MOAIS)

How does it work?

Theories to be tested

Installation Requirements

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Measure of Artificial Intelligence Similarity 🗿 (MOAIS)

How does it work?

Theories to be tested

Installation Requirements

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages