Deception-Detection

Aditya Sahai (MT24009) ; Sharad Jain (MT24138) ; Shreyas Rajendra Gore (MT24087)

Overview

The QANTA Diplomacy project focuses on detecting deceptive messages between players in the strategy game Diplomacy. The task involves analyzing text and metadata to classify messages as either deceptive or truthful. This has implications for NLP, decision-making tasks, game theory, and security applications.

Dataset

17,289 in-game messages from the Diplomacy game
Each message annotated by sender (truthful/deceptive) and receiver (perceived deception)
Significant class imbalance: deceptive messages constitute only ~5% of examples
Includes message-level, speaker-level, and conversation-level metadata
Metadata includes: message text, countries involved, game scores, message indices, and temporal context

Methodology

The project compared four modeling approaches of increasing complexity:

BiLSTM+Attention: Simple baseline using BiLSTM with attention mechanism and structured metadata
BiLSTM+Power+RoBERTa: Combined BiLSTM with frozen RoBERTa embeddings and metadata features
LLM2Vec+GNN: Message and player interactions modeled as a heterogeneous graph with DistilBERT embeddings
MLDM (Multi-Level Deception Model): Best-performing model combining DistilBERT embeddings with dialogue act predictions, power difference embeddings, and graph encoding

Key Techniques

Data Augmentation: Synonym replacement on deceptive messages
Balanced Sampling: Oversampling of deceptive class to handle imbalance
Metadata Integration: Incorporating structured game data with text representations
Graph Construction: Building heterogeneous graphs representing message-message and player-message relationships

Results

Model	MacroF1	Accuracy
BiLSTM+Attention	0.47	0.67
BiLSTM+RoBERTa	0.49	0.68
LLM2Vec+GNN	0.53	0.81
MLDM	0.54	0.83

Inference Steps

To run inference with the MLDM model:

Required Files:
- deception/train.jsonl - Training dataset
- deception/validation.jsonl - Validation dataset
- deception/test.jsonl - Test dataset
- best_model_checkpoint.pt - Pretrained model
- u_cache.pt - Precomputed DistilBERT embeddings cache

Setup Environment:

pip install torch torch-geometric transformers nlpaug scikit-learn pandas numpy matplotlib seaborn tqdm

Run Inference: inference.ipynb
Inference Pipeline:
- The script loads the test data from JSONL files
- Generates DistilBERT embeddings for each message
- Constructs a graph representing message-speaker relationships
- Loads the pretrained MLDM model weights
- Runs inference with a threshold of 0.60 for deception detection
- Outputs accuracy, Macro F1, and Deceptive F1 scores
- Saves predictions to test_predictions.csv
Inference Results on Custom Examples:

Conclusion

The best performance was achieved by the MLDM model which fuses linguistic, structural, and strategic context. Graph-based architectures outperformed sequential baselines by modeling message relationships and player interactions. The project demonstrates the importance of combining pretrained language models with relational graph modeling and metadata fusion for effective deception detection in strategic environments.

Team

Aditya Sahai (MT24009)
Shreyas Rajendra Gore (MT24087)
Sharad Jain (MT24138)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
23_midevaluation		23_midevaluation
Baseline-2_BiLstm_Roberta		Baseline-2_BiLstm_Roberta
Baseline-3_LLM2Vec_Gnn		Baseline-3_LLM2Vec_Gnn
data		data
23_PPT.pptx		23_PPT.pptx
23_Report.pdf		23_Report.pdf
LICENSE		LICENSE
MLDM.ipynb		MLDM.ipynb
README.md		README.md
deception-inference.ipynb		deception-inference.ipynb
inference.png		inference.png
results.png		results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deception-Detection

Aditya Sahai (MT24009) ; Sharad Jain (MT24138) ; Shreyas Rajendra Gore (MT24087)

Overview

Dataset

Methodology

Key Techniques

Results

Inference Steps

Conclusion

Team

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

AdityaSahai123/Deception-Detection

Folders and files

Latest commit

History

Repository files navigation

Deception-Detection

Aditya Sahai (MT24009) ; Sharad Jain (MT24138) ; Shreyas Rajendra Gore (MT24087)

Overview

Dataset

Methodology

Key Techniques

Results

Inference Steps

Conclusion

Team

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages