MohiCodeHub

Follow

Mohammed Talab MohiCodeHub

Follow

2nd Year Computer Science Student at University College London (UCL).

6 followers · 7 following

in/mohammed-talab-9b6816335

Achievements

Achievements

Highlights

Pro

Pinned Loading

emergent-misaligned-agents emergent-misaligned-agents Public

In this repository, we explore the notion of emergent misalignment in the context of tool-augmented large language models. Tested models are finetuned on partially incorrect datasets to induce misa…

Python
Protocol66 Protocol66 Public

Protocol 66 is an automated red‑teaming harness for AI agents. It inspects an agent’s configuration, synthesizes adversarial prompts tailored to its tools and permissions, simulates responses via C…

TypeScript
dorukersoy47/lighthouse-ai dorukersoy47/lighthouse-ai Public

Python
multi-step-agent-rl-infra multi-step-agent-rl-infra Public

RL training infrastructure for multi-step web agents. Generates tasks at controllable difficulty (planning horizon), validates with oracle, evaluates efficiency-accuracy tradeoff

Python