Subject to Change (Foster 2024)

Welcome! This repository contains code that uses machine learning to estimate "change" years for organized non-state violent conflict actors whose activities are tracked by the Uppsala Conflict Data Program.

You can explore the data at the associated dashboard, hosted by Render. Heads up that it will take about a minute to load. If you like what you see, you can download the processed .csv and incorporate it into your workflow.

Slides for a high-level presentation of the project here. A link to the peer-reviewed manuscript is forthcoming.

Project Overview

The output summarizes potential years in which third-party observers changed the underlying thematic presentation of non-state miltiant groups tracked in the UCDP's Georeferenced Events Dataset v 21.1).

A single group-level example looks like:

.

The model plots the Topic 1 vs Topic 2 assignment of news articles associated with ASG for each year and summaries the relative distribution with the green dotted line. Operationalizing "change" as a year with the majority topic changing from one year to the next, it estimates that the Philippines- based jihadi insurgency underwent "change" periods in approximately 2001, 2003, 2007/2008, and 2019.

One might use the measure to improve predictions of conflict dynamics, such as , which presents coefficient estimates of a Cox proportional hazards survival model indicating that non-state actors with any change years tend to be associated with longer conflicts, as do groups with more "change" periods.

Results Summary

This plot produces a high-level visualization of the output, via an aggegation of the number of group "changes" assigned to regions of operation, by year:

The STC Visualizer dashboard allows interested users to drill down into the trajectory for a specific group. Selecting a region in the first drop-down will automatically populate the second drop down with the UCDP groups associated with that region (and which had sufficient articles to model).

The processed change variable can be downloaded in the Data subdirectory. It includes the UCDP Name for each modeled non-state actor, the year, the proportion of documents assigned to Topic One, proportion assigned to Topic Two, and, where possible, the FREX words associated with the dominant topic. The data is more fully characterized in the summary Jupyter Notebook.

Code Overview:

The code in this repository:

Takes event-level news articles from the UCDP GED and aggregates them for non-state actors
Runs a Structural Topic Model on the corpus of news snippets associated with each group.
Aggregates the topic model output by group year
Models the group-level trends to identify points where third-party writers change how they write about the activities of actors
Creates a dataset that operationalizes the above into a:
- binary "actor change" variable
- time-series actor-country-year change records
Inserts the variable in a previous study about the effects of uncertainty on the length of substate conflict, more directly capturing the dynamic of interest and increasing the precision of the study's estimate

Data

The data used for the analysis (along with copies of code, plots, and full run logs) can be found in the Harvard Dataverse repository associated with this project. The .zip file includes the underlying event data, media precision, and study extension data as well as the intermediate .Rdata files produced from the R code.

Using the repository

This repository replicates the analysis needed to produce the data and results in the paper.

It also features Jupyter Notebook scripts that allow potential users to interact with the new "change" variable.

Replication

The workflow to do a full replication is:

Knit STC_R_Replication_Log.Rmd This Markdown file calls each of the scripts in sequence to take in the UCDP GED data and produce the change measurement. It uses the GroundhogR dependency management framework to ensure library consistency. It concludes with some light directory cleanup. Each R script called by STC_R_Replication_Log.Rmd produces an html log, housed in the Logs/ subdirectory. (STC_R_Replication_Log.Rmd takes 6 - 8 hours to run on an Apple M1 Pro laptop.)
Open STATA and call: do STC_STATA_Replication.do do STC_STATA_Rep_All.do

The application of the measurement is done via a replication of a STATA script, STC_STATA_Replication.do An evaluation of the effects of changing inclusion thresholds is in STC_STATA_Rep_All.do. The log of these scripts is STC_STATA_Log.pdf.

Knit Replication_Figures.Rmd

The logs themselves are long and complex (with a lot of printouts), so for convenience, I have aggregated all of the tables and figures into a single printout. Replication_Figures.Rmd, which produces a pdf (Replication_Figures.pdf) of the figures and tables that are featured in the Manuscript and appendix.

Logs

Logs for the entire project are available at:

(3a) R-based analysis

./STC_R_Replication_Log.html ./Logs/[R_file_name_here].html

(3b) STATA-based analysis

A log of the STATA run can be found in:

./Logs/STC_STATA_Log.pdf ./Logs/STC_STATA_Log.smcl

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Logs		Logs
data		data
images		images
.gitignore		.gitignore
00articleModelingRep.R		00articleModelingRep.R
00dominantFramingRep.R		00dominantFramingRep.R
01measurementFullDataTinyThreshRep.R		01measurementFullDataTinyThreshRep.R
02tinyThreshTransformVarRep.R		02tinyThreshTransformVarRep.R
03BanalysisFullDataTinyThreshWithTimeRep.R		03BanalysisFullDataTinyThreshWithTimeRep.R
03ClocationPrecisionRep.R		03ClocationPrecisionRep.R
03analysisFullDataTinyThreshRep.R		03analysisFullDataTinyThreshRep.R
04nsPrepRecurrance.R		04nsPrepRecurrance.R
04nsPrepTerminationRep.R		04nsPrepTerminationRep.R
05Replication-Recurrence-analysis.do		05Replication-Recurrence-analysis.do
05replication-Termination-analysis-ISQ.do		05replication-Termination-analysis-ISQ.do
06ThresholdPanelDescriptives.R		06ThresholdPanelDescriptives.R
06a-Robustness.R		06a-Robustness.R
06c-Robustness.R		06c-Robustness.R
06d-RobustnessVisualization.R		06d-RobustnessVisualization.R
06dRobustnessTerminationComparision_10_1.do		06dRobustnessTerminationComparision_10_1.do
06dRobustnessTerminationComparision_10_75.do		06dRobustnessTerminationComparision_10_75.do
06dRobustnessTerminationComparision_10_90.do		06dRobustnessTerminationComparision_10_90.do
06dRobustnessTerminationComparision_1_1.do		06dRobustnessTerminationComparision_1_1.do
06dRobustnessTerminationComparision_1_75.do		06dRobustnessTerminationComparision_1_75.do
06dRobustnessTerminationComparision_1_90.do		06dRobustnessTerminationComparision_1_90.do
06dRobustnessTerminationComparision_5_1.do		06dRobustnessTerminationComparision_5_1.do
06dRobustnessTerminationComparision_5_75.do		06dRobustnessTerminationComparision_5_75.do
06dRobustnessTerminationComparision_5_90.do		06dRobustnessTerminationComparision_5_90.do
07_Introductory_Viz.ipynb		07_Introductory_Viz.ipynb
07_Prep_For_Dist.R		07_Prep_For_Dist.R
07_Prep_For_Dist.ipynb		07_Prep_For_Dist.ipynb
08_Dash_Dev.ipynb		08_Dash_Dev.ipynb
7a_DataPrep.ipynb		7a_DataPrep.ipynb
README.md		README.md
RepFiguresBackup.R		RepFiguresBackup.R
ReplicationRLog.Rmd		ReplicationRLog.Rmd
ReplicationRLog.html		ReplicationRLog.html
Replication_Figures.Rmd		Replication_Figures.Rmd
Replication_Figures.pdf		Replication_Figures.pdf
RmarkdownLog.Rmd		RmarkdownLog.Rmd
RobustnessPanel.R		RobustnessPanel.R
STATAReplicationLog.html		STATAReplicationLog.html
STC_R_Replication_Log.Rmd		STC_R_Replication_Log.Rmd
STC_R_Replication_Log.html		STC_R_Replication_Log.html
STC_RecurranceReplicationForLog.do		STC_RecurranceReplicationForLog.do
STC_ReplicationForLog.do		STC_ReplicationForLog.do
STC_STATA_Rep_All.do		STC_STATA_Rep_All.do
STC_STATA_Replication.do		STC_STATA_Replication.do
Scrap-for-descriptives.R		Scrap-for-descriptives.R
StataReplicationLog.txt		StataReplicationLog.txt
ThresholdReplication.do		ThresholdReplication.do
ThresholdReplicationLog.html		ThresholdReplicationLog.html
ThresholdReplicationLog.txt		ThresholdReplicationLog.txt
analysisGroupPlotsRep.R		analysisGroupPlotsRep.R
aqapthreetopics.txt		aqapthreetopics.txt
articleModelingAlt.r		articleModelingAlt.r
changeOverviewAll.html		changeOverviewAll.html
checkAlternateSpecTiny.R		checkAlternateSpecTiny.R
implementAltPKK.r		implementAltPKK.r
implementAltPKKAQAP.R		implementAltPKKAQAP.R
matchingPairAQAPStories.txt		matchingPairAQAPStories.txt
process.txt		process.txt
replication_main.R		replication_main.R
too-short.txt		too-short.txt
topicSearchK.R		topicSearchK.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subject to Change (Foster 2024)

Project Overview

Results Summary

Code Overview:

Data

Using the repository

Replication

Logs

About

Uh oh!

Packages

Uh oh!

Languages

margaretfoster/SubjectToChange

Folders and files

Latest commit

History

Repository files navigation

Subject to Change (Foster 2024)

Project Overview

Results Summary

Code Overview:

Data

Using the repository

Replication

Logs

About

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Languages

Packages