-
Notifications
You must be signed in to change notification settings - Fork 4
Description
Several scripts that format tweet data for social network analysis have been developed for project-specific use cases but these are not broadly generalizable or consistent with newer RITHM conventions for handling input and output procedures. It will be beneficial to develop a script for consistent implementation of basic network analysis within the RITHM framework.
This will start with a procedure that (1) links retweet IDs to original tweet IDs that are present in existing metadata. Continued work will include (2) linking tweet response IDs to original tweet IDs and (3) linking quoted tweet IDs to original tweet IDs. This process should be flexible enough that these various relationships (1-3) can be handled separately and so that output can be aggregated for user-level analysis. Output should include well-formed dictionary objects (pickle format) that can be updated as new data are added, as well as node and edge files (TSV format) that can be used in third-party analysis software.