1
0
Analysis scripts and code for studying lifecycles of MediaWiki projects
Go to file
2025-10-27 17:54:14 -07:00
.ipynb_checkpoints updating some of the text analysis 2025-03-01 17:08:16 -08:00
analysis_data adding trial survival test and more information about adac variables 2025-10-27 17:54:14 -07:00
artifact_figures unifying current data and some repo cleaning 2025-09-29 14:10:39 -07:00
dsl adding trial survival test and more information about adac variables 2025-10-27 17:54:14 -07:00
p1 new task plot 2025-07-29 14:48:52 -07:00
p2 adding trial survival test and more information about adac variables 2025-10-27 17:54:14 -07:00
.gitignore reorganizing 2025-05-18 16:50:20 -07:00
.RData adding trial survival test and more information about adac variables 2025-10-27 17:54:14 -07:00
.sh_history saving cleaned, unified csv for text modeling 2025-07-14 08:19:11 -07:00
.wget-hsts backup, trying to parse text 2025-02-20 15:44:54 -08:00
batched-mw-olmo-info-cat.log olmo batched for getting the title in there too, i think 2025-09-22 19:21:15 -05:00
english-ewt-ud-2.5-191206.udpipe backup, trying to parse text 2025-02-20 15:44:54 -08:00
README.md unified new data and cleaned project directory 2025-10-24 09:03:54 -07:00
rstudio-server.job exploratory data analysis 2025-01-26 11:51:25 -08:00

mw-lifecycle-analysis

Analysis scripts and code for studying the deployment processes of three MediaWiki/Wikimedia features (2013-2015)

/analysis_data

  • contains the unified and constituent data for final analysis

/dsl

  • contains the aggregation code for turning unified task data into DSL form
  • contains the R scripts for fitting and interpreting a DSL model

/p1

  • contains the scripts and plots from the first phase of analysis

/p2

  • contains the scripts and plots from the second phase of analysis
  • /p2_EDA contains small scripts for analysis and interpretation of initial data
  • /quest contains tools for using HPC
  • /slurm_jobs contains the bash scripts used to orchestrate slurm jobs on HPC
  • /python_scripts contains the code to run different labeling and analysis tasks