-
f073136e09
updating with new descriptive stats for the methods section following data cleaning
main
Matthew Gaughan
2025-12-16 21:42:43 -0800
-
32fb4ca67c
updating with some plots for new results, results updated with new work
Matthew Gaughan
2025-12-16 21:26:11 -0800
-
1584e2cd5f
updating new analysis with re-labeled data, gerrit is out and bzimport is its own thing
Matthew Gaughan
2025-12-16 17:55:51 -0800
-
df1dcf1224
updating with new PCA run
mgaughan
2025-12-16 15:35:27 -0600
-
e0cb055ff7
updating and de-duplicating results for new PCA run
Matthew Gaughan
2025-12-16 12:40:44 -0800
-
30a828fc56
updating neurobiber PCA values
mgaughan
2025-12-16 11:14:04 -0600
-
0ded278e9e
adding misc odd statistics to write-up
Matthew Gaughan
2025-12-15 20:37:59 -0800
-
11cd084a6c
updated with new work for RQ0 on different commits to different libraries
Matthew Gaughan
2025-12-13 16:52:29 -0800
-
b0d4950bee
updating with revised plots and data
Matthew Gaughan
2025-12-08 20:21:58 -0800
-
e7e1bb3458
analysis and plots for draft results section
Matthew Gaughan
2025-12-08 10:53:07 -0800
-
c010e9f9cf
updating rq1 plots
Matthew Gaughan
2025-12-07 13:32:35 -0800
-
108b8aacd6
updating with new olmo labels
Matthew Gaughan
2025-12-07 10:15:14 -0800
-
cec9d82d41
updated olmo categorization
mgaughan
2025-12-07 10:10:15 -0600
-
a9ec0b19ef
updating figures for tentative printout
Matthew Gaughan
2025-12-04 14:06:22 -0800
-
d513e245b5
updating some plots for results section, also saving model to file
Matthew Gaughan
2025-12-02 14:20:38 -0800
-
90594d1ce3
new plot for results section
Matthew Gaughan
2025-12-02 09:18:50 -0800
-
86ee932c67
updating with new analysis/new information for write up
Matthew Gaughan
2025-12-01 13:44:49 -0800
-
a0545ad8de
adding small updates to results scripts
Matthew Gaughan
2025-11-26 13:10:20 -0800
-
f37dac73f4
updates to DSL modeling
Matthew Gaughan
2025-11-21 09:06:12 -0800
-
495be027e7
adding next iteration of model fit to dsl
Matthew Gaughan
2025-11-20 15:02:19 -0800
-
13d2113b73
updating dsl fitting
Matthew Gaughan
2025-11-18 13:00:07 -0800
-
6092e21977
running first DSL fit and trying to poke at FELM issue
Matthew Gaughan
2025-11-17 14:03:35 -0800
-
fb490e37f5
preliminary re-aggregstion of DSL df, preliminary drafting of DSL model
Matthew Gaughan
2025-11-17 09:15:37 -0800
-
7555259a3e
adding some more metadata to the DSL aggregation files
Matthew Gaughan
2025-11-10 14:32:14 -0800
-
be587982d7
adding updated human labels etc.
Matthew Gaughan
2025-11-09 15:48:09 -0800
-
7ace52a559
adding new batch of olmo labels
mgaughan
2025-11-09 16:29:01 -0600
-
43984fb605
backing up few-shot
mgaughan
2025-11-05 22:56:58 -0600
-
6f2858dd72
updating for new bivariate plots
Matthew Gaughan
2025-11-03 10:04:42 -0800
-
2efd961fed
adding trial survival test and more information about adac variables
Matthew Gaughan
2025-10-27 17:54:14 -0700
-
ab1cb3efea
updated DSL data aggregation
Matthew Gaughan
2025-10-27 10:28:08 -0700
-
e955b4f50f
adding some analysis of modal terms and olmo labels
Matthew Gaughan
2025-10-24 14:10:49 -0700
-
e5ca779900
unified new data and cleaned project directory
Matthew Gaughan
2025-10-24 09:03:54 -0700
-
d6965a33cb
new batched OLMO labels
mgaughan
2025-10-24 10:03:28 -0500
-
0ed72af495
add scripts for other aggregation and merge tasks
Matthew Gaughan
2025-10-23 13:50:27 -0700
-
e3748fa55f
updating collation scripts, more work TODO
Matthew Gaughan
2025-10-21 19:41:36 -0700
-
90311ca136
updating with new human labels
Matthew Gaughan
2025-10-21 15:19:13 -0700
-
b198781aa0
updating some of the scripts for PCA analysis
Matthew Gaughan
2025-10-20 11:09:04 -0700
-
f146016eac
re-done total pca
mgaughan
2025-10-20 12:38:44 -0500
-
2e8b85d3e9
removing erroneous PCA df, going to re-run
Matthew Gaughan
2025-10-20 10:31:54 -0700
-
bf4bc88083
running PCA across both description and reply comment types
mgaughan
2025-10-20 11:30:38 -0500
-
c40e87ff80
updating the repo, cleaning up misc. printout
Matthew Gaughan
2025-10-20 09:13:48 -0700
-
d86233abca
updated PCA analysis
Matthew Gaughan
2025-10-15 10:45:29 -0700
-
0843685707
final run of olmo sentence categorization
mgaughan
2025-10-15 09:51:33 -0500
-
f60f3ef120
updating PCA to account for sentence count and median length
mgaughan
2025-10-14 23:15:14 -0500
-
cb2fe737cd
updating batching script, preparing for run
mgaughan
2025-10-11 07:38:11 -0500
-
186a26f261
backing up renewed PCA analysis
Matthew Gaughan
2025-10-08 14:55:31 -0700
-
840b32a2e4
simple bivariate plots to look at variance, or lack thereof.
Matthew Gaughan
2025-10-07 15:00:59 -0700
-
6fb1801b2a
updating with basic seniority and affiliation data
Matthew Gaughan
2025-10-06 13:55:03 -0700
-
b982973f37
updating human sampling
Matthew Gaughan
2025-10-06 09:37:06 -0700
-
a14b08cfd8
pulling sample for human_labeling
Matthew Gaughan
2025-10-06 09:14:00 -0700
-
83bcc15811
updated with new outcome variable
Matthew Gaughan
2025-10-03 12:01:37 -0700
-
5f157ef532
some updates to PCA
Matthew Gaughan
2025-10-02 09:22:36 -0700
-
7f89fd1966
updated PCA analysis, ready for rob tomorrow
Matthew Gaughan
2025-10-01 20:58:55 -0700
-
f636969541
updated PCA results with dropped rows
mgaughan
2025-10-01 21:28:12 -0500
-
e61d3b6599
updating with DSL power analysis
Matthew Gaughan
2025-09-30 20:17:09 -0700
-
b7c2c9fcd6
unifying current data and some repo cleaning
Matthew Gaughan
2025-09-29 14:10:39 -0700
-
acd8964e73
preliminary EDA on the PCA analysis
Matthew Gaughan
2025-09-25 14:09:39 -0700
-
b21ecb02c3
running PCA on subcomment values, adding new plot for closed_relevance
mgaughan
2025-09-25 10:11:47 -0500
-
e29d4bf59c
cleaning working directory and re-running PCA with final neurobiber vectors
mgaughan
2025-09-25 09:48:23 -0500
-
9d1359af36
updating biberplus and olmo_batched results
mgaughan
2025-09-25 09:20:40 -0500
-
265b930578
updating library to account for re-running PCA
mgaughan
2025-09-23 16:41:32 -0500
-
032975c4f0
updating to collect new batch job labels
mgaughan
2025-09-23 15:09:45 -0500
-
b4f0c8f885
trying to sample the human label rows again
mgaughan
2025-09-22 20:34:31 -0500
-
bcfa688e11
olmo batched for getting the title in there too, i think
mgaughan
2025-09-22 19:18:11 -0500
-
e2413ed955
update to gerrit metadata extraction regex
Matthew Gaughan
2025-09-16 11:37:46 -0700
-
bb67fea96b
hopefully last update to human sampling
mgaughan
2025-09-16 12:16:10 -0500
-
89969daab5
updating labeling sample to be, uh, correct
mgaughan
2025-09-16 11:43:28 -0500
-
d83022f184
sampled comments for human labeling
mgaughan
2025-09-16 11:22:45 -0500
-
f68372572f
updating some scripts
mgaughan
2025-09-14 11:14:16 -0500
-
f9c12bb445
shelving some of the merge work for now
Matthew Gaughan
2025-09-14 09:11:33 -0700
-
77fc3ec541
preparing DSL modeling, looking at OLMO category data
Matthew Gaughan
2025-09-07 13:21:45 -0700
-
99c702fe20
adding batched OLMO results
mgaughan
2025-09-07 11:10:31 -0500
-
6de62f2447
some neurobiber PCA analysis
Matthew Gaughan
2025-09-05 14:59:07 -0700
-
a96fd6db2f
updates and re-running the batched olmo categorization
mgaughan
2025-09-05 13:43:00 -0500
-
f2afb7c981
should be updated and refined pca analysis
mgaughan
2025-09-04 15:47:11 -0500
-
a770d9c668
looking at kpca
mgaughan
2025-09-04 14:30:34 -0500
-
5d4df28f94
backing up the morning' before taking a few meetings
mgaughan
2025-09-04 11:21:07 -0500
-
6a5f07872d
looking at subcomment authorship
mgaughan
2025-09-04 11:13:31 -0500
-
0e569ac714
comment_type PCA
mgaughan
2025-09-04 10:57:11 -0500
-
5be22d3bfb
looking at ticket status
mgaughan
2025-09-04 10:46:33 -0500
-
68c95cdb8a
trying to look for the pca, with more specificity
mgaughan
2025-09-04 10:37:57 -0500
-
ccf434db38
looking for new phase pca
mgaughan
2025-09-04 10:25:00 -0500
-
809e858bbf
updating with new pca results
mgaughan
2025-09-04 10:12:34 -0500
-
a3c1a48dc7
trying to run olmo cat distributed, also running kernelPCA.
mgaughan
2025-09-04 09:35:41 -0500
-
a36226eab9
trying to look at the pca_plot 3
mgaughan
2025-09-02 16:04:06 -0500
-
dc23065cc8
trying to look at the pca_plot 2
mgaughan
2025-09-02 15:55:27 -0500
-
b8c12f987b
trying to look at the pca_plot 1
mgaughan
2025-09-02 15:50:47 -0500
-
d97b6e141c
trying to look at the pca_plot 0
mgaughan
2025-09-02 15:37:07 -0500
-
89105b7660
first pass at implementing pca for the style vectors
mgaughan
2025-09-02 15:30:50 -0500
-
b714e8dedb
updates to new script, I guess
Matthew Gaughan
2025-09-02 12:32:41 -0700
-
2d396ceb26
scaffolding out some work TODO on getting the olmo categories to be sentence-level
mgaughan
2025-09-02 12:48:11 -0500
-
53775c51db
removing stale todo list
mgaughan
2025-09-02 12:36:16 -0500
-
1c709f9a69
updating with gerrit information now
Matthew Gaughan
2025-08-07 19:03:20 -0700
-
41de0cbc7a
drop the labels from the FOSSY closed by plot
Matthew Gaughan
2025-07-31 21:51:51 -0700
-
7232f095e0
update to FOSSY tasks resolved plot
Matthew Gaughan
2025-07-31 21:49:32 -0700
-
34c376dbc3
FOSSY resolution share
Matthew Gaughan
2025-07-31 21:47:24 -0700
-
5239a8458a
adding renewed FOSSY heatmap
Matthew Gaughan
2025-07-31 18:30:23 -0700
-
822103ec3a
new task plot
Matthew Gaughan
2025-07-29 14:48:52 -0700
-
b624109f8d
updating with new heatmap for FOSSY presentation
Matthew Gaughan
2025-07-29 14:25:19 -0700
-
c5966518ef
updating similarity vectors
Matthew Gaughan
2025-07-29 13:38:50 -0700