1
0
Commit Graph

170 Commits

Author SHA1 Message Date
Matthew Gaughan
d513e245b5 updating some plots for results section, also saving model to file 2025-12-02 14:20:38 -08:00
Matthew Gaughan
90594d1ce3 new plot for results section 2025-12-02 09:18:50 -08:00
Matthew Gaughan
86ee932c67 updating with new analysis/new information for write up 2025-12-01 13:44:49 -08:00
Matthew Gaughan
a0545ad8de adding small updates to results scripts 2025-11-26 13:10:20 -08:00
Matthew Gaughan
f37dac73f4 updates to DSL modeling 2025-11-21 09:06:12 -08:00
Matthew Gaughan
495be027e7 adding next iteration of model fit to dsl 2025-11-20 15:02:19 -08:00
Matthew Gaughan
13d2113b73 updating dsl fitting 2025-11-18 13:00:07 -08:00
Matthew Gaughan
6092e21977 running first DSL fit and trying to poke at FELM issue 2025-11-17 14:03:35 -08:00
Matthew Gaughan
fb490e37f5 preliminary re-aggregstion of DSL df, preliminary drafting of DSL model 2025-11-17 09:15:37 -08:00
Matthew Gaughan
7555259a3e adding some more metadata to the DSL aggregation files 2025-11-10 14:32:14 -08:00
Matthew Gaughan
be587982d7 adding updated human labels etc. 2025-11-09 15:48:09 -08:00
mgaughan
7ace52a559 adding new batch of olmo labels 2025-11-09 16:29:01 -06:00
mgaughan
43984fb605 backing up few-shot 2025-11-05 22:56:58 -06:00
Matthew Gaughan
6f2858dd72 updating for new bivariate plots 2025-11-03 10:04:42 -08:00
Matthew Gaughan
2efd961fed adding trial survival test and more information about adac variables 2025-10-27 17:54:14 -07:00
Matthew Gaughan
ab1cb3efea updated DSL data aggregation 2025-10-27 10:28:08 -07:00
Matthew Gaughan
e955b4f50f adding some analysis of modal terms and olmo labels 2025-10-24 14:10:49 -07:00
Matthew Gaughan
e5ca779900 unified new data and cleaned project directory 2025-10-24 09:03:54 -07:00
mgaughan
d6965a33cb new batched OLMO labels 2025-10-24 10:03:36 -05:00
Matthew Gaughan
0ed72af495 add scripts for other aggregation and merge tasks 2025-10-23 13:50:27 -07:00
Matthew Gaughan
e3748fa55f updating collation scripts, more work TODO 2025-10-21 19:41:36 -07:00
Matthew Gaughan
90311ca136 updating with new human labels 2025-10-21 15:19:13 -07:00
Matthew Gaughan
b198781aa0 updating some of the scripts for PCA analysis 2025-10-20 11:09:04 -07:00
mgaughan
f146016eac re-done total pca 2025-10-20 12:38:44 -05:00
Matthew Gaughan
2e8b85d3e9 removing erroneous PCA df, going to re-run 2025-10-20 10:31:54 -07:00
mgaughan
bf4bc88083 running PCA across both description and reply comment types 2025-10-20 11:30:38 -05:00
Matthew Gaughan
c40e87ff80 updating the repo, cleaning up misc. printout 2025-10-20 09:13:48 -07:00
Matthew Gaughan
d86233abca updated PCA analysis 2025-10-15 10:45:29 -07:00
mgaughan
0843685707 final run of olmo sentence categorization 2025-10-15 09:51:33 -05:00
mgaughan
f60f3ef120 updating PCA to account for sentence count and median length 2025-10-14 23:15:14 -05:00
mgaughan
cb2fe737cd updating batching script, preparing for run 2025-10-11 07:39:13 -05:00
Matthew Gaughan
186a26f261 backing up renewed PCA analysis 2025-10-08 14:55:31 -07:00
Matthew Gaughan
840b32a2e4 simple bivariate plots to look at variance, or lack thereof. 2025-10-07 15:00:59 -07:00
Matthew Gaughan
6fb1801b2a updating with basic seniority and affiliation data 2025-10-06 13:55:03 -07:00
Matthew Gaughan
b982973f37 updating human sampling 2025-10-06 09:37:06 -07:00
Matthew Gaughan
a14b08cfd8 pulling sample for human_labeling 2025-10-06 09:14:00 -07:00
Matthew Gaughan
83bcc15811 updated with new outcome variable 2025-10-03 12:01:37 -07:00
Matthew Gaughan
5f157ef532 some updates to PCA 2025-10-02 09:22:36 -07:00
Matthew Gaughan
7f89fd1966 updated PCA analysis, ready for rob tomorrow 2025-10-01 20:58:55 -07:00
mgaughan
f636969541 updated PCA results with dropped rows 2025-10-01 21:28:12 -05:00
Matthew Gaughan
e61d3b6599 updating with DSL power analysis 2025-09-30 20:17:09 -07:00
Matthew Gaughan
b7c2c9fcd6 unifying current data and some repo cleaning 2025-09-29 14:10:39 -07:00
Matthew Gaughan
acd8964e73 preliminary EDA on the PCA analysis 2025-09-25 14:09:39 -07:00
mgaughan
b21ecb02c3 running PCA on subcomment values, adding new plot for closed_relevance 2025-09-25 10:11:47 -05:00
mgaughan
e29d4bf59c cleaning working directory and re-running PCA with final neurobiber vectors 2025-09-25 09:48:23 -05:00
mgaughan
9d1359af36 updating biberplus and olmo_batched results 2025-09-25 09:20:40 -05:00
mgaughan
265b930578 updating library to account for re-running PCA 2025-09-23 16:41:32 -05:00
mgaughan
032975c4f0 updating to collect new batch job labels 2025-09-23 15:09:45 -05:00
mgaughan
b4f0c8f885 trying to sample the human label rows again 2025-09-22 20:34:31 -05:00
mgaughan
bcfa688e11 olmo batched for getting the title in there too, i think 2025-09-22 19:21:15 -05:00