Matthew Gaughan
|
2efd961fed
|
adding trial survival test and more information about adac variables
|
2025-10-27 17:54:14 -07:00 |
|
Matthew Gaughan
|
ab1cb3efea
|
updated DSL data aggregation
|
2025-10-27 10:28:08 -07:00 |
|
Matthew Gaughan
|
e955b4f50f
|
adding some analysis of modal terms and olmo labels
|
2025-10-24 14:10:49 -07:00 |
|
Matthew Gaughan
|
e5ca779900
|
unified new data and cleaned project directory
|
2025-10-24 09:03:54 -07:00 |
|
mgaughan
|
d6965a33cb
|
new batched OLMO labels
|
2025-10-24 10:03:36 -05:00 |
|
Matthew Gaughan
|
0ed72af495
|
add scripts for other aggregation and merge tasks
|
2025-10-23 13:50:27 -07:00 |
|
Matthew Gaughan
|
e3748fa55f
|
updating collation scripts, more work TODO
|
2025-10-21 19:41:36 -07:00 |
|
Matthew Gaughan
|
90311ca136
|
updating with new human labels
|
2025-10-21 15:19:13 -07:00 |
|
Matthew Gaughan
|
b198781aa0
|
updating some of the scripts for PCA analysis
|
2025-10-20 11:09:04 -07:00 |
|
mgaughan
|
f146016eac
|
re-done total pca
|
2025-10-20 12:38:44 -05:00 |
|
Matthew Gaughan
|
2e8b85d3e9
|
removing erroneous PCA df, going to re-run
|
2025-10-20 10:31:54 -07:00 |
|
mgaughan
|
bf4bc88083
|
running PCA across both description and reply comment types
|
2025-10-20 11:30:38 -05:00 |
|
Matthew Gaughan
|
c40e87ff80
|
updating the repo, cleaning up misc. printout
|
2025-10-20 09:13:48 -07:00 |
|
Matthew Gaughan
|
d86233abca
|
updated PCA analysis
|
2025-10-15 10:45:29 -07:00 |
|
mgaughan
|
0843685707
|
final run of olmo sentence categorization
|
2025-10-15 09:51:33 -05:00 |
|
mgaughan
|
f60f3ef120
|
updating PCA to account for sentence count and median length
|
2025-10-14 23:15:14 -05:00 |
|
mgaughan
|
cb2fe737cd
|
updating batching script, preparing for run
|
2025-10-11 07:39:13 -05:00 |
|
Matthew Gaughan
|
186a26f261
|
backing up renewed PCA analysis
|
2025-10-08 14:55:31 -07:00 |
|
Matthew Gaughan
|
840b32a2e4
|
simple bivariate plots to look at variance, or lack thereof.
|
2025-10-07 15:00:59 -07:00 |
|
Matthew Gaughan
|
6fb1801b2a
|
updating with basic seniority and affiliation data
|
2025-10-06 13:55:03 -07:00 |
|
Matthew Gaughan
|
b982973f37
|
updating human sampling
|
2025-10-06 09:37:06 -07:00 |
|
Matthew Gaughan
|
a14b08cfd8
|
pulling sample for human_labeling
|
2025-10-06 09:14:00 -07:00 |
|
Matthew Gaughan
|
83bcc15811
|
updated with new outcome variable
|
2025-10-03 12:01:37 -07:00 |
|
Matthew Gaughan
|
5f157ef532
|
some updates to PCA
|
2025-10-02 09:22:36 -07:00 |
|
Matthew Gaughan
|
7f89fd1966
|
updated PCA analysis, ready for rob tomorrow
|
2025-10-01 20:58:55 -07:00 |
|
mgaughan
|
f636969541
|
updated PCA results with dropped rows
|
2025-10-01 21:28:12 -05:00 |
|
Matthew Gaughan
|
e61d3b6599
|
updating with DSL power analysis
|
2025-09-30 20:17:09 -07:00 |
|
Matthew Gaughan
|
b7c2c9fcd6
|
unifying current data and some repo cleaning
|
2025-09-29 14:10:39 -07:00 |
|
Matthew Gaughan
|
acd8964e73
|
preliminary EDA on the PCA analysis
|
2025-09-25 14:09:39 -07:00 |
|
mgaughan
|
b21ecb02c3
|
running PCA on subcomment values, adding new plot for closed_relevance
|
2025-09-25 10:11:47 -05:00 |
|
mgaughan
|
e29d4bf59c
|
cleaning working directory and re-running PCA with final neurobiber vectors
|
2025-09-25 09:48:23 -05:00 |
|
mgaughan
|
9d1359af36
|
updating biberplus and olmo_batched results
|
2025-09-25 09:20:40 -05:00 |
|
mgaughan
|
265b930578
|
updating library to account for re-running PCA
|
2025-09-23 16:41:32 -05:00 |
|
mgaughan
|
032975c4f0
|
updating to collect new batch job labels
|
2025-09-23 15:09:45 -05:00 |
|
mgaughan
|
b4f0c8f885
|
trying to sample the human label rows again
|
2025-09-22 20:34:31 -05:00 |
|
mgaughan
|
bcfa688e11
|
olmo batched for getting the title in there too, i think
|
2025-09-22 19:21:15 -05:00 |
|
Matthew Gaughan
|
e2413ed955
|
update to gerrit metadata extraction regex
|
2025-09-16 11:37:46 -07:00 |
|
mgaughan
|
bb67fea96b
|
hopefully last update to human sampling
|
2025-09-16 12:16:10 -05:00 |
|
mgaughan
|
89969daab5
|
updating labeling sample to be, uh, correct
|
2025-09-16 11:43:28 -05:00 |
|
mgaughan
|
d83022f184
|
sampled comments for human labeling
|
2025-09-16 11:22:45 -05:00 |
|
mgaughan
|
f68372572f
|
updating some scripts
|
2025-09-14 11:14:49 -05:00 |
|
Matthew Gaughan
|
f9c12bb445
|
shelving some of the merge work for now
|
2025-09-14 09:11:33 -07:00 |
|
Matthew Gaughan
|
77fc3ec541
|
preparing DSL modeling, looking at OLMO category data
|
2025-09-07 13:21:45 -07:00 |
|
mgaughan
|
99c702fe20
|
adding batched OLMO results
|
2025-09-07 11:11:00 -05:00 |
|
Matthew Gaughan
|
6de62f2447
|
some neurobiber PCA analysis
|
2025-09-05 14:59:07 -07:00 |
|
mgaughan
|
a96fd6db2f
|
updates and re-running the batched olmo categorization
|
2025-09-05 13:43:00 -05:00 |
|
mgaughan
|
f2afb7c981
|
should be updated and refined pca analysis
|
2025-09-04 15:47:11 -05:00 |
|
mgaughan
|
a770d9c668
|
looking at kpca
|
2025-09-04 14:30:34 -05:00 |
|
mgaughan
|
5d4df28f94
|
backing up the morning' before taking a few meetings
|
2025-09-04 11:21:07 -05:00 |
|
mgaughan
|
6a5f07872d
|
looking at subcomment authorship
|
2025-09-04 11:13:31 -05:00 |
|