1
0
Commit Graph

38 Commits

Author SHA1 Message Date
mgaughan
a770d9c668 looking at kpca 2025-09-04 14:30:34 -05:00
mgaughan
5d4df28f94 backing up the morning' before taking a few meetings 2025-09-04 11:21:07 -05:00
mgaughan
6a5f07872d looking at subcomment authorship 2025-09-04 11:13:31 -05:00
mgaughan
0e569ac714 comment_type PCA 2025-09-04 10:57:11 -05:00
mgaughan
5be22d3bfb looking at ticket status 2025-09-04 10:46:33 -05:00
mgaughan
68c95cdb8a trying to look for the pca, with more specificity 2025-09-04 10:37:57 -05:00
mgaughan
ccf434db38 looking for new phase pca 2025-09-04 10:25:00 -05:00
mgaughan
809e858bbf updating with new pca results 2025-09-04 10:12:34 -05:00
mgaughan
a3c1a48dc7 trying to run olmo cat distributed, also running kernelPCA. 2025-09-04 09:35:41 -05:00
mgaughan
a36226eab9 trying to look at the pca_plot 3 2025-09-02 16:04:06 -05:00
mgaughan
dc23065cc8 trying to look at the pca_plot 2 2025-09-02 15:55:27 -05:00
mgaughan
b8c12f987b trying to look at the pca_plot 1 2025-09-02 15:50:47 -05:00
mgaughan
d97b6e141c trying to look at the pca_plot 0 2025-09-02 15:37:07 -05:00
mgaughan
89105b7660 first pass at implementing pca for the style vectors 2025-09-02 15:30:50 -05:00
Matthew Gaughan
b714e8dedb updates to new script, I guess 2025-09-02 12:32:41 -07:00
mgaughan
2d396ceb26 scaffolding out some work TODO on getting the olmo categories to be sentence-level 2025-09-02 12:48:11 -05:00
mgaughan
53775c51db removing stale todo list 2025-09-02 12:37:24 -05:00
Matthew Gaughan
1c709f9a69 updating with gerrit information now 2025-08-07 19:03:20 -07:00
Matthew Gaughan
41de0cbc7a drop the labels from the FOSSY closed by plot 2025-07-31 21:51:51 -07:00
Matthew Gaughan
34c376dbc3 FOSSY resolution share 2025-07-31 21:47:24 -07:00
Matthew Gaughan
5239a8458a adding renewed FOSSY heatmap 2025-07-31 18:30:23 -07:00
Matthew Gaughan
822103ec3a new task plot 2025-07-29 14:48:52 -07:00
Matthew Gaughan
b624109f8d updating with new heatmap for FOSSY presentation 2025-07-29 14:25:19 -07:00
Matthew Gaughan
c5966518ef updating similarity vectors 2025-07-29 13:38:50 -07:00
mgaughan
23ef7acd01 updating 072525 biberplus labels to reflect that they have been pre-processes 2025-07-29 13:03:46 -05:00
mgaughan
3e21ac1bb7 updating with OLMO-generated classifications 2025-07-28 17:09:23 -05:00
mgaughan
9e4c05e347 almost done with the classification task 2025-07-25 15:37:32 -05:00
mgaughan
862643d5df building out olmo classification pipeline 2025-07-25 14:18:27 -05:00
Matthew Gaughan
a08a49d04e adding in analysis of biberplus vectors 2025-07-23 14:22:20 -07:00
mgaughan
b0584ec1be adding biberplus labels 2025-07-23 15:20:26 -05:00
mgaughan
edd17d3269 updating with biberplus implementation, though not quite solved yet 2025-07-22 16:44:07 -05:00
Matthew Gaughan
2e0665488c updating with dbscan clustering etc. 2025-07-16 14:03:51 -07:00
Matthew Gaughan
90e69975d2 preliminary EDA around neurobiber 2025-07-15 15:15:01 -07:00
mgaughan
43fb346318 updated the labels to try to store in a better format 2025-07-15 14:17:46 -05:00
mgaughan
7e8fb1982b updating with tentative neurobiber labels, need to verify outputs 2025-07-14 15:38:23 -05:00
Matthew Gaughan
c4dd45e344 saving cleaned, unified csv for text modeling 2025-07-14 08:19:11 -07:00
mgaughan
8f2409feb0 updating with some structure for discussion analysis stuff 2025-07-11 16:13:26 -05:00
mgaughan
68ec9c75f6 restructuring the repo for the second phase 2025-07-11 15:14:24 -05:00