mgaughan
|
f60f3ef120
|
updating PCA to account for sentence count and median length
|
2025-10-14 23:15:14 -05:00 |
|
mgaughan
|
cb2fe737cd
|
updating batching script, preparing for run
|
2025-10-11 07:39:13 -05:00 |
|
Matthew Gaughan
|
186a26f261
|
backing up renewed PCA analysis
|
2025-10-08 14:55:31 -07:00 |
|
Matthew Gaughan
|
5f157ef532
|
some updates to PCA
|
2025-10-02 09:22:36 -07:00 |
|
Matthew Gaughan
|
7f89fd1966
|
updated PCA analysis, ready for rob tomorrow
|
2025-10-01 20:58:55 -07:00 |
|
mgaughan
|
f636969541
|
updated PCA results with dropped rows
|
2025-10-01 21:28:12 -05:00 |
|
Matthew Gaughan
|
b7c2c9fcd6
|
unifying current data and some repo cleaning
|
2025-09-29 14:10:39 -07:00 |
|
Matthew Gaughan
|
acd8964e73
|
preliminary EDA on the PCA analysis
|
2025-09-25 14:09:39 -07:00 |
|
mgaughan
|
b21ecb02c3
|
running PCA on subcomment values, adding new plot for closed_relevance
|
2025-09-25 10:11:47 -05:00 |
|
mgaughan
|
e29d4bf59c
|
cleaning working directory and re-running PCA with final neurobiber vectors
|
2025-09-25 09:48:23 -05:00 |
|
mgaughan
|
9d1359af36
|
updating biberplus and olmo_batched results
|
2025-09-25 09:20:40 -05:00 |
|
mgaughan
|
265b930578
|
updating library to account for re-running PCA
|
2025-09-23 16:41:32 -05:00 |
|
mgaughan
|
032975c4f0
|
updating to collect new batch job labels
|
2025-09-23 15:09:45 -05:00 |
|
mgaughan
|
b4f0c8f885
|
trying to sample the human label rows again
|
2025-09-22 20:34:31 -05:00 |
|
mgaughan
|
bb67fea96b
|
hopefully last update to human sampling
|
2025-09-16 12:16:10 -05:00 |
|
mgaughan
|
89969daab5
|
updating labeling sample to be, uh, correct
|
2025-09-16 11:43:28 -05:00 |
|
mgaughan
|
d83022f184
|
sampled comments for human labeling
|
2025-09-16 11:22:45 -05:00 |
|
mgaughan
|
f68372572f
|
updating some scripts
|
2025-09-14 11:14:49 -05:00 |
|
Matthew Gaughan
|
f9c12bb445
|
shelving some of the merge work for now
|
2025-09-14 09:11:33 -07:00 |
|
Matthew Gaughan
|
77fc3ec541
|
preparing DSL modeling, looking at OLMO category data
|
2025-09-07 13:21:45 -07:00 |
|
mgaughan
|
99c702fe20
|
adding batched OLMO results
|
2025-09-07 11:11:00 -05:00 |
|
Matthew Gaughan
|
6de62f2447
|
some neurobiber PCA analysis
|
2025-09-05 14:59:07 -07:00 |
|
mgaughan
|
a96fd6db2f
|
updates and re-running the batched olmo categorization
|
2025-09-05 13:43:00 -05:00 |
|
mgaughan
|
f2afb7c981
|
should be updated and refined pca analysis
|
2025-09-04 15:47:11 -05:00 |
|
mgaughan
|
a770d9c668
|
looking at kpca
|
2025-09-04 14:30:34 -05:00 |
|
mgaughan
|
5d4df28f94
|
backing up the morning' before taking a few meetings
|
2025-09-04 11:21:07 -05:00 |
|
mgaughan
|
6a5f07872d
|
looking at subcomment authorship
|
2025-09-04 11:13:31 -05:00 |
|
mgaughan
|
0e569ac714
|
comment_type PCA
|
2025-09-04 10:57:11 -05:00 |
|
mgaughan
|
5be22d3bfb
|
looking at ticket status
|
2025-09-04 10:46:33 -05:00 |
|
mgaughan
|
68c95cdb8a
|
trying to look for the pca, with more specificity
|
2025-09-04 10:37:57 -05:00 |
|
mgaughan
|
ccf434db38
|
looking for new phase pca
|
2025-09-04 10:25:00 -05:00 |
|
mgaughan
|
809e858bbf
|
updating with new pca results
|
2025-09-04 10:12:34 -05:00 |
|
mgaughan
|
a3c1a48dc7
|
trying to run olmo cat distributed, also running kernelPCA.
|
2025-09-04 09:35:41 -05:00 |
|
mgaughan
|
a36226eab9
|
trying to look at the pca_plot 3
|
2025-09-02 16:04:06 -05:00 |
|
mgaughan
|
dc23065cc8
|
trying to look at the pca_plot 2
|
2025-09-02 15:55:27 -05:00 |
|
mgaughan
|
b8c12f987b
|
trying to look at the pca_plot 1
|
2025-09-02 15:50:47 -05:00 |
|
mgaughan
|
d97b6e141c
|
trying to look at the pca_plot 0
|
2025-09-02 15:37:07 -05:00 |
|
mgaughan
|
89105b7660
|
first pass at implementing pca for the style vectors
|
2025-09-02 15:30:50 -05:00 |
|
mgaughan
|
2d396ceb26
|
scaffolding out some work TODO on getting the olmo categories to be sentence-level
|
2025-09-02 12:48:11 -05:00 |
|
mgaughan
|
53775c51db
|
removing stale todo list
|
2025-09-02 12:37:24 -05:00 |
|
Matthew Gaughan
|
1c709f9a69
|
updating with gerrit information now
|
2025-08-07 19:03:20 -07:00 |
|
Matthew Gaughan
|
5239a8458a
|
adding renewed FOSSY heatmap
|
2025-07-31 18:30:23 -07:00 |
|
Matthew Gaughan
|
b624109f8d
|
updating with new heatmap for FOSSY presentation
|
2025-07-29 14:25:19 -07:00 |
|
Matthew Gaughan
|
c5966518ef
|
updating similarity vectors
|
2025-07-29 13:38:50 -07:00 |
|
mgaughan
|
23ef7acd01
|
updating 072525 biberplus labels to reflect that they have been pre-processes
|
2025-07-29 13:03:46 -05:00 |
|
mgaughan
|
3e21ac1bb7
|
updating with OLMO-generated classifications
|
2025-07-28 17:09:23 -05:00 |
|
mgaughan
|
9e4c05e347
|
almost done with the classification task
|
2025-07-25 15:37:32 -05:00 |
|
mgaughan
|
862643d5df
|
building out olmo classification pipeline
|
2025-07-25 14:18:27 -05:00 |
|
Matthew Gaughan
|
a08a49d04e
|
adding in analysis of biberplus vectors
|
2025-07-23 14:22:20 -07:00 |
|
mgaughan
|
b0584ec1be
|
adding biberplus labels
|
2025-07-23 15:20:26 -05:00 |
|