| 
							
							
								 Matthew Gaughan | 2efd961fed | adding trial survival test and more information about adac variables | 2025-10-27 17:54:14 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | ab1cb3efea | updated DSL data aggregation | 2025-10-27 10:28:08 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | e955b4f50f | adding some analysis of modal terms and olmo labels | 2025-10-24 14:10:49 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | e5ca779900 | unified new data and cleaned project directory | 2025-10-24 09:03:54 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | d6965a33cb | new batched OLMO labels | 2025-10-24 10:03:36 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 0ed72af495 | add scripts for other aggregation and merge tasks | 2025-10-23 13:50:27 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | e3748fa55f | updating collation scripts, more work TODO | 2025-10-21 19:41:36 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 90311ca136 | updating with new human labels | 2025-10-21 15:19:13 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | b198781aa0 | updating some of the scripts for PCA analysis | 2025-10-20 11:09:04 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | f146016eac | re-done total pca | 2025-10-20 12:38:44 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 2e8b85d3e9 | removing erroneous PCA df, going to re-run | 2025-10-20 10:31:54 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | bf4bc88083 | running PCA across both description and reply comment types | 2025-10-20 11:30:38 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | c40e87ff80 | updating the repo, cleaning up misc. printout | 2025-10-20 09:13:48 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | d86233abca | updated PCA analysis | 2025-10-15 10:45:29 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | 0843685707 | final run of olmo sentence categorization | 2025-10-15 09:51:33 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | f60f3ef120 | updating PCA to account for sentence count and median length | 2025-10-14 23:15:14 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | cb2fe737cd | updating batching script, preparing for run | 2025-10-11 07:39:13 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 186a26f261 | backing up renewed PCA analysis | 2025-10-08 14:55:31 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 840b32a2e4 | simple bivariate plots to look at variance, or lack thereof. | 2025-10-07 15:00:59 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 6fb1801b2a | updating with basic seniority and affiliation data | 2025-10-06 13:55:03 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | b982973f37 | updating human sampling | 2025-10-06 09:37:06 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | a14b08cfd8 | pulling sample for human_labeling | 2025-10-06 09:14:00 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 83bcc15811 | updated with new outcome variable | 2025-10-03 12:01:37 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 5f157ef532 | some updates to PCA | 2025-10-02 09:22:36 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 7f89fd1966 | updated PCA analysis, ready for rob tomorrow | 2025-10-01 20:58:55 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | f636969541 | updated PCA results with dropped rows | 2025-10-01 21:28:12 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | e61d3b6599 | updating with DSL power analysis | 2025-09-30 20:17:09 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | b7c2c9fcd6 | unifying current data and some repo cleaning | 2025-09-29 14:10:39 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | acd8964e73 | preliminary EDA on the PCA analysis | 2025-09-25 14:09:39 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | b21ecb02c3 | running PCA on subcomment values, adding new plot for closed_relevance | 2025-09-25 10:11:47 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | e29d4bf59c | cleaning working directory and re-running PCA with final neurobiber vectors | 2025-09-25 09:48:23 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 9d1359af36 | updating biberplus and olmo_batched results | 2025-09-25 09:20:40 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 265b930578 | updating library to account for re-running PCA | 2025-09-23 16:41:32 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 032975c4f0 | updating to collect new batch job labels | 2025-09-23 15:09:45 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | b4f0c8f885 | trying to sample the human label rows again | 2025-09-22 20:34:31 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | bcfa688e11 | olmo batched for getting the title in there too, i think | 2025-09-22 19:21:15 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | e2413ed955 | update to gerrit metadata extraction regex | 2025-09-16 11:37:46 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | bb67fea96b | hopefully last update to human sampling | 2025-09-16 12:16:10 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 89969daab5 | updating labeling sample to be, uh, correct | 2025-09-16 11:43:28 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | d83022f184 | sampled comments for human labeling | 2025-09-16 11:22:45 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | f68372572f | updating some scripts | 2025-09-14 11:14:49 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | f9c12bb445 | shelving some of the merge work for now | 2025-09-14 09:11:33 -07:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 77fc3ec541 | preparing DSL modeling, looking at OLMO category data | 2025-09-07 13:21:45 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | 99c702fe20 | adding batched OLMO results | 2025-09-07 11:11:00 -05:00 |  | 
			
				
					| 
							
							
								 Matthew Gaughan | 6de62f2447 | some neurobiber PCA analysis | 2025-09-05 14:59:07 -07:00 |  | 
			
				
					| 
							
							
								 mgaughan | a96fd6db2f | updates and re-running the batched olmo categorization | 2025-09-05 13:43:00 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | f2afb7c981 | should be updated and refined pca analysis | 2025-09-04 15:47:11 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | a770d9c668 | looking at kpca | 2025-09-04 14:30:34 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 5d4df28f94 | backing up the morning' before taking a few meetings | 2025-09-04 11:21:07 -05:00 |  | 
			
				
					| 
							
							
								 mgaughan | 6a5f07872d | looking at subcomment authorship | 2025-09-04 11:13:31 -05:00 |  |