1
0
mw-lifecycle-analysis/p2/quest/archived_data/101325_description_neurobiber-pca.log
2025-10-24 09:03:54 -07:00

240 lines
7.5 KiB
Plaintext

starting the job at: Tue Oct 14 15:08:48 CDT 2025
setting up the environment
running the neurobiber labeling script
[[13. ]
[14. ]
[11. ]
...
[10. ]
[14. ]
[12.5]]
Number of PCs explaining 90% variance: 15
Variance of each PCA component: [138.60156907 44.29951603 25.63179594 21.39857213 14.99271754
10.88014877 8.72969328 8.11497994 6.78712318 5.50912497
5.25006184 4.96444801 4.62359041 3.68257699 3.28506433]
PC1:
median_sentence_length: 0.994
normalized_CAP: -0.069
normalized_NNP: -0.050
normalized_NOMZ: -0.029
normalized_NUM: 0.026
normalized_DET: 0.024
normalized_ART: 0.020
normalized_PREP: 0.019
normalized_PIN: 0.019
normalized_RB: 0.016
PC2:
normalized_CAP: 0.555
normalized_NNP: 0.554
normalized_DET: -0.298
normalized_ART: -0.232
normalized_PREP: -0.220
normalized_PIN: -0.220
sentence_count: -0.189
normalized_RB: -0.125
normalized_PRP: -0.110
normalized_SBJP: -0.110
PC3:
normalized_NN: 0.509
normalized_PREP: 0.491
normalized_PIN: 0.491
normalized_CAP: 0.304
normalized_NNP: 0.279
normalized_DET: 0.143
sentence_count: -0.115
normalized_ART: 0.109
normalized_NOMZ: -0.098
normalized_INF: 0.095
PC4:
normalized_NN: 0.683
sentence_count: -0.412
normalized_NNP: -0.295
normalized_PIN: -0.217
normalized_PREP: -0.217
normalized_CAP: -0.174
normalized_PRP: -0.173
normalized_SBJP: -0.173
normalized_RB: -0.142
normalized_JJ: 0.117
PC5:
sentence_count: 0.718
normalized_NN: 0.358
normalized_DET: 0.228
normalized_PIN: -0.223
normalized_PREP: -0.223
normalized_ART: 0.221
normalized_NOMZ: -0.190
normalized_CAP: 0.186
normalized_INF: -0.137
normalized_JJ: -0.123
PC6:
normalized_DET: 0.538
normalized_ART: 0.483
sentence_count: -0.398
normalized_PREP: -0.216
normalized_PIN: -0.216
normalized_CAP: 0.206
normalized_VPRT: 0.204
normalized_INDA: 0.186
normalized_NN: -0.142
normalized_X: -0.132
PC7:
normalized_RB: 0.442
normalized_CAP: 0.343
normalized_PRP: 0.313
normalized_SBJP: 0.313
normalized_NNP: -0.278
normalized_VPRT: 0.234
normalized_ART: -0.232
normalized_NN: 0.229
normalized_DET: -0.208
normalized_NOMZ: -0.164
PC8:
normalized_JJ: 0.504
normalized_CAP: 0.502
normalized_NNP: -0.468
normalized_NOMZ: 0.296
sentence_count: 0.150
normalized_X: -0.146
normalized_QUOT: -0.145
normalized_NN: -0.142
normalized_VPRT: -0.131
normalized_RB: -0.128
PC9:
normalized_JJ: 0.637
normalized_VPRT: 0.357
normalized_NNP: 0.337
normalized_CAP: -0.265
normalized_INF: -0.258
normalized_QUOT: -0.224
normalized_RB: 0.204
normalized_X: -0.145
normalized_AUXB: 0.143
sentence_count: 0.135
PC10:
normalized_INF: 0.691
normalized_QUOT: -0.415
normalized_VPRT: -0.263
normalized_RB: 0.222
normalized_TO: 0.184
normalized_PRP: -0.155
normalized_SBJP: -0.155
normalized_CONT: -0.138
normalized_NNP: 0.122
normalized_PIN: -0.120
PC11:
normalized_QUOT: 0.714
normalized_JJ: 0.402
normalized_CONT: 0.295
normalized_INF: 0.294
normalized_NOMZ: -0.246
normalized_PRP: -0.126
normalized_SBJP: -0.126
normalized_X: 0.107
normalized_NUM: -0.076
normalized_CAP: 0.067
PC12:
normalized_RB: 0.521
normalized_PRP: -0.426
normalized_SBJP: -0.426
normalized_JJ: -0.236
normalized_INF: -0.229
normalized_FPP1: -0.211
normalized_NNP: -0.184
normalized_CONJ: 0.129
normalized_XX0: 0.125
normalized_TO: -0.124
PC13:
normalized_X: 0.808
normalized_NOMZ: -0.391
normalized_QUOT: -0.249
normalized_JJ: 0.163
sentence_count: -0.146
normalized_NNP: -0.126
normalized_CAP: 0.107
normalized_CONT: -0.097
normalized_VPRT: 0.096
normalized_RB: -0.071
PC14:
normalized_VPRT: 0.514
normalized_AUXB: 0.496
normalized_RB: -0.346
normalized_PASS: 0.221
normalized_INF: 0.218
normalized_NOMZ: 0.215
normalized_BEMA: 0.161
normalized_JJ: -0.161
normalized_VBD: -0.137
normalized_NUM: -0.136
PC15:
normalized_NOMZ: 0.554
normalized_NUM: -0.544
normalized_X: 0.438
normalized_RB: 0.239
sentence_count: 0.146
normalized_NNP: 0.135
normalized_AUXB: -0.116
normalized_NN: 0.105
normalized_CONT: 0.104
normalized_INF: -0.101
Top 10 PC1 values:
PC1 PC2 ... AuthorPHID date_created
16080 525.102703 48.280630 ... PHID-USER-zjzhrhmn36icnzbckqy4 1350678600
18859 77.466344 3.706703 ... PHID-USER-ll6tmaogat2b5q7tnqas 1405358040
20378 69.473292 -5.921977 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1407551580
8874 67.305410 6.587019 ... PHID-USER-ydswvwhh5pm4lshahjje 1371667800
6468 52.113083 12.698065 ... PHID-USER-azy72hrp3tpetr52aob6 1378208100
18692 43.220624 8.230008 ... PHID-USER-arjqb24x4oae7awzpfp6 1411431840
5607 42.720768 1.581160 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1360124400
19479 41.065047 8.286151 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1406854860
13751 38.405351 6.445956 ... PHID-USER-v7vgzvvcw7v2umf737ri 1380947640
6503 37.060191 -2.635433 ... PHID-USER-qgqq35kbi5wss2tlgmhg 1377865740
[10 rows x 25 columns]
Bottom 10 PC1 values:
PC1 PC2 ... AuthorPHID date_created
19173 -14.819594 38.839843 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
23533 -14.098760 31.956092 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498718
24553 -14.098553 31.953701 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498559
23532 -14.098346 31.951309 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498772
129 -13.767257 2.442547 ... PHID-USER-hyfm4swq76s4j642w46x 1375120080
22245 -12.327433 30.418183 ... PHID-USER-v7vgzvvcw7v2umf737ri 1438377936
752 -12.170613 17.171274 ... PHID-USER-sx63fwaih5kjt7bz4u6z 1380590700
2120 -11.607147 -10.509373 ... PHID-USER-xfe43w2lb5gpvglf4coa 1367008080
22153 -11.098587 7.351805 ... PHID-USER-a6p24cvyblhfzc7we7nc 1438982860
24847 -10.908633 15.377024 ... PHID-USER-srhlj2447vmpmrfhqnfa 1417632210
[10 rows x 25 columns]
Top 10 PC2 values:
PC1 PC2 ... AuthorPHID date_created
16080 525.102703 48.280630 ... PHID-USER-zjzhrhmn36icnzbckqy4 1350678600
19173 -14.819594 38.839843 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
23127 -1.787399 32.727692 ... PHID-USER-myidf5vlkwvrgp2iwn76 1433839792
23533 -14.098760 31.956092 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498718
24553 -14.098553 31.953701 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498559
23532 -14.098346 31.951309 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498772
18500 13.647382 30.709395 ... PHID-USER-hbffue25ov3attlvclze 1387662960
22245 -12.327433 30.418183 ... PHID-USER-v7vgzvvcw7v2umf737ri 1438377936
22023 -7.400000 29.037196 ... PHID-USER-a6p24cvyblhfzc7we7nc 1440568477
14809 -2.186555 28.072103 ... PHID-USER-zjzhrhmn36icnzbckqy4 1379900100
[10 rows x 25 columns]
Bottom 10 PC2 values:
PC1 PC2 ... AuthorPHID date_created
23485 -1.065773 -15.903250 ... PHID-USER-u7udgblfyop6qd5wxot6 1425991276
22060 4.042133 -15.132236 ... PHID-USER-2nnm76h4ykalvvref2ye 1440412099
5792 -2.696513 -15.036399 ... PHID-USER-grpjkpfolt5gz4ljlbfg 1355334540
1436 -0.480107 -15.016999 ... PHID-USER-tyjmn7xcw6s2b6rqagj7 1373878680
22799 -5.139569 -14.977697 ... PHID-USER-fjve3gq5wsmaaccti7pb 1430752987
22845 -0.877723 -14.762675 ... PHID-USER-2nnm76h4ykalvvref2ye 1440085454
7451 9.897529 -14.392291 ... PHID-USER-ysftv67jxeaxdwcakvwo 1374347580
9423 10.013728 -14.381035 ... PHID-USER-zzvqlvm6i6kml4tfnqvq 1369411380
1228 -2.448487 -13.906291 ... PHID-USER-ysftv67jxeaxdwcakvwo 1374765240
2775 3.664323 -13.485623 ... PHID-USER-dw53c5cb2qfhyemej57o 1377068880
[10 rows x 25 columns]
job finished, cleaning up
job pau at: Tue Oct 14 15:09:18 CDT 2025