1
0
mw-lifecycle-analysis/p2/quest/100125_description_neurobiber-pca.log
2025-10-01 21:28:12 -05:00

300 lines
7.8 KiB
Plaintext

starting the job at: Wed Oct 1 20:55:40 CDT 2025
setting up the environment
running the neurobiber labeling script
Number of PCs explaining 90% variance: 21
Variance of each PCA component: [44.14465236 25.51079987 20.02977026 11.84052754 8.73144858 8.38589906
6.95245699 5.64852989 5.25245119 4.98015739 4.87640589 3.84009303
3.46134099 2.49633957 2.31075199 2.07408882 1.83990439 1.83715267
1.69163987 1.34972345 1.21923888]
PC1:
BIN_CAP: 0.575
BIN_NNP: 0.568
BIN_DET: -0.296
BIN_ART: -0.232
BIN_PREP: -0.226
BIN_PIN: -0.226
BIN_RB: -0.126
BIN_INF: -0.109
BIN_PRP: -0.105
BIN_SBJP: -0.105
PC2:
BIN_PREP: 0.498
BIN_PIN: 0.498
BIN_NN: 0.460
BIN_CAP: 0.334
BIN_NNP: 0.313
BIN_DET: 0.148
BIN_NOMZ: -0.112
BIN_ART: 0.111
BIN_INF: 0.097
BIN_CONJ: 0.075
PC3:
BIN_NN: 0.811
BIN_PIN: -0.235
BIN_PREP: -0.235
BIN_NNP: -0.223
BIN_PRP: -0.196
BIN_SBJP: -0.196
BIN_RB: -0.175
BIN_INF: -0.130
BIN_FPP1: -0.091
BIN_VPRT: -0.085
PC4:
BIN_DET: 0.587
BIN_ART: 0.528
BIN_PREP: -0.282
BIN_PIN: -0.282
BIN_CAP: 0.252
BIN_INDA: 0.183
BIN_VPRT: 0.178
BIN_JJ: -0.137
BIN_NOMZ: -0.130
BIN_NNP: 0.123
PC5:
BIN_RB: 0.439
BIN_CAP: 0.348
BIN_PRP: 0.313
BIN_SBJP: 0.313
BIN_NNP: -0.285
BIN_ART: -0.234
BIN_VPRT: 0.231
BIN_NN: 0.229
BIN_DET: -0.210
BIN_NOMZ: -0.160
PC6:
BIN_JJ: 0.552
BIN_CAP: 0.454
BIN_NNP: -0.397
BIN_NOMZ: 0.374
BIN_X: -0.208
BIN_QUOT: -0.184
BIN_NN: -0.160
BIN_NUM: -0.146
BIN_CONT: -0.117
BIN_ART: 0.088
PC7:
BIN_JJ: 0.552
BIN_NNP: 0.417
BIN_VPRT: 0.374
BIN_CAP: -0.333
BIN_RB: 0.258
BIN_QUOT: -0.224
BIN_X: -0.175
BIN_INF: -0.172
BIN_AUXB: 0.157
BIN_XX0: 0.091
PC8:
BIN_INF: 0.720
BIN_QUOT: -0.330
BIN_VPRT: -0.252
BIN_RB: 0.200
BIN_TO: 0.190
BIN_NOMZ: 0.159
BIN_NUM: -0.152
BIN_NNP: 0.147
BIN_PRP: -0.132
BIN_SBJP: -0.132
PC9:
BIN_QUOT: 0.681
BIN_JJ: 0.417
BIN_INF: 0.317
BIN_CONT: 0.281
BIN_NOMZ: -0.266
BIN_PRP: -0.139
BIN_SBJP: -0.139
BIN_X: 0.129
BIN_RB: 0.084
BIN_CAP: 0.072
PC10:
BIN_RB: 0.507
BIN_PRP: -0.411
BIN_SBJP: -0.411
BIN_NNP: -0.204
BIN_X: 0.202
BIN_FPP1: -0.195
BIN_INF: -0.193
BIN_NOMZ: -0.158
BIN_NUM: 0.156
BIN_JJ: -0.154
PC11:
BIN_X: 0.632
BIN_NOMZ: -0.436
BIN_QUOT: -0.379
BIN_JJ: 0.317
BIN_CONT: -0.171
BIN_NUM: 0.159
BIN_RB: -0.149
BIN_PRP: 0.119
BIN_SBJP: 0.119
BIN_INF: 0.106
PC12:
BIN_VPRT: 0.495
BIN_X: 0.445
BIN_AUXB: 0.381
BIN_NUM: -0.346
BIN_NOMZ: 0.291
BIN_RB: -0.234
BIN_PASS: 0.177
BIN_JJ: -0.159
BIN_VBD: -0.118
BIN_BEMA: 0.112
PC13:
BIN_NUM: 0.440
BIN_X: -0.437
BIN_RB: -0.347
BIN_NOMZ: -0.338
BIN_AUXB: 0.333
BIN_VPRT: 0.223
BIN_INF: 0.210
BIN_PASS: 0.141
BIN_TO: 0.132
BIN_BEMA: 0.127
PC14:
BIN_AUXB: 0.473
BIN_VPRT: -0.443
BIN_NUM: -0.405
BIN_VBD: 0.282
BIN_CONT: -0.239
BIN_NOMZ: -0.211
BIN_PASS: 0.209
BIN_BEMA: 0.165
BIN_INF: -0.156
BIN_CCONJ: 0.149
PC15:
BIN_NUM: 0.581
BIN_NOMZ: 0.428
BIN_AUXB: 0.327
BIN_VPRT: -0.213
BIN_PGAS: -0.197
BIN_X: 0.187
BIN_RB: 0.164
BIN_BEMA: 0.163
BIN_QUOT: 0.143
BIN_CCONJ: -0.143
PC16:
BIN_PGAS: 0.702
BIN_CONJ: -0.428
BIN_CCONJ: -0.371
BIN_SCONJ: 0.217
BIN_WH: 0.138
BIN_WZPRES: 0.132
BIN_TO: 0.132
BIN_GER: 0.090
BIN_VBD: 0.089
BIN_NUM: 0.088
PC17:
BIN_CCONJ: 0.462
BIN_PGAS: 0.459
BIN_CONJ: 0.395
BIN_CONT: -0.333
BIN_QUOT: 0.184
BIN_NUM: 0.180
BIN_VPRT: 0.177
BIN_VBD: -0.172
BIN_XX0: -0.170
BIN_SPAU: -0.139
PC18:
BIN_CCONJ: 0.691
BIN_CONJ: -0.502
BIN_CONT: 0.238
BIN_VBD: 0.152
BIN_NUM: 0.149
BIN_ANDC: 0.144
BIN_INDA: -0.122
BIN_XX0: 0.120
BIN_PRIV: -0.115
BIN_PHC: 0.101
PC19:
BIN_CONT: 0.563
BIN_CONJ: 0.459
BIN_PGAS: 0.332
BIN_SPAU: 0.255
BIN_XX0: 0.234
BIN_QUOT: -0.231
BIN_RB: -0.223
BIN_SCONJ: -0.172
BIN_AUXB: 0.163
BIN_PASS: 0.131
PC20:
BIN_INDA: 0.674
BIN_DET: -0.416
BIN_QUAN: -0.265
BIN_ART: 0.217
BIN_FPP1: -0.202
BIN_PGAS: -0.152
BIN_CONJ: -0.150
BIN_SCONJ: 0.141
BIN_CCONJ: 0.130
BIN_DEMO: -0.128
PC21:
BIN_SCONJ: 0.568
BIN_PRIV: 0.541
BIN_TO: -0.332
BIN_WH: 0.270
BIN_RB: -0.158
BIN_INDA: -0.139
BIN_COND: 0.134
BIN_VPRT: -0.130
BIN_CCONJ: 0.129
BIN_CONJ: 0.102
Top 10 PC1 values:
PC1 PC2 ... AuthorPHID date_created
19173 40.268860 26.736392 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
23127 34.022257 7.573103 ... PHID-USER-myidf5vlkwvrgp2iwn76 1433839792
23533 33.055352 7.623438 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498718
24553 33.053151 7.621628 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498559
23532 33.050949 7.619818 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498772
22245 31.318686 5.617453 ... PHID-USER-v7vgzvvcw7v2umf737ri 1438377936
18500 29.657022 4.747496 ... PHID-USER-hbffue25ov3attlvclze 1387662960
22023 29.625085 9.081212 ... PHID-USER-a6p24cvyblhfzc7we7nc 1440568477
14809 28.210405 6.749195 ... PHID-USER-zjzhrhmn36icnzbckqy4 1379900100
22930 27.824399 14.949181 ... PHID-USER-fo56wm4wxiwpoofn2xdu 1436249770
[10 rows x 28 columns]
Bottom 10 PC1 values:
PC1 PC2 ... AuthorPHID date_created
23485 -16.873824 13.740160 ... PHID-USER-u7udgblfyop6qd5wxot6 1425991276
22060 -16.135690 12.174259 ... PHID-USER-2nnm76h4ykalvvref2ye 1440412099
22845 -15.391146 13.319574 ... PHID-USER-2nnm76h4ykalvvref2ye 1440085454
24795 -15.084050 14.347308 ... PHID-USER-5dwuaigmkz2vzg65lape 1419297091
7451 -14.541432 5.740545 ... PHID-USER-ysftv67jxeaxdwcakvwo 1374347580
23471 -13.857781 7.962597 ... PHID-USER-2nnm76h4ykalvvref2ye 1426228927
22443 -13.803016 7.605012 ... PHID-USER-fo56wm4wxiwpoofn2xdu 1435267334
23300 -13.605468 0.980452 ... PHID-USER-evd3wnvnlb66lrwulch4 1423322226
11814 -13.401241 7.881186 ... PHID-USER-5pyvkdz65d5h5vxebodc 1372684440
968 -13.313317 0.369182 ... PHID-USER-j5ma2nageni56xp567v5 1377621000
[10 rows x 28 columns]
Top 10 PC2 values:
PC1 PC2 ... AuthorPHID date_created
24610 6.265218 29.494190 ... PHID-USER-tafngdco2cilcyr7qhhg 1422645688
20963 27.578946 27.679075 ... PHID-USER-rooknayvbydy6sodz3lx 1436311793
24082 -4.360480 27.219954 ... PHID-USER-jcypqodpdpbcicgwgh7j 1419534643
19173 40.268860 26.736392 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
24824 -2.967505 23.097004 ... PHID-USER-mdihg2tyzmlvyhn3h32y 1418230141
24818 20.182195 22.630740 ... PHID-USER-hbtlbu4zftxnz4i6f7yf 1418856731
13345 6.075708 22.048374 ... PHID-USER-ydswvwhh5pm4lshahjje 1371860160
21020 6.876811 21.888275 ... PHID-USER-zcsdm7lwcehnusyhh6xp 1435194938
20973 -7.021508 20.911008 ... PHID-USER-hxwwywcyzpooynxuo7a2 1435878993
22029 0.897428 20.736628 ... PHID-USER-a6p24cvyblhfzc7we7nc 1440568357
[10 rows x 28 columns]
Bottom 10 PC2 values:
PC1 PC2 ... AuthorPHID date_created
3134 5.691116 -12.652404 ... PHID-USER-ydswvwhh5pm4lshahjje 1374855900
654 -0.763875 -12.369520 ... PHID-USER-hbtlbu4zftxnz4i6f7yf 1366408980
16080 -0.816582 -12.352041 ... PHID-USER-zjzhrhmn36icnzbckqy4 1350678600
1207 4.758836 -12.101115 ... PHID-USER-slccyo5rqasgpljxny7g 1374857700
17982 6.571867 -11.954035 ... PHID-USER-kqibbfgfpgocyzwe32lv 1412196840
1885 15.905505 -11.884510 ... PHID-USER-hyfm4swq76s4j642w46x 1372088340
2934 0.131925 -11.738040 ... PHID-USER-it53o2f2kyryqyj33uzt 1375529520
2109 -2.111122 -11.398959 ... PHID-USER-p6hvqn5njgnxuagekh4b 1367215380
13276 15.471863 -11.316666 ... PHID-USER-z6nzrwuaij3spgyg23jt 1373035320
24126 -1.622360 -11.265986 ... PHID-USER-lhtlnmkdbzlz6pbxaqdd 1430156915
[10 rows x 28 columns]
job finished, cleaning up
job pau at: Wed Oct 1 20:56:13 CDT 2025