240 lines
7.5 KiB
Plaintext
240 lines
7.5 KiB
Plaintext
starting the job at: Tue Oct 14 15:08:48 CDT 2025
|
|
setting up the environment
|
|
running the neurobiber labeling script
|
|
[[13. ]
|
|
[14. ]
|
|
[11. ]
|
|
...
|
|
[10. ]
|
|
[14. ]
|
|
[12.5]]
|
|
Number of PCs explaining 90% variance: 15
|
|
Variance of each PCA component: [138.60156907 44.29951603 25.63179594 21.39857213 14.99271754
|
|
10.88014877 8.72969328 8.11497994 6.78712318 5.50912497
|
|
5.25006184 4.96444801 4.62359041 3.68257699 3.28506433]
|
|
PC1:
|
|
median_sentence_length: 0.994
|
|
normalized_CAP: -0.069
|
|
normalized_NNP: -0.050
|
|
normalized_NOMZ: -0.029
|
|
normalized_NUM: 0.026
|
|
normalized_DET: 0.024
|
|
normalized_ART: 0.020
|
|
normalized_PREP: 0.019
|
|
normalized_PIN: 0.019
|
|
normalized_RB: 0.016
|
|
PC2:
|
|
normalized_CAP: 0.555
|
|
normalized_NNP: 0.554
|
|
normalized_DET: -0.298
|
|
normalized_ART: -0.232
|
|
normalized_PREP: -0.220
|
|
normalized_PIN: -0.220
|
|
sentence_count: -0.189
|
|
normalized_RB: -0.125
|
|
normalized_PRP: -0.110
|
|
normalized_SBJP: -0.110
|
|
PC3:
|
|
normalized_NN: 0.509
|
|
normalized_PREP: 0.491
|
|
normalized_PIN: 0.491
|
|
normalized_CAP: 0.304
|
|
normalized_NNP: 0.279
|
|
normalized_DET: 0.143
|
|
sentence_count: -0.115
|
|
normalized_ART: 0.109
|
|
normalized_NOMZ: -0.098
|
|
normalized_INF: 0.095
|
|
PC4:
|
|
normalized_NN: 0.683
|
|
sentence_count: -0.412
|
|
normalized_NNP: -0.295
|
|
normalized_PIN: -0.217
|
|
normalized_PREP: -0.217
|
|
normalized_CAP: -0.174
|
|
normalized_PRP: -0.173
|
|
normalized_SBJP: -0.173
|
|
normalized_RB: -0.142
|
|
normalized_JJ: 0.117
|
|
PC5:
|
|
sentence_count: 0.718
|
|
normalized_NN: 0.358
|
|
normalized_DET: 0.228
|
|
normalized_PIN: -0.223
|
|
normalized_PREP: -0.223
|
|
normalized_ART: 0.221
|
|
normalized_NOMZ: -0.190
|
|
normalized_CAP: 0.186
|
|
normalized_INF: -0.137
|
|
normalized_JJ: -0.123
|
|
PC6:
|
|
normalized_DET: 0.538
|
|
normalized_ART: 0.483
|
|
sentence_count: -0.398
|
|
normalized_PREP: -0.216
|
|
normalized_PIN: -0.216
|
|
normalized_CAP: 0.206
|
|
normalized_VPRT: 0.204
|
|
normalized_INDA: 0.186
|
|
normalized_NN: -0.142
|
|
normalized_X: -0.132
|
|
PC7:
|
|
normalized_RB: 0.442
|
|
normalized_CAP: 0.343
|
|
normalized_PRP: 0.313
|
|
normalized_SBJP: 0.313
|
|
normalized_NNP: -0.278
|
|
normalized_VPRT: 0.234
|
|
normalized_ART: -0.232
|
|
normalized_NN: 0.229
|
|
normalized_DET: -0.208
|
|
normalized_NOMZ: -0.164
|
|
PC8:
|
|
normalized_JJ: 0.504
|
|
normalized_CAP: 0.502
|
|
normalized_NNP: -0.468
|
|
normalized_NOMZ: 0.296
|
|
sentence_count: 0.150
|
|
normalized_X: -0.146
|
|
normalized_QUOT: -0.145
|
|
normalized_NN: -0.142
|
|
normalized_VPRT: -0.131
|
|
normalized_RB: -0.128
|
|
PC9:
|
|
normalized_JJ: 0.637
|
|
normalized_VPRT: 0.357
|
|
normalized_NNP: 0.337
|
|
normalized_CAP: -0.265
|
|
normalized_INF: -0.258
|
|
normalized_QUOT: -0.224
|
|
normalized_RB: 0.204
|
|
normalized_X: -0.145
|
|
normalized_AUXB: 0.143
|
|
sentence_count: 0.135
|
|
PC10:
|
|
normalized_INF: 0.691
|
|
normalized_QUOT: -0.415
|
|
normalized_VPRT: -0.263
|
|
normalized_RB: 0.222
|
|
normalized_TO: 0.184
|
|
normalized_PRP: -0.155
|
|
normalized_SBJP: -0.155
|
|
normalized_CONT: -0.138
|
|
normalized_NNP: 0.122
|
|
normalized_PIN: -0.120
|
|
PC11:
|
|
normalized_QUOT: 0.714
|
|
normalized_JJ: 0.402
|
|
normalized_CONT: 0.295
|
|
normalized_INF: 0.294
|
|
normalized_NOMZ: -0.246
|
|
normalized_PRP: -0.126
|
|
normalized_SBJP: -0.126
|
|
normalized_X: 0.107
|
|
normalized_NUM: -0.076
|
|
normalized_CAP: 0.067
|
|
PC12:
|
|
normalized_RB: 0.521
|
|
normalized_PRP: -0.426
|
|
normalized_SBJP: -0.426
|
|
normalized_JJ: -0.236
|
|
normalized_INF: -0.229
|
|
normalized_FPP1: -0.211
|
|
normalized_NNP: -0.184
|
|
normalized_CONJ: 0.129
|
|
normalized_XX0: 0.125
|
|
normalized_TO: -0.124
|
|
PC13:
|
|
normalized_X: 0.808
|
|
normalized_NOMZ: -0.391
|
|
normalized_QUOT: -0.249
|
|
normalized_JJ: 0.163
|
|
sentence_count: -0.146
|
|
normalized_NNP: -0.126
|
|
normalized_CAP: 0.107
|
|
normalized_CONT: -0.097
|
|
normalized_VPRT: 0.096
|
|
normalized_RB: -0.071
|
|
PC14:
|
|
normalized_VPRT: 0.514
|
|
normalized_AUXB: 0.496
|
|
normalized_RB: -0.346
|
|
normalized_PASS: 0.221
|
|
normalized_INF: 0.218
|
|
normalized_NOMZ: 0.215
|
|
normalized_BEMA: 0.161
|
|
normalized_JJ: -0.161
|
|
normalized_VBD: -0.137
|
|
normalized_NUM: -0.136
|
|
PC15:
|
|
normalized_NOMZ: 0.554
|
|
normalized_NUM: -0.544
|
|
normalized_X: 0.438
|
|
normalized_RB: 0.239
|
|
sentence_count: 0.146
|
|
normalized_NNP: 0.135
|
|
normalized_AUXB: -0.116
|
|
normalized_NN: 0.105
|
|
normalized_CONT: 0.104
|
|
normalized_INF: -0.101
|
|
Top 10 PC1 values:
|
|
PC1 PC2 ... AuthorPHID date_created
|
|
16080 525.102703 48.280630 ... PHID-USER-zjzhrhmn36icnzbckqy4 1350678600
|
|
18859 77.466344 3.706703 ... PHID-USER-ll6tmaogat2b5q7tnqas 1405358040
|
|
20378 69.473292 -5.921977 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1407551580
|
|
8874 67.305410 6.587019 ... PHID-USER-ydswvwhh5pm4lshahjje 1371667800
|
|
6468 52.113083 12.698065 ... PHID-USER-azy72hrp3tpetr52aob6 1378208100
|
|
18692 43.220624 8.230008 ... PHID-USER-arjqb24x4oae7awzpfp6 1411431840
|
|
5607 42.720768 1.581160 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1360124400
|
|
19479 41.065047 8.286151 ... PHID-USER-ynivjflmc2dcl6w5ut5v 1406854860
|
|
13751 38.405351 6.445956 ... PHID-USER-v7vgzvvcw7v2umf737ri 1380947640
|
|
6503 37.060191 -2.635433 ... PHID-USER-qgqq35kbi5wss2tlgmhg 1377865740
|
|
|
|
[10 rows x 25 columns]
|
|
|
|
Bottom 10 PC1 values:
|
|
PC1 PC2 ... AuthorPHID date_created
|
|
19173 -14.819594 38.839843 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
|
|
23533 -14.098760 31.956092 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498718
|
|
24553 -14.098553 31.953701 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498559
|
|
23532 -14.098346 31.951309 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498772
|
|
129 -13.767257 2.442547 ... PHID-USER-hyfm4swq76s4j642w46x 1375120080
|
|
22245 -12.327433 30.418183 ... PHID-USER-v7vgzvvcw7v2umf737ri 1438377936
|
|
752 -12.170613 17.171274 ... PHID-USER-sx63fwaih5kjt7bz4u6z 1380590700
|
|
2120 -11.607147 -10.509373 ... PHID-USER-xfe43w2lb5gpvglf4coa 1367008080
|
|
22153 -11.098587 7.351805 ... PHID-USER-a6p24cvyblhfzc7we7nc 1438982860
|
|
24847 -10.908633 15.377024 ... PHID-USER-srhlj2447vmpmrfhqnfa 1417632210
|
|
|
|
[10 rows x 25 columns]
|
|
Top 10 PC2 values:
|
|
PC1 PC2 ... AuthorPHID date_created
|
|
16080 525.102703 48.280630 ... PHID-USER-zjzhrhmn36icnzbckqy4 1350678600
|
|
19173 -14.819594 38.839843 ... PHID-USER-doeppszazlm3r7xah4il 1416964345
|
|
23127 -1.787399 32.727692 ... PHID-USER-myidf5vlkwvrgp2iwn76 1433839792
|
|
23533 -14.098760 31.956092 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498718
|
|
24553 -14.098553 31.953701 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498559
|
|
23532 -14.098346 31.951309 ... PHID-USER-sai77mtxmpqnm6pycyvz 1424498772
|
|
18500 13.647382 30.709395 ... PHID-USER-hbffue25ov3attlvclze 1387662960
|
|
22245 -12.327433 30.418183 ... PHID-USER-v7vgzvvcw7v2umf737ri 1438377936
|
|
22023 -7.400000 29.037196 ... PHID-USER-a6p24cvyblhfzc7we7nc 1440568477
|
|
14809 -2.186555 28.072103 ... PHID-USER-zjzhrhmn36icnzbckqy4 1379900100
|
|
|
|
[10 rows x 25 columns]
|
|
|
|
Bottom 10 PC2 values:
|
|
PC1 PC2 ... AuthorPHID date_created
|
|
23485 -1.065773 -15.903250 ... PHID-USER-u7udgblfyop6qd5wxot6 1425991276
|
|
22060 4.042133 -15.132236 ... PHID-USER-2nnm76h4ykalvvref2ye 1440412099
|
|
5792 -2.696513 -15.036399 ... PHID-USER-grpjkpfolt5gz4ljlbfg 1355334540
|
|
1436 -0.480107 -15.016999 ... PHID-USER-tyjmn7xcw6s2b6rqagj7 1373878680
|
|
22799 -5.139569 -14.977697 ... PHID-USER-fjve3gq5wsmaaccti7pb 1430752987
|
|
22845 -0.877723 -14.762675 ... PHID-USER-2nnm76h4ykalvvref2ye 1440085454
|
|
7451 9.897529 -14.392291 ... PHID-USER-ysftv67jxeaxdwcakvwo 1374347580
|
|
9423 10.013728 -14.381035 ... PHID-USER-zzvqlvm6i6kml4tfnqvq 1369411380
|
|
1228 -2.448487 -13.906291 ... PHID-USER-ysftv67jxeaxdwcakvwo 1374765240
|
|
2775 3.664323 -13.485623 ... PHID-USER-dw53c5cb2qfhyemej57o 1377068880
|
|
|
|
[10 rows x 25 columns]
|
|
job finished, cleaning up
|
|
job pau at: Tue Oct 14 15:09:18 CDT 2025
|