|
9345f9de94
|
make pass keyword arg to dataframe.drop
|
2023-05-31 09:47:21 -07:00 |
|
|
07b0dff9bc
|
changes for archiving.
|
2023-05-23 17:18:19 -07:00 |
|
|
811a0d87c4
|
changes from dirty branch.
|
2023-05-18 10:29:08 -07:00 |
|
|
c190791364
|
add 2 more umap parameters
|
2022-06-08 17:27:37 -07:00 |
|
|
5a40465a62
|
add support for umap->hdbscan clustering method
|
2022-06-08 17:01:27 -07:00 |
|
|
541e125b28
|
lsi support for weekly similarities
|
2021-08-11 22:48:33 -07:00 |
|
Nate E TeBlunthuis
|
cf86c7492c
|
update clustering scripts
|
2021-08-03 14:55:02 -07:00 |
|
Nate E TeBlunthuis
|
87ffaa6858
|
script for picking the best clustering given constraints
|
2021-05-14 19:10:36 -07:00 |
|
Nate E TeBlunthuis
|
582cf263ea
|
bug fix in affinity clustering
|
2021-05-13 22:26:15 -07:00 |
|
Nate E TeBlunthuis
|
4cb7eeec80
|
Refactor to make a decent api.
|
2021-05-10 13:46:49 -07:00 |
|
Nate E TeBlunthuis
|
f05cb962e0
|
refactor clustring in object oriented style
|
2021-05-07 22:33:26 -07:00 |
|
Nate E TeBlunthuis
|
8d1df5b26e
|
refactor clustering.py into method-specific files.
|
2021-05-03 11:28:48 -07:00 |
|
Nate E TeBlunthuis
|
7df8436067
|
Use Latent semantic indexing and hdbscan
|
2021-05-02 23:39:55 -07:00 |
|
Nate E TeBlunthuis
|
36b24ee933
|
reindex tfidf in memory instead of using spark
|
2021-04-30 12:48:19 -07:00 |
|
Nate E TeBlunthuis
|
37dd0ef55f
|
bugfixes in clustering selection.
|
2021-04-21 16:56:25 -07:00 |
|
Nate E TeBlunthuis
|
01a4c35358
|
grid sweep selection for clustering hyperparameters
|
2021-04-20 11:33:54 -07:00 |
|
Nate E TeBlunthuis
|
f0176d9f0d
|
Changes for cosine similarities on klone.
|
2021-04-05 23:21:06 -07:00 |
|
Nate E TeBlunthuis
|
4dc949de5f
|
Changes from hyak.
|
2021-02-22 16:03:48 -08:00 |
|
Nate E TeBlunthuis
|
4e20dce188
|
Updating to support wang-style user overlaps.
|
2020-12-24 22:38:04 -08:00 |
|
Nate E TeBlunthuis
|
56269deee3
|
Some improvements to run affinity clustering on larger dataset and
compute density.
|
2020-12-12 20:42:47 -08:00 |
|
Nate E TeBlunthuis
|
e6294b5b90
|
Refactor and reorganze.
|
2020-12-08 17:32:20 -08:00 |
|