Nathan TeBlunthuis groceryheist
  • Joined on 2024-04-18
groceryheist created branch compute-diffs in collective/mediawiki_dump_tools 2026-02-23 23:05:14 +00:00
groceryheist pushed to compute-diffs at collective/mediawiki_dump_tools 2026-02-23 23:05:14 +00:00
groceryheist created branch jsonl-output in collective/mediawiki_dump_tools 2026-02-23 19:52:44 +00:00
groceryheist pushed to jsonl-output at collective/mediawiki_dump_tools 2026-02-23 19:52:44 +00:00
c7eb374ceb use signalling to timeout mwparserfromhell instead of asyncio.
4b8288c016 add some debug lines.
8590e5f920 fix jsonl.d output.
93f6ed0ff5 fix bug by truncating corrupted jsonl lines.
5ebdb26d82 make resume with jsonl output fault tolerant.
Compare 10 commits »
groceryheist pushed to jsonl-output at collective/mediawiki_dump_tools 2026-01-07 20:42:56 +00:00
c7eb374ceb use signalling to timeout mwparserfromhell instead of asyncio.
groceryheist pushed to jsonl-output at collective/mediawiki_dump_tools 2026-01-07 03:58:28 +00:00
4b8288c016 add some debug lines.
groceryheist pushed to compute-diffs at collective/mediawiki_dump_tools 2025-07-08 03:59:02 +00:00
76d54ae597 support partitioning output parquet by namespace.
groceryheist pushed to compute-diffs at collective/mediawiki_dump_tools 2025-07-08 03:25:10 +00:00
c9fb94ccc0 fix tests.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-06 03:56:07 +00:00
e596d00971 implement diff of a sequence (for speed and c++ play).
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 20:34:16 +00:00
5427a4defa try -O3
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 18:38:14 +00:00
63b7ae4ccb add test and disable debugging symbols.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 18:31:41 +00:00
5d9517d7df attempt bugfix.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 18:25:08 +00:00
714e52f9fe add test and enable debugging symbols.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 18:02:18 +00:00
d811bc7a1c add test.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 17:19:34 +00:00
83ebedfb41 improve tests and fix bug.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 04:38:45 +00:00
1d73eef722 update version.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 04:38:06 +00:00
0ad442d5eb fix bug with newlines.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 03:55:40 +00:00
bf0739c25a this should work with python 3.9
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 03:54:15 +00:00
9d63263384 switch to GPL (for complience with wikidiff2) add the code.
groceryheist pushed to main at groceryheist/pywikidiff2 2025-07-05 03:53:03 +00:00
43ce33153e create repository to call wikidiff2 from python.