Logo
Explore Help
Sign In
collective/mediawiki_dump_tools
18
0
Fork 0
You've already forked mediawiki_dump_tools
Code Issues Pull Requests Activity
32 Commits 10 Branches 0 Tags
7b856bec861fb0d3bf568218971cba18e05fbe91
Commit Graph

11 Commits

Author SHA1 Message Date
groceryheist
7b856bec86 Merge branch 'master' into regex_scanner 2019-10-05 18:17:03 -07:00
groceryheist
324ccc8e26 update baseline outputs 2019-10-05 16:36:07 -07:00
sohyeonhwang
7bf4559ceb changes for regex scanner addition 2019-10-05 15:36:58 -05:00
groceryheist
f7f5bf8fd4 sub assertEquals assertEqual 2018-09-03 11:21:49 -07:00
groceryheist
7cd0bf3b9e Add parameter for selecting specific namespaces. 2018-08-23 18:49:32 -07:00
groceryheist
f468d1a5b6 add support for persistence with segment matching 2018-08-20 16:08:16 -07:00
groceryheist
bf396ad366 Prefix page titles with namespace names. 2018-07-09 22:11:17 -07:00
groceryheist
dba793c6ac migrate to mwxml. This completes the migration away from python-mediawiki-utilities. Except for preserving legacy persistence behavior, we can safely use the nice updates from the mediawiki-utils project. 2018-07-05 01:16:00 -07:00
groceryheist
d77b0a4965 migrate to mwpersistence. this fixes many issues. We preserve legacy persistence behavior using the --persistence-legacy. 2018-07-04 19:06:07 -07:00
groceryheist
e925ac9da1 add tests for wikipedia, malformed xml, bzip2, correct bz2 bug in wikiq. 2018-07-04 15:08:30 -07:00
groceryheist
d2746879d0 create baseline tests for xml dump processing 2018-07-03 23:43:47 -07:00
Powered by Gitea Version: 1.25.4 Page: 72ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API