1
0
mw-convo-collections/data_collection/spark-warehouse/activity_schema.csv

8.4 KiB

1Field classField nameData typeComment
2Event_globalwiki_dbstringenwiki, dewiki, eswiktionary, etc.
3Event_globalevent_entitystringrevision, user or page
4Event_globalevent_typestringcreate, move, delete, etc. Detailed explanation in the docs under #Event_types
5Event_globalevent_timestampstringWhen this event ocurred
6Event_globalevent_commentstringComment related to this event, sourced from log_comment, rev_comment, etc.
7Event userevent_user_idbigintID of the user that caused the event. Null if the user is anonymous or if from a revision where the user has been revision deleted.
8Event userevent_user_text_historicalstringHistorical username (IP address for anonymous user) of the user that caused the event. Null for revisions where the user has been revision deleted.
9Event userevent_user_textstringCurrent username of the user that caused the event. Null for anonymous users (the IP is stored in event_user_text_historical). Null for revisions where the user has been revision deleted.
10Event userevent_user_blocks_historicalarray<string>Historical blocks of the user that caused the event
11Event userevent_user_blocksarray<string>Current blocks of the user that caused the event
12Event userevent_user_groups_historicalarray<string>Historical groups of the user that caused the event
13Event userevent_user_groupsarray<string>Current groups of the user that caused the event
14Event userevent_user_is_bot_by_historicalarray<string>Historical bot information of the user that caused the event, can contain values name or group
15Event userevent_user_is_bot_byarray<string>Bot information of the user that caused the event, can contain values name or group
16Event userevent_user_is_created_by_selfbooleanWhether the event_user created their own account
17Event userevent_user_is_created_by_systembooleanWhether the event_user account was created by mediawiki (eg. centralauth)
18Event userevent_user_is_created_by_peerbooleanWhether the event_user account was created by another user
19Event userevent_user_is_anonymousbooleanWhether the event_user is not registered, using the old way that surfaced the IP publicly. True for revisions where the user has been revision deleted, even if the user was actually registered.
20Event userevent_user_is_temporarybooleanWhether the event_user is not registered, using the new temporary account way. True for revisions where the user has been revision deleted, even if the user was actually registered.
21Event userevent_user_is_permanentbooleanWhether the event_user is registered.
22Event userevent_user_registration_timestampstringRegistration timestamp of the user that caused the event (from user table)
23Event userevent_user_creation_timestampstringCreation timestamp of the user that caused the event (from logging table)
24Event userevent_user_first_edit_timestampstringTimestamp of the first edit of the user that caused the event
25Event userevent_user_revision_countbigintNumber of revisions made by the event_user up to the historical time in this wiki_db (only available in revision-create events so far). For revision-create events, this includes the event itself.
26Event userevent_user_seconds_since_previous_revisionbigintIn revision events: seconds elapsed since the previous revision made by the current event_user_id (only available in revision-create events so far)
27pagepage_idbigintIn revision/page events: id of the page
28pagepage_title_historicalstringIn revision/page events: historical title of the page
29pagepage_titlestringIn revision/page events: current title of the page
30pagepage_namespace_historicalintIn revision/page events: historical namespace of the page.
31pagepage_namespace_is_content_historicalbooleanIn revision/page events: historical namespace of the page is categorized as content
32pagepage_namespaceintIn revision/page events: current namespace of the page
33pagepage_namespace_is_contentbooleanIn revision/page events: current namespace of the page is categorized as content
34pagepage_is_redirectbooleanIn revision/page events: whether the page is currently a redirect
35pagepage_is_deletedbooleanIn revision/page events: Whether the page is rebuilt from a delete event
36pagepage_creation_timestampstringIn revision/page events: creation timestamp of the page
37pagepage_first_edit_timestampstringIn revision/page events: timestamp of the page's first revision. Can be before the page_creation in some restore/merge cases (see revision_is_from_before_page_creation).
38pagepage_revision_countbigintIn revision/page events: Cumulative revision count per page for the current page_id (only available in revision-create events so far)
39pagepage_seconds_since_previous_revisionbigintIn revision/page events: seconds elapsed since the previous revision made on the current page_id (only available in revision-create events so far)
40useruser_idbigintIn user events: id of the user
41useruser_text_historicalstringIn user events: historical username or IP address of the user
42useruser_textstringIn user events: current username or IP address of the user
43useruser_blocks_historicalarray<string>In user events: historical user blocks
44useruser_blocksarray<string>In user events: current user blocks
45useruser_groups_historicalarray<string>In user events: historical user groups
46useruser_groupsarray<string>In user events: current user groups
47useruser_is_bot_by_historicalarray<string>In user events: Historical bot information of the user, can contain values name or group
48useruser_is_bot_byarray<string>In user events: Bot information of the user, can contain values name or group
49useruser_is_created_by_selfbooleanIn user events: whether the user created their own account
50useruser_is_created_by_systembooleanIn user events: whether the user account was created by mediawiki
51useruser_is_created_by_peerbooleanIn user events: whether the user account was created by another user
52useruser_is_anonymousbooleanIn user events: whether the user is not registered, using the old way that surfaced the IP publicly
53useruser_is_temporarybooleanIn user events: whether the user is not registered, using the new temporary account way
54useruser_is_permanentbooleanIn user events: whether the user is registered
55useruser_registration_timestampstringIn user events: registration timestamp of the user.
56useruser_creation_timestampstringIn user events: Creation timestamp of the user (from logging table)
57useruser_first_edit_timestampstringIn user events: Timestamp of the first edit of the user
58revisionrevision_idbigintIn revision events: id of the revision
59revisionrevision_parent_idbigintIn revision events: id of the parent revision
60revisionrevision_minor_editbooleanIn revision events: whether it is a minor edit or not
61revisionrevision_deleted_partsarray<string>In revision events: Deleted parts of the revision, can contain values text, comment and user
62revisionrevision_deleted_parts_are_suppressedbooleanIn revision events: Whether the deleted parts are deleted to admin as well (visible only by stewards)
63revisionrevision_text_bytesbigintIn revision events: number of bytes of revision
64revisionrevision_text_bytes_diffbigintIn revision events: change in bytes relative to parent revision (can be negative).
65revisionrevision_text_sha1stringIn revision events: sha1 hash of the revision
66revisionrevision_content_modelstringIn revision events: content model of revision
67revisionrevision_content_formatstringIn revision events: content format of revision
68revisionrevision_is_deleted_by_page_deletionbooleanIn revision events: whether this revision has been deleted (moved to archive table)
69revisionrevision_deleted_by_page_deletion_timestampstringIn revision events: the timestamp when the revision was deleted
70revisionrevision_is_identity_revertedbooleanIn revision events: whether this revision was reverted by another future revision
71revisionrevision_first_identity_reverting_revision_idbigintIn revision events: id of the revision that reverted this revision
72revisionrevision_seconds_to_identity_revertbigintIn revision events: seconds elapsed between revision posting and its revert (if there was one)
73revisionrevision_is_identity_revertbooleanIn revision events: whether this revision reverts other revisions
74revisionrevision_is_from_before_page_creationbooleanIn revision events: True if the revision timestamp is before the page creation (can happen with restore events)
75revisionrevision_tagsarray<string>In revision events: Tags associated to the revision