Known Undisclosed Paid Editors (English Wikipedia)

<div>This dataset contains a manually curated set of known undisclosed paid editor (UPE) accounts from Wikipedia. This is not a complete set of known editors. Editors who do not appear in this set are not guaranteed to not be paid editors.</div><div><br></div><div>See also https://en.wikipedia.org/wiki/Wikipedia:Paid-contribution_disclosure<br></div><div><br></div><div>The dataset contains four columns:</div><div><br></div><div> - <b>user_name</b>: The username of the UPE</div><div> - <b>case_page_name</b>: The page name (title) of a page describing the case through which paid editing was discovered.</div><div> - <b>type</b>: One of three types of UPEs (described below)</div><div> - <b>notes</b>: Any notes that a dataset curator chose to include with the example. <br></div><div><br></div><br><table><tr><td>Type 1</td><td>User makes just over 10 minor edits. Is quiet for a few days well waiting for autoconfirm (user right) to kick in (takes 4 days). Then creates a promotional article in one big edit followed by the account going silent.</td><td>This is the main priority. These are present in the largest numbers and are the clearest pattern. They also cause the most damage to our shared brand.</td></tr><tr><td>Type 2</td><td>User is an obvious newbie. Makes lots of mistakes. Often turns out to be internal staff. </td><td>Not a key priority. We already manage these cases fairly well as they are often so obvious.</td></tr><tr><td>Type 3</td><td>Undisclosed paid editor, but one who only moves on to new accounts once their current account gets detected. </td><td>A serious problem--these will be harder to detect as we will have smaller numbers of these cases. Also a long time will need to pass before a pattern becomes present</td></tr></table>