Jump to content

Wikipedia:Bots/Requests for approval/KMLbot

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by The Earwig (talk | contribs) at 07:42, 9 October 2016 (Discussion: re). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Operator: Evad37 (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)

Time filed: 05:33, Saturday, September 3, 2016 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): SPARQL + PetScan + AWB

Source code available: Yes, see function details below

Function overview: Adds {{Attached KML}} to articles which have KML files available through Wikidata

Links to relevant discussions (where appropriate): Template talk:Attached KML § Proposal: Use Wikidata and new module

Edit period(s): Around once a week

Estimated number of pages affected: ~450 for initial run, probably much less in subsequent runs (depends on KML creation rate on other wikis)

Exclusion compliant (Yes/No): Not through {{bots}} Yes, per below

Already has a bot flag (Yes/No): No

Function details:

  • (1) Get a list of Wikidata items which have both a KML file and an article on English Wikipedia.
    Done manually with the following SPARQL query to Wikidata Query Service:
SELECT ?article
WHERE
{
        ?article schema:about ?item 
        ;       schema:isPartOf <https://en.wikipedia.org/> .
        ?item wdt:P3096 ?kml .
        SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
Output saved as CSV, opened with MS Excel. Titles extracted from URLs using formula like =LEFT(MID(A2,31,999),LEN(A2)-30). But these are precent-encoded, so then decode them using a web-based URL decoder.

Notes:

  • Applying here per Special:Diff/737483526
  • The page at User:KMLbot would have info like what I started drafting at User:Evad37/KML, but adjusted to match whatever approval is given for the bot.
  • The bot would be not only be exclusion-compliant through {{bots}}, but would also be exclusion-compliant through the (yet to be created) {{No KML}} template populating [[Category:Pages which should not use KML from Wikidata]] (hidden tracking category also yet to be created). This allows excluded pages to be filtered out with the PetScan query, would enable tracking of such article through the category, and would encourage editors to provide a reason why the KML shouldn't be used (so that problems could possibly be fixed for all wikis, rather than just ignored at the English Wikipedia). - Evad37 [talk] 05:33, 3 September 2016 (UTC)[reply]

Discussion

Unless you turn the option off, AWB is exclusion compliant via {{bots}}. — JJMC89(T·C) 05:55, 3 September 2016 (UTC)[reply]

Oh, okay. Didn't realise that, but I don't think it makes a big difference to the proposal. - Evad37 [talk] 06:25, 3 September 2016 (UTC) Adjusted above - Evad37 [talk] 06:27, 3 September 2016 (UTC)[reply]

Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. — Earwig talk 17:16, 3 September 2016 (UTC)[reply]

Trial complete. See edits. Notes:
  • Edits 1-5: edit summary malformed
  • Edits 6-10: disambiguation pages included - because typo in petscan (capital D)
  • Edits 11-14: tried to fix in petscan - something went wrong, all of these had no KML in wikidata - reverted these edits
  • Edits 15-20: Started over (Wikidata query, decode titles, petscan query, save as UTF8 text file). Generally okay, but AWB inconsistent in adding new lines - some have a single space (as intended), some with 2 (which makes a gap in rendered page), some with none (??) - may be due to extra lines being rearranged by "Sort meta data after".
  • Edits 21-25: Try option "use 0 newlines" - all appears to be okay now;
  • Edits 25-50: Went ahead with remainder of trial edits. Spot-checked about 15 of these while the bot was editing and saw no problems - Evad37 [talk] 00:44, 6 September 2016 (UTC)[reply]
Some further notes: I left the KML files on the disambig pages in place. While I don't think dismbig pages are good candidates for automated KML addition, as what is ambiguous in one language might not be in another, in these cases the KML files did match the disambig listings. I reverted the bot's edit to Field of Mars ([1]), a set index article. With similar reasoning to disambigs, I'll exclude set index articles in future runs (by putting Category:All set index articles in the petscan query). All other edits have functioned as expected, and as of now (6 days later), none have been reverted, and no-one's posted anything at the bot's talk page. - Evad37 [talk] 04:14, 12 September 2016 (UTC)[reply]
A user has requested the attention of a member of the Bot Approvals Group. Once assistance has been rendered, please deactivate this tag by replacing it with {{t|BAG assistance needed}}. - Evad37 [talk] 07:18, 16 September 2016 (UTC)[reply]
@The Earwig: - Evad37 [talk] 00:36, 26 September 2016 (UTC)[reply]
@Evad37: Hmm. Do you think it's placing the template in the best location? This and a lot of others looks fairly unbalanced. Also, what's going on with the unreferenced tag here? — Earwig talk 01:03, 26 September 2016 (UTC)[reply]
The movement of unreferenced tags might be an AWB bug, I've reported it at WT:AWB#Unreferenced_stub_template (I think that might be AWB mistaking {{unreferenced stub}} for a stub tag, and thus resorting it to the end of the article.)
With regards to the location, the absolute ideal location could vary quite a lot based on what else is on the page. It should go somewhere under the last heading, and come before the categories and stub templates, but whether it would be better above or below the navboxes really depends on what exactly is above the navboxes. If its just a simple bulleted list, then it can float to the right of the list. But if there's already one or more floating-box templates, or a multi-column list (e.g. reflist), or if its a short article with a long infobox, placement above the navboxes would cause excessive whitespace. (For example, these pages which had the KML box manually added below the navboxes some years ago: Karrinyup Road, West Coast Highway, Perth.) But then again, if there's just one floating box template and 6 or so items in a bulleted list, then it may be better to place it immediately after the existing box, to float in space available on the right of the list. I don't think a bot can be much good at making these cosmetic decisions – I'd rather the bot place the template in an acceptable position (if not 100% optimal), that gives the links to readers without causing big whitespace issues, and allow humans editors to make aesthetic choices when further editing of the article occurs. - Evad37 [talk] 02:24, 26 September 2016 (UTC)[reply]
@The Earwig: Got an answer from WT:AWB – if I enable genfixes, then the {{unreferenced stub}} template redirect will be bypassed and replaced with {{unreferenced}}, but left where it is rather than being moved to the end of the article. This seems to work, I tested it by getting AWB to generate a diff (but not saving) for Nishiyatsushiro District, Yamanashi:
(the metadata sorting doesn't seem to work in userspace, otherwise I would have made a sandbox edit to show you) - Evad37 [talk] 04:17, 29 September 2016 (UTC)[reply]

@Evad37: Okay. Sorry, I haven't been around much lately, so this is going slowly. The other thing I notice is that most of the tagged pages are towns (where the KML file shows the town boundary) not routes like the help pages describe their use to be. Do you think we have consensus to add these? — Earwig talk 23:10, 8 October 2016 (UTC)[reply]

I think there is already an informal consensus to use KML files for polygons (areas) as well as lines (routes), as evidenced by the existing uses of KML for articles like Beaver Island State Park, Central Park, Des Moines, Iowa, and List of postcode areas in the United Kingdom. I think it is more that the documentation is out of date, and thus also the help page (as it was based on the template doc). - Evad37 [talk] 03:13, 9 October 2016 (UTC)[reply]
@The Earwig: I updated the template documentation and the help page to mention polygon features - Evad37 [talk] 03:24, 9 October 2016 (UTC)[reply]
Hm, are you sure about Central Park? I see it removed by an editor before the bot added it. (Ping epicgenius for input.) — Earwig talk 07:41, 9 October 2016 (UTC)[reply]