User:Legobot/Requests
Requests for work to be done by Legobot. Mainly used so I don't forget anything and for requests on IRC. You should probably talk to me before adding anything here. Create a new section |
Convert wayback links
- Replace web.archive.org* with
{{wayback}}
Regional Google links
- Convert various regional google links (google.co.uk) to a standard google.com
- List_of_google_domains may be of use.
- Is this a case of "its not broken, don't fix it?" Need to do research on if it produces any actual change. Legoktm (talk) 16:42, 28 October 2012 (UTC)
Convert google books to real book citations
- Find a google books URL, turn it into a
{{cite book}}
- Use Google Books API ([1]) Combine with isbndb.com or another free source? (is this even needed?)
Dead link tagging
- Approved. will start doing batch runs in the next few days. Legoktm (talk) 17:31, 29 October 2012 (UTC)
Identify links of the following form which are not inside a {{wayback}}
or {{WebCite}}
or
used as a url= param for a citation template where an archiveurl= param is used -
- ap.google Done
- canadianpress.google Done
- ukpress.google.com Done
- .geocities.com On hold Some Geocities Plus sites are still up (ex: [2])
- .btinternet.co.uk Done
- .btinternet.com Done
and add a {{dead link}}
tag.
Sfan00 IMG (talk) 12:08, 20 October 2012 (UTC)
http://www.google.com/buzz* appear to be dead links Sfan00 IMG (talk) 13:12, 15 November 2012 (UTC)
Further:
Flagging 'Tertiary Sources'
Identify (and if appropriate comment out (pending review)links to teritary sources such as :-
- Google Answers (answers.google.*)
- Yahoo Answers
- Google Knol (Knol.google.*)
Any others people know of? 13:17, 20 October 2012 (UTC)
- I don't like the idea of just commenting out a certain reference, maybe a database report/list of pages for humans to work on? Legoktm (talk) 16:35, 28 October 2012 (UTC)
TAP Bot
- TAP Bot's AFD task. Thine Antique Pen (talk) 10:18, 21 October 2012 (UTC)
Link expansion
The links here:-
http://en.wikipedia.org/w/index.php?title=Special%3ALinkSearch&target=goo.gl http://en.wikipedia.org/w/index.php?title=Special%3ALinkSearch&target=g.co
should probably be expand to full ones. Sfan00 IMG (talk) 17:33, 22 October 2012 (UTC)
- API at [3] Legoktm (talk) 17:34, 22 October 2012 (UTC)
- Code has been written (github). Need to do research on why g.co links are not blacklisted, and exclude goo.gl/maps links (below)
Convert goo.gl/maps/ links to use coord
- Idea: convert /maps/ links to use
{{coord}}
. Probably another request. Legoktm (talk) 17:42, 22 October 2012 (UTC)API for that is [4]. Legoktm (talk) 20:11, 22 October 2012 (UTC)Licensing problems.- OSM alternative:
https://wiki.openstreetmap.org/wiki/XAPIopenstreetmap:Nominatim
Implementation brainstorm:
- If a
{{coord}}
is already present, skip and log - Search for /maps/ links, expand them into full links extract coordinates or search query
- If coordinates, parse them, add
{{coord}}
to the top, and log - If search query, use Nominatim to find coordinates
- If only one result, apply
{{coord}}
to the top, and log - If multiple results, post a note on the talk page with a copy/paste code for
{{coord}}
that can be applied by a human. Need a hidden string (the maps url?) to avoid re-posts. (log once done)- Any smart thinking can be used here? Ex: If we know subject is in California, two results for California and Illinois show up, bot auto-chooses California
- If only one result, apply
- If coordinates, parse them, add
Questions:
- Where does
{{coord}}
get placed? - ...
Template parameters for AWB
- Implement Wikipedia:AutoWikiBrowser/Rename template parameters into awb_gen_fixes.py
Redirected media cleanup
Checking... There is a database report that lists unused or low use image redirects.
Would it be possible for a bot to Periodically run through this list updating if
- Image is used in a mainspace article.
- Image has been moved to a new name
- Image is used in articlespace under the old name.
Alongside this change the bot should carry out general fixes. Sfan00 IMG (talk) 22:34, 25 October 2012 (UTC)
- Combine with Wikipedia_talk:FMR#Have_a_bot_fix_backlinks_to_moved_files? Legoktm (talk) 16:44, 28 October 2012 (UTC)
Withdrawn BRFA's that were good ideas
- Wikipedia:Bots/Requests for approval/StubSyncBot - integrate with WPTagger?
- Wikipedia:Bots/Requests for approval/HairBot
De-cache Google links
- Some have format of: //google.com/search&q=cache
- Further on this
http://webcache.googleusercontent.com/search?q=cache:07_jk2n-Yt0J:the.honoluluadvertiser.com/current/ln/150photo/20061108 +monica+boyar+photo &hl=en &ct=clnk &cd=5 &gl=us &client=firefox-a\Monica
the original link is : http://the.honoluluadvertiser.com/current/ln/150photo/20061108
So you need to extract the link between the third-colon and the first + symbol it would seem.
Then the link needs de-escaping , and a check made for the link being 404 (in which case a [dead link ] tag should be added outside the accompanying [ ] tags
Sfan00 IMG (talk) 12:06, 15 November 2012 (UTC)
Use Google subdomains
- google.com/mars --> mars.google.com
- google.*/books --> books.google.com
- www.google.*/books -> books.google.com