Jump to content

User:Legobot/Requests

From Wikipedia, the free encyclopedia
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
  • Replace web.archive.org* with {{wayback}}

 Checking...

  • Convert various regional google links (google.co.uk) to a standard google.com
  • Is this a case of "its not broken, don't fix it?" Need to do research on if it produces any actual change. Legoktm (talk) 16:42, 28 October 2012 (UTC)

Convert google books to real book citations

  • Find a google books URL, turn it into a {{cite book}}
  • Use Google Books API ([1]) Combine with isbndb.com or another free source? (is this even needed?)


 Approved. will start doing batch runs in the next few days. Legoktm (talk) 17:31, 29 October 2012 (UTC)

Identify links of the following form which are not inside a {{wayback}} or {{WebCite}} or used as a url= param for a citation template where an archiveurl= param is used -

  • ap.google  Done
  • canadianpress.google  Done
  • ukpress.google.com  Done
  • .geocities.com  On hold Some Geocities Plus sites are still up (ex: [2])
  • .btinternet.co.uk  Done
  • .btinternet.com  Done

and add a {{dead link}} tag. Sfan00 IMG (talk) 12:08, 20 October 2012 (UTC)

How many more press things does Google have??? Legoktm (talk) 01:24, 12 November 2012 (UTC)


http://www.google.com/buzz* appear to be dead links Sfan00 IMG (talk) 13:12, 15 November 2012 (UTC)

Further:

Flagging 'Tertiary Sources'

Identify (and if appropriate comment out (pending review)links to teritary sources such as :-

  • Google Answers (answers.google.*)
  • Yahoo Answers
  • Google Knol (Knol.google.*)

Any others people know of? 13:17, 20 October 2012 (UTC)

I don't like the idea of just commenting out a certain reference, maybe a database report/list of pages for humans to work on? Legoktm (talk) 16:35, 28 October 2012 (UTC)

TAP Bot

 Will code...

 On hold

The links here:-

http://en.wikipedia.org/w/index.php?title=Special%3ALinkSearch&target=goo.gl http://en.wikipedia.org/w/index.php?title=Special%3ALinkSearch&target=g.co

should probably be expand to full ones. Sfan00 IMG (talk) 17:33, 22 October 2012 (UTC)

  • API at [3] Legoktm (talk) 17:34, 22 October 2012 (UTC)
    • Code has been written (github). Need to do research on why g.co links are not blacklisted, and exclude goo.gl/maps links (below)

 Will code...

Implementation brainstorm:

  • If a {{coord}} is already present, skip and log
  • Search for /maps/ links, expand them into full links extract coordinates or search query
    • If coordinates, parse them, add {{coord}} to the top, and log
    • If search query, use Nominatim to find coordinates
      • If only one result, apply {{coord}} to the top, and log
      • If multiple results, post a note on the talk page with a copy/paste code for {{coord}} that can be applied by a human. Need a hidden string (the maps url?) to avoid re-posts. (log once done)
        • Any smart thinking can be used here? Ex: If we know subject is in California, two results for California and Illinois show up, bot auto-chooses California

Questions:

  • Where does {{coord}} get placed?
  • ...

Template parameters for AWB

Redirected media cleanup

 Checking... There is a database report that lists unused or low use image redirects.

Would it be possible for a bot to Periodically run through this list updating if

  • Image is used in a mainspace article.
  • Image has been moved to a new name
  • Image is used in articlespace under the old name.

Alongside this change the bot should carry out general fixes. Sfan00 IMG (talk) 22:34, 25 October 2012 (UTC)

Combine with Wikipedia_talk:FMR#Have_a_bot_fix_backlinks_to_moved_files? Legoktm (talk) 16:44, 28 October 2012 (UTC)

Withdrawn BRFA's that were good ideas

  • Some have format of: //google.com/search&q=cache
Further on this

http://webcache.googleusercontent.com/search?q=cache:07_jk2n-Yt0J:the.honoluluadvertiser.com/current/ln/150photo/20061108 +monica+boyar+photo &hl=en &ct=clnk &cd=5 &gl=us &client=firefox-a\Monica

the original link is : http://the.honoluluadvertiser.com/current/ln/150photo/20061108

So you need to extract the link between the third-colon and the first + symbol it would seem.

Then the link needs de-escaping , and a check made for the link being 404 (in which case a [dead link] tag should be added outside the accompanying [ ] tags


Sfan00 IMG (talk) 12:06, 15 November 2012 (UTC)

Use Google subdomains

  • google.com/mars --> mars.google.com
  • google.*/books --> books.google.com
  • www.google.*/books -> books.google.com