Operator: —— Eagle101Need help?
Ok, before you folks freak out at the bot name, let me state first that it will not edit articles. Let me start of with the basic stuff. Its programmed in perl with the assistance of perlwikipedia, and my own framework that builds ontop of it. In addition the bot is making use of the aspell library. The bot is operated by the programmer, that is me. SpellCheckerBot will run daily, probably a category or two a day. It should not need to exceed more then about 2-3 edits a minute. As this proposal has come up before let me explain the function details.
The bot will load pages from a category, and do some preprocessing before applying the spellcheck. First it will find all words found inside of blue wikilinks (those that have articles) and add those to a temporary spelling database, to avoid flagging things like names, places ect. The bot will also avoid anything in ALL CAPS, or inside of <code> or other related tags. It will also ignore anything that is bolded or in italics. (also added to the temporary spelling database). All output will go to a main list of entries, that looks something like: article || word -- suggestions. I have done some trials on articles on my computer, and it looks to be fairly decent, getting about 3 wrong for every 100 reports, though I'd like to improve this of course, this is only after about 4 hours experimentation. :)
Depending on how successful it is, I might make up a signup page where users can sign up for X spelling errors to be delivered to them on a particular day of the week. There will also likely be a page where users who have signed up can submit common errors or words that the bot needs to have added to its database. —— Eagle101Need help? 08:22, 4 June 2007 (UTC)[reply]
So it edits in a userspace? E talk 08:24, 4 June 2007 (UTC)[reply]
Approved for trial. Please provide a link to the relevant contributions and/or diffs when the trial is complete. - 250 spelling errors to be reported to a subpage under the bot's user space, with a big flashy warning at the top (keep edit rate < 2 per min). Martinp23 15:44, 5 June 2007 (UTC)[reply]