Revision as of 18:27, 18 September 2006 editBluemoose (talk | contribs)29,151 editsNo edit summary← Previous edit | Latest revision as of 15:32, 25 December 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,293,067 editsm Archiving 2 discussion(s) to Misplaced Pages talk:AutoWikiBrowser/Archive 34) (bot | ||
Line 1: | Line 1: | ||
{{AWB|notes={{Clickable button 2|Start a new discussion|url={{fullurl:Misplaced Pages talk:AutoWikiBrowser|action=edit§ion=new&dtenable=1}}|class=mw-ui-progressive|style=border-radius:4px; font-size:150%;}}}} | |||
{| class="infobox" | |||
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the ''']''' section below. Before asking a question, please refer to the '''read the ]''' below. | |||
{{archive box|auto=yes|search=yes|bot=lowercase sigmabot III|age=30}} | |||
{{User:HBC Archive Indexerbot/OptIn|target=Misplaced Pages talk:AutoWikiBrowser/Archive index|mask=Misplaced Pages talk:AutoWikiBrowser/Archive <#>|leading_zeros=0|indexhere=yes}} | |||
{{User:MiszaBot/config | |||
|maxarchivesize = 250K | |||
|counter = 34 | |||
|algo = old(30d) | |||
|archive = Misplaced Pages talk:AutoWikiBrowser/Archive %(counter)d | |||
| archiveheader = {{Talk archive navigation}} | |||
}} | |||
{{toc left}} | |||
{{clear}} | |||
= Before you post = | |||
{| class=wikitable | |||
|- | |- | ||
! Do you want to ... !! Please use | |||
!align="center"|]<br>] | |||
---- | |||
|- | |- | ||
| Report a bug or request a feature in AWB? || before . You do not need to create another account there; just log in with your global Wikimedia account. ] on how to report bugs and request features on Phabricator. | |||
| | |||
{{collapse top|1=Report a bug}} | |||
* ] | |||
Try to report bugs in the {{em|current}} version of the software. Update to the most recent version and check to make sure your bug has not already been reported on Phabricator. See for advice on how to write bug reports. | |||
* ] | |||
* ] | |||
* ] | |||
* ] | |||
* ] | |||
* ] | |||
* ] | |||
* ] | |||
|} | |||
Before posting anything related to non-], verify that the site is running a ''recent version'' of MediaWiki with enabled ]. Older versions of MediaWiki or without the Bot API are <em style="color:red;">not supported</em>. Be sure to mention the ''exact'' URL of your wiki. | |||
= <font size="6">Frequently asked questions</font> = | |||
{{collapse bottom}} | |||
*''When I start it up I get error "The application failed to initialize properly (0xc0000135). Click on OK to terminate the application."'' | |||
{{collapse top|Request a feature details}} | |||
*:This error means your computer does not have the .NET framework version 2 installed properly. | |||
Please use the feature request button to submit new feature requests. This format helps the developers track and manage requests efficiently. Before submitting, take a moment to search the archives—both and —to see if a similar request has already been discussed. | |||
{{collapse bottom}} | |||
|- | |||
| Report an incorrectly fixed typo? || ] | |||
|- | |||
| Request approval to use AWB? || ] | |||
|- | |||
| Ask a question about AWB or ask for help? || This page | |||
|}<!-- ] 00:55, 4 April 2114 (UTC) --> | |||
= Frequently asked questions = | |||
*''Will it ever work on linux?'' | |||
{{anchor|FAQ}}<!-- ] 00:55, 4 April 2114 (UTC) --> | |||
*:Probably not. | |||
{{collapse top|title=Frequently asked questions}} | |||
{{Misplaced Pages talk:AutoWikiBrowser/FAQ}} | |||
*''Does AWB work on other projects/languages?'' | |||
{{collapse bottom}} | |||
*:Many WikiMedia projects and languages are supported, see the "Select language and project" option in the file menu. Other languages will be added on request, though at the moment the interface is always in English. | |||
*''What interwiki link order does AWB use?'' | |||
*:It uses the orders specified at ], as all other bots do. | |||
*''I don't like or use Internet Explorer, please use FireFox instead.'' | |||
*:AWB does not use Internet Explorer, it does however happen to use the same web browser control that Internet Explorer does, the equivalent FireFox component does not provide the needed functionality. | |||
*''How do I open the page in another browser if I can't use the one in AWB?'' | |||
*:Right click on the edit box in the bottom right side of your screen. Select "Open page in browser" | |||
*''How do I edit a page that doesn't exist?'' | |||
*:Uncheck "Ignore non existing pages" in the "Skip articles" box. | |||
*''How do I skip certain articles?'' | |||
*:Use the "Skip if contains" and "Skip if doesn't contain" in the "(2) Set options" tab | |||
*''Can't you leave up a "stable" version, so I don't have to download new versions? | |||
*:It is important to keep people up to date with the latest versions, because their use of the software doesnt just affect them, but the whole of wikipedia. As any bugs that remain will be trivial, hopefully releases won't be so frequent anyway. | |||
=Discussion= | =Discussion= | ||
== Formatting templates == | |||
While I'm waiting to see what I can do on Wikinews with AWB, I'm trying out the MWiki-Browser. How would I go about formatting all occurrences of a template with it? | |||
From | |||
<nowiki>*{{source|url=http://somewhere.at.example.com|title=This is not the news you are looking for|author=|pub=Example.com|date=August 25, 2006}}</nowiki> | |||
To | |||
<nowiki>*{{source|url=http://somewhere.at.example.com</nowiki> | |||
<nowiki>|title=This is not the news you are looking for</nowiki> | |||
<nowiki>|author=</nowiki> | |||
<nowiki>|pub=Example.com</nowiki> | |||
<nowiki>|date=August 25, 2006}}</nowiki> | |||
And, as a (hopefully) minor feature request, can the protection of articles be performed from AWB. This is probably only of use to Wikinews where all articles 10 days old are protected. --] 08:19, 26 August 2006 (UTC) | |||
:AWB now works with wikinews. I think the best way to do that would be using the advanced find and replace to only do replacments inside templates, though i haven't really done anything like it before so i'm not sure what would be the best way. Protection of articles would currently be quite technically difficult. ] 08:37, 26 August 2006 (UTC) | |||
::Thanks for the information, I really need to read up on regex or find a friendly expert. :-) --] 10:56, 1 September 2006 (UTC) | |||
== Tagging talk pages of red-link articles == | |||
Kingbotk has had a few false positives, where I've tagged talk pages for articles deleted since I built my list. Looking at the message displayed above the edit box for new talk pages, I think AWB could very easily trap and avoid this. | |||
HTML: <pre>please verify that a page called <i><a href="/search/?title=There%27s_nothing_to_see_here%2C_move_along&action=edit" class="new" title="There's nothing to see here, move along">There's nothing to see here, move along</a></i> exists.</pre> | |||
HTML: <pre>"please verify that a page called <i><a href="/About_a_Book_Club_%28Hope_%26_Faith_episode%29" title="About a Book Club (Hope & Faith episode)">About a Book Club (Hope & Faith episode)</a></i> exists"</pre> | |||
It's a silly message really, because Mediawiki has had to look up whether the article exists or not!!! Anyway, it seems that class="new" is responsible for making the link red. | |||
Presuming that the message comes from the Mediawiki namespace somewhere and not from the PHP code directly, we can also leave a message on the Mediawiki talk page asking to be alerted of changes or of course just watchlist it :) --] 12:46, 26 August 2006 (UTC) | |||
:Stupid me... much easier way is to just check if the "article" tab is red or not. --] 10:11, 28 August 2006 (UTC) | |||
== OS X? == | |||
I'm not sure if this has been asked before, but is there a version of Mac OS X? ] 02:40, 27 August 2006 (UTC) | |||
:For there to be such a version the OS would need to support the .NET framework version 2 and have Internet Explorer. In other words, I doubt it. --] 09:59, 27 August 2006 (UTC) | |||
==Most (Mboverload) typos in one article?== | |||
Possibly ] "comunity → community (12), regulary → regularly, autorities → authorities (2), autority → authority, belived → believed (2), colaborators → collaborators (3), condemmed → cond" then the edit summary fins out of space. Approx 32 replacements. | |||
''] ]'' 08:51 ] ] (GMT). | |||
:Wow, that's mighty impressive, shame the article is probably going to get deleted! ] 10:15, 27 August 2006 (UTC) | |||
==New version ignoring option?== | |||
The new version seems to be enabling "add replacements to edit summary" even though I have this option disabled in the settings. It's just doing it anyway. Am I missing something or is this a bug? --] 13:19, 27 August 2006 (UTC) | |||
:It is a new bug, I improved the edit summary system and missed something, the older version is ok. thanks ] 13:25, 27 August 2006 (UTC) | |||
== changing link bug == | |||
here it only added a summary with no change, link should be changed like in here ] 06:30, 28 August 2006 (UTC) | |||
:It is most likely a bug in what ever find and replace strings were used, AWB has an option to ignore articles when no replacement was made anyway. ] 10:42, 28 August 2006 (UTC) | |||
==Nested square brackets bug== | |||
I quote ] who I think explains the symptom well | |||
<blockquote> | |||
The bot is causing some images to not show up. The image is Table 1 in the section called "Frequency of Incarceration." SmackBot deletes one of the brackets at the beginning of the external link in the caption. It also deletes the 3rd bracket at the end of the caption and link. It obviously is not recognizing the stacked brackets due to the combination of link and image coding ending at the same place. This is a serious problem because it is common to put a source link at the end of a sentence. Here is the correct image coding: <br><br> | |||
<nowiki> A U.S. Bureau of Justice Statistics report. The totals do not include people held in juvenile facilities. According to a 2006 OJJDP (Office of Juvenile Justice and Delinquency Prevention) report there were 97,000 held in juvenile facilities as of October 22, 2003. ]]</nowiki><br><br> | |||
SmackBot ends up with this coding below, and it causes the image to not show up:<br><br> | |||
<nowiki> A U.S. Bureau of Justice Statistics report. The totals do not include people held in juvenile facilities. According to a 2006 OJJDP (Office of Juvenile Justice and Delinquency Prevention) report there were 97,000 held in juvenile facilities as of October 22, 2003. ]</nowiki> | |||
</blockquote><br><br> | |||
Clearly a somewhat rare occurrence to have two external links in an image link, but there it is. Rgds, ''] ]'' 09:22 ] ] (GMT). | |||
:Ok, thanks I'll look into it. ] 09:38, 28 August 2006 (UTC) | |||
::Fixed by Martin, I believe, see ] below. ''] ]'' 20:39 ] ] (GMT). | |||
== Tip - Disappearing space on C: == | |||
This might be worthy of a mention on the project page. | |||
Since I started using AWB in anger, free space on C: has become an issue. I cleared 2GB of space and soon that was gone too. I searched for large files, cleared my internet cache regularly, but still the drive would be full. Well, exploring in Cygwin it would seem that IE creates a sh*tload of files in the Temporary Internet Files folder which ''don't get cleared even if you tell IE to clear it's cache''. What's more, they're hidden in Explorer even if you have it configured to show hidden files. Why they would do this I don't know - it's slightly sinister if you ask me - but, anyway, if you find that C: is gobbling up the gigs with no apparent cause this might be it. | |||
It's covered in more detail . The best tip seems to be: | |||
*Click Start, click Run, type the following command and click OK: | |||
:Shell:Cache\Content.IE5 | |||
--] 09:53, 28 August 2006 (UTC) | |||
:I haven't noticed a problem, but if it does does exist for some people it will only become noticed if they were doing 10s of 1000s of edits. The cache would probably clear itself after a period of time anyway, unless there is a massive flaw in how internet explorer works. ] 10:49, 28 August 2006 (UTC) | |||
::I'd call 386,156 files in ''just one'' of the subfolders - having clicked "clear cache" multiple times - a massive flaw! I'm only part of the way through clearing this crud and I've clawed back ''gigabytes''. This is insane! Anyrode, I hope the tip is useful to somebody, it's sure helped relieve my disk space issues. --] 10:55, 28 August 2006 (UTC) | |||
:::But it should be noted that your bot has made enourmous number of edits in a relatively short period of time, for the average user doing a few 1000 edits this will never be an issue. ] 11:09, 28 August 2006 (UTC) | |||
::::It's not that I'm blaming you Martin (unless you work on the MSIE team :)) but I think that a "clear cache" button which leaves several gigs of cached files in place, to the extent that a user's C drive is full and programs start crashing, is quite manifestly broken. Perhaps we'll have to agree to disagree on this point :) --] 19:38, 28 August 2006 (UTC) | |||
:::::For me clearing the cache of IE (in IE) clears "Shell:Cache\Content.IE5" (accessed by the procedure you've shown above). On a second note, AWB just uses the BrowserControl which is shared with IE. So we are rather bound to that with all the drawbacks/bugs. If you can provide a better control for browsing (or refer to one) or a tweak to the AWB code, you are of course very welcome :-). --] 08:48, 29 August 2006 (UTC) | |||
:::::::I know that - why does everyone seem to think I'm complaining?! Can't a guy indulge in a bit of gentle Microsoft bashing? :) --] 09:11, 29 August 2006 (UTC) | |||
::::::For your curiousity: The mozilla control which iirc can be downloaded , while nice, does not provide the critical functionality that AWB needs. ] 08:54, 29 August 2006 (UTC) | |||
:::::::Ooh. Interesting, thanks. --] 09:11, 29 August 2006 (UTC) | |||
:Thanks for the tip. ''] ]'' 10:20 ] ] (GMT). | |||
== small bug == | |||
There's a small bug if a URL is embedded in a image description link where AWB messes up the brackets: for example , you have <nowiki>]]</nowiki>, and awb removes one of the right square brackets and adds a left square bracket before the url. ] 10:45, 28 August 2006 (UTC) | |||
This is the same as two sections up, sorry :) ] 10:47, 28 August 2006 (UTC) | |||
:Fixed in newest release. ] 08:43, 29 August 2006 (UTC) | |||
== Weird bug with talk pages == | |||
I have a weird bug with AWB when prepending information to talk pages. When I want to prepend a msg to a list of talk pages, the diff blanks the whole page with just a "Modified" word. Here is the screenshot: ]. | |||
Is it me or is there something weird? | |||
Thanks, ] <sup>]</sup> 11:34, 28 August 2006 (UTC) | |||
:I'm trying to release a new version at the moment that cleans up a few issues, but sourceforge is giving me an "internal server error", I'm sure it will work soon. ] 11:45, 28 August 2006 (UTC) | |||
:Done it now, hopefully that will be the last release for a while. ] 11:48, 28 August 2006 (UTC) | |||
:: Woot, thanks!!! :) -- ] <sup>]</sup> 12:06, 28 August 2006 (UTC) | |||
== Auto-update? == | |||
How about a one button "upload new version" to make re-installing a snap when an existing version is superseded? ] ] 15:26, 28 August 2006 (UTC) | |||
:Not really possible while it is hosted at sourceforge, hopefully the frequency of releases will slow down now anyway. ] 08:43, 29 August 2006 (UTC) | |||
== Prepending to talk pages reloaded == | |||
] | |||
While testing my bot that currently delivers newsletters, I encountered a weird thing. While prepending to an un-existing page, well, nothing gets prepended. The diff is loaded but is not saved. And of course, the option "Ignore non-existing pages" is ''un''checked, as can be seen. And it works for a blank article talk page too. Obviously, it is only true for auto-mode. | |||
Is it a bug or a feature? | |||
Thanks, ] <sup>]</sup> 16:57, 28 August 2006 (UTC) | |||
:Hhhhmm, I tihnk it's fixed now, hopefully this really will be the last release for a while. ] 08:43, 29 August 2006 (UTC) | |||
== (Hopefully very easy) feature request == | |||
Hi Martin: can you create a way to add the contents of first-level subcategories to the article list? For example: when you make a list from a category, you get all the pages in the category plus the subcategories as part of the list. Do you think it would be possible to be able to double-click on the category in the list of articles to add the contents of that category to the list? Or something of the like...? It would be much easier than copy/paste, especially if you wanted the pages in all the subcategories of something with a huge amount of subcategories, such as ]. —<span style="font: small-caps 14px times; color: red;">] (])</span> 21:24, 28 August 2006 (UTC) | |||
:If you highlight the categories in the list and then open the context menu "Add selected to list..." and then "from category", this will get all the articles from those categories. ] 21:34, 28 August 2006 (UTC) | |||
::/me slaps himself in the head. Thanks :-) —<span style="font: small-caps 14px times; color: red;">] (])</span> 21:49, 28 August 2006 (UTC) | |||
== Unicode bug == | |||
When I was operating WinBot in 3.0.2.3 I was told that the unicodify in was a bad one. So I am wondering if there is a bug in here? Many thanks. --] <sup>(])</sup> 03:44, 29 August 2006 (UTC) | |||
: IOW, the bug discussed at ] appears to be back. ] 07:50, 29 August 2006 (UTC) | |||
:Ok, fixed in the newest verison. ] 08:43, 29 August 2006 (UTC) | |||
==Linux?== | |||
Can I run it on Linux, using Mono and Wine. --] 07:13, 29 August 2006 (UTC) | |||
:Afraid not. ] 08:43, 29 August 2006 (UTC) | |||
What about this: I have .NET farmework and IE6 instalated with Wine, mscoree.dll is also included in Mono. --] 13:50, 29 August 2006 (UTC) | |||
: You like to make things complicated, I see :)) -- ] <sup>]</sup> 13:59, 29 August 2006 (UTC) | |||
: Lol. It's not reasonable to expect Martin to support, erm, "esoteric" configurations like that. If you can get it to work, great - please report back - but the official line I suspect will remain the same :) --] 15:41, 29 August 2006 (UTC) | |||
== Plugins == | |||
===Feature requests=== | |||
*''(Being looked into)'' Access to XML settings. (If this isn't available I might try to go in the opposite direction, having the plugin control AWB's settings) | |||
*HasChanged boolean byval argument to ProcessArticle(), so that the plugin doesn't tell AWB to skip when ''AWB'' has made a change; ''or'' (easier) ignore the plugin's Skip value if AWB made a change | |||
*If the plugin has set a valid edit summary, have AWB not complain about empty edit summary box (but if in point above the Skip value is ignored and plugin returns an empty summary, AWB must use it's own summary) | |||
--] 15:49, 29 August 2006 (UTC) | |||
: Reading and writing AML settings is a possibility, actually changing the AWB settings in a definite no. Having the "HasChanged" variable would be tricky, I know things like that look easy but they are not, largely because it is fundamentally against how AWB works, i.e. if you are doing job x, then the article either needs job x doing (so save it) or it doesn't (so ignore it), also, anything that is done with find+replace wuithin AWB can be done easily in the plugin anyway. ] 16:50, 29 August 2006 (UTC) | |||
::True, true. Thanks. My plugin is working now anyway and hopefully will go into service later today - I have a backlog to catch up on! :) --] 17:15, 29 August 2006 (UTC) | |||
I'm going to be a pain in the arse now and suggest that - given your (well argued) line about moving all work to the plugin and not using AWB's skip/find/replace features at all, the code which calls the plugin ought to be moved back to where it was :) i.e. AWB gets article text, sends it to plugin for processing, and if plugin says skip that's the job done. What do you think? | |||
I took your advice by the way and moved my "skip this article" regex into the plugin. I hardly need worry about AWB settings now, it's all blank settings except for "make from file" and an edit summary of "Bot". --] 12:51, 30 August 2006 (UTC) | |||
:Well, you're the only person making a plugin, so i don't see the harm in moving it back. Also, I have added 3 methods to the interface for reading/writing XML and reseting the settings. It is a bit limited because of the complexity in dealing with plugins, I'll update my example when I have released the newest version. ] 12:57, 30 August 2006 (UTC) | |||
::Got the newest version, thanks. WriteXML() fires when saving settings (as expected), but ReadXML() doesn't ''seem'' to fire when loading settings. Bug? Also, under what circumstances is Reset() called? --] 18:17, 30 August 2006 (UTC) | |||
:::The read only fires when the XML node actually has some attributes. reset it called when the user clicks "reset" in the file menu. ] 18:28, 30 August 2006 (UTC) | |||
::::Cool. I shall now have a play with that, thanks Martin. --] 18:34, 30 August 2006 (UTC) | |||
Hopefully the last feature I ask for until you've had a rest :) In Initialise() could the plugin somehow get access to the options/start tabs? I'd like to add a tab which shows the status of my plugin and statistics - what it's doing, how many articles it's done/skipped/major edit/minor edit, etc etc. --] 13:55, 30 August 2006 (UTC) | |||
===Plugins in testing=== | |||
I have a plugin ready (]) and am testing it, should anybody be interested :) --] 15:49, 29 August 2006 (UTC) | |||
== Small categorisation bug == | |||
When recategorising articles, could you please fix it so that links that start ''] <sup>(])</sup> <sub>16:01, 29 August 2006</sub> | |||
:Why would you not want to change links like that? they will also need to be changed surely? ] 16:42, 29 August 2006 (UTC) | |||
::Well not if they're in discussion, changing someone else's comments. — ] <sup>(])</sup> <sub>16:44, 29 August 2006</sub> | |||
:::I don't see a problem with changing a link in someone's comments if leaving it would result in the link being incorrect. This ''doesn't'' apply to automated bots of course, as a human needs to check the context. --] 10:19, 30 August 2006 (UTC) | |||
::::Well I am mainly talking about an automated bot anyway, ], so an option here would be helpful. — ] <sup>(])</sup> <sub>13:00, 30 August 2006</sub> | |||
::::Not if I say "blah blah look at ] it should be a sub cat of ] blah blah.." and someone recats trees to graphs. ''] ]'' 21:40 ] ] (GMT). | |||
:::::Hence the "a human needs to check the context". A bot would get that edit wrong, a human shouldn't. --] 10:00, 31 August 2006 (UTC) | |||
==Another small categorisation issue== | |||
When categories have been foolishly placed in the middle of text it seems that their removal leads to extra carriage returns being inserted. If followed by spaces this can change formatting, e.g. . Regards, ''] ]'' 10:01 ] ] (GMT). | |||
== new feature == | |||
1) can this be implemented: change from i.e. to y ? to simplify the links | |||
2) why can't general fixes/unicodify (like removing underscores) be done (by AWB) before "find and replace" ? this causes a need of consideration all unicode/special characters into the regexes used into "find and replace" (special wikipedia's characters can't be matched by "find" until you know them exactly, but it's hard to cosider them everywhere!) | |||
] | |||
:1) Already a feature under "Apply general fixes" '''''<font color="darkblue">]</font>''''' 12:53, 30 August 2006 (UTC) | |||
:: read again, it's done after "find and replace" so in fact it wont work when matched string contains those characeters, general fixes will change to and not more] | |||
:::It has to be done after the find+replace or it can cause some complications. Links aren't simplified any more than they currently are because this can often lead to some strange looking links. ] 13:14, 30 August 2006 (UTC) | |||
:::Actually, in this case it is fairly easy to allow an option to apply before or after the general fixes. ] 13:27, 30 August 2006 (UTC) | |||
3) what about moving pages, can this be done ? (if it isn't already included) ] | |||
4) can erasing redundant spaces be included into general fixes ? for example into expressions like this "blblb ", " blabll", <nowiki>'' ddd '', ''' eee '''</nowiki> etc., also multiple spaces between the words would be erased as they aren't visible if more than 1 ] | |||
== Buglet == | |||
I thought my plugin had a bug, because it was skipping red-link talk pages. However, in debugging mode I found nothing wrong... then noticed that when I click "Auto save" AWB then automatically checks "Skip articles when no change made", which for some reason is causing those pages to skip. | |||
I've no idea why AWB would auto-check "Skip articles when no change made" but if there's no vital reason for doing so could you turn that off pse Martin? :) --] 13:43, 30 August 2006 (UTC) | |||
:Unfortunately it's not a bug, it's a defensive feature, otherwise some people have a nasty habit of setting a bot loose without it on and making a series of trivial edits. ] 13:48, 30 August 2006 (UTC) | |||
::lol, OK. Do any of the objects I get passed in Initialise() give me access to that checkbox? --] 13:53, 30 August 2006 (UTC) | |||
:::Answer to self, they ought to now I have access to the tabcontrol. --] 20:37, 31 August 2006 (UTC) | |||
==Skipping bug?== | |||
I just downloaded the new version and it seems to be skipping nearly everything, without regard to any setting as far as I can tell... Anyone know what I could be doing wrong, or if this could be a bug? --] 01:36, 31 August 2006 (UTC) | |||
:Probably you have "skip articles when no change made" selected? --] 10:09, 31 August 2006 (UTC) | |||
::Nope... argh! --] 14:31, 31 August 2006 (UTC) | |||
:::Try the latest vesion. thanks ] 14:56, 31 August 2006 (UTC) | |||
::::Working. Thank you! --] 16:33, 31 August 2006 (UTC) | |||
==Categories - for discussion== | |||
''Copied from my talk page, ''] ]'' 09:46 ] ] (GMT).'' | |||
:Could your bot also stop being 'helpful' with categories? I am getting sick of reverting that three or four times a day. To get categories sorted alphabetically you often need to place them above templates, but your bot keeps moving them back to the bottom (which ends up giving me a mish-mashed order). ] 23:04, 30 August 2006 (UTC) | |||
::Well it's done for now. I understand what your saying about categories, it raises two interesting points a. how should categories be ordered, and b. what to do about transcluded categories. The first has been thrashed out, and the conclusion reaced that alpahbetical order is not necessarily best (AWB used to order categories). The second is more probelmatical, I believe, for example that ''in general'' maintainance categories (and hence templates) should go after normal categories - and I thnk this is widely supported in principle. On the other hand it is common practice to put cleanup and wikify right at the top of articles. One off the things that AWB does in its general options is to put interwiki at the end, and non-trancluded categories immediately before, so I will copy part of your comment and this reply onto the AWB talk page for discussion. ''] ]'' 09:43 ] ] (GMT). | |||
::Alphabetical sorting isn't best in my opinion, sorting by relevance/priority is better. Yes, that does raise the point that transcluded maintenace categories will be first if the template is at the top of the page but c'est la vie... convention is to put those templates at the top of the page and that's not AWB's fault. It's quite simply not an AWB issue. --] 10:03, 31 August 2006 (UTC) | |||
:::It's not ''solely'' and AWB issue <grin>. I definatley don't think it's an AWP ''problem'', nor do I want to go 'round the mulberry bush we've been round before - just invite new ideas. I liked your Freudean slip "at the top of the fault." <second grin> ''] ]'' 10:11 ] ] (GMT). | |||
==Bolding first occurance of title in Image: name== | |||
I added formatting to an image filename. Whoops! ''] ]'' 10:06 ] ] (GMT). | |||
==One more oddity== | |||
See SmackBot's to ]. It blanked most of the article. Obiously unusual characters in the page, but apart from that no idea why. ''] ]'' 12:16 ] ] (GMT). | |||
:You've certainly been very unluckly with bugs today Rich! I'll have this sorted (well, I'll work around the problems in the .NET HTML decoder anyway). ] 16:27, 31 August 2006 (UTC) | |||
::Shotgun effect, there's a lot of them thar ISBNs. Thanks for your hard work. Can you let me know if there's something I can scan for to spot where else this might have happened? ''] ]'' 20:43 ] ] (GMT). | |||
== Edit summary link == | |||
Just a minor niggle, but can AWB insert w:WP:AWB when not working on wikipedia? I get redlinks in the edit summary using it on Wikinews. --] 16:03, 31 August 2006 (UTC) | |||
:Could you create a soft redirect to this page instead, as some projects have their own AWB page, and others just soft redirect to here. thanks. ] 16:25, 31 August 2006 (UTC) | |||
::Okay, Wikinews now has ] referring people to the project page. --] 10:48, 1 September 2006 (UTC) | |||
== Autonomous mode on other projects == | |||
Hi, I juse AWB on nl.wikipedia. It used to be possible to use my AWB on autonomous mode (bot-mode) on my bot-account , but in the newer versions of the software it's not possible anymore, which is a problem for me. I see that someone else has the same problem (see ]). Can anyone help and fix this, or is it not possible? NielsF<small>]</small> 19:28, 31 August 2006 (UTC) | |||
:The automode only becomes available when it has logged in (there is a log in button on the file menu, or it does it automatically when you start editing). Also make sure you have the newest version. ] 19:38, 31 August 2006 (UTC) | |||
::Ah thanks, upgrading to the latest version did the trick! Thanks for your quick response. NielsF<small>]</small> 20:06, 31 August 2006 (UTC) | |||
== Plugin stuff == | |||
Besides getting notification of start/stop/exit, could the plugin get access to txtEdit please Martin? The ContextMenuStrip isn't very useful if I can't put text into the box (or, is there a routine to call to do that?) --] 20:39, 31 August 2006 (UTC) | |||
== Timer == | |||
I'm using 3.0.2.8 and I can't seem to get the timer to appear. I have tried turning it off and on again with no success. --] <sup><small>]</small></sup> 22:20, 31 August 2006 (UTC) | |||
*Sorry, my bad. I didn't see that it moved.... BTW, can someone remind me of the limit for "too fast". I can't remember where that is documented. --] <sup><small>]</small></sup> 22:24, 31 August 2006 (UTC) | |||
== Hiding tabs from plugin == | |||
Any idea why the following code doesn't work Martin? | |||
<pre> | |||
Friend Shared Sub HideTabs() | |||
For Each tabp As TabPage In SettingsTabs | |||
tabp.Hide() | |||
Next | |||
End Sub | |||
</pre> | |||
When this code ''does'' work: | |||
<pre> | |||
Friend Shared Sub HideTabs() | |||
For Each tabp As TabPage In SettingsTabs | |||
tabp.Text = "I love Bluemoose!" | |||
Next | |||
End Sub | |||
</pre> | |||
--] 12:26, 1 September 2006 (UTC) | |||
:You can't use .Hide(), you have to remove the tabpage from the tabcontrol, then add it back if you want to show it, careful to add back in same order though. ] 12:40, 1 September 2006 (UTC) | |||
::Would that break AWB in any way? Do you ever reference the controls through the tabcontrol.tabpages() collection or only by name? --] 12:46, 1 September 2006 (UTC) | |||
:::It ''should'' be fine. Only one way to be sure though. ] 12:53, 1 September 2006 (UTC) | |||
::::Hehe, yep! --] 12:54, 1 September 2006 (UTC) | |||
== Possible bug in Categorisation == | |||
I am trying to remove ] and AWB will not recognize the string. Is it possible that it has something to do with the parenthesis? --] <sup><small>]</small></sup> 21:16, 1 September 2006 (UTC) | |||
:Ok, I see whats wrong, fix will be in next release. thanks ] 10:27, 2 September 2006 (UTC) | |||
:Ooh, small world - that's a CFD nomination of mine :) --] 10:46, 2 September 2006 (UTC) | |||
::New release 3.0.2.9 did the job. Thanks for the fix. --] <sup><small>]</small></sup> 21:23, 3 September 2006 (UTC) | |||
== Notification of new messages == | |||
When working on Wikinews and I get a new message on my talk page, AWB is trying to load up a Misplaced Pages page instead of a diff of my talk page on Wikinews. --] 07:27, 2 September 2006 (UTC) | |||
:Oh yeah, what a stupid error, fix in next release. thanks ] 10:27, 2 September 2006 (UTC) | |||
::Found when someone went "where the $#@% did you get AWB for Wikinews?" :-) --] 15:30, 2 September 2006 (UTC) | |||
== Image bug == | |||
Hi Martin, I encountered a little image commenting out bug, see . A random line break was previously added, so the formatting was | |||
<pre>[[Image:Delmarquis1.jpg|thumb|250px|right| | |||
Del Marquis in concert with ].]]'''Del Marquis''' (born '''Derek Gruen''' ] ], ]) is ...</pre> | |||
Just letting you know. —<span style="font: small-caps 14px times; color: red;">] (])</span> 16:55, 2 September 2006 (UTC) | |||
== Problems == | |||
Hi, I have some small problems (my browser is Firefox): | |||
*I try to work with the catalan version and I receive a message. It appears a window telling me I received a new message. Cool, but now I already read it and I can't close it. I read it with both Firefox and IE and the message is still there. I'll have to kill AWB process. Done. | |||
*Now, I still have the ] as some months before. My text editor is wordpad or notebook (I don't have word). I didn't answer then because I was going to change my PC and I decided waiting what's up with the new one, but still can not load the settings (it finds problems even after a <nowiki><!--</nowiki> !!!). The supposed working ] you corrected, doesen't work. Therefore I was thinking on: | |||
*Since the important part of the file are catalan typos, I wanted to put them in a page (it could be ]) as your Typos page, but I can not load them in AWB as you do in the english version. So, could you please add an option in wich either you can choose from wich page you want to dowload the typos list, or change the page deppending on the language... | |||
Thank you!--] - (]) - (]) 16:01, 3 September 2006 (UTC) | |||
*:Probably a caching issue in IE, don't think there is much that can be done. | |||
*:The settings file is not designed to be edited manually, though it can be done, I don't recommend it. | |||
*:I'll make it so it changes the typos page depending on the language. The ca: page will be http://ca.wikipedia.org/Viquip%C3%A8dia:AutoWikiBrowser/Typos (I have copied your settings into it), note that it will detect any problems with the regex syntax and alert you to it, and is also sensitive to using the correct syntax <Typo word="" find="" replace="" /> ] 19:03, 3 September 2006 (UTC) | |||
::Great! Thank you very much!--] - (]) - (]) 19:10, 3 September 2006 (UTC) | |||
== Redirects == | |||
I want to replace ''#REDIRECT <nowiki>]</nowiki>'' (of course with the \ tags for regular expressions) with, e.g. ''car'' (just an example). If I enter this in "Find and Replace - Normal", that doesn't work. The problem is, AWB doesn't take care of this page and follows the redirect. On the redirected page ABW of course doesn't replace anything. Does anyone know how to make this work? Many thanks in advance, ] 18:04, 3 September 2006 (UTC) | |||
:Turn off the "Bypass redirects" option in the "General" menu. ] 18:26, 3 September 2006 (UTC) | |||
== Feature request == | |||
Could we have an option under the "Make from" setting which fetches articles from a category '''and''' all it's subcategories? So for example, if you entered a category such as ], it would fetch all articles from that category, and then fetch all articles from the sub-categories too, etc etc until there were no more sub-categories. This would be very helpful for certain bot tasks. Thanks, — ] <sup>(])</sup> <sub>10:21, 04 September 2006</sub> | |||
:I have already coded this, but not implemented as it is a little scary, apart from cyclical categorisation (which can be dealt with), it could be possible to have many hundreds of categories. I guess I would have to implement a limit on the number of results. ] 11:13, 4 September 2006 (UTC) | |||
::Ah, ok :) — ] <sup>(])</sup> <sub>11:17, 04 September 2006</sub> | |||
:::You could write a plugin FF ;) --] 21:01, 4 September 2006 (UTC) | |||
== International issue == | |||
Even on fully localized projects, English names of special namespaces still could be used. For example, on de: you can use <tt><nowiki>]</nowiki></tt> instead of <tt><nowiki>]</nowiki></tt>. But AWB does not support such ambiguousness and as result, some pages cannot be recategorized and some images cannot be removed. Of course, you still can use regexps, but it's kinda pain in the ass... ] 14:51, 5 September 2006 (UTC) | |||
== Auto tag - Uncategorised == | |||
I was doing some removing of categories from articles, per ] and in some cases, the only category was removed, but in one case the <nowiki>{{Uncategorised}}</nowiki> tag was added and the other was not (I added it manually, so the diff shows it in there). They both had iw links, but the 2nd also had a stub tag. Does the stub tag keep the Uncategorised tag from getting applied, and if so, is that a possible upgrade opportunity? --] <sup><small>]</small></sup> 01:37, 6 September 2006 (UTC) | |||
:Yes, a stub tag will stop the uncategorised tag being applied, as AWB is just being ultra cautious (as often tags give an article a category). But I will tweak this at some point to ignore stub tags. ] 08:35, 6 September 2006 (UTC) | |||
== Rats? == | |||
Any ideas: | |||
From the Article ] | |||
*⌊⌊⌊⌊rats0rats⌋⌋⌋⌋{{ratsinfoboxrats ratsAircraftrats | |||
|ratsnamerats =ratsATRrats rats42rats & ratsATRrats rats72rats | |||
|ratstyperats =] | |||
|ratsmanufacturerrats =ratsATRrats (]: rats50rats%, ]: rats50rats%) | |||
|ratsimagerats =ratsimagerats:ratsAerrats.ratsarannrats.ratsatr72rats.ratseirats-ratsredrats.ratsarprats.ratsjpgrats | |||
|ratscaptionrats =ratsATRrats rats72rats ratsofrats ratsAerrats ratsArannrats ratsatrats ratstakerats ratsoffrats | |||
|ratsdesignerrats = | |||
|ratsfirstrats ratsflightrats =rats1984rats | |||
|ratsintroducedrats =rats1985rats | |||
|ratsretiredrats = | |||
|ratsstatusrats =ratsInrats ratsrevenuerats ratsservicerats | |||
|ratsprimaryrats ratsuserrats =] | |||
|ratsmorerats ratsusersrats = | |||
|ratsproducedrats = | |||
|ratsnumberrats ratsbuiltrats = | |||
|ratsunitrats ratscostrats = | |||
|ratsvariantsrats ratswithrats ratstheirrats ratsownrats ratsarticlesrats = | |||
}}{{ratsalternateusesrats}} | |||
ratsTherats ]-] ratsbasedrats ] ratsmanufacturerrats '''ratsAereirats ratsdarats ratsTrasportorats ratsRegionalerats''' ratsorrats '''ratsAvionsrats ratsderats ratsTransportrats ratsRégionalrats''' ('''ratsATRrats''') ratswasrats ratsformedrats ratsinrats rats1981rats, ratsfromrats ratstherats ratsconsortiumrats ratsformedrats ratsbyrats ''']''' ratsofrats ratsFrancerats (ratsnowrats ]) ratsandrats ''']''' (ratsnowrats ''']'''), ratsofrats ]. | |||
Does this on any article.... Restarted AWB | |||
Same all the way through..... | |||
] 17:31, 6 September 2006 (UTC) | |||
:What settings are you using? ] 17:51, 6 September 2006 (UTC) | |||
::I would guess replace \b with rats. ATR is an anagram of RAT, so I smell a rat. ''] ]'', 19:26 ] ] (GMT). | |||
:::Perhapsly someone vandalised the RETF page? ''] ]'', 19:28 ] ] (GMT). | |||
::::Good thinking, there was a regex mistake on the typo page. thanks ] 19:40, 6 September 2006 (UTC) | |||
== Request to add Chinese Project Support == | |||
Hello, I want to use AWB in Chinese Misplaced Pages, so I request to add Chinese (zh) project support. The namespace in Chinese project is in English (due to two different characters in the same project) , so I think Chinese project might be easier to be supported. You can contact me if needed. :) --] 17:45, 6 September 2006 (UTC) | |||
:Ok, will do. ] 19:40, 6 September 2006 (UTC) | |||
== Add to selected list from category == | |||
Hey, | |||
How come sometimes when you select multiple categories, right click, the 'Add to selected list from category' is greyed out and requires multiple re-highlights and or re-right clicking. | |||
Anyone else noticed this? Would you be able to look into it please martin? | |||
Cheers | |||
] 19:00, 7 September 2006 (UTC) | |||
:I've noticed it on occasion, yes. IIRC it usually happens when some of the selected category names contain numbers?? --] 09:05, 8 September 2006 (UTC) | |||
== ] Random Edit == | |||
AWB wants to change | |||
** ]: ({{IPA|}}) '''Y'''oung '''R'''eligious '''U'''nitarian '''U'''niversalists | |||
into | |||
** ]: ({{IPA|}}) '''Y'''oung '''R'''iousacrileg$2 '''U'''nitarian '''U'''niversalists | |||
Mistake on the typo list? | |||
] 19:05, 7 September 2006 (UTC) | |||
:Yes, it was a typo list error, I've fixed it now. ] 19:26, 7 September 2006 (UTC) | |||
== minor edits == | |||
Is there an option to mark the edits done through AWB such as spelling mistakes etc. as minor? I am unable to find it. Thanks -- ]] 06:24, 8 September 2006 (UTC) | |||
:In the "general" menu, there is an option. I think I will move it as some point to somewhere more logical. thanks ] 08:20, 8 September 2006 (UTC) | |||
::Thank you. Found it -- ]] 09:02, 8 September 2006 (UTC) | |||
== Plugin example code == | |||
Martin, I think the example code needs a slight change regarding the ReadXML event. It needs to test the return value of MoveToAttribute() and if False, use the plugin's default value. In VB code like this is giving strange results: | |||
<pre> | |||
Friend Function XMLReadBoolean(ByVal reader As System.Xml.XmlTextReader, ByVal param As String) As Boolean | |||
reader.MoveToAttribute(param) | |||
Return Boolean.Parse(reader.Value) | |||
End Function | |||
Friend Function XMLReadString(ByVal reader As System.Xml.XmlTextReader, ByVal param As String) As String | |||
reader.MoveToAttribute(param) | |||
Return reader.Value | |||
End Function | |||
</pre> | |||
So I'll be changing my functions to take an ExistingValue argument, and return that if MoveToAttribute() returns false. HTH HAND :) --] 10:29, 8 September 2006 (UTC) | |||
In VB this code does the trick: | |||
<pre> | |||
Friend Module XMLUtils | |||
Friend Function XMLReadBoolean(ByVal reader As System.Xml.XmlTextReader, ByVal param As String, _ | |||
ByVal ExistingValue As Boolean) As Boolean | |||
If reader.MoveToAttribute(param) Then Return Boolean.Parse(reader.Value) Else Return ExistingValue | |||
End Function | |||
Friend Function XMLReadString(ByVal reader As System.Xml.XmlTextReader, ByVal param As String, _ | |||
ByVal ExistingValue As String) As String | |||
If reader.MoveToAttribute(param) Then Return reader.Value Else Return ExistingValue | |||
End Function | |||
End Module | |||
</pre> | |||
--] 10:41, 8 September 2006 (UTC) | |||
== AWB process won't die == | |||
I've noticed that if I exit AWB (by closing the form) while it's trying to verify that I'm logged in, the process keeps on running for several minutes (until I kill it). Note that this is with no plugins installed, just plain AWB. --] 14:55, 8 September 2006 (UTC) | |||
:Alos, ff you don't notice this, you will be unable to extract the upgraded version.... ''] ]'', 12:33 ] ] (GMT). | |||
== hndis and surname == | |||
I recently separated the surname template so it no longer redirects to hndis. One of the "general fixes" for AWB appears to replace {{]}} with {{]}}. How can that fix be disabled for AWB in general? -- ] 17:23, 9 September 2006 (UTC) | |||
:I've changed it in the next version, I'll release it now. ] 18:36, 9 September 2006 (UTC) | |||
==Watchlist problem== | |||
Even when I have "Add all to watchlist" ''un''checked in the menu, all of the articles that I edit are still getting added to my watchlist. Is there a toggle somewhere that I've missed? --] 21:06, 9 September 2006 (UTC) | |||
:Are you sure you have the option unchecked in ] too? — ] <sup>(])</sup> <sub>21:12, 09 September 2006</sub> | |||
:: I thought that the AWB options were independent of individual Misplaced Pages preferences? --] 22:48, 9 September 2006 (UTC) | |||
:::Yes they are, but once AWB has submitted a page for saving to Misplaced Pages your Misplaced Pages preferences take over. If they say "add all pages to the watchlist" that's what Mediawiki will do. AWB and Mediawiki are independent products and they don't share their settings in any way. --] 12:37, 10 September 2006 (UTC) | |||
:::: Okay, I may be mis-remembering, but I thought that the way it used to work was that they were entirely independent. In other words, I could have AWB working in the background, and any changes it made could be flagged to ''not'' show up on my watchlist. But at the same time, I could be normal editing in another window, and those changes automatically ''would'' show up in my watchlist. Otherwise I have to keep remembering to check or uncheck the watch box depending on which window that I'm in. In other words, what I would like (and the way that I thought it used to work) was that I could keep my normal Misplaced Pages preferences set as "add to watchlist", but if I keep the menu option unselected on AWB, that it's able to keep the watch box ''un''checked. AWB seems to be able to toggle the watch and "minor" boxes on... Isn't there a way that it can also turn them off? --] 18:43, 10 September 2006 (UTC) | |||
:::::My experience has been similar to what Elonka seems to expect. In the past, when I have had the AWB preference set to "not add to watch list", it didn't add them, even though my standard setting outside of AWB was to add them. --] <sup><small>]</small></sup> 18:58, 10 September 2006 (UTC) | |||
::::::It can't be done, because if AWB unchecks the "add to watchlist" box, it unwatches stuff that was ''already'' in your watchlist, there is no way to discriminate between what is already in your watchlist and what isn't. ] 19:15, 10 September 2006 (UTC) | |||
::::::: Ah, good point, I see the problem. Hmmm. Well, to be honest, I'd be willing to take that risk. Could the option of "do not add edited pages to watchlist" be added to AWB, perhaps with a clear disclaimer, like, "Warning! Changing this will affect all articles that you edit with AWB, and could have the unintended consequence of inadvertently unwatching an article that was already on your watchlist. Please use with care." --] 20:55, 10 September 2006 (UTC) | |||
==Bolding article name in Image bug== | |||
This seems to be a problem in the latest version (3.0.3.0) with this article ], at least. ''] ]'', 12:34 ] ] (GMT). | |||
==Not enabled to use this?== | |||
I just tried running AWB for the first time. After having set up my procedure, when I press "Start the process", I keep getting the error message "You are not enabled to use this." It then opens a window to ], on which I am clearly listed as a registered/enabled user. Did I set up my procedure wrong? --] <sup>] · <font color="green">]</font></sup> 22:21, 10 September 2006 (UTC) | |||
== Problem with Special:Log/Newusers == | |||
I'm trying to make a list from Special:Log/Newusers, but I'm not getting any users whose talk pages don't yet exist, even if I uncheck "Ignore existing pages" in the "Skip articles" section. I deduce that this is because the code added or tweaked per the request at ] assumes that one would only want users with ''live'' talk pages. But I want to find users ''without'' talk pages, and I suspect that the unchecked "Ignore" option never comes into play because the generated list must ''first'' include the desired pages. If so, could this be fixed so that ''all'' users in the desired portion of the log are represented? If this is done, the default behavior of skipping non-existing pages should automatically provide the current functionality, and folks in my situation will be accomodated as well. Thanks. ~ ] ] 00:29, 12 September 2006 (UTC) | |||
: Any thoughts on this problem yet, folks? Am I being dense, perhaps? ~ ] ] 22:33, 16 September 2006 (UTC) | |||
== Creating page list by filtering on content == | |||
I would like to use AWB's excellent mechanisms for fetching pages and examining their content to generate a list of pages with challenging editing problems. The idea is that AWB can find problem pages matching a specific pattern, but the fix to each page may take some research, so it would be nice to simply generate a list for offline work. However, I haven't come up with a decent way to do this. The "Make list" filter only works on page names, as I understand it. The skip articles can ''identify'' target articles (or filter out non-targets), but only to perform an operation on them — they toss the page off the page list whether or not they perform the operation. (Tagging the articles for attention is an option, but I'd prefer to create an offline list rather than edit each article twice, once to tag and once to fix.) Nor can I see how to use the "Find and Replace" options, even the "Advanced" rules, to manipulate either the page list or a separate file (like a log). Do the experienced AWB users here have any advice for this AWB newbie? Thanks. ~ ] ] 00:51, 12 September 2006 (UTC) | |||
:If you can program in C# or VB.NET your best bet would be to make a plugin. It would be ''very'' simple to implement. You'd build your list, AWB would send the text of each article to the plugin, the plugin would analyse the content and write it out to a log and just tell AWB to skip the page (so AWB wouldn't actually do ''any'' edits). You wouldn't need a fancy user interface or anything so you could do that with a few lines of code and some regular expressions. --] 10:10, 12 September 2006 (UTC) | |||
:: Sounds like fun. You don't happen to know of any cheap (and legal!) C# or VB.NET programming tools, do you? I can't even afford to upgrade my Windows OS with Microsoft's monopoly-enabled fees. ~ ] ] 21:12, 12 September 2006 (UTC) | |||
:::]. The bees knees. AWB is developed in the C# version. My plugin uses VB.NET (which, of course, all the best programmers use - isn't that right Martin? ;)) --] 21:17, 12 September 2006 (UTC) PS There are Java and C++ versions too, but I can't vouch for either of them as I haven't used them. --] 21:19, 12 September 2006 (UTC) | |||
:::: Cool! I've been wanting to try out C# after having read an article about it a few years back that made it look better designed for OOP than than C++'s grafting of OO onto C. (Ugh, what geeky alphabet soup.) Thanks for the info. ~ ] ] 00:38, 13 September 2006 (UTC) | |||
:::::I'm as happy to bash MS as the next guy (my first PC had Linux on it over 10 years ago), but dotnet is OOP heaven. When I first read a massive tome on it every page was "wow, it does that?" and "that's clever". It's first rate. Definitely as a C++ programmer you want to use C#. I'm using VB.NET as I have a lot of experience with VBA in Access, and VB6. They all compile to the same Intermediate Language so, with a very small number of exceptions, they all do pretty much the same thing. Good luck and let us know how you get on! --] 09:31, 13 September 2006 (UTC) | |||
::::::Oh noez! A programming language thread ;) ! C++ does have it's merits. But not for those using old fashioned C programming paradigms (read a decent book that explains things like ]). I admit, average joe programmer is quick at achieving progress in C#, as such it isn't a bad language. It's also cool for rapid prototyping. --] 09:51, 13 September 2006 (UTC) | |||
:::::::Have you tried C++.NET? Is it any good? Or are they incompatible bedfellows? | |||
:::::::Horses for courses. I'm into rapid application development. I have no desire to write device drivers, no ability with art so no interest in creating fancy graphics etc etc. I also think there's a certain amount of snobbery about low vs high level languages. Indeed take C# vs VB.NET - VB can do almost everything that C# can do, but it's a higher level language. Surely that makes it ''better''? (unless coming from a C background). --] 10:16, 13 September 2006 (UTC) | |||
== bypass wikilinks while scanning database == | |||
1) could there by an option "ignore wikilinks" (into wiki database scanner) ? | |||
2) Feature, automated searching the list of articles (for a MISTAKE) – i.e. articles are created from database but many of them are already fixed (database gets out-of-date soon). To eliminate those "fixed" articles I load a new settings with only one regex/string matching MISTAKE, then set "skip when no change/replacement made" and push "start the process" – if it find "no change/replacement" those "no needed" (I don't want to apply general/other fixes for them etc. if no MISTAKE is available anymore) articles are removed from the list, but the process stops when MISTAKE is founded. The thing is to check all articles automatically in this case (similar to "auto save"), like auto ignore (remove from list) if there's no MISTAKE, leave the article on the list if MISTAKE is founded, and check consecutive articles, could this be implemented in the future version? ] | |||
:I'm not sure what you mean by ignore wikilinks? As for the second idea, if I understand correctly, this has been suggested before, but I refused on the grounds that it would be a large drain on servers to have people crawling through thousands of pages. ] 13:55, 12 September 2006 (UTC) | |||
::2) I will be doing this by switching articles by hand anyway, this's for not doing redundant edits which will be included into database | |||
::1) Not to search into <nowiki> ] ]</nowiki> etc. (it's called "ignore interwiki links" as in "find and replace") --] | |||
::: now if i search through database – sometimes there's nothing to change because im ignoring interwiki into "find and replace" ] | |||
== 404 on startup with nonstandard Default.xml == | |||
Once again, this is using AWB with en.wikinews, I've overwritten the default config .xml file with that detail, plus setting the EnableRegexTypoFix option. Now, whenever I start up AWB I get a 404 error, my guess it is perhaps looking for a page of regexes on Wikinews. ''If'' this is the case, can you let me know what I'd need to create on wikinews, and where I'd need to copy from? I'd love to be able to include fixes to change quotes from MS Office into plain quotes - they break our PDF/print edition. | |||
Steps to reproduce are, File->User and project preferences, set project to Wikinews, select make from Category, enter a recent date (eg September 1, 2006), click on the More options tag and select Enable RegexTypo Fix, uncheck Skip article, click Make list, select File->Save settings, overwrite Default.xml, quit AWB, restart and observe the error, should be: '''The remote server returned an error: (404) Not Found. --] 17:35, 12 September 2006 (UTC) | |||
:The page is http://en.wikinews.org/Wikinews:AutoWikiBrowser/Typos now I have created the page it works ok. ] 19:32, 12 September 2006 (UTC) | |||
::Thank you for this, I've copied the typo list from wikipedia and added it to my watchlist so I spot updates. I really appreciate this tool and have made some significant changes on Wikinews with its help. --] 20:14, 12 September 2006 (UTC) | |||
== Newest version crashes == | |||
The current version of AWB always crashes on the first or second edit. Does anyone else have this problem?--] <sup>(])</sup> 21:46, 12 September 2006 (UTC) | |||
:No. It's quite usual, alas, for it crash after a thousand or more edits, but I've never had it crash after one or two. --] 22:33, 12 September 2006 (UTC) | |||
::No problem for me either, doesn't crash after 100+ edits. ] 03:37, 13 September 2006 (UTC) | |||
:::Maybe I had a bad download. I could reinstall...--] <sup>(])</sup> 11:10, 14 September 2006 (UTC) | |||
:::::No such luck. I guess I can wait for the next release and see what happens.--] <sup>(])</sup> 17:42, 14 September 2006 (UTC) | |||
::::::Hmm.. I also have the same problem, I tried both 3.0.2.9 and 3.0.3.0 and they crashed on my first and second edit. Dunno why though. --] <sup>(])</sup> 01:25, 15 September 2006 (UTC) | |||
== XML settings bug? == | |||
Loading these settings (make list from category) I get an error at | |||
<pre> | |||
if (reader.MoveToAttribute("index")) | |||
listMaker1.SelectedSource = (WikiFunctions.Lists.SourceType)int.Parse(reader.Value); | |||
</pre> | |||
in UserSettings.cs. | |||
Settings (tested with plugin deleted, it's not a plugin issue) - | |||
<pre> | |||
<?xml version="1.0" encoding="utf-8"?> | |||
<Settings program="AWB" schema="2"> | |||
<Project> | |||
<projectlang proj="wikipedia" lang="en" /> | |||
</Project> | |||
<Options> | |||
<selectsource index="Category" text="Mexican politician stubs" /> | |||
<general general="True" tagger="True" unicodifyer="True" /> | |||
<categorisation index="0" text="" /> | |||
<skip does="False" doesnot="False" regex="False" casesensitive="False" doestext="" doesnottext="" moreindex="0" /> | |||
<message enabled="False" text="" append="True" /> | |||
<automode delay="15" quicksave="False" suppresstag="True" /> | |||
<imager index="0" replace="" with="" /> | |||
</Options> | |||
<regextypofix> | |||
<regextypofixproperties enabled="False" skipnofixed="False" /> | |||
</regextypofix> | |||
<FindAndReplaceSettings> | |||
<findandreplacesettings enabled="False" ignorenofar="True" ignoretext="False" appendsummary="True" afterotherfixes="False" /> | |||
</FindAndReplaceSettings> | |||
<FindAndReplace> | |||
<replacerules enabled="False"> | |||
<rule name="Rule" type="0" enabled="True" /> | |||
</replacerules> | |||
</FindAndReplace> | |||
<startoptions> | |||
<summary text="clean up" /> | |||
<summaryindex index="clean up" /> | |||
<find text="" regex="False" casesensitive="False" /> | |||
<menu> | |||
<wordwrap enabled="True" /> | |||
<toolbar enabled="False" /> | |||
<bypass enabled="True" /> | |||
<ingnorenonexistent enabled="True" /> | |||
<noautochanges enabled="False" /> | |||
<skipnochanges enabled="False" /> | |||
<preview enabled="False" /> | |||
<minor enabled="False" /> | |||
<watch enabled="False" /> | |||
<timer enabled="False" /> | |||
<sortinterwiki enabled="True" /> | |||
<addignoredtolog enabled="False" /> | |||
</menu> | |||
<plugins /> | |||
</startoptions> | |||
<pastemore> | |||
<pastemore1 text="" /> | |||
<pastemore2 text="" /> | |||
<pastemore3 text="" /> | |||
<pastemore4 text="" /> | |||
<pastemore5 text="" /> | |||
<pastemore6 text="" /> | |||
<pastemore7 text="" /> | |||
<pastemore8 text="" /> | |||
<pastemore9 text="" /> | |||
<pastemore10 text="" /> | |||
</pastemore> | |||
<preferences> | |||
<preferencevalues enhancediff="True" scrolldown="True" difffontsize="150" textboxfontsize="10" textboxfont="Courier New" lowthreadpriority="False" flashandbeep="True" /> | |||
</preferences> | |||
</Settings> | |||
</pre> | |||
--] 14:31, 13 September 2006 (UTC) | |||
:On further inspection I think the settings are getting ''saved'' incorrectly, and "selectsource index" should be "0", not "category"? --] 14:38, 13 September 2006 (UTC) | |||
::This is only in SVN, I've changed it now. ] 15:28, 13 September 2006 (UTC) | |||
== Making lists == | |||
Martin, any chance we could get these? | |||
*Make list from category - first 200 articles. Sometimes I want to sample the category and not get the entire thing (especially if contains 100,000 articles!). Links on page for a category page doesn't currently work; an alternative to my request might be to makle links on page for a category page work i.e. it returns the listing on the first page. | |||
*What redirects here. | |||
--] 14:55, 14 September 2006 (UTC) | |||
:I suppose I can put an optional limit in the category, but the other things would need a change in ]. ] 15:37, 14 September 2006 (UTC) | |||
::Blimey. I didn't know about that. Never heard of it. (rolls eyes). --] 17:45, 14 September 2006 (UTC) | |||
== Login problem == | |||
] | |||
My Auto Wiki Browser refuses to believe that I'm logged in, even though I very obviously am. As you can see at the screenshot to the left, I had logged in successfully, yet it was still prompting me to log in again. What on earth is the matter? ] ] 04:48, 16 September 2006 (UTC) | |||
:Most likely that you are not using the monobook skin. ] 10:12, 16 September 2006 (UTC) | |||
::I'll look into making it work though. ] 15:21, 16 September 2006 (UTC) | |||
== find and replace - ignore external/interwiki links, images, nowiki ... == | |||
when this option is set, regex: | |||
<nowiki>('''.*?'''( \(.*?\))?) ?? ?(jest )?to(^:| ) </nowiki> | |||
won't catch: <nowiki>'''Bielefeld''' to</nowiki> | |||
in ] | |||
– everything it's ok when I unset that option, regex checker tells it's true anyway, so it might be bug ] | |||
:It was a problem with an internal regex being too greedy, I've fixed it. ] 15:21, 16 September 2006 (UTC) | |||
== Lists of large categories == | |||
I'm finding that when I create a list of articles from multiple large categories, AWB omits a substantial number of the articles. Specifically, the subcats of ], there are about 17,000 articles listed, and when I create a list from them, many articles are left out of the list (several hundred at least), even if I try it twice. So um... any ideas? If this is a known bug, is there any reliable tool to generate a list of ''all'' articles in a large category? --] 14:29, 16 September 2006 (UTC) | |||
:Ah I'd glad you posted this. When I build a listing of ] I get 120,000 or so articles. If I build a list of ] (talk pages tagged with living=yes) I only get 101,000. I do a bot run and discover that thousands of my remaining 20,000 or so articles ''already have living=yes''. Mediawiki hasn't updated the category properly (unlikely, because the job queue runs often enough on WPBiography); the list comparer is broken (possible but I don't think it's this); or there's something wrong with the list grabbing from large cats. --] 14:37, 16 September 2006 (UTC) PS My plugin keeps a log so I can furnish a skipped list if need be.--] 14:37, 16 September 2006 (UTC) | |||
:I've noticed this, as it only occurs on very large categories, I half suspect it is the queri API rather than AWB, but I'll find out for sure soon. ] 15:21, 16 September 2006 (UTC) | |||
== Adding wikiproject banner to talk pages == | |||
Hi, is it possible to add a wikiproject banner to the talk pages of articles using AWB? I clicked on more options, clicked on append message and wrote down the banner of the project {{tl|WP India}}. And then I saved. But nothing happened. Please suggest -- ] 19:50, 16 September 2006 (UTC) | |||
:You have to set all the settings, then start the process. ] 21:17, 16 September 2006 (UTC) | |||
::Thanks, I did set all settings as far as I could gather. But not able to do it. Help would be greatly appreciated. -- ] 04:52, 17 September 2006 (UTC) | |||
== Purpose of AWB? == | |||
I'm sorry if this is a stupid question, but what is the actual purpose of AWB and/or what is the main function of it that makes it superior to simply going around in IE and editing pages? The article doesn't exactly make it clear (to me). I tend to see mainly spelling and grammar errors corrected with AWB tags in the change-log. What exactly does AWB allow you to do? ] 00:42, 17 September 2006 (UTC) | |||
:AWB can be used for repeating the same task over and over and over and over again. Like adding a template to every page in a category, (even hundreds of them). It can also be used to do tasks like update a link or image on pages. It is not designed to replace your normal browser, or to be your primary editor. Some ] run solely using the find and replace utility of AWB. — ] <sup>]</sup> 02:02, 17 September 2006 (UTC) | |||
== User login == | |||
Hi, I'm using AWB on Swedish Misplaced Pages, and it works well. To my knowledge, I have not entered my username in AWB or its config files, still AWB is logging in with my standard login. How can it work? Magic? I'm clogging down the RC with my edits though, how do I make AWB login as my bot account? //] 06:22, 17 September 2006 (UTC) | |||
:Yes, it's magic :) Actually, no, it's because AWB uses the Internet Explorer engine. You'll have to log out of Misplaced Pages in IE and log back in as your bot. If you want to run AWB and do manual edits at the same time using 2 different accounts, use IE for your bot and do your manual edits in another browser like Firefox or Opera. --] 09:13, 17 September 2006 (UTC) | |||
::Aha! I added this in the User manual on the front AWB page. Thanks! //] 11:24, 17 September 2006 (UTC) | |||
== "Correcting the misspelling" of a direct quote == | |||
Hi there. Occasionally, people will come across the ] page using AWB and "correct" the spelling of a person being quoted, even when the spelling is in their exact words. Is there a way to prevent this? Thanks. ] 06:28, 17 September 2006 (UTC) | |||
:Put "(sic)" or "" next to the intentional spelling mistake, and any AWB user with any wits about them will know it's as quoted and leave it? --] 09:10, 17 September 2006 (UTC) | |||
== What to do, what to do? == | |||
I have just downloaded and been registered for AWB and looked at the Terms and Conditions. Does spell-checking - the task I plan to complete with it - class as unecessarily minor edits? Thanks. ]<font color="green">]</font>]<font color="green">]</font>]|<sup>]</sup>|<sub>]</sub> 17:25, 17 September 2006 (UTC) | |||
== AutoWikiBrowser and searching for pages with capture regex. == | |||
:No, spell checking is not minor. thanks ] 17:37, 17 September 2006 (UTC) | |||
I'm looking for pages that have strings like <nowiki>]</nowiki> So I'd like to search for these with something like <nowiki>\* * *\|\1</nowiki> and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? ] (]) 22:50, 24 November 2024 (UTC) | |||
:: Are you sure? I'd been under the impression from ] that simple spelling corrections were minor edits. Where I'm still fuzzy though, is whether the addition of a {{tl|stub}} template counts as minor or not. --] 18:15, 18 September 2006 (UTC) | |||
:Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) <i>~"Alpha Phi Alpha"</i> I get 799 entries, for <i>~"Alpha Phi Alpha" insource:/\] (]) 23:57, 24 November 2024 (UTC) | |||
:::Err... no, this is a seperate issue. You're talking about "do I tick the Misplaced Pages 'minor edit' box or not?". The original question was about the terms and conditions of AWB and not making "''unneccessary'' minor edits". --] 18:19, 18 September 2006 (UTC) | |||
:You have to put round brackets into the search string to tell the regex code what <code>\1</code> is intended to match. I began a database scan for <code><nowiki>\* * *)\|\1</nowiki></code> but quickly aborted it; there are tens of thousands of matches. Typical examples are <code><nowiki>]</nowiki></code> and <code><nowiki>]</nowiki></code>. I restricted the search to the names of Greek letters and dumped the results at ] (]) -- ] (]) 08:10, 25 November 2024 (UTC) | |||
::Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?] (]) 13:50, 25 November 2024 (UTC) | |||
:::{{Re|Naraht}} I used AWB's "]". AWB normally needs ] but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file <code>enwiki-20241120-pages-articles.xml.bz2</code> is a 20 Gigabyte download from , and that has to be uncompressed to 102 Gigabytes before AWB can use it. | |||
:::If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- ] (]) 14:07, 25 November 2024 (UTC) | |||
::::Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.] (]) 18:06, 25 November 2024 (UTC) | |||
== |
== Redlinks == | ||
Hello. | |||
Again, this is something I've observed before I started using a plugin, so the problem is within AWB itself. I've found that AWB memory usage can increase steadily throughout a session until it's at 400MB or more of physical RAM. Also in the past it's been normal for me to wake up in the morning and find that AWB stalled throughout the night. To counter the second problem, I've added a feature to my plugin to stop and restart AWB if the list isn't empty and it doesn't send any articles to the plugin in 10 minutes. Unfortunately that has the side effect of trying to keep AWB running if it's struggling for memory. | |||
I don't know if it's technically possible. | |||
My machine has 1GB of memory, but this morning when I got up both of my AWB processes had crashed as out of memory and, rather annoyingly, they'd taken my Firefox with umpteem open tabs up down with them. I can only imagine that certain resources aren't being disposed of correctly or objects are somehow kept alive when no longer needed. Any ideas Martin and has anyone else doing thousands of automated edits noticed this? --] 10:04, 18 September 2006 (UTC) | |||
It's for the French wiki, but I think I have more help here. | |||
:It's the IE control, it seems to want to cache pages. I have never had any problems with it, even on large runs, maybe your IE has a different option set to cache pages in a different way or something. ] 10:08, 18 September 2006 (UTC) | |||
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example | |||
::Ah. Well, remember the issue I had with gigs of pages being cached? Also, since I zapped that cache my MSDN help viewer has been f*cked too. My version of IE must have problems. Any registry settings or owt you know of to help fix it? --] 10:20, 18 September 2006 (UTC) | |||
If it's possible, can I have help with the process? ] (]) 13:01, 2 December 2024 (UTC) | |||
:My Pc's got 2GB of ram in, and 1GB of page file. I've had AWB running for long enough to have to close it due to using all the page file. I havent really done AWB runs recently, or any large ones, but it seems to be a bit better in the newer version. | |||
:I'll have to dig through the , but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. ] (]) 21:00, 3 December 2024 (UTC) | |||
:Ive noticed it during any .NET app that i've created, whenever you open or close forms, and press buttons and the memory usage just keeps increasing! I know people run AWB bots and stuff, with i think mboverlord running one quite a lot... And martin, you have bluebot don't you? ] 10:41, 18 September 2006 (UTC) | |||
::There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. ] (]) 22:48, 4 December 2024 (UTC) | |||
:::Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... ] (]) 15:28, 5 December 2024 (UTC) | |||
::::Thank you. ] (]) 09:46, 6 December 2024 (UTC) | |||
:::::Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. ] (]) 19:33, 6 December 2024 (UTC) | |||
::::::]; give it a spin. But it's getting late here so I may not respond right away. ] (]) 02:35, 7 December 2024 (UTC) | |||
:::::::If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. ] (]) 14:32, 7 December 2024 (UTC) | |||
:::::::: Wow! You're a genius! ] (]) 14:26, 10 December 2024 (UTC) | |||
::::::::: Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. ] (]) 21:07, 10 December 2024 (UTC) | |||
:::::::::: Yes, I had noticed this bug. Thanks ] (]) 08:57, 11 December 2024 (UTC) | |||
== Stub spacing == | |||
::I regularly do runs of multiple thousands of edits without any problem, the memory usage does get quite high after a couple of thousand, but it seems to reach a ceiling eventually. Historically there was a problem with stalling occasionally, but that particular problem has been solved. ] 13:10, 18 September 2006 (UTC) | |||
{{Tracked|T382578}} | |||
The requirement for two blank lines before stubs has now been removed. See ] — ] <sup>]</sup> 08:54, 20 December 2024 (UTC) | |||
:Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. ] (]) 22:26, 20 December 2024 (UTC) | |||
:::I made ~3000 edits today, I noticed memory usage went up to ~300mb, then IE seemed to purge itself and it went right back down. ] 18:27, 18 September 2006 (UTC) | |||
:: Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer ''needs'' to force two lines before a stub for enwiki. No idea about other wikis — ] <sup>]</sup> 23:50, 20 December 2024 (UTC) | |||
== |
== Bot saving blank pages == | ||
As a feature request, would it be possible to have AWB pre-load a page? Sort of like running a tabbed browser? I notice that when I'm running through a long list (such as ]), that I usually only need a few seconds to actually decide what to do with a particular page, but that it takes just as long to wait for the next page to load after I hit "save." If AWB could be pre-loading the next page in the list, while I'm making the decision on the current one, that would speed things up considerably, as I wouldn't have the "wait for page load" delays. --] 18:20, 18 September 2006 (UTC) | |||
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 19:40, 22 December 2024 (UTC) | |||
:It would be pretty difficult to implement. Normally the delay is fairly insignificant, but the servers have been slow for the last couple of days. ] 18:27, 18 September 2006 (UTC) | |||
:See also {{slink|User_talk:Primefac/Archive_21#Blanking}}, slightly different setup for skip checks. ] (]) 12:56, 23 December 2024 (UTC) |
Latest revision as of 15:32, 25 December 2024
AutoWikiBrowser 6.3.1.1- Home
Introduction and rules - User manual
How to use AWB - Discussion
Discuss AWB, report errors, and request features - User tasks
Request or help with AWB-able tasks - Technical
Technical documentation
- Changelog
- Developer discussion
- Modules
- Regular expression
- Sandbox
- Template redirects
- Typos
- Usage stats
- Userbox
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the § Before you post section below. Before asking a question, please refer to the read the § Frequently asked questions below.
Archives |
Index 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 |
This page has archives. Sections older than 30 days may be automatically archived by Lowercase sigmabot III. |
Before you post
Do you want to ... | Please use | ||||
---|---|---|---|---|---|
Report a bug or request a feature in AWB? | Check reported bugs on Phabricator before filing a new bug report. You do not need to create another account there; just log in with your global Wikimedia account. See this MediaWiki wiki page on how to report bugs and request features on Phabricator.
| ||||
Report an incorrectly fixed typo? | Misplaced Pages talk:AutoWikiBrowser/Typos | ||||
Request approval to use AWB? | Misplaced Pages:Requests for permissions/AutoWikiBrowser | ||||
Ask a question about AWB or ask for help? | This page |
Frequently asked questions
Frequently asked questions |
---|
//Detect IE5.5+ if (navigator.appVersion.indexOf("MSIE")==-1) { // Previous contents go here .... }
|
Discussion
AutoWikiBrowser and searching for pages with capture regex.
I'm looking for pages that have strings like ] So I'd like to search for these with something like \* * *\|\1 and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? Naraht (talk) 22:50, 24 November 2024 (UTC)
- Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) ~"Alpha Phi Alpha" I get 799 entries, for ~"Alpha Phi Alpha" insource:/\
- You have to put round brackets into the search string to tell the regex code what
\1
is intended to match. I began a database scan for\* * *)\|\1
but quickly aborted it; there are tens of thousands of matches. Typical examples are]
and]
. I restricted the search to the names of Greek letters and dumped the results at User:John of Reading/X2 (permalink) -- John of Reading (talk) 08:10, 25 November 2024 (UTC)- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
enwiki-20241120-pages-articles.xml.bz2
is a 20 Gigabyte download from partway down this page, and that has to be uncompressed to 102 Gigabytes before AWB can use it. - If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- John of Reading (talk) 14:07, 25 November 2024 (UTC)
- Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.Naraht (talk) 18:06, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
Redlinks
Hello.
I don't know if it's technically possible.
It's for the French wiki, but I think I have more help here.
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example here
If it's possible, can I have help with the process? Bordurie (talk) 13:01, 2 December 2024 (UTC)
- I'll have to dig through the code, but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. Primefac (talk) 21:00, 3 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Yes, I had noticed this bug. Thanks Bordurie (talk) 08:57, 11 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
Stub spacing
Tracked in PhabricatorTask T382578
The requirement for two blank lines before stubs has now been removed. See WP:STUBSPACING — GhostInTheMachine 08:54, 20 December 2024 (UTC)
- Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. Kiwipete (talk) 22:26, 20 December 2024 (UTC)
- Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer needs to force two lines before a stub for enwiki. No idea about other wikis — GhostInTheMachine 23:50, 20 December 2024 (UTC)
Bot saving blank pages
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. ~ Tom.Reding (talk ⋅dgaf) 19:40, 22 December 2024 (UTC)
- See also User talk:Primefac/Archive 21 § Blanking, slightly different setup for skip checks. Primefac (talk) 12:56, 23 December 2024 (UTC)