Revision as of 20:21, 8 December 2005 editBluemoose (talk | contribs)29,151 editsm moved User talk:Bluemoose/Robot to User talk:Bluemoose/AutoWikiBrowser← Previous edit | Latest revision as of 15:32, 25 December 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,293,067 editsm Archiving 2 discussion(s) to Misplaced Pages talk:AutoWikiBrowser/Archive 34) (bot | ||
Line 1: | Line 1: | ||
{{AWB|notes={{Clickable button 2|Start a new discussion|url={{fullurl:Misplaced Pages talk:AutoWikiBrowser|action=edit§ion=new&dtenable=1}}|class=mw-ui-progressive|style=border-radius:4px; font-size:150%;}}}} | |||
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the ''']''' section below. Before asking a question, please refer to the '''read the ]''' below. | |||
{{archive box|auto=yes|search=yes|bot=lowercase sigmabot III|age=30}} | |||
{{User:HBC Archive Indexerbot/OptIn|target=Misplaced Pages talk:AutoWikiBrowser/Archive index|mask=Misplaced Pages talk:AutoWikiBrowser/Archive <#>|leading_zeros=0|indexhere=yes}} | |||
{{User:MiszaBot/config | |||
|maxarchivesize = 250K | |||
|counter = 34 | |||
|algo = old(30d) | |||
|archive = Misplaced Pages talk:AutoWikiBrowser/Archive %(counter)d | |||
| archiveheader = {{Talk archive navigation}} | |||
}} | |||
{{toc left}} | |||
{{clear}} | |||
= Before you post = | |||
{| class=wikitable | |||
|- | |||
! Do you want to ... !! Please use | |||
|- | |||
| Report a bug or request a feature in AWB? || before . You do not need to create another account there; just log in with your global Wikimedia account. ] on how to report bugs and request features on Phabricator. | |||
{{collapse top|1=Report a bug}} | |||
Try to report bugs in the {{em|current}} version of the software. Update to the most recent version and check to make sure your bug has not already been reported on Phabricator. See for advice on how to write bug reports. | |||
Before posting anything related to non-], verify that the site is running a ''recent version'' of MediaWiki with enabled ]. Older versions of MediaWiki or without the Bot API are <em style="color:red;">not supported</em>. Be sure to mention the ''exact'' URL of your wiki. | |||
{{collapse bottom}} | |||
{{collapse top|Request a feature details}} | |||
Please use the feature request button to submit new feature requests. This format helps the developers track and manage requests efficiently. Before submitting, take a moment to search the archives—both and —to see if a similar request has already been discussed. | |||
{{collapse bottom}} | |||
|- | |||
| Report an incorrectly fixed typo? || ] | |||
|- | |||
| Request approval to use AWB? || ] | |||
|- | |||
| Ask a question about AWB or ask for help? || This page | |||
|}<!-- ] 00:55, 4 April 2114 (UTC) --> | |||
= Frequently asked questions = | |||
{{anchor|FAQ}}<!-- ] 00:55, 4 April 2114 (UTC) --> | |||
{{collapse top|title=Frequently asked questions}} | |||
{{Misplaced Pages talk:AutoWikiBrowser/FAQ}} | |||
{{collapse bottom}} | |||
=Discussion= | |||
== AutoWikiBrowser and searching for pages with capture regex. == | |||
I'm looking for pages that have strings like <nowiki>]</nowiki> So I'd like to search for these with something like <nowiki>\* * *\|\1</nowiki> and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? ] (]) 22:50, 24 November 2024 (UTC) | |||
:Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) <i>~"Alpha Phi Alpha"</i> I get 799 entries, for <i>~"Alpha Phi Alpha" insource:/\] (]) 23:57, 24 November 2024 (UTC) | |||
:You have to put round brackets into the search string to tell the regex code what <code>\1</code> is intended to match. I began a database scan for <code><nowiki>\* * *)\|\1</nowiki></code> but quickly aborted it; there are tens of thousands of matches. Typical examples are <code><nowiki>]</nowiki></code> and <code><nowiki>]</nowiki></code>. I restricted the search to the names of Greek letters and dumped the results at ] (]) -- ] (]) 08:10, 25 November 2024 (UTC) | |||
::Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?] (]) 13:50, 25 November 2024 (UTC) | |||
:::{{Re|Naraht}} I used AWB's "]". AWB normally needs ] but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file <code>enwiki-20241120-pages-articles.xml.bz2</code> is a 20 Gigabyte download from , and that has to be uncompressed to 102 Gigabytes before AWB can use it. | |||
:::If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- ] (]) 14:07, 25 November 2024 (UTC) | |||
::::Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.] (]) 18:06, 25 November 2024 (UTC) | |||
== Redlinks == | |||
Hello. | |||
I don't know if it's technically possible. | |||
It's for the French wiki, but I think I have more help here. | |||
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example | |||
If it's possible, can I have help with the process? ] (]) 13:01, 2 December 2024 (UTC) | |||
:I'll have to dig through the , but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. ] (]) 21:00, 3 December 2024 (UTC) | |||
::There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. ] (]) 22:48, 4 December 2024 (UTC) | |||
:::Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... ] (]) 15:28, 5 December 2024 (UTC) | |||
::::Thank you. ] (]) 09:46, 6 December 2024 (UTC) | |||
:::::Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. ] (]) 19:33, 6 December 2024 (UTC) | |||
::::::]; give it a spin. But it's getting late here so I may not respond right away. ] (]) 02:35, 7 December 2024 (UTC) | |||
:::::::If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. ] (]) 14:32, 7 December 2024 (UTC) | |||
:::::::: Wow! You're a genius! ] (]) 14:26, 10 December 2024 (UTC) | |||
::::::::: Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. ] (]) 21:07, 10 December 2024 (UTC) | |||
:::::::::: Yes, I had noticed this bug. Thanks ] (]) 08:57, 11 December 2024 (UTC) | |||
== Stub spacing == | |||
{{Tracked|T382578}} | |||
The requirement for two blank lines before stubs has now been removed. See ] — ] <sup>]</sup> 08:54, 20 December 2024 (UTC) | |||
:Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. ] (]) 22:26, 20 December 2024 (UTC) | |||
:: Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer ''needs'' to force two lines before a stub for enwiki. No idea about other wikis — ] <sup>]</sup> 23:50, 20 December 2024 (UTC) | |||
== Bot saving blank pages == | |||
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 19:40, 22 December 2024 (UTC) | |||
:See also {{slink|User_talk:Primefac/Archive_21#Blanking}}, slightly different setup for skip checks. ] (]) 12:56, 23 December 2024 (UTC) |
Latest revision as of 15:32, 25 December 2024
AutoWikiBrowser 6.3.1.1- Home
Introduction and rules - User manual
How to use AWB - Discussion
Discuss AWB, report errors, and request features - User tasks
Request or help with AWB-able tasks - Technical
Technical documentation
- Changelog
- Developer discussion
- Modules
- Regular expression
- Sandbox
- Template redirects
- Typos
- Usage stats
- Userbox
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the § Before you post section below. Before asking a question, please refer to the read the § Frequently asked questions below.
Archives |
Index 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 |
This page has archives. Sections older than 30 days may be automatically archived by Lowercase sigmabot III. |
Before you post
Do you want to ... | Please use | ||||
---|---|---|---|---|---|
Report a bug or request a feature in AWB? | Check reported bugs on Phabricator before filing a new bug report. You do not need to create another account there; just log in with your global Wikimedia account. See this MediaWiki wiki page on how to report bugs and request features on Phabricator.
| ||||
Report an incorrectly fixed typo? | Misplaced Pages talk:AutoWikiBrowser/Typos | ||||
Request approval to use AWB? | Misplaced Pages:Requests for permissions/AutoWikiBrowser | ||||
Ask a question about AWB or ask for help? | This page |
Frequently asked questions
Frequently asked questions |
---|
//Detect IE5.5+ if (navigator.appVersion.indexOf("MSIE")==-1) { // Previous contents go here .... }
|
Discussion
AutoWikiBrowser and searching for pages with capture regex.
I'm looking for pages that have strings like ] So I'd like to search for these with something like \* * *\|\1 and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? Naraht (talk) 22:50, 24 November 2024 (UTC)
- Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) ~"Alpha Phi Alpha" I get 799 entries, for ~"Alpha Phi Alpha" insource:/\
- You have to put round brackets into the search string to tell the regex code what
\1
is intended to match. I began a database scan for\* * *)\|\1
but quickly aborted it; there are tens of thousands of matches. Typical examples are]
and]
. I restricted the search to the names of Greek letters and dumped the results at User:John of Reading/X2 (permalink) -- John of Reading (talk) 08:10, 25 November 2024 (UTC)- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
enwiki-20241120-pages-articles.xml.bz2
is a 20 Gigabyte download from partway down this page, and that has to be uncompressed to 102 Gigabytes before AWB can use it. - If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- John of Reading (talk) 14:07, 25 November 2024 (UTC)
- Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.Naraht (talk) 18:06, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
Redlinks
Hello.
I don't know if it's technically possible.
It's for the French wiki, but I think I have more help here.
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example here
If it's possible, can I have help with the process? Bordurie (talk) 13:01, 2 December 2024 (UTC)
- I'll have to dig through the code, but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. Primefac (talk) 21:00, 3 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Yes, I had noticed this bug. Thanks Bordurie (talk) 08:57, 11 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
Stub spacing
Tracked in PhabricatorTask T382578
The requirement for two blank lines before stubs has now been removed. See WP:STUBSPACING — GhostInTheMachine 08:54, 20 December 2024 (UTC)
- Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. Kiwipete (talk) 22:26, 20 December 2024 (UTC)
- Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer needs to force two lines before a stub for enwiki. No idea about other wikis — GhostInTheMachine 23:50, 20 December 2024 (UTC)
Bot saving blank pages
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. ~ Tom.Reding (talk ⋅dgaf) 19:40, 22 December 2024 (UTC)
- See also User talk:Primefac/Archive 21 § Blanking, slightly different setup for skip checks. Primefac (talk) 12:56, 23 December 2024 (UTC)