Revision as of 23:57, 24 November 2024 editNeils51 (talk | contribs)Extended confirmed users114,076 editsm →AutoWikiBrowser and searching for pages with capture regex.: reply← Previous edit | Latest revision as of 15:32, 25 December 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,292,548 editsm Archiving 2 discussion(s) to Misplaced Pages talk:AutoWikiBrowser/Archive 34) (bot | ||
(29 intermediate revisions by 10 users not shown) | |||
Line 43: | Line 43: | ||
=Discussion= | =Discussion= | ||
⚫ | == AutoWikiBrowser and searching for pages with capture regex. == | ||
== "Following" #Rs from moves without an #R left behind == | |||
⚫ | I'm looking for pages that have strings like <nowiki>]</nowiki> So I'd like to search for these with something like <nowiki>\* * *\|\1</nowiki> and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? ] (]) 22:50, 24 November 2024 (UTC) | ||
I frequently come across pages that have been moved without leaving a redirect, like ]. The destination, ], is right there on the (browser) page, and I'd like to capture it, but can't figure out how. The api doesn't return the destination, and using the normal URLs both return null <code>Tools.GetHTML()</code>. Does anyone know how this can be done? <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 15:28, 19 October 2024 (UTC) | |||
⚫ | :Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) <i>~"Alpha Phi Alpha"</i> I get 799 entries, for <i>~"Alpha Phi Alpha" insource:/\] (]) 23:57, 24 November 2024 (UTC) | ||
:If you use the API to query for move events in the logs for the page the info you need is shown: . ] 09:39, 20 October 2024 (UTC) | |||
:You have to put round brackets into the search string to tell the regex code what <code>\1</code> is intended to match. I began a database scan for <code><nowiki>\* * *)\|\1</nowiki></code> but quickly aborted it; there are tens of thousands of matches. Typical examples are <code><nowiki>]</nowiki></code> and <code><nowiki>]</nowiki></code>. I restricted the search to the names of Greek letters and dumped the results at ] (]) -- ] (]) 08:10, 25 November 2024 (UTC) | |||
::Excellent, thank you! <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 09:44, 20 October 2024 (UTC) | |||
::Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?] (]) 13:50, 25 November 2024 (UTC) | |||
:::{{Re|Naraht}} I used AWB's "]". AWB normally needs ] but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file <code>enwiki-20241120-pages-articles.xml.bz2</code> is a 20 Gigabyte download from , and that has to be uncompressed to 102 Gigabytes before AWB can use it. | |||
:::If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- ] (]) 14:07, 25 November 2024 (UTC) | |||
⚫ | ::::Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.] (]) 18:06, 25 November 2024 (UTC) | ||
== Redlinks == | |||
== Restarting in >> seconds (again) == | |||
Hello. | |||
I downloaded it three days ago on Windows 11, AWB Version 6.3.1.1 (SVN 12633). I want to work on the Slovenian Misplaced Pages, so I am logged in there. Other Slovenian users don’t have any issues. It works fine until I press the "Save" button, after which I get the message "Restarting in ...". ] (]) 14:37, 26 October 2024 (UTC) | |||
I don't know if it's technically possible. | |||
== Removal of blp parameter by genfixes == | |||
It's for the French wiki, but I think I have more help here. | |||
In , the {{para|blp|no}} parameter was removed from {{tl|WikiProject banner shell}}, which is incorrect. This parameter is needed to identify that the subject is not living. Otherwise the page ends up in ] which is a maintenance category — Martin <small>(] · ])</small> 22:09, 27 October 2024 (UTC) | |||
:This came up earlier in the year (see ]) and is tracked by {{phab|358963}}. If I recall correctly the OP had already fixed the cases they were concerned about. I just noticed it was also reported at ]. ] (]) 16:14, 28 October 2024 (UTC) | |||
::OK, {{awbsvn|12643}} / {{phab|358963}} don't remove "blp=no" from WikiProject banner shell. Will be in next release of AWB. ] 17:33, 28 October 2024 (UTC) | |||
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example | |||
== No longer handling WP:ORDER issues (specifically WP:STUBSPACING) == | |||
{{resolved}} | |||
I'm having to manually drop the stub templates two spaces down where previously AWB did a pretty good job at handling those. It seemed to work nicely, whether the stub templates were butted up right beneath the categories, or whether they were mixed in with some of the nav templates. Seems like that functionality is gone now. ] 22:19, 28 October 2024 (UTC) | |||
:OK, I may be an idiot. Gen Fixes was off so continuing to edit and see if that does the trick. ] 22:28, 28 October 2024 (UTC) | |||
If it's possible, can I have help with the process? ] (]) 13:01, 2 December 2024 (UTC) | |||
== Username change == | |||
:I'll have to dig through the , but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. ] (]) 21:00, 3 December 2024 (UTC) | |||
::There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. ] (]) 22:48, 4 December 2024 (UTC) | |||
:::Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... ] (]) 15:28, 5 December 2024 (UTC) | |||
::::Thank you. ] (]) 09:46, 6 December 2024 (UTC) | |||
:::::Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. ] (]) 19:33, 6 December 2024 (UTC) | |||
::::::]; give it a spin. But it's getting late here so I may not respond right away. ] (]) 02:35, 7 December 2024 (UTC) | |||
:::::::If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. ] (]) 14:32, 7 December 2024 (UTC) | |||
:::::::: Wow! You're a genius! ] (]) 14:26, 10 December 2024 (UTC) | |||
::::::::: Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. ] (]) 21:07, 10 December 2024 (UTC) | |||
:::::::::: Yes, I had noticed this bug. Thanks ] (]) 08:57, 11 December 2024 (UTC) | |||
== Stub spacing == | |||
Hello. This is Sam S, previously known as The Sharpest Lives. I had a username change and am requesting this is changed in the AWB user list. Thanks. ] (]) 05:24, 8 November 2024 (UTC) | |||
{{Tracked|T382578}} | |||
:{{done}}, noting that these sorts of request are best handled at ]. ] (]) 12:50, 8 November 2024 (UTC) | |||
The requirement for two blank lines before stubs has now been removed. See ] — ] <sup>]</sup> 08:54, 20 December 2024 (UTC) | |||
:Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. ] (]) 22:26, 20 December 2024 (UTC) | |||
== Re-up ARM64 build == | |||
:: Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer ''needs'' to force two lines before a stub for enwiki. No idea about other wikis — ] <sup>]</sup> 23:50, 20 December 2024 (UTC) | |||
== Bot saving blank pages == | |||
I recently offered an unofficial build of AWB optimized for ARM64 PCs, like the new Microsoft Surface. I have a new build that will also run on a "normal" x86-64 PC (but see below). It is built using the latest source tree. | |||
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 19:40, 22 December 2024 (UTC) | |||
I think it runs faster than the official build not only due to the CPU's emulation overhead, which is less on Windows 11 than 10, but also because it does not have the emulation memory overhead, which is pretty extensive. For example, in my tests on Windows 11 the native ARM64 build uses 29.5 MB on startup, and 56.5 after loading the first article in a moderate size list. The official build, x86 running emulated, uses 49 MB and 74.5 respectively. | |||
:See also {{slink|User_talk:Primefac/Archive_21#Blanking}}, slightly different setup for skip checks. ] (]) 12:56, 23 December 2024 (UTC) | |||
Running on an x86-64 PC, this build runs in x64 mode, so it uses more memory than the official x86 build, but not by a huge amount (27.7 versus 22.3 on startup). Anyway, if you have an ARM64 box and want to check it out, it's on github . ] (]) 03:57, 12 November 2024 (UTC) | |||
== Black background on start? == | |||
Is there a css way to make the diff window background black when starting AWB? ] works well for diffs, but on startup and when switching settings/projects, it defaults to white. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 10:58, 17 November 2024 (UTC) | |||
== Is adding links cosmetic? == | |||
I've been adding links to citations (for example <nowiki>{{cite news ... |work=RNZ ... }}</nowiki> to <nowiki>{{cite news ... |work=] ... }}</nowiki> but someone has opposed this and suggested that it breaks number four of ], saying that it's a ]. As adding links to citations modifies the HTML of the page, I belive that it is not cosmetic, but I'd like a third opinion please to clear this up. ―<span style="font-family:Poppins, Helvetica, Sans-serif;">]</span> ] 03:58, 23 November 2024 (UTC) | |||
:It's not cosmetic, but per rule 3 if someone is contesting it, it's on you to demonstrate consensus for it. ] (]) 04:15, 23 November 2024 (UTC) | |||
::Not cosmetic, but I do agree if a single page is being edited multiple times that is a bit of an issue. ] (]) 13:17, 23 November 2024 (UTC) | |||
: For the record, if consensus is needed for this, I endorse and support it. ] ] 15:34, 23 November 2024 (UTC) | |||
⚫ | :: |
||
:Based on my interpretation of the rules, this seems totally acceptable, but again, you could consider running multiple tasks simultaneously or only add links when typos are found. Have fun doing it. ] 19:49, 23 November 2024 (UTC) | |||
:Personally I think it adds consistency and I'm a huge fan of it. It's an absolute improvement from my point of view. However, I do agree that it shouldn't be one edit at a time per page, but to add instances within the references at once. ] (]) 18:49, 24 November 2024 (UTC) | |||
⚫ | == AutoWikiBrowser and searching for pages with capture regex. == | ||
⚫ | I'm looking for pages that have strings like <nowiki>]</nowiki> So I'd like to search for these with something like <nowiki>\* * *\|\1</nowiki> and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? ] (]) 22:50, 24 November 2024 (UTC) | ||
⚫ | :Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) <i>~"Alpha Phi Alpha"</i> I get 799 entries, for <i>~"Alpha Phi Alpha" insource:/\] (]) 23:57, 24 November 2024 (UTC) |
Latest revision as of 15:32, 25 December 2024
AutoWikiBrowser 6.3.1.1- Home
Introduction and rules - User manual
How to use AWB - Discussion
Discuss AWB, report errors, and request features - User tasks
Request or help with AWB-able tasks - Technical
Technical documentation
- Changelog
- Developer discussion
- Modules
- Regular expression
- Sandbox
- Template redirects
- Typos
- Usage stats
- Userbox
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the § Before you post section below. Before asking a question, please refer to the read the § Frequently asked questions below.
Archives |
Index 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 |
This page has archives. Sections older than 30 days may be automatically archived by Lowercase sigmabot III. |
Before you post
Do you want to ... | Please use | ||||
---|---|---|---|---|---|
Report a bug or request a feature in AWB? | Check reported bugs on Phabricator before filing a new bug report. You do not need to create another account there; just log in with your global Wikimedia account. See this MediaWiki wiki page on how to report bugs and request features on Phabricator.
| ||||
Report an incorrectly fixed typo? | Misplaced Pages talk:AutoWikiBrowser/Typos | ||||
Request approval to use AWB? | Misplaced Pages:Requests for permissions/AutoWikiBrowser | ||||
Ask a question about AWB or ask for help? | This page |
Frequently asked questions
Frequently asked questions |
---|
//Detect IE5.5+ if (navigator.appVersion.indexOf("MSIE")==-1) { // Previous contents go here .... }
|
Discussion
AutoWikiBrowser and searching for pages with capture regex.
I'm looking for pages that have strings like ] So I'd like to search for these with something like \* * *\|\1 and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? Naraht (talk) 22:50, 24 November 2024 (UTC)
- Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) ~"Alpha Phi Alpha" I get 799 entries, for ~"Alpha Phi Alpha" insource:/\
- You have to put round brackets into the search string to tell the regex code what
\1
is intended to match. I began a database scan for\* * *)\|\1
but quickly aborted it; there are tens of thousands of matches. Typical examples are]
and]
. I restricted the search to the names of Greek letters and dumped the results at User:John of Reading/X2 (permalink) -- John of Reading (talk) 08:10, 25 November 2024 (UTC)- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
enwiki-20241120-pages-articles.xml.bz2
is a 20 Gigabyte download from partway down this page, and that has to be uncompressed to 102 Gigabytes before AWB can use it. - If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- John of Reading (talk) 14:07, 25 November 2024 (UTC)
- Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.Naraht (talk) 18:06, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
Redlinks
Hello.
I don't know if it's technically possible.
It's for the French wiki, but I think I have more help here.
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example here
If it's possible, can I have help with the process? Bordurie (talk) 13:01, 2 December 2024 (UTC)
- I'll have to dig through the code, but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. Primefac (talk) 21:00, 3 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Yes, I had noticed this bug. Thanks Bordurie (talk) 08:57, 11 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
Stub spacing
Tracked in PhabricatorTask T382578
The requirement for two blank lines before stubs has now been removed. See WP:STUBSPACING — GhostInTheMachine 08:54, 20 December 2024 (UTC)
- Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. Kiwipete (talk) 22:26, 20 December 2024 (UTC)
- Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer needs to force two lines before a stub for enwiki. No idea about other wikis — GhostInTheMachine 23:50, 20 December 2024 (UTC)
Bot saving blank pages
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. ~ Tom.Reding (talk ⋅dgaf) 19:40, 22 December 2024 (UTC)
- See also User talk:Primefac/Archive 21 § Blanking, slightly different setup for skip checks. Primefac (talk) 12:56, 23 December 2024 (UTC)