Revision as of 21:07, 10 December 2024 editDavidBrooks (talk | contribs)Extended confirmed users19,874 edits →Redlinks: xml encoding bug fix← Previous edit | Latest revision as of 15:32, 25 December 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,293,067 editsm Archiving 2 discussion(s) to Misplaced Pages talk:AutoWikiBrowser/Archive 34) (bot | ||
(8 intermediate revisions by 7 users not shown) | |||
Line 42: | Line 42: | ||
=Discussion= | =Discussion= | ||
== Username change == | |||
Hello. This is Sam S, previously known as The Sharpest Lives. I had a username change and am requesting this is changed in the AWB user list. Thanks. ] (]) 05:24, 8 November 2024 (UTC) | |||
:{{done}}, noting that these sorts of request are best handled at ]. ] (]) 12:50, 8 November 2024 (UTC) | |||
== Re-up ARM64 build == | |||
I recently offered an unofficial build of AWB optimized for ARM64 PCs, like the new Microsoft Surface. I have a new build that will also run on a "normal" x86-64 PC (but see below). It is built using the latest source tree. | |||
I think it runs faster than the official build not only due to the CPU's emulation overhead, which is less on Windows 11 than 10, but also because it does not have the emulation memory overhead, which is pretty extensive. For example, in my tests on Windows 11 the native ARM64 build uses 29.5 MB on startup, and 56.5 after loading the first article in a moderate size list. The official build, x86 running emulated, uses 49 MB and 74.5 respectively. | |||
Running on an x86-64 PC, this build runs in x64 mode, so it uses more memory than the official x86 build, but not by a huge amount (27.7 versus 22.3 on startup). Anyway, if you have an ARM64 box and want to check it out, it's on github . ] (]) 03:57, 12 November 2024 (UTC) | |||
== Black background on start? == | |||
Is there a css way to make the diff window background black when starting AWB? ] works well for diffs, but on startup and when switching settings/projects, it defaults to white. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 10:58, 17 November 2024 (UTC) | |||
== Is adding links cosmetic? == | |||
I've been adding links to citations (for example <nowiki>{{cite news ... |work=RNZ ... }}</nowiki> to <nowiki>{{cite news ... |work=] ... }}</nowiki> but someone has opposed this and suggested that it breaks number four of ], saying that it's a ]. As adding links to citations modifies the HTML of the page, I belive that it is not cosmetic, but I'd like a third opinion please to clear this up. ―<span style="font-family:Poppins, Helvetica, Sans-serif;">]</span> ] 03:58, 23 November 2024 (UTC) | |||
:It's not cosmetic, but per rule 3 if someone is contesting it, it's on you to demonstrate consensus for it. ] (]) 04:15, 23 November 2024 (UTC) | |||
::Not cosmetic, but I do agree if a single page is being edited multiple times that is a bit of an issue. ] (]) 13:17, 23 November 2024 (UTC) | |||
: For the record, if consensus is needed for this, I endorse and support it. ] ] 15:34, 23 November 2024 (UTC) | |||
::I'm fine with this, my primary suggestion is to stack these together in one awb run so that RNZ and ... and ... and ... are all done at once.] (]) 19:20, 23 November 2024 (UTC) | |||
:Based on my interpretation of the rules, this seems totally acceptable, but again, you could consider running multiple tasks simultaneously or only add links when typos are found. Have fun doing it. ] 19:49, 23 November 2024 (UTC) | |||
:Personally I think it adds consistency and I'm a huge fan of it. It's an absolute improvement from my point of view. However, I do agree that it shouldn't be one edit at a time per page, but to add instances within the references at once. ] (]) 18:49, 24 November 2024 (UTC) | |||
::Thanks for the suggestions, everyone. I have started adding links to multiple different newspapers at once. ―<span style="font-family:Poppins, Helvetica, Sans-serif;">]</span> ] 08:14, 25 November 2024 (UTC) | |||
== AutoWikiBrowser and searching for pages with capture regex. == | == AutoWikiBrowser and searching for pages with capture regex. == | ||
Line 102: | Line 73: | ||
:::::::: Wow! You're a genius! ] (]) 14:26, 10 December 2024 (UTC) | :::::::: Wow! You're a genius! ] (]) 14:26, 10 December 2024 (UTC) | ||
::::::::: Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. ] (]) 21:07, 10 December 2024 (UTC) | ::::::::: Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. ] (]) 21:07, 10 December 2024 (UTC) | ||
:::::::::: Yes, I had noticed this bug. Thanks ] (]) 08:57, 11 December 2024 (UTC) | |||
== Stub spacing == | |||
{{Tracked|T382578}} | |||
The requirement for two blank lines before stubs has now been removed. See ] — ] <sup>]</sup> 08:54, 20 December 2024 (UTC) | |||
:Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. ] (]) 22:26, 20 December 2024 (UTC) | |||
:: Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer ''needs'' to force two lines before a stub for enwiki. No idea about other wikis — ] <sup>]</sup> 23:50, 20 December 2024 (UTC) | |||
== Bot saving blank pages == | |||
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. <b>~</b> <span style="font-family:Monotype Corsiva; font-size:16px;">] (] ⋅])</span> 19:40, 22 December 2024 (UTC) | |||
:See also {{slink|User_talk:Primefac/Archive_21#Blanking}}, slightly different setup for skip checks. ] (]) 12:56, 23 December 2024 (UTC) |
Latest revision as of 15:32, 25 December 2024
AutoWikiBrowser 6.3.1.1- Home
Introduction and rules - User manual
How to use AWB - Discussion
Discuss AWB, report errors, and request features - User tasks
Request or help with AWB-able tasks - Technical
Technical documentation
- Changelog
- Developer discussion
- Modules
- Regular expression
- Sandbox
- Template redirects
- Typos
- Usage stats
- Userbox
This is the discussion page for the AutoWikiBrowser (AWB) project. It is also the place to discuss using the AWB program (for help, questions, or general inquiries about AWB). Specific guidelines on where to make particular reports or requests are provided in the § Before you post section below. Before asking a question, please refer to the read the § Frequently asked questions below.
Archives |
Index 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 |
This page has archives. Sections older than 30 days may be automatically archived by Lowercase sigmabot III. |
Before you post
Do you want to ... | Please use | ||||
---|---|---|---|---|---|
Report a bug or request a feature in AWB? | Check reported bugs on Phabricator before filing a new bug report. You do not need to create another account there; just log in with your global Wikimedia account. See this MediaWiki wiki page on how to report bugs and request features on Phabricator.
| ||||
Report an incorrectly fixed typo? | Misplaced Pages talk:AutoWikiBrowser/Typos | ||||
Request approval to use AWB? | Misplaced Pages:Requests for permissions/AutoWikiBrowser | ||||
Ask a question about AWB or ask for help? | This page |
Frequently asked questions
Frequently asked questions |
---|
//Detect IE5.5+ if (navigator.appVersion.indexOf("MSIE")==-1) { // Previous contents go here .... }
|
Discussion
AutoWikiBrowser and searching for pages with capture regex.
I'm looking for pages that have strings like ] So I'd like to search for these with something like \* * *\|\1 and while AWB does capture, it looks like that is only for internal, not for looking for them in the first place, is that something that wikipedia or AWB can do, or is this something where I need Cirrus or something else more powerful? Naraht (talk) 22:50, 24 November 2024 (UTC)
- Not sure what you are after. You have a specific string 'like' then use a generic form of string search entry. Just plain old search is reasonably powerful. If I search for (articles only) ~"Alpha Phi Alpha" I get 799 entries, for ~"Alpha Phi Alpha" insource:/\
- You have to put round brackets into the search string to tell the regex code what
\1
is intended to match. I began a database scan for\* * *)\|\1
but quickly aborted it; there are tens of thousands of matches. Typical examples are]
and]
. I restricted the search to the names of Greek letters and dumped the results at User:John of Reading/X2 (permalink) -- John of Reading (talk) 08:10, 25 November 2024 (UTC)- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
enwiki-20241120-pages-articles.xml.bz2
is a 20 Gigabyte download from partway down this page, and that has to be uncompressed to 102 Gigabytes before AWB can use it. - If you post search requests on this page, I or someone else with a recent database dump will probably respond. -- John of Reading (talk) 14:07, 25 November 2024 (UTC)
- Right now, only 10 GB free on my personal hard drive. Maybe when I buy my next one. :) again thank you.Naraht (talk) 18:06, 25 November 2024 (UTC)
- @Naraht: I used AWB's "Database Scanner". AWB normally needs this permission but if you're only using it to create a list of articles, you can use it without logging in. BUT to use the database scanner, you'll need a copy of the text of Misplaced Pages on your hard drive - the file
- Thank you both. While I expect the majority of occurances to be from a group of 9 Fraternites and Sororities, there are hundreds that are possibilities. John, that is exactly what I wanted, I expected to have to trim down some. What software is needed for that Database scan and is that something a non-admin user will have access to?Naraht (talk) 13:50, 25 November 2024 (UTC)
Redlinks
Hello.
I don't know if it's technically possible.
It's for the French wiki, but I think I have more help here.
Is it possible with this tool to remove all red links on a specific page, because red links are not admissible and never will be? For example here
If it's possible, can I have help with the process? Bordurie (talk) 13:01, 2 December 2024 (UTC)
- I'll have to dig through the code, but I don't think so... on the other hand, there's an "if template exists" function so there might be an "ifexists" in general. Primefac (talk) 21:00, 3 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Yes, I had noticed this bug. Thanks Bordurie (talk) 08:57, 11 December 2024 (UTC)
- Thank you! And, of course, I just found a bug (a link with a ' character in it, and possibly other punctuations). Please copy and reload the module. David Brooks (talk) 21:07, 10 December 2024 (UTC)
- Wow! You're a genius! It works! Bordurie (talk) 14:26, 10 December 2024 (UTC)
- If you've read it: yes, I know. I wrote it in a hurry so I'll probably be tweaking it for efficiency and robustness (and bugs :-O ). If you are interested in it, you may want to add the module to your watchlist. David Brooks (talk) 14:32, 7 December 2024 (UTC)
- User:DavidBrooks/UndoRelinksModule; give it a spin. But it's getting late here so I may not respond right away. David Brooks (talk) 02:35, 7 December 2024 (UTC)
- Daily stand-up: a module is working on some relatively short articles. Running into an undocumented limit in a MediaWiki API. Higher-priority commitments rn. David Brooks (talk) 19:33, 6 December 2024 (UTC)
- Thank you. Bordurie (talk) 09:46, 6 December 2024 (UTC)
- Thinking about what I wrote: it should be feasible to do the edits in a Module (or possibly a Plugin). Stand by... David Brooks (talk) 15:28, 5 December 2024 (UTC)
- There is an API that lists the pages linked from a page, and another that will report whether pages in that list exist. AWB already has a mechanism for detecting and unlinking links (the de-duplication function). So all the main pieces are in place, although it would need a new option flag if you wanted to make it happen without interaction. Still, it would be a project, and couldn't be classed as maintenance. David Brooks (talk) 22:48, 4 December 2024 (UTC)
Stub spacing
Tracked in PhabricatorTask T382578
The requirement for two blank lines before stubs has now been removed. See WP:STUBSPACING — GhostInTheMachine 08:54, 20 December 2024 (UTC)
- Does this mean there is or will be a new version of AWB? I'm currently using version 6.3.1.1. Kiwipete (talk) 22:26, 20 December 2024 (UTC)
- Due to (19 December 2024) changes to CSS on enwiki, output is now OK with two blank lines, one blank line or no blank line before a stub. AWB no longer needs to force two lines before a stub for enwiki. No idea about other wikis — GhostInTheMachine 23:50, 20 December 2024 (UTC)
Bot saving blank pages
Since there's not much information on this that I'm aware of, I think it's important to keep track of the circumstances that this bug presents itself. Tom.Bot was running on Wikispecies nearly continuously for 2 weeks, from Dec 7 to Dec 21, after 492,026 successful saves in the same instance of AWB before it started intermittently saving blank pages, despite failing a "Skip if doesn't contain" check that I thought would help prevent this problem. Very shortly prior to that, I "Reset saved/skipped counts", which usually produces a large negative "Edits/min" value, which may or may not be related. Before restarting the AWB instance, I reran the bot on some of the blanked pages and they were not blanked again. Restarting the instance fixed the problem. ~ Tom.Reding (talk ⋅dgaf) 19:40, 22 December 2024 (UTC)
- See also User talk:Primefac/Archive 21 § Blanking, slightly different setup for skip checks. Primefac (talk) 12:56, 23 December 2024 (UTC)