Wikimedia Foundation's plans to introduce AI-generated article summaries to Wikipedia

antonim@lemmy.dbzer0.com · edit-2 9 days ago

Wikimedia Foundation's plans to introduce AI-generated article summaries to Wikipedia

RaoulDook@lemmy.world · 15 days ago

If people use AI to summarize passages of written words to be simpler for those with poor reading skills to be able to more easily comprehend the words, then how are those readers going to improve their poor reading skills?

Dumbing things down with AI isn’t going to make people smarter I bet. This seems like accelerating into Idiocracy

FooBarrington@lemmy.world · 15 days ago

[…] then how are those readers going to improve their poor reading skills?

By becoming interested in improving their poor reading skills. You won’t make people become interested in that by having everything available only in complex language, it’s just going to make them skip over your content. Otherwise there shouldn’t be people with poor reading skills, since complex language is already everywhere in life.

RaoulDook@lemmy.world · 15 days ago

Nope. Reading skills are improved by being challenged by complex language, and the effort required to learn new words to comprehend it. If the reader is interested in the content, they aren’t going to skip it. Dumbing things down only leads to dumbing things down.

For example, look at all the iPad kids who can’t use a computer for shit. Kids who grew up with computers HAD to learn the more complex interface of computers to be able to do the cool things they wanted to do on the computer. Now they don’t because they don’t have to. Therefore if you get everything dumbed down to 5th Grade reading level, that’s where the common denominator will settle. Overcoming that apathy requires a challenge to be a barrier to entry.

FooBarrington@lemmy.world · edit-2 15 days ago

If the reader is interested in the content, they aren’t going to skip it.

But they aren’t interested in the content because of the complexity. You may wish that humans work like you describe, but we literally see that they don’t.

What you can do is provide a simplified summary to make people interested, so they’re willing to engage with the more complex language to get deeper knowledge around the topic.

For example, look at all the iPad kids who can’t use a computer for shit. Kids who grew up with computers HAD to learn the more complex interface of computers to be able to do the cool things they wanted to do on the computer.

You’re underestimating how many people before the iPad generation also can’t use computers because they never developed an interest to engage with the complexity.

vermaterc@lemmy.ml · 15 days ago

Wikipedia is not made to teach people how to read, it is meant to share knowledge. For me, they could even make Wikipedia version with hieroglyphics if that would make understanding content easier

RaoulDook@lemmy.world · 15 days ago

Novels are also not made to teach people how to read, but reading them does help the reader practice their reading skills. Beside that point, Wikipedia is not hard to understand in the first place.

FooBarrington@lemmy.world · 15 days ago

Sorry, but that’s absolutely wrong - the complexity of articles can vary wildly. Many are easily understandable, while many others are not understandable without a lot of prerequisite knowledge in the domain (e.g. mathematics stuff).

A Wild Mimic appears!@lemmy.dbzer0.com · 14 days ago

I am not a native speaker, but my knowledge of the english language is better than most people i know, having no issues reading scientific papers and similar complex documents. Some wikipedia article intros, especially in the mathematics, are not comprehensible for anyone but mathematicians, and therefore fail the objective to give the average person an overview of the material.

It’s fine for me if i am not able to grasp the details of the article because of missing prerequisite knowledge (and i know how to work with integrals and complex numbers!), but the intro should at least not leave me wondering what the article is about.

Dr. Moose@lemmy.world · 14 days ago

Do you give toddlers post-grad books to read too? This is such an idiotic slippery slope fallacy that it just reeks of white people privilege.

Matriks404@lemmy.world · edit-2 14 days ago

TIL: Wikipedia uses complex language.

It might just be me, but I find articles written on Wikipedia much more easier to read than shit sometimes people write or speak to me. Sometimes it is incomprehensible garbage, or without much sense.

blarghly@lemmy.world · 13 days ago

You’ve clearly never tried to use Wikipedia to help with your math homework

Matriks404@lemmy.world · 13 days ago

I never did any homework unless absolutely necessary.

Now I understand that I should have done it, because I am not good at learning shit in classrooms where there is bunch of people who distract me and I don’t learn anything that way. Only many years later I found out that for most things it’s best for me to study alone.

That said, you are most probably right, because I have opened some math-related Wikipedia articles at some point, and they were pretty incomprehensible to me.

baatliwala@lemmy.world · 14 days ago

I’m from a country where English isn’t the primary language, people tend to find many aspects of English complex

Matriks404@lemmy.world · 14 days ago

I am also from a country that English is not widely spoken, in fact most people are not able to make a simple conversation (they will tell you they know ““basic English”” though).

I still find it easier to read Wikipedia articles in English, than than understand some relatives, because they never precisely say what the fuck they want from me. One person even say such incomprehensible shit, that I am thinking their brain is barely functional.

ace_garp@lemmy.world · 15 days ago

These LLM-page-summaries need to be contained and linked, completely separately, in something like llm.wikipedia.org or ai.wikipedia.org.

In a possible future case, that a few LLM hallucinations have been uncovered in these summaries, it would cast doubts about the accuracy of all page content in the project.

Keep the generated-summaries visibly distinct from user created content.

coolmojo@lemmy.world · 15 days ago

Is this the same WiliMedia Foundation who was complaining about AI scrapers in April?

AbouBenAdhem@lemmy.world · 15 days ago

IIRC, they weren’t trying to stop them—they were trying to get the scrapers to pull the content in a more efficient format that would reduce the overhead on their web servers.

Lv_InSaNe_vL@lemmy.world · 15 days ago

You can literally just download all of Wikipedia in one go from one URL. They would rather people just do that instead of crawling their entire website because that puts a huge load on their servers.

vermaterc@lemmy.ml · 15 days ago

I’m ok with auto generated content, but only if it is clearly separated from human generated content, can be disabled at any time and writing main articles with AI is forbidden

markovs_gun@lemmy.world · 14 days ago

Wikipedia articles are already quite simplified down overviews for most topics. I really don’t like the direction of the world where people are reading summaries of summaries and mistaking that for knowledge. The only time I have ever found AI summaries useful is for complex legal documents and low-importance articles where it is clear the author’s main goal was SEO rather than concise and clear information transfer.

KnitWit@lemmy.world · 15 days ago

Never thought I’d cancel my recurring donation for them, but just sent the email. I hope they change their mind on this, but as I told them, I will not support this.

BrianTheeBiscuiteer@lemmy.world · 15 days ago

I do have concerns about this but it’s really all about the usage, not the AI itself. Would the AI version be the only version allowed? Would the summaries get created on the fly for every visitor? Would edits to an AI summary be allowed? Would this get applied to and alter existing summaries?

I’m totally fine with LLMs and AI as a stop-gap for missing info or a way to coach or critique a human-written summary, but generally I haven’t seen good results when AI is allowed to do its thing without a human reviewing or guiding the outputs.

Deflated0ne@lemmy.world · 15 days ago

wpb@lemmy.world · 14 days ago

It’s kind of indirectly related, but adding a query parameter udm=14 to the url of your Google searches removes the AI summary at the top, and there are plugins for Firefox that do this for you. My hopes for this WM project are that similar plugins will be possible for Wikipedia.

The annoying thing about these summaries is that even for someone who cares about the truth, and gathering actual information, rather than the fancy autocomplete word salad that LLMs generate, it is easy to “fall for it” and end up reading the LLM summary. Usually I catch myself, but I often end up wasting some time reading the summary. Recently the non-information was so egregiously wrong (it called a certain city in Israel non-apartheid), that I ended up installing the udm 14 plugin.

In general, I think the only use cases for fancy autocomplete are where you have a way to verify the answer. For example, if you need to write an email and can’t quite find the words, if an LLM generates something, you will be able to tell whether it conveys what you’re trying to say by reading it. Or in case of writing code, if you’ve written a bunch of tests beforehand expressing what the code needs to do, you can run those on the code the LLM generates and see if it works (if there’s a Dijkstra quote that comes to your mind reading this: high five, I’m thinking the same thing).

I think it can be argued that Wikipedia articles satisfy this criterion. All you need to do to verify the summary is read the article. Will people do this? I can only speak for myself, and I know that, despite my best intentions, sometimes I won’t. If that’s anything to go by, I think these summaries will make the world a worse place.

Redex@lemmy.world · edit-2 14 days ago

Honestly, I think it’s a good idea. As long as it’s clearly highlighted that “this is an AI generated summary”, it could be very useful. I feel like a lot of people here have never tried to e.g. read a maths article without having a PHD in mathematics. I would often find myself trying to remember what a term means or how it works in practice, only to be met by a giant article going into extreme technical detail that I for the life of me cannot understand, but if I were to ask ChatGPT to explain it I would immediately get it.

JandroDelSol@lemmy.world · 14 days ago

People will believe the AI summary without reading the article, and AI hallucinates constantly. Never trust an output from a LLM

Scotty_Trees@lemmy.world · 14 days ago

Relax, this is not the doom and gloom some of y’all think this is and that is pretty telling.

UnderpantsWeevil@lemmy.world · 14 days ago

Given the degree to which the modern day Wiki mods jump on to every edit and submission like a pack of starved lions, unleashing a computer to just pump out vaguely human-sounding word salad sounds like a bad enough idea on its face.

If the AI is being given priority over the editors and mods, it sounds even worse. All of that human labor, the endless back-and-forth in the Talk sections, arguing over the precise phrasing or the exact validity of sources or the relevancy of newly released information… and we’re going to occlude it with the half-wit remarks of a glorified chatbot?

Woof. Enshittification really coming for us all.

antonim@lemmy.dbzer0.com · 14 days ago

Yeah, the catastrophic comments do take it too far… WMF has already announced they’re putting it on hold, so at the very least there’s a lot of discussion with the editors and additional work that will have to happen before this launches - if it ever launches.

Dr. Moose@lemmy.world · edit-2 14 days ago

AI threads on lemmy are always such a disappointment.

Its ironic that people put so little thought into understanding this and complain about “ai slop”. The slop was in your heads all along.

To think that more accessibility for a project that is all about sharing information with people to whom information is least accessible is a bad thing is just an incredible lack of awareness.

Its literally the opposite of everything people might hate AI for:

RAG is very good and accurate these days that doesn’t invent stuff. Especially for short content like wiki articles. I work with RAG almost every day and never seen it hallucinate with big models.
it’s open and not run a “big scary tech”
it’s free for all and would save millions of editor hours and allow more accuracy and complexity in the articles themselves.

And to top it all you know this is a lost fight even if you’re right so instead of contributing to steering this societal ship these people cover their ears and “bla bla bla we don’t want it”. It’s so disappointingly irresponsible.

qevlarr@lemmy.world · 14 days ago

The point is they should be fighting AI, not open the door even an inch to AI on their site. Like so many other endeavors, it only works because the contributors are human. Not corpos, not AI, not marketing. AI kills Wikipedia if they let that slip. Look at StackOverflow, look at Reddit, look at Google search, look at many corporate social media. Dead internet theory is all around us.

Wikipedia is trusted because it’s all human. No other reason

antonim@lemmy.dbzer0.com · edit-2 14 days ago

RAG is very good and accurate these days that doesn’t invent stuff.

In the OP I linked a comment showing how the summary presented in the showcase video is not actually very accurate and it definitely does invent some elements that are not present in the article that is being summarised.

And in general the “accessibility” that primarily seems to work by expressing things in imprecise, unscientific or emotionally charged terms could well be more harmful than less immediately accessible but accurate and unambiguous content. You appeal to Wikipedia being “a project that is all about sharing information with people to whom information is least accessible”, but I don’t think this ever was that much of a goal - otherwise the editors would have always worked harder on keeping the articles easily accessible and comprehensible to laymen (in fact I’d say traditional encyclopedias are typically superior to Wikipedia in this regard).

and would save millions of editor hours and allow more accuracy and complexity in the articles themselves.

Sorry but you’re making things up here, not even the developers of the summaries are promising such massive consequences. The summaries weren’t meant to replace any of the usual editing work, they weren’t meant to replace the normal introductory paragraphs or anything else. How would they save these supposed “millions of editor hours” then? In fact, they themselves would have to be managed by the editors as well, so all I see is a bit of additional work.

phantomwise@lemmy.ml · 14 days ago

I don’t think the idea itself is awful, but everyone is so fed up with AI bullshit that any attempt to integrate even an iota of it will be received very poorly, so I’m not sure it’s worth it.

Dr. Moose@lemmy.world · 13 days ago

I don’t think it’s everyone either - just a very vocal minority.

jsomae@lemmy.ml · 14 days ago

ok, just so long as the articles themselves aren’t AI generated.

Blue_Morpho@lemmy.world · 15 days ago

I’m pro AI but absolutely fucking not.

The use case for AI is to summarize Wiki as an external tool. If Wikipedia starts using AI, it becomes AI eating its own tail.