Pokémon Go Players Have Unwittingly Trained AI to Navigate the World

Stopthatgirl7@lemmy.world · 5 days ago

Pokémon Go Players Have Unwittingly Trained AI to Navigate the World

Lvxferre@mander.xyz · 5 days ago

I’ll copypaste an interesting comment here:

[Stephen Smith] This article is a great example of a trend I don’t think companies realize they’ve started yet: They have killed the golden goose of user-generated content for short-term profit. // Who would willingly contribute to a modern-day YouTube, Reddit, StackOverflow, or Twitter knowing that they are just feeding the robots that will one day replace them?

You don’t even need robots replacing humans, or people believing so. All you need is people feeling that you’re profiting at their expense.

Also obligatory “If you’re not paying for the product, then you are the product”.

milicent_bystandr@lemm.ee · 4 days ago

Thing is, consider Google maps. It’s been harvesting data secretly and openly for a long time. I vaguely remember a time when Street View cars were found to be harvesting WiFi information in Australia and their response was, “oops, our engineers made a mistake.” Yeah, right.

But, Google maps is an amazing tool. All that traffic info? All those time estimates? Maybe it’s worth it. Maybe if people knew what they were providing, and the result they’d get, they’d still be happy to give all that “free” data to Google.

Putting aside the ethics of a company taking (stealing? or shall we call it, pirating?) all the ownership of that knowledge asset, if they make a really useful tool from it perhaps Pokémon players will be glad to have been part of such an epic achievement.

Danitos@reddthat.com · 4 days ago

The traffic data is not as good as it appears. It is completely closed, only given to police and goverment agencies. No API, no numerical values for speed (only 5 ‘color codes’ that are relative to location, so are almost useles) and numerical data is not given even to academics. I spent almost a whole month trying to get actual useful data for academic purposes, but Google really went out in their path to make it impossible.

It has the potential to be an excellent tool: crowsourced real-time data, access to historical data and it is incredibly fine-grained, improving over goverment data (at least in my city) by a 10 or 100x factor. But no, it had to be yet another Google’s tool for spying on people, not giving it away and sell it to police.

Glitterbomb@lemmy.world · 3 days ago

I worked for a company contracted by government agencies (city/county/state/fed) to gather traffic statistics. We were used because they were not able to use Google traffic data as a blanket rule.

milicent_bystandr@lemm.ee · 4 days ago

But for ordinary drivers, it’s great.

paraphrand@lemmy.world · 5 days ago

I’ve found myself thinking “well, you just helped teach the AI about that one…” various times when reading content online.

It’s a strange thing to know that a form of the basilisk is real. Things posted will help AI get better, if only my teeny tiny increments each time.

webghost0101@sopuli.xyz · 5 days ago

AI learning isn’t the issue, its not something we will be able to put a lid on either way. Either it destroys or saves the world. It doesn’t need to learn much to do so besides evolving actual self-agency and sovereign thought.

What is a huge issue is the secretive non-consentual mining of peoples identity and expressions.

And then acting all normal about It.

Bookmeat@lemmy.world · 5 days ago

The only down side, IMO, is that the models are proprietary and closed.

phoneymouse@lemmy.world · edit-2 5 days ago

I think people will still “contribute” because they also don’t care that their use of certain platforms leaks data used to target ads at them.

In the same vein though, once AI essentially destroys a site like Stack Overflow, where will AI companies source new training data with updated information? Also, we are likely to see something like 50% of content being AI generated. Are AI models then going to train on the content they themselves created? What is the impact of that? What is the use?

SlopppyEngineer@lemmy.world · 5 days ago

Are AI models then going to train on the content they themselves created? What is the impact of that?

It leads to model collapse. The second AI starts to focuses on certain patterns in the output of the first AI instead of the actual content and you get degraded output. They are pattern matching machines after all. Repeat the cycle a few times and all output becomes gibberish. Think of it as data incest.

So the AI companies are pretty desperate for more fresh user data. More data is the only way they have currently to push through the diminishing returns.

sem@lemmy.blahaj.zone · 4 days ago

Also Obligatory: “You can pay for something and still be the product”

whalebiologist@lemmy.world · 4 days ago

Thanks for sharing the Stephen Smith quote, I had not made that connection yet.

SlopppyEngineer@lemmy.world · 5 days ago

What are platforms going to do about it? Start to demonetize AI generated videos and ban AI written fan fiction?