AI

This Week in AI: Midjourney bets it can beat the copyright police

Comment

Steve Blank as illustrated by MidJourney
Image Credits: Haje Kamps (opens in a new window) / MidJourney (opens in a new window)

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world of machine learning, along with notable research and experiments we didn’t cover on their own.

Last week, Midjourney, the AI startup building image (and soon video) generators, made a small, blink-and-you’ll-miss-it change to its terms of service related to the company’s policy around IP disputes. It mainly served to replace jokey language with more lawyerly, doubtless case law–grounded clauses. But the change can also be taken as a sign of Midjourney’s conviction that AI vendors like itself will emerge victorious in the courtroom battles with creators whose works comprise vendors’ training data.

The change in Midjourney’s terms of service. Image Credits: Midjourney

Generative AI models like Midjourney’s are trained on an enormous number of examples — such as images and text — usually sourced from public websites and repositories around the web. Vendors assert that fair use, the legal doctrine that allows for the use of copyrighted works to make a secondary creation as long as it’s transformative, shields them where it concerns model training. But not all creators agree — particularly in light of a growing number of studies showing that models can — and do — “regurgitate” training data. 

Some vendors have taken a proactive approach, inking licensing agreements with content creators and establishing “opt-out” schemes for training datasets. Others have promised that, if customers are implicated in a copyright lawsuit arising from their use of a vendor’s GenAI tools, they won’t be on the hook for legal fees.

Midjourney isn’t one of the proactive ones.

On the contrary, Midjourney has been somewhat brazen in its use of copyrighted works, at one point maintaining a list of thousands of artists — including illustrators and designers at major brands like Hasbro and Nintendo — whose works were, or would be, used to train Midjourney’s models. A study shows convincing evidence that Midjourney used TV shows and movie franchises in its training data as well, from “Toy Story” to “Star Wars” to “Dune” to “Avengers.”

Now, there’s a scenario in which courtroom decisions go Midjourney’s way in the end. Should the justice system decide fair use applies, nothing’s stopping the startup from continuing as it has been, scraping and training on copyrighted data old and new.

But it seems like a risky bet.

Midjourney is flying high at the moment, having reportedly reached around $200 million in revenue without a dime of outside investment. Lawyers are expensive, however. And if it’s decided fair use doesn’t apply in Midjourney’s case, it’d decimate the company overnight.

No reward without risk, eh?

Here are some other AI stories of note from the past few days:

AI-assisted ad attracts the wrong kind of attention: Creators on Instagram lashed out at a director whose commercial reused another’s (much more difficult and impressive) work without credit.

EU authorities are putting AI platforms on notice ahead of elections: They’re asking the biggest companies in tech to explain their approach to preventing electoral shenanigans.

Google DeepMind wants your co-op gaming partner to be their AI: Training an agent on many hours of 3D gameplay made it capable of performing simple tasks phrased in natural language.

The problem with benchmarks: Many, many AI vendors claim their models have the competition met or beat by some objective metric. But the metrics they’re using are flawed, often.

AI2 scores $200M: AI2 Incubator, spun out of the nonprofit Allen Institute for AI, has secured a windfall $200 million in compute that startups going through its program can take advantage of to accelerate early development.

India requires, then rolls back, gov approval for AI: India’s government can’t seem to decide what level of regulation is appropriate for the AI industry.

Anthropic launches new models: AI startup Anthropic has launched a new family of models, Claude 3, that it claims rivals OpenAI’s GPT-4. We put the flagship model (Claude 3 Opus) to the test, and found it impressive — but also lacking in areas like current events.

Political deepfakes: A study from the Center for Countering Digital Hate (CCDH), a British nonprofit, looks at the growing volume of AI-generated disinformation — specifically deepfake images pertaining to elections — on X (formerly Twitter) over the past year.

OpenAI versus Musk: OpenAI says that it intends to dismiss all claims made by X CEO Elon Musk in a recent lawsuit, and suggested that the billionaire entrepreneur — who was involved in the company’s co-founding — didn’t really have that much of an impact on OpenAI’s development and success.

Reviewing Rufus: Last month, Amazon announced that it would launch a new AI-powered chatbot, Rufus, inside the Amazon Shopping app for Android and iOS. We got early access — and were quickly disappointed by the lack of things Rufus can do (and do well).

More machine learnings

Molecules! How do they work? AI models have been helpful in our understanding and prediction of molecular dynamics, conformation, and other aspects of the nanoscopic world that may otherwise take expensive, complex methods to test. You still have to verify, of course, but things like AlphaFold are rapidly changing the field.

Microsoft has a new model called ViSNet, aimed at predicting what are called structure-activity relationships, complex relationships between molecules and biological activity. It’s still quite experimental and definitely for researchers only, but it’s always great to see hard science problems being addressed by cutting-edge tech means.

Image Credits: Microsoft

University of Manchester researchers are looking specifically at identifying and predicting COVID-19 variants, less from pure structure like ViSNet and more by analysis of the very large genetic datasets pertaining to coronavirus evolution.

“The unprecedented amount of genetic data generated during the pandemic demands improvements to our methods to analyze it thoroughly,” said lead researcher Thomas House. His colleague Roberto Cahuantzi added: “Our analysis serves as a proof of concept, demonstrating the potential use of machine learning methods as an alert tool for the early discovery of emerging major variants.”

AI can design molecules, too, and a number of researchers have signed an initiative calling for safety and ethics in this field. Though as David Baker (among the foremost computational biophysicists in the world) notes, “The potential benefits of protein design far exceed the dangers at this point.” Well, as a designer of AI protein designers, he would say that. But all the same, we must be wary of regulation that misses the point and hinders legitimate research while allowing bad actors freedom.

Atmospheric scientists at the University of Washington (UW) have made an interesting assertion based on AI analysis of 25 years of satellite imagery over Turkmenistan. Essentially, the accepted understanding that the economic turmoil following the fall of the Soviet Union led to reduced emissions may not be true — in fact, the opposite may have occurred.

AI helped find and measure the methane leaks shown here. Image Credits: University of Washington/ He et al. / PNAS

“We find that the collapse of the Soviet Union seems to result, surprisingly, in an increase in methane emissions,” said UW professor Alex Turner. The large datasets and lack of time to sift through them made the topic a natural target for AI, which resulted in this unexpected reversal.

Large language models are largely trained on English source data, but this may affect more than their facility in using other languages. EPFL researchers looking at the “latent language” of LlaMa-2 found that the model seemingly reverts to English internally even when translating between French and Chinese. The researchers suggest, however, that this is more than a lazy translation process, and in fact the model has structured its whole conceptual latent space around English notions and representations. Does it matter? Probably. We should be diversifying their datasets anyway.

More TechCrunch

iOS 18 will be available in the fall as a free software update.

Here are all the devices compatible with iOS 18

The tests indicate there are loopholes in TikTok’s ability to apply its parental controls and policies effectively in a situation where the teen user originally lied about their age, as…

TikTok glitch allows Shop to appear to users under 18, despite adults-only policy

Lhoopa has raised $80 million to address the lack of affordable housing in Southeast Asian markets, starting with the Philippines.

Lhoopa raises $80M to spur more affordable housing in the Philippines

Former President Donald Trump picked Ohio Senator J.D. Vance as his running mate on Monday, as he runs to reclaim the office he lost to President Joe Biden in 2020.…

Trump’s VP candidate JD Vance has long ties to Silicon Valley, and was a VC himself

Hello and welcome back to TechCrunch Space. Is it just me, or is the news cycle only accelerating this summer?!

TechCrunch Space: Space cowboys

Apple Intelligence features are not available in the developer beta, which is out now.

Without Apple Intelligence, iOS 18 beta feels like a TV show that’s waiting for the finale

Apple released the public betas for its next generation of software on the iPhone, Mac, iPad and Apple Watch on Monday. You can now test out iOS 18 and many…

Apple’s public betas for iOS 18 are here to test out

One major dissenter threatens to upend Fisker’s apparent best chance at offloading its unsold EVs, a deal that would keep the startup’s bankruptcy proceeding alive and pave the way for…

Fisker has one major objector to its Ocean SUV fire sale

Payments giant Stripe has delayed going public for so long that its major investor Sequoia Capital is getting creative to offer returns to its limited partners. The venture firm emailed…

Major Stripe investor Sequoia confirms $70B valuation, offers its investors a payday

Alphabet, Google’s parent company, is in advanced talks to acquire Wiz for $23 billion, a person close to the company told TechCrunch. The deal discussions were previously reported by The…

Google’s Kurian approached Wiz, $23B deal could take a week to land, source says

Name That Bird determines individual members of a species by identifying distinguishing characteristics that most humans would be hard-pressed to spot.

Bird Buddy’s new AI feature lets people name and identify individual birds

YouTube Music is introducing two new ways to boost song discovery on its platform. YouTube announced on Monday that it’s experimenting with an AI-generated conversational radio feature, and rolling out…

YouTube Music is testing an AI-generated radio feature and adding a song recognition tool

Tesla had internally planned to build the dedicated robotaxi and the $25,000 car, often referred to as the Model 2, on the same platform.

Elon Musk confirms Tesla ‘robotaxi’ event delayed due to design change

What this means for the space industry is that theory has become reality: The possibility of designing a habitation within a lunar tunnel is a reasonable proposition.

Moon cave! Discovery could redirect lunar colony and startup plays

Get ready for a prime week of savings at TechCrunch Disrupt 2024 with the launch of Disrupt Deal Days! From now to July 19 at 11:59 p.m. PT, we’re going…

Disrupt Deal Days are here: Prime savings for TechCrunch Disrupt 2024!

Deezer is the latest music streaming app to introduce an AI playlist feature. The company announced on Monday that a select number of paid users will be able to create…

Deezer chases Spotify and Amazon Music with its own AI playlist generator

Real-time payments are becoming commonplace for individuals and businesses, but not yet for cross-border transactions. That’s what Caliza is hoping to change, starting with Latin America. Founded in 2021 by…

Caliza lands $8.5 million to bring real-time money transfers to Latin America using USDC

Adaptive is a platform that provides tools designed to simplify payments and accounting for general construction contractors.

Adaptive builds automation tools to speed up construction payments

When VanMoof declared bankruptcy last year, it left around 5,000 customers who had preordered e-bikes in the lurch. Now VanMoof is up and running under new management, and the company’s…

How VanMoof’s new owners plan to win over its old customers

Mitti Labs aims to transform rice farming in India and other South Asian markets by reducing methane emissions by 50% and water consumption by 30%.

Mitti Labs aims to make rice farming less harmful to the climate, starting in India

This is a guide on how to check whether someone compromised your online accounts.

How to tell if your online accounts have been hacked

There is a general consensus today that generative AI is going to transform business in a profound way, and companies and individuals who don’t get on board will be quickly…

The AI financial results paradox

Google’s parent company Alphabet might be on the verge of making its biggest acquisition ever. The Wall Street Journal reports that Alphabet is in advanced talks to acquire Wiz for…

Google reportedly in talks to acquire cloud security company Wiz for $23B

Featured Article

Hank Green reckons with the power — and the powerlessness — of the creator

Hank Green has had a while to think about how social media has changed us. He started making YouTube videos in 2007 with his brother, novelist John Green, at a time when the first iPhone was in development, Myspace was still relevant and Instagram didn’t exist. Seventeen years later, posting…

Hank Green reckons with the power — and the powerlessness — of the creator

Here is a timeline of Synapse’s troubles and the ongoing impact it is having on banking consumers. 

Synapse’s collapse has frozen nearly $160M from fintech users — here’s how it happened

Featured Article

Helixx wants to bring fast-food economics and Netflix pricing to EVs

When Helixx co-founder and CEO Steve Pegg looks at Daisy — the startup’s 3D-printed prototype delivery van — he sees a second chance. And he’s pulling inspiration from McDonald’s to get there.  The prototype, which made its global debut this week at the Goodwood Festival of Speed, is an interesting proof…

Helixx wants to bring fast-food economics and Netflix pricing to EVs

Featured Article

India clings to cheap feature phones as brands struggle to tap new smartphone buyers

India is struggling to get new smartphone buyers, as millions of Indians don’t go for an upgrade and continue to be on feature phones.

India clings to cheap feature phones as brands struggle to tap new smartphone buyers

Roboticists at The Faboratory at Yale University have developed a way for soft robots to replicate some of the more unsettling things that animals and insects can accomplish — say,…

Meet the soft robots that can amputate limbs and fuse with other robots

Featured Article

If you’re an AT&T customer, your data has likely been stolen

This week, AT&T confirmed it will begin notifying around 110 million AT&T customers about a data breach that allowed cybercriminals to steal the phone records of “nearly all” of its customers. The stolen data contains phone numbers and AT&T records of calls and text messages during a six-month period in…

If you’re an AT&T customer, your data has likely been stolen

In the first half of 2024 alone, more than $35.5 billion was invested into AI startups globally.

Here’s the full list of 28 US AI startups that have raised $100M or more in 2024