So far, when AI companies have trained on YouTubeās invaluable stash of videos, captions, and other content, theyāve done so without permission. An AI-focused content licensing startup called Calliope Networks is hoping to change that with its new āLicense to Scrape,ā a program aimed directly at YouTube stars.
āThere’s obvious demand from AI companies to scrape YouTube content. We see that by their actions. So what we’re trying to do is to create a tool that makes it legal and simple for them,ā says Calliope Networks CEO Dave Davis. Unlike other big social platforms, like Reddit, YouTube hasnāt struck deals with AI bigwigs to scrape its videos. The appeal of the License to Scrape is that it sidesteps the company itself providing a large volume of YouTube content in one go by corralling a group of creators and negotiating a blanket license.
Davis has a background in traditional media licensing; he left a gig at the Motion Picture Licensing Corporation to launch Calliope, betting that the AI industry would eventually move away from permissionless scraping and toward licensing as a norm. Heās not alone in this belief; itās a boom time for AI data licensing startups. Calliope Networks is a founding member of the Datasets Providers Alliance, a trade group that requires all creators and rights holders to opt into scraping.
Hereās how Davis hopes itāll work: YouTube creators who want to license their data will enter into a contract with Calliope, which will then sublicense their work out for training generative AI foundational models. Itāll need a critical mass of content to make the deal attractive enough to the AI players first, so the program will need to get YouTubers on board before it can properly get up and running. Calliope would take a percentage of the licensing fees paid by the AI companies.
Although thereās nothing quite like this in the AI world yet, Davis modeled the scraping license format off other parts of the entertainment industry, like Broadcast Music Inc. (BMI) and the American Society of Composers, Authors, and Publishers (ASCAP), which both use blanket licenses for music.
āItās early in the recruitment process,ā Davis says. He estimates that Calliope will need to offer a minimum of 25,000 to 50,000 hours of YouTube content before itās taken seriously by the AI industry. That this volume of footage is the likely threshold for blanket licenses demonstrates why banding together could be some creatorsā best bet for making money for AI trainingāin this business, volume matters, and video generators are powered by a large amount of data.
There arenāt any marquee names endorsing the license yet, but Calliope has already drafted a few influencer marketing agencies like Viral Nation to get clients on board. āIāve been getting really good feedback from creators,ā says Bianca Serafini, Viral Nationās head of content licensing. She is confident that a large number of the companyās client rosterāwhich is close to 900 YouTubersāwill participate. āNo one has presented something like this to us before.ā