Lightspeed Ventures-backed audio platform Pocket FM introduced it has partnered with voice-cloning firm ElevenLabs to shortly convert textual content content material, reminiscent of script, into audio sequence utilizing AI.
Pocket FM, which raised $103 million in Collection D funding in March, advised TechCrunch on the time that it was already experimenting with the flexibility to transform textual content content material into audio utilizing ElevenLabs‘ tech. Now, the India-based firm has expanded the partnership to make the conversion software accessible to all creators over the following few weeks.
Within the check part, Pocket FM already produced 30,000 hours of audio sequence utilizing ElevenLab’s AI tech. With the brand new roll-out, the startup expects to triple its content material library of over 100,000 hours of audio content material this yr. Pocket FM additionally stated that in the course of the experimental part, the AI-powered instruments helped it minimize the price of producing audio by 90%.
Pocket FM’s co-founder and CTO Prateek Dixit advised TechCrunch over a name that with this partnership, the corporate desires to make it simpler for writers to transform their writings into audio sequence.
“We have over 250,000 writers (including the ones on the company’s Pocket Novel writing plaform) and this partnership decreases the cost of setting up and recording audio for them,” he stated.
“Even with a good set up of recording tools and equipment, writers can produce roughly 30 minutes of high-quality audio content per day. With the AI tools, this output can be 10 times more,” he added.
Pocket FM has constructed a software integrating ElevenLabs tech, by which it’s providing 50 voices for writers who need to convert their content material. ElevenLabs’ co-founder Mati Staniszewski stated that his firm’s software understands the context of the writing and infers feelings by the voice routinely.
“Working with Pocket FM, we are deploying our newer models that understand the genre of writing and are emotionality better,” Staniszewski stated.
Dixit famous that based mostly on knowledge from customers’ engagement with this type of content material, the platform additionally plans to recommend voices that work properly for writers in a specific style.
Pocket FM isn’t the one audio sequence platform experimenting with AI-powered instruments. Google-backed Kuku FM is utilizing GPT-4, Claude, BandLab and even ElevenLabs to assist its writers with totally different levels of creation, together with refining script, producing thumbnails, including sound results and changing textual content into audio.
Kuku FM advised TechCrunch that additionally it is experimenting with utilizing visible technology instruments reminiscent of Midjourney and Runway to create adverts associated to content material.
High quality of content material and impression on artists
The promise of AI-powered instruments is to generate extra content material sooner, however that doesn’t imply the content material is sweet. Pocket FM’s reply to aiding discovery and surfacing high quality content material is making its discovery algorithm subtle and experimenting with consumer engagement.
“If a writer publishes an audio series, we surface that content to a select number of users and observe engagement metrics. If these metrics are positive, we further propagate that,” Dixit stated.
Using AI might result in faster outcomes and a much bigger content material library for these platforms, however it would additionally scale back the roles of voiceover artists working with them. India’s Affiliation of Voiceover Artists (AVA) has expressed its considerations about AI taking up.
“If AI takes over, we are finished. As voice artists, we need to get some regulation in place so that our livelihood is protected,” Amarinder Singh Sodhi, the affiliation’s normal secretary, advised Indian publication Scroll.
Sodi additionally advised Scroll about incidents the place voiceover artists have been referred to as into the studio to report samples to coach AI with out acquiring their consent or informing them.
“On an emotional level, it scares me. By using AI, you are essentially diluting the human experience of storytelling. You lose out on an emotional connection,” Delhi-based voiceover artist Aditya Mattoo advised TechCrunch.
He added that giving entry to premium voices to individuals who don’t have the style and ability to provide high quality content material will result in the market getting flooded by dangerous content material.
After we requested concerning the impression of AI-powered voice technology on Pocket FM, the corporate didn’t instantly reply the query. Nevertheless, Dixit famous that engagement with AI-generated content material in its experiments is “as good as human voiceover production.” Notably, the corporate can be engaged on expertise to include a number of voices in a single audio output.
Each Pocket FM and Kuku FM don’t at the moment label their content material to point if AI has been used within the creation course of.