ElevenLabs Hits the Right Note: A.I. Songwriting for Language Learners

In case you missed it, A.I. text-to-speech leader ElevenLabs is the latest platform to join the generative music scene – so language learners and teachers have another choice for creating original learning songs.

ElevenLabs’ Creative Platform ElevenMusic takes a much more structured approach to music creation that other platforms I’ve tried. Enter your prompt (or full lyrics), and it will build a song from block components – verse, chorus, bridge – just as you might construct one as a human writer. It makes for a much more natural-sounding track.

ElevenLabs music creation

ElevenLabs music creation

As you’d expect from voice experts ElevenLabs, the service copes with a wide range of languages and the diction is very convincing. A tad more so, I think, than the current iteration of the first big name on the block, Suno AI. No doubt the latter will have some tricks up its sleeve to keep up the pace – but for now, ElevenLabs is the place to go for quick and catchy learning song.

Anyway, here’s one I made earlier – a rather natty French rock and roll song about the Moon landings. Get those blue suede Moon boots on!

It’s definitely worth having a play on the site to see what you can come up with for you or your classes. ElevenLabs has a free tier, of course, so you can try it out straight away. [Note: that’s my wee affiliate link, so if you do sign up and hop on a higher tier later, you’re helping keep Polyglossic going!]

Generative Images Locally : Running Models on Your Machine

I’ve written a fair bit about language models of late. This is a language blog, after all! But creating resources is about other visual elements, too. And just as you can run off text from local generative AI, you can create images locally, too.

For working on a computer, ComfyUI is a good bet. It’s a graphical dashboard for creating AI art with a huge array of customisation options. The fully-featuredness of it, admittedly, makes it a complex first intro to image generation. It’s interface, which takes a pipeline / modular format, takes a bit of getting used to. But it also comes with pre-defined workflows that mean you can just open it, prompt and go. There’s also a wide, active community that supports in online, so there’s plenty of help available.

Generate images locally - the ComfyUI interface

Generate images locally – the ComfyUI interface

At the more user-friendly end of it is Draw Things for Apple machines (unfortunately no Android yet). With a user interface much closer to art packages you’ll recognise, Draw Things allows you to download different models and prompt locally – and is available as an iOS app too. Obviously there’s a lot going on when you generate images, so it slugs along at quite a modest trot on my two-year-old iPad. But it gives you so much access to the buttons and knobs to tweak that it’s a great way to learn more about the generation process. Like ComfyUI, its complexity – once you get your head round it – actually teaches you a lot about image generation.

Of all the benefits of these apps, perhaps the greatest is again the environmental. You could fire up a browser and prompt one of the behemoths. But why crank up the heat on a distant data centre machine, when you can run locally? Many commercial generative models are far too powerful for what most people need.

Save power, and prompt locally. It’s more fun!

ChatGPT French travel poster

A Second Shot at Perfect Posters – ChatGPT’s Image Tweaker

The big ChatGPT news in recent weeks is about images, rather than words. The AI frontrunner has added a facility to selectively re-prompt for parts of an image, allowing us to tweak sections that don’t live up to prompt expectations.

In essence, this new facility gives us a second shot at saving otherwise perfect output from minor issues. And for language learning content, like posters and flashcards, the biggest ‘minor’ issue – the poor spellings that crop up in AI image generation – makes the difference between useful and useless material.

Rescuing ChatGPT Posters

Take this example. It’s a simple brief – a stylish, 1950s style travel poster for France. Here’s the prompt I used to generate it:

Create a vibrant, stylish 1950s style travel poster featuring Paris and the slogan “La France”.

I wanted the text “La France” at the top, but, as you can see, we’ve got a rogue M in there instead of an N.

ChatGPT generated image of a French travel poster

To target that, I tap the image in the ChatGPT app. It calls up the image in edit mode, where I can highlight the areas that need attention:

ChatGPT image editing window

Then, I press Next, and can re-prompt for that part of the image. I simply restate the slogan instructions:

The slogan should read “La France”.

The result – a correct spelling, this time!

ChatGPT French travel poster

It can take a few goes. Dodgy spelling hasn’t been fixed; we’ve just been given a way to try again without scrapping the entire image. Certain details also won’t be retained between versions, such as the font, in this example. Others may be added, like the highly stylised merging of the L and F in the slogan (a feature, rather than a bug, I think!).

But the overall result is good enough that our lovely 1950s style poster wasn’t a total write-off.

Another case of AI being highly imperfect on its own, but a great tool when enhanced by us human users. It still won’t replace us – just yet!

Image tweaking is currently only available in the ChatGPT app (iOS / Android).