Screenscot of Cell to Singularity, an immersive casual clicker game available on Steam.

Cell to Singularity : Casual Play for TL Immersion

Osmosis isn’t just for cells – it’s for language learners too! Soaking up target language simply by placing it in your everyday line of sight is one of the most effective strategies for fluency. From your instagram feed to cosy telly-watching, consolidation can be about throwing more of the things you love in your way.

Gaming is another entz stream that is really easy to target language-ify, since many titles have multiple language options. The Steam platform is a particular goldmine here – a huge multi-platform marketplace, with loads of free-to-play offerings. The trick is to find quite text-heavy games with dialogue and interactions, exposing you to as much content as possible in-play. There’s honestly something for everyone here, from word games to fully-fledged RPG.

This week, I chanced across a casual clicker on Steam that has been working its quiet way into the hearts of users since its inception in 2018. It’s Cell to Singularity, a game that simulates the blossoming of life on Earth, from eukaryotes, to jellyfish, to humans (and beyond). It’s the kind of game you can have running inconspicuously in the background while you work, slowly developing and growing like a bonsai that needs occasional tending. Very Zen.

Screenscot of Cell to Singularity, an immersive casual clicker game available on Steam.

As you can see from the screenshot, it’s also a great way to revise the building blocks of life. That’s the root educational application the game has been feted for, covering evolutionary biology in a fun, laddered way. Switching my interface to German gives me a ton of fun natural world vocab.

Beyond word level

But the game is also full of conversational exchanges you have with the ‘supercomputer’ running your life simulation, as well as Wikipedia-style descriptions of all your finds. In short, it supports word, sentence and text-level language skills in a rich, engaging environment. What more could you ask for?

Screenshot from Cell to Singularity showing dinosaurs

The range of languages available right now is already impressive. Not only the ‘mainstream’ school ones, but also Korean, Japanese, Polish and Portuguese, amongst others.

Screenscot of the language options in Cell to Singularity, an immersive casual clicker game available on Steam.

Cell to Singularity currently has an 89% positive rating from thousands of Steam users. I wonder how many of them are playing to improve their target language? Hopefully I’ve enticed a few more of you to do just that!

Escaping the Ecosystem : AI Edition

We live in such unexpected, shifting, fracturing geopolitical times just now. A stability taken for granted for decades no longer seems a given. So much so, that many have begun to question the global tech ecosystem we are embedded in, considering the safety of our data and workflows, and seeking less exposed, closer-to-home alternatives.

It’s something we can explore without straying into conspiracy territory, and it goes beyond data security. Tech writer Cory Doctorow has written at length on the downsides to walled garden platforms that make leaving costs high while degrading (or enshittifying – Macquarie Dictionary’s 2024 word of the year) their services. Linguaphiles should know – our own beloved Duo is one of them. It’s a compelling argument, and one that national consumer protection agencies are starting to incorporate into policy. The notion that we can take meaningful steps to decouple from tech monopolies is beginning to take hold.

Ecosystem creep : AI

This leads us to AI firms – arguably the fastest growing of tech behemoths, whose services nonetheless are working their way into many of our workflows. It’s not all doom and gloom here, though; Anthropic in particular has emerged as one US company willing to stand up for an ethical stance in the field.

That said, most European LLM traffic still goes down that American route, collecting on servers users’ states have no jurisdiction over. Users come to rely more and more on these services for key elements of their day-to-day, although have little control over their place in that ecosystem.

So what to do? LLMs are incredibly useful tools for a number of creative applications. For language teachers, they are particularly good at creating authentic-sounding materials for worksheets. In fact, I’ve often argued that LLMs are a tech almost tailor-made for language learning and teaching – in few other fields is the language structure more important than the actual content! They’re genuinely brilliant at creating copy, often highly nuanced, for learning.

AI Swaps

Well, one quick and easy swap is Le Chat by French AI company Mistral. It’s a ‘full fat’ LLM on a par with the big US names, running your prompts remotely on a multi-billion parameter model. Not so remote, though – their server activity remains within EU jurisdiction.

Then, of course, there is the ‘peak privacy’ option – running your own LLM. That’s a lot easier than it sounds, thanks to easy-setup software like LM Studio or Ollama (both US-based projects, but run locally on your own machine). Install, download a model, and prompt away. While few (to no) people will have the hardware to run full-sized LLMs, small models are getting better and better, rivalling the biggies for everyday use.

Google’s Gemma 4 is a case in point, a new small model (you can get a sub-20gb version) achieving some really impressive benchmark scores. Multi-language support is one of its strengths, and believe me, it does more than a good enough job of worksheet authoring and lesson planning. And it comes with an extra ‘externalities’ bonus, too – the only energy it’s using it your laptop battery, rather than spinning up some red-hot servers on a remote farm somewhere.

That has to be a win-win – using open source releases from the industry leads, without getting trapped inside the matrix.

We may have little control over geopolitics. But there are always choices when it comes to our exposure to it in the tech we use. I’m working on a list of these swaps as part of my own digital hygiene plan, and hope to share much more of this in coming weeks!

Speaking dialect in the mountains

AI and Dialect: Can You Nudge a Language Model Off the Standard Path?

One of my quiet worries about the AI revolution in language is standardisation. When a model trained on billions of documents produces text, it tends to sand everything down to some kind of weighted average – grammatically impeccable, register-appropriate, but utterly devoid of local colour. And where’s the fun in that? You know my thoughts on dialect and language learning.

But here’s the thing: despite the odds, the standard grey slop isn’t a mathematical inevitability. Large Language Models are trained on billions of words. And that material will include plenty of non-standard language, too, simply by dint of the extraordinarily broad sweep of the training brush. Dialect literature, forum posts, subtitles, transcripts – there’s room for non-standard, more colloquial language in every context of the model’s background.

Which raises an interesting question: can we nudge AI output away from the standard, and towards something more authentically regional?

I’ve been playing around with exactly this, and the short answer is: yes – and it’s surprisingly decent at it.

Stepping off the well-trodden path

I’ve said from the start that AI is built for language learning content creation. There aren’t many fields where content accuracy doesn’t matter too much – as long as the language is correct and authentic-sounding. One of my go-to moves is pushing the LLM for a more colloquial register. Standard written language is fine for reading comprehension, but if you’re teaching people how to actually speak, you want contractions, ellipsis, informal vocabulary – the stuff that sounds like a real person. AI handles this pretty well once you prompt for it explicitly.

Create a model German dialogue on the topic “Reality TV”. The language should be contemporary, colloquial and natural-sounding German.

Claude did a pretty good job of this:

  • Lena: Ey, hast du letzte Woche „Love Island” geschaut? Ich konnte gar nicht aufhören – ich war bis halb zwei wach.
  • Kai: Natürlich hab ich das geschaut – wär ich ja bescheuert, das zu verpassen. Diese Tiara und Marco… ich meine, komm schon. Die spielen da nur rum.
  • Lena: Genau! Marco tut total so, als wäre er an ihr interessiert, aber das ist doch alles für die Kamera. Ich find das so zum Fremdschämen.
  • Kai: Ja, aber genau deswegen schaust du’s doch! Gib’s zu. Du liebst den Drama.
  • Lena: Okay, okay, ich geb’s zu. Aber ich find’s trotzdem fragwürdig, wie die Leute da so bloßgestellt werden. Manche wirken echt überfordert.
  • Kai: Absolut. Ich glaub, vielen ist nicht bewusst, wie sehr die Produktion das alles lenkt. Die schneiden das so zusammen, dass du denkst, die Person ist voll die Zicke – aber vielleicht war das gar nicht so gemeint.
  • Lena: Stimmt schon. Und dann wird die Person im Netz fertiggemacht. Ich hab mal gelesen, dass einige danach richtig psychische Probleme hatten.
  • Kai: Das ist so traurig. Und trotzdem melden sich jedes Jahr tausende Leute an. Ich versteh’s irgendwie – du willst berühmt werden, schnelles Geld, Follower… Der Lockruf ist halt stark.
  • Lena: Stimmt. Und seien wir ehrlich – wir beide gucken’s ja auch. Also sind wir Teil des Problems, oder?
  • Kai: Pff… ich würd sagen: wir sind Teil des Unterhaltsystems. Klingt besser. Jetzt mach die nächste Folge an.

Some great non-standard, everyday features in there. Colloquial contractions like -e dropping on verbs, decent use of particles like halt – not a bad job at all. Claude even threw in a bunch of usage notes without asking, too.

But colloquial is still, broadly, floating just beneath a standard umbrella. What if we go further?

Adding Local Flavour

When I’m probing the edges of what AI can do, I like to start with varieties I can actually judge. Austria is my German-speaking region of choice in this regard, being my year abroad destination of old – I still have a huge soft spot for Kärnterisch. So here’s the tweak:

Create a model German dialogue on the topic “Reality TV” between two young Austrian speakers from Kärnten. The language should be contemporary, colloquial and natural-sounding Austrian German.

Here’s what came back:

  • Magdi: Oida, hast du gestern „Love Island” g’schaut? I kann nimma! Die Lea hat den Stefan komplett ausg’spielt, echt a Wahnsinn.
  • Tobi: Na freilich, i schau des jo seit Wochn! Aber geh, die Lea, die tuat doch nur so. Die weiß ganz genau, wos sie macht — des is alles g’spield.
  • Magdi: Ja eh, ober trotzdem — der Stefan is so a Trottel! Wie kann man so blöd sein, gell? Der glaubt ihr jo alles.
  • Tobi: Najo, i mein, de san jo auch so a bissi animiert zum Streiten, oder? Die Produzenten woin das jo so. Ohne Zoff läuft des ned.
  • Magdi: Des stimmt scho, ober i find des trotzdem scheißn. Die tean so, ois ob’s echt wär, und dann geht’s nur ums Follower kriegen. Danoch san alle auf Instagram mit Werbung zuagekleistert.
  • Tobi: Haha, stimmt! Drei Wochen nach der Sendung verkaufen’s alle irgendwelche Protein-Shakes. Gott sei Dank ned mia. [lacht]
  • Magdi: Ober weißt wos? I schau’s trotzdem weiter. I bin schon voll drin, i muss wissen, ob der Kevin die Kathi kriegt oder ned.
  • Tobi: Na sag i dir: die kriegen sich ned. Die Kathi is viel zu schlau für ihn. Die geht da raus, macht a paar Storys, und dann war’s des.
  • Magdi: Oida, du bist so a Zyniker! Vielleicht is es jo echt?
  • Tobi: Magdi. Des is Reality TV. Des is so echt wie mei Oma beim Wrestling. [grinst]
  • Magdi: Haha, okay okay. Ober schauen tust’s trotzdem, gell?
  • Tobi: … Ja. Natürli. Um neun bin i dabei. Sag’s ned weiter.

If you’ve spent any time in Austria, this is probably just as joyful to you as it is to me. It’s added in so much southern colour, from the tag particle ‘gell’ to the reduced i from ich and past participles in g’. There’s even some fun wordplay in there (as real as my gran at wrestling!), so it’s not only dialect, but character. It’s laid it on thick!

There’s an element of pastiche to this, for sure. Nobody will claim this is indistinguishable from an authentic speaker of Klagenfurt. But it’s genuinely not bad, especially if you’re after a solid introduction to local speech features.

And then there’s Trøndersk…

That’s all very well for ‘mainstream’ foreign languages like German. To push the envelope a little more, it’s worth playing around with some lesser-learnt languages – Norwegian is always a good choice.

For my Norwegian test, I went with Trøndersk, the dialect cluster associated with the Trondheim region and much of central Norway. It’s one of the most immediately recognisable Norwegian dialects, with distinctive pitch accent patterns, characteristic -a verb endings, and a few phonological quirks that mark it out clearly from the Oslo Bokmål you’ll meet in textbooks.

How good a job would Claude do of this challenge?

  • Marte: Oi, har du sett at det e ny sesong av Farmen nå? Æ begynte å se på den i går kveld og klarte itj å slutt.
  • Jonas: Åja? Æ har itte sett mye reality-greier på ei stund, men Farmen e jo noe anna. Det e liksom ordentlig underholdning, da.
  • Marte: Ja, det e noko med den serien! Og den nye dama fra Steinkjer – ho e jo helt vill. Ho klaga på absolutt alt og alle og gjor ingenting sjæl.
  • Jonas: Haha, klassisk. Det e alltid nåkka en sånn type på realitygreier. Produsenten e nødt til å ha med dem, skjønner du? Ellers e det jo kjedelig.
  • Marte: Jaja, men det e jo litt ekkelt å innrømme at man liker det, liksom. Æ satt der og ropte på skjermen som et idiot.
  • Jonas: Haha! Ka sa du da?
  • Marte: Noe sånt som «koffor gjør’u det, din tulling!» Men mora mi satt rett ved sia og hun så bare på mæ som om æ var gal.
  • Jonas: Uff da, haha. Ka syns ho om programmet da?
  • Marte: Ho e heilt avhengig sjæl! Det e liksom heime hos oss at hele familien samles rundt skjermen på tirsdager. Det har vorre sånn siden æ va lita.
  • Jonas: Okei men ka med all den andre reality-søpla da? Eks på stranda og sånt? Det e jo bare tull, syns æ.
  • Marte: Ja nei, det gidder itj æ. Det e jo bare folk som vil bli kjendis og som itte har nåkka å by på. Farmen e annleis fordi det faktisk krev noe.
  • Jonas: Sant nok. Kameraten min i Ålesund e helt hekta på Paradise Hotel derimot. Han ser på det heile tida.
  • Marte: Seriøst? Koss klare han det? Æ holdt ut to episoder og ga opp. Alle folkan der oppe e jo helt… æ veit itte. Tomme, liksom.
  • Jonas: Haha, han sier det e «sosialt eksperiment». Æ tror bare han har lyst på drama heime fra sofaen sjæl.
  • Marte: Hahaha! Ja det e nok det. Uansett — skal du se Farmen med mæ på fredag? Vi kan bestille pizza og le av alle.
  • Jonas: Ja, det høres bra ut! Men du bestemme pizza-topping. Sist gang valde du masse squash og det va itte greit.
  • Marte: Oi, squash e jo godt da! Men okei, du får velge. Bare itte ananas.
  • Jonas: Deal. Vi ses fredag!

As far as I can tell – and I’m not so well placed here, I must admit – it’s done a solid job. The k- question words (rather than hv-), the reduced æ for jeg… Even the cultural references, selecting two actual Norwegian shows for the exchange. I’d trust this as a bit of pre-Trondheim trip prep!

That said, if you speak a Trøndersk dialect, what’s the verdict? Does this read as a reasonable attempt, or is it giving you second-hand embarrassment? I’d genuinely love to know. (Drop a comment below, or find me wherever I am on the internet this week.)

AI and Dialect for Language Learning

AI platforms clearly have some representation of dialect. Awareness is the wrong word for it – anthropomorphising LLMs and all that – but you know what I mean. The patterns are in the training data; the question is whether a prompt can reliably surface them, or whether the model’s default pull towards the standard is too strong.

The two languages I explored here seem to work well. The pull to parody is always something to watch, particularly when written material in a dialect tends to be humorous, rather than in-community. And rarer or more exclusively spoken varieties will no doubt be much more troublesome. But the more linguistic scaffolding you give the model – the more you treat prompting as a kind of sociolinguistic briefing – the better the output is likely to get.

That standard grey slop, it seems, has more texture than it first appears – you just have to know where to poke.

The CEFR scale - a ladder to fluency in language learning

CEFR and Interactive Language Learning: Bringing the Threads Together

The Council of Europe’s CEFR (Common European Framework of Reference) scale is something we’re almost instinctively aware of as language learners. A1/2, B1/2 and C1/2 are the shorthand we regularly use for fluency. A recent paper by Gökhan Haldun Demirdöven in Frontiers in Education takes a long-overdue fresh look at the framework in light of new immersive, technology-mediated language learning. How does our common language ladder mesh with recent developments?

A lot of recent work on language learning tech focuses on platforms such as XR, AI-driven environments and conversational agents almost as a pedagogical end in themselves. Instead, this paper asks a more structured question: how might these technologies meaningfully align with what the CEFR already – with well-established practical application – conceptualises as language ability?

In other words, can we design immersive digital environments to support the kinds of communicative action, mediation, and interaction that CEFR descriptors actually describe?

CEFR : More Than Just Assessment

The author takes the 2020 revisions to the CEFR as a starting point, identifying features of the updated framework – particularly its emphasis on mediation, multiple competences, and real-world communicative tasks – and considers how these might map onto immersive learning contexts. Examples include simulated environments for task-based interaction, AI-supported conversational agents, and speech-recognition systems that are sensitive to sociolinguistic variation rather than enforcing a single normative model.

One enlightening conclusion is that the CEFR is not simply an assessment tool; it is also a design framework. New language learning technologies are useful as long as they support CEFR-aligned communicative action, rather than as standalone innovations. This is a helpful course correction in a space where language-learning technologies can sometimes prioritise novelty over pedagogical coherence. For me, as a language software developer, it’s certainly something that strikes home – language professionals should avoid developing new language tech in a vacuum, and instead build on previous work with a proven track record.

For teachers, curriculum designers, and language-learning app developers, it’s a really timely reminder that innovation in language learning does not have to sit outside existing standards. Frameworks like the CEFR can play an active role in shaping how new technologies are pedagogically grounded.

Diffuse squares

SingaKids: A Glimpse of Where Multimodal AI Tutoring May Be Headed

A recent pre-print on SingaKids, a multilingual multimodal tutoring system for young learners, offers an interesting look at how AI-supported language learning is evolving. You can read the paper here: SingaKids: A Multilingual Multimodal Dialogic Tutor for Language Learning.

Designed for early primary classrooms, SingaKids is an AI-based system that uses picture-description tasks as the basis for spoken interaction. It combines dense image captioning, multilingual speech recognition, a dialogue model tuned with pedagogical scaffolding, and child-friendly text-to-speech. The system works in English, Mandarin, Malay, and Tamil, with extra attention paid to the lower-resource languages to improve recognition and generation quality.

Flexible Scaffolding

Something that stood out to me in particular was the system’s focus on scaffolding rather than straightforward correction. That approach is flexible; depending on a child’s response, the system shifts between prompts, hints, explanations, and more structured guidance. Higher-performing learners are pushed towards fuller reasoning; less confident learners get clearer cues and more supportive turns. It’s a step away from the rigid “question–answer–score” pattern and closer to the texture of real classroom dialogue.

Although the work is aimed at children, several ideas have wider implications for the rest of us. Picture-guided dialogue isn’t new in ‘grown-up’ resources – think Rosetta Stone, for instance. But it could easily support adult learners practising free production in AI tools, too. Improved multilingual ASR – especially for hesitant, accented, or code-switched speech – would benefit almost every speaking-practice tool. And the flexible scaffolding approach hints at future e-tutors that adapt to the learner’s behaviour dynamically, rather than funnelling everyone down the same path.

The project sits firmly in the research space, but it points towards what the next generation of tools may look like: multimodal, context-aware systems that don’t just respond to learners but actively guide, prompt, and adjust. For anyone keeping an eye on developments in educational AI, it’s a nice indication of the direction of travel (and I’m probably a wee bit envious of those kids getting a chance to try it first!).

Perplexity Tasks for Language Learners

AI techniques to support language learning are pretty well-known now. From structured conversation partners to resource creators, LLM platforms have been embraced by the polyglot community.

Like many of us, I dip in and out of them almost unthinkingly now. Often, I’ll snap in a page from a chapter I’m working on with my Greek teacher, and it’ll help me prepare ahead of a lesson. Sometimes, I’ll get it to reel off a list of useful phrases on a topic I’m studying. LLMs can make great worksheet creators, too. In many ways, it’s simply a very interactive reference tool, giving (mostly) reliable answers but with a big nod to context.

I’d been pretty dogged in my choice of platform, sticking for the most part with ChatGPT Plus. Claude and Gemini were also in the mix, alongside some fun running local models. But for the most part, I thought my tool choices were pretty settled.

But then I gave Perplexity a whirl.

Perplexity – Task Master

Perplexity isn’t an LLM in the sense that ChatGPT, Gemini and Claude are. It uses LLM technology. But it’s actually more of an intelligent, context-sensitive search tool, that uses natural language APIs to turbo-boost its web-hunting activities.

I’d clearly not found that prospect very exciting, as I’d not gone near it until now. But thanks to a bundled free upgrade, I got to try the premium tier of late. And one particular feature stands out as potentially transformative for my learning habits: Perplexity Tasks.

Tasks are scheduled searches you set up with natural language instructions. And those instructions can be as rich as your usual LLM prompts in terms of requested formatting and such like, so in essence, you can build regular bulletins with up-to-date information in any language you like. Take one of mine, that runs daily:

Search the global news for the biggest world news story of the day. Summarise it in French, German, Modern Greek, Polish, Scottish Gaelic and Swahili at a level appropriate for an intermediate learner, ensuring that the translation is of the highest, native speaker standard quality, idiomatic and natural-sounding. Summaries should be 3-4 sentences long. Highlight key words in bold.

Accompany each summary text with a glossary / vocabulary list detailing all the key / difficult words from it in dictionary format (listing word class, irregular parts if applicable etc.). Hyperlink glossary items to Wiktionary entries where available with further information on them (use the English version en.wiktionary.com).

Lay it all out neatly to make it easy on the eye. Use plenty of emojis for impact too. Make this a fabulous resource for polyglot language learning! 🌍

Now, every morning, I get a wee news digest emailed straight to my inbox in multiple languages. It’s learner-friendly, includes vocab support, and gives me something to talk about in my language meets and lessons. I’ve done the same for academic paper searches in linguistics, and stories on dialect appearing in news outlets.

It feels like a proper game changer!

Tasking on Other Platforms

Now, you don’t need Perplexity to do this – it’s just one of the most user-friendly ways I’ve found to do it. If you have ChatGPT,  check out Scheduled Tasks. In Gemini, Scheduled Actions will do the trick for Pro members. Copilot is in on the game too. Others will no doubt follow suit shortly – clearly, task scheduling is becoming one of those features AI platforms are expected to have.

What I like about Perplexity, though, is that its whole raison d’être is the search – it feels particularly suited to web-based tasks like news digests. It’s also quite nice to keep the separation between my everyday LLM ramblings, and my more structured, scheduled items (use it for a few weeks and you’ll have clogged your timeline up with dozens of chats!).

If you’ve been looking for a way to make AI genuinely work for your learning rather than distract from it, try setting up a task or two – you might just find it becomes part of your morning ritual as well.

Image showing lots of document icons for a post on building a Zotero and Obsidian workflow

Zotero and Obsidian : A Workflow to Research Anything

If much of your study is electronic – e-books, PDF papers, worksheets and the like – you’ll face the same struggle I have: digital overwhelm. A clear workflow for dealing with mounds of virtual material is essential if you’re not to get lost.

I feel like I’ve tried them all, too. I’ve gone through the gamut of e-readers: GoodReader, PDF Expert, even trusty old Apple Preview (which has great annotation features). All very decent in their own way. On the file system side of things, though, it’s another story. I’ve cobbled together some sort of ‘folders on the Cloud’ system over the years, but it’s seriously creaky. I break my own rules half the time!

Bearing that in mind, I was chuffed to bits to chance upon a whole new system recently – one that’s passed me by completely. It seems to be a particularly big hit across North American universities. It also has a large, active community online, sharing performance tweaks. And best of all – it uses completely free software.

Zotero and Obsidian

Zotero is a publications manager that you simply drag your e-material into. The app retrieves bibliographical information, renames files sensibly and stores a copy online for working cross-device. Even better, it’s capable of generating full bibliographies, so is a file store, reader and referencing tool all in one.

Obsidian is the note-taking side of this – a sleek, markdown-driven text editor that is beautifully minimalistic. It excels in creating hyperlinked notes, allowing you to build your own Wiki-style knowledge bank. But it dovetails beautifully into Zotero thanks to community plugins that allow you to import your PDF annotations directly into bibliographically pigeon-holed notes.

After resisting the temptation to kick myself for not spotting it sooner, I did a deep-dive into Zotero + Obsidian workflow how-tos, and it’s an academic revelation. A couple of community content creators are real stand-outs here – so much so that it’s best I let them do the talking rather than waffle any more. I’m learning this as I go along, and these are great places to start.

Workflow Training

Here’s where I started, more by chance YouTube search than anything else. Girl in Blue Music namechecks a lot of the other big Z+O content creators here, so it’s a good jumping point for newcomers.

From there, it’s worth exploring morganeua‘s vast selection of content, including numerous how-to videos and worked examples.

Once you’ve worked through those, you can graduate to full geek mode! Bryan Jenks pushes the system well beyond anything else I’ve seen, and likewise has a huge back catalogue of training vids. He layers styling and advanced templating onto the base, making for a slick, colour-coded, optimally managed research system.

I feel very late indeed to this workflow party. But if you are too, join the club – and let me know if you’ve found this useful too!

ElevenLabs Hits the Right Note: A.I. Songwriting for Language Learners

In case you missed it, A.I. text-to-speech leader ElevenLabs is the latest platform to join the generative music scene – so language learners and teachers have another choice for creating original learning songs.

ElevenLabs’ Creative Platform ElevenMusic takes a much more structured approach to music creation that other platforms I’ve tried. Enter your prompt (or full lyrics), and it will build a song from block components – verse, chorus, bridge – just as you might construct one as a human writer. It makes for a much more natural-sounding track.

ElevenLabs music creation

ElevenLabs music creation

As you’d expect from voice experts ElevenLabs, the service copes with a wide range of languages and the diction is very convincing. A tad more so, I think, than the current iteration of the first big name on the block, Suno AI. No doubt the latter will have some tricks up its sleeve to keep up the pace – but for now, ElevenLabs is the place to go for quick and catchy learning song.

Anyway, here’s one I made earlier – a rather natty French rock and roll song about the Moon landings. Get those blue suede Moon boots on!

It’s definitely worth having a play on the site to see what you can come up with for you or your classes. ElevenLabs has a free tier, of course, so you can try it out straight away. [Note: that’s my wee affiliate link, so if you do sign up and hop on a higher tier later, you’re helping keep Polyglossic going!]

A swirl of IPA symbols in the ether. Do LLMs 'understand' phonology? And are they any good at translation?

Tencent’s Hunyuan-MT-7B, the Translation Whizz You Can Run Locally

There’s been a lot of talk this week about a brand new translation model, Tencent’s Hunyuan-MT-7B. It’s a Large Language Model (LLM) trained to perform machine translation. And it’s caused a big stir by beating heftier (and heavier) models by Google and OpenAI in a recent event.

This is all the more remarkable given that it’s really quite a small model by LLM standards. Hunyuan actually manages its translation-beating feat packed into just 7 billion parameters (the information nodes that models learn from). Now that might sound a lot. But fewer usually means weaker, and the behemoths are nearing post-trillion param levels already.

So Hunyuan is small. But in spite of that, it can translate accurately and reliably – market-leader beatingly so – between over 30 languages, including some low-resource ones like Tibetan and Kazakh. And its footprint is truly tiny in LLM terms – it’s lightweight enough to run locally on a computer or even tablet, using inference software like LMStudio or PocketPal.

The model is available in various GGUF formats at Hugging Face. The 4-bit quantised version comes in at just over 4 GB, making it iPad-runnable. If you want greater fidelity, then 8-bit quantised is still only around 8 GB, easily handleable in LMStudio with a decent laptop spec.

So is it any good?

Well, I ran a few deliberately tricky English to German tasks through it, trying to find a weak spot. And honestly, it’s excellent – it produces idiomatic, native-quality translations that don’t sound clunky. What I found particularly impressive was its ability to paraphrase where a literal translation wouldn’t work.

There are plenty of use cases, even if you’re not looking for a translation engine for a full-blown app. Pocketising it means you have a top-notch multi-language translator to use offline, anywhere. For language learners – particularly those struggling with the lower-resource languages the model can handle with ease – it’s another source of native-quality text to learn from.

Find out more about the model at Hugging Face, and check out last week’s post for details on loading it onto your device!

Ultra-Mobile LLMs : Getting the Most from PocketPal

If you were following along last week, I was deep into the territory of running open, small-scale Large Language Models (LLMs) locally on a laptop in the free LMStudio environment. There are lots of reasons you’d want to run these mini chatbots, including the educational, environmental, and security aspects.

I finished off with a very cursory mention of an even more mobile vehicle for these, PocketPal. This free, open source app (available on Google and iOS) allows for easy (no computer science degree required) searching, downloading and running LLMs on smartphones and tablets. And, despite the resource limitations of mobile devices compared with full computer hardware, they run surprisingly well.

PocketPal is such a powerful and unique tool, and definitely worth a spotlight of its own. So, this week, I thought I’d share some tips and tricks I’ve found for smooth running of these language models in your pocket.

Full-Fat LLMs?

First off, even small, compact models can be (as you’d expect) unwieldy and resource-heavy files. Compressed, self-contained LLM models are available as .gguf files from sources like Hugging Face, and they can be colossal. There’s a process you’ll hear mentioned a lot in the AI world called quantisation, which compresses models to varying degrees. Generally speaking, the more compression, the more poorly the model performs. But even the most highly compressed small models can weigh in at 2gb and above. After downloading them, these mammoth blobs then load into memory, ready to be prompted. That’s a lot of data for your system to be hanging onto!

That said, with disk space, a good internet connection, and decent RAM, it’s quite doable. On a newish MacBook, I was comfortably downloading and running .gguf files 8gb large and above in LMStudio. And you don’t need to downgrade your expectations too much to run models in PocketPal, either.

For reference, I’m using a 2023 iPad Pro with the M2 chip – quite a modest spec now – and a 2024 iPhone 16. On both of them, the sweet spot seems to be a .gguf size of around 4gb – you can go larger, but there’s a noticeable slowdown and sluggishness beyond that. A couple of the models I’ve been getting good, sensible and usable results from on mobile recently are:

  • Qwen3-4b-Instruct (8-bit quantised version) – 4.28gb
  • Llama-3.2-3B-Instruct (6-bit quantised version) – 3.26gb

The ‘instruct’ in those model names refers to the fact that they’ve been trained to follow instructions particularly keenly – one of the reasons they give such decent practical prompt responses with a small footprint.

Optimising PocketPal

Once you have them downloaded, there are a couple of things you can tweak in PocketPal to eke out even more performance.

The first is to head to the settings and switch on Metal, Apple’s hardware-accelerated API. Then, increase the “Layers on GPU” setting to around 80 or so – you can experiment with this to see what your system is happy with. But the performance improvement should be instantaneous, the LLM spitting out tokens at multiple times the default speed.

What’s happening with this change is that iOS is shifting some of the processing from the device’s CPU to the GPU, or graphical processing unit. That may seem odd, but modern graphics chips are capable of intense mathematical operations, and this small switch recruits them into doing some of the heavy work.

Additionally, on some recent devices, switching on “Flash Attention” can bring extra performance enhancements. This interacts with the way LLMs track how much weight to give certain tokens, and how that matrix is stored in memory during generation. It’s pot luck whether it will make a difference, depending on device spec, but I see a little boost.

Tweaking PocketPal’s settings to run LLMs more efficiently

Tweaking PocketPal’s settings to run LLMs more efficiently

Making Pals – Your Own Custom Bots

When you’re all up and running with your PocketPal LLMs, there’s another great feature you can play with to get very domain-specific results – “Pal” creation. Pals are just system prompts – instructions that set the boundaries and parameters for the conversation – in a nice wrapper. And you can be as specific as you want with them, instructing the LLM to behave as a language learning assistant, a nutrition expert, a habits coach, and such like – with as many rules and output notes as you see fit. It’s an easy way to turn a very generalised tool into something focused and with real-world application.

So that’s my PocketPal in-a-nutshell power guide. I hope you can see why it’s worth much more than just a cursory mention at the end of last week’s post! Tools like PocketPal and LMStudio put you right at the centre of LLM development, and I must admit it’s turned me into a models geek – I’m already looking forward to what new open LLMs will be unleashed next.

So what have you set your mobile models doing? Please share your tips and experiences in the comments!