AI Parallel Texts for Learning Two Similar Languages

I’ve seen a fair few social media posts recently about linguist Michael Petrunin’s series of Comparative Grammars for polyglots. They seem to have gone down a storm, not least because of the popularity of triangulation as a polyglot strategy.

They’re a great addition to the language learning bookshelf, since there’s still so little formal course material that uses this principle. Of course, you can triangulate by selecting course books in your base language, as many do with Assimil and other series like the Éditions Ellipse.

Parallel Texts à la LLM

But LLMs like ChatGPT, which already do a great job of the parallel text learning style, are pretty handy for creative comparative texts, too. Taking a story format, here’s a sample parallel text prompt for learners of German and Dutch. It treats each sentence as a mini lesson in highlighting differences between the languages.

I’m learning Dutch and German, two closely related languages. To help me learn them in parallel and distinguish them from each other, create a short story for me in Dutch, German and English in parallel text style. Each sentence should be given in Dutch, German and English. Purposefully use grammatical elements which highlight the differences between the languages, which a student of both does need to work hard to distinguish, in order to make the text more effective.

The language level should be lower intermediate, or B1 on the CEFR scale. Make the story engaging, with an interesting twist. Format the text so it is easy to read, grouping the story lines together with each separate sentence on a new line, and the English in italics.

You can tweak the formatting, as well as the premise – specify that the learner already speaks one of the languages more proficiently than the other, for example. You could also offer a scenario for the story to start with, so you don’t end up with “once upon a time” every run. But the result is quite a compact, step-by-step learning resource that builds on a comparative approach.

ChatGPT creating parallel texts in German and Dutch with an English translation.

ChatGPT creating parallel texts in German and Dutch with an English translation.

Variations and Limitations

I also tried prompting for explanatory notes:

Where the languages differ significantly in grammar / syntax, add an explanatory note (in English) to the sentences, giving details.

This was very hit and miss, with quite unhelpful notes in most runs. In fact, this exposes the biggest current limitation of LLMs: they’re excellent content creators, but still far off the mark in terms of logically appraising the language they create.

It is, however, pretty good at embellishing the format of its output. The following variation is especially impressive in an LLM platform that shows a preview of its code:

I’m learning Spanish and Portuguese, two closely related languages. To help me learn them in parallel and distinguish them from each other, create a short story for me in Spanish, Portuguese and English in parallel text style. Each sentence should be given in Spanish, Portuguese and English. Purposefully use grammatical elements which highlight the differences between the languages, which a student of both does need to work hard to distinguish, in order to make the text more effective.

The language level should be lower intermediate, or B1 on the CEFR scale. Make the story engaging, with an interesting twist.

The output should be an attractively formatted HTML page, using a professional layout. Format the sentences so they are easy to read, grouping the story lines together with each separate sentence on a new line, and the English in italics. Hide the English sentences first – include a “toggle translation” button for the user.

Claude by Anthropic creating an HTML-formatted parallel story in Spanish and Portuguese.

Claude by Anthropic creating an HTML-formatted parallel story in Spanish and Portuguese.

It’s another use case that highlights LLMs’ greatest strength: the creation of humanlike texts. For linguists, it matters not a jot how much (or little) deep understanding there is beneath that. With the language quality now almost indistinguishable from real people-speak, AI texts serve as brilliant ‘fake authentic’ language models.

e-Stories as parallel texts are yet another fun, useful flavour of that!

Parallel text style learning, like Assimil courses, can be a great way to improve your fluency.

DIY Assimil : Parallel Text Learning with ChatGPT

Assimil language learning books are hugely popular in our polyglot community. And for good reason – many of us learn really effectively with its parallel text method.

They’re especially userful when the base language is another of our stronger languages, adding an element of triangulation. I learned a heap of Greek vocabulary from the French edition Le Grec sans Peine, at the same time as strengthening my (ever slightly wobbly) French.

Now, Assimil is already available in a great range of language pairs. But it’s not always a perfect fit. For example, some editions are more up-to-date than others. More off-the-beaten-track languages still aren’t available. And at times, you can’t find the right base language – no use learning Breton through French, if you don’t have any French.

Enter ChatGPT (or your alternative LLM of choiceBing also does a great job of these!).

DIY Assimil Prompting

Copy and paste this into your AI chat, changing the language (top), translation language (middle) and topic (bottom) to suit.

You are an expert creator of language learning resources. I want to create some text-based learning units for beginner Malay learners (level A0/A1 on the CEFR scale). The units follow the parallel text approach of the well-known Assimil language learning books.

Each unit has a text in the target language (about 250 words) on a specific vocabulary topic. It should be narrative, talking about how the topic relates to an everyday person. It should be divided into logical paragraphs. After each paragraph, there is an English translation of that paragraph in italics.

The text should be written in very clear, simple language. The language must read like a native speaker wrote it, and be error-free and natural-sounding. Source the info for the text from target language resources online, making it as up-to-date and authentic as possible. It should be completely original and not copied or lifted from any other source directly.

After the text, there is a glossary list of the key topic words from the text, sorted alphabetically and grouped by parts of speech (nouns, verbs, adjectives, adverbs etc.).

Are you ready to create some content? The first topic is: Mobile Technology

This prompt creates a prose-based parallel text unit. However, if you prefer dialogue-style texts, simply change the second paragraph of the prompt:

Each unit has a humorous dialogue in the target language (about 20 lines) on a specific vocabulary topic. The dialogue should relate the topic to everyday speakers through colloquial, idiomatic language.

The prompt works a treat in both ChatGPT Plus (paid) and Microsoft Bing (free). I also got very useable results in the free version of ChatGPT and Claude 2. It works so well as the focus is purely on what LLMs do best: spooling off creative text.

How Do I Use Them?

So, with your shiny, new Assimil-style units spooled off, what do you do with them?

Personally, I like to copy and paste the output into the notes app on my phone. That way, they make nice potted units to browse through when I have some spare moments on the bus or train. They’re equally handy copy-pasted into PDF documents that you can annotate on your phone or tablet.

Parallel text for Malay language learning created by AI

Parallel text in Malay and English created by AI

In terms of real-world use, the self-contained, chatty texts typically created make perfect material for the islands approach to improving spoken fluency. Create some units in topics that are likely to come up in conversation. Then, spend some time memorising the phrases by heart. You’ll be able to draw on them whenever you need in real-life conversation.

Enjoy prompts like these? Check out my book AI for Language Learners, which lists even more fun ways to get results without paying hefty course book price tags!