Why your usual Wordle strategy isnʻt working today, according to a linguistic... - 0 views
-
Lara Cowell on 13 Mar 22TechRadar spoke to Dr Matthew Voice, an Assistant Professor in Applied Linguistics at the UK's University of Warwick, to find out the science behind the struggle to deduce Wordle Puzzle #256. "[In your live blog] you've already talked about _ATCH as a kind of trap. This is an example of an n-gram, i.e. a group of letters of a length (n) that commonly cluster together. So this is an n-gram with a length of four letters: a quadrigram," Professor Voice tells us. "Using [this] Project Gutenberg data, it's interesting to note that _ATCH isn't listed as one of the most common quadrigrams in English overall, but the [same] data considers words of all lengths, rather than just the five letters Wordle is limited to. I don't know of any corpus exclusively composed of common 5 letter words, but it might be the case that _ATCH happens to be particularly productive for that length." "The other thing to mention," Professor Voice adds, "would be that the quadrigram _ATCH is made up of smaller n-grams, like the bigram AT, which is extremely common in English. So we're seeing a lot of common building blocks in one word, which means that sorting individual letters might not be narrowing down people's guesses as much as it would with other words."