R – Austronesian Language Translation


Google Translate is awesome for most of the major languages but where do you even begin if you wanted to make your OWN translation engine? Let's say I want to create a very basic Cuyonon to English or even an English to Cuyonon phrase translator, where do I begin?

Best Solution

You read about 5,000 pages on the science of machine translation. Google uses statistical machine translation. They collect gigantic parallel corpora of text in the two languages. They they match up the sentences (this alignment problem is not trivial) and then they train a gigantic statistical model. There are open source kits that can build these models if you have all the data, but they won't work as well as Google's.

For example, this.