Kennigit
proud 2 boxer
So, we all know that in the future, translation software will only continue to get better. There are various goals that can be set--or at least imagined--and they may or may not happen sequentially, in parallel, or even at all.
translation vs interpretation
Generally, "translations" means 1:1 accuracy or very close to it, whereas "interpretation" generally means "getting the jist/approximate meaning of it". Some people like to be very detail oriented and point out that most real-time or live translations basically need to be interpretations, e.g. live translating an idiom or phrase unique in one language, rather either translating literally or having the time to really translate the idiom or phrase into a new phrase that entirely captures the meaning properly.
Constructed language
A constructed language would be something like esperanto, which has no historial basis in terms of culture and is constructed solely for the purpose of trying to get a "universal language" or "universal secondary language"
Rosetta stone
One text translated to many other languages.
Kennigit generated postulates (KGP)
The list goes on, in both scope (what are future goals) and timeframe (what will be achieved and when, and what first?). But, for instance, it may be pointless to try to develop a translator that uses the original speaker's voice and transmits it in some translated way by adjusting all the proper frequencies, etc, to reconstruct the same tone with other words (otherwise you have to have a prerecorded bank of the speaker listing out vocab words) if everyone is satisfied hearing a third party voice.
I have spent 2 minutes and 23 seconds learning about this subject (coincidentally, the length of this video
The process of google translate is to look at translated documents and try statistical tests to have pattern recognition. That is, the translation works by sampling larger and larger data sets, and is a monte carlo method (not deterministic, i.e. the computer does not know the rules of every language but guesses it). The video should explain better.
Thus, perhaps translating Swahili to Japanese may not be very effective, as there are probably not many translated texts between swahili and japanese. I'm sure google's pattern recognition may make use of secondary translations, e.g. perhaps there are many more english to japanese translations than spanish to japanese, so a step of spanish -> japanese may flow as spanish -> english -> japanese, or to make use of a "rosetta stone", let's say a chinese document all translated into english, spanish, and japanese, and use that document to compare the three together to construct rules for spanish -> japanese (rather than look for samples of spanish documents translated directly into japanese)
Pictures perhaps work better:
I assume (though will not google it) that most professional translators are supposedly very knowledgeable in 2-3 languages, and work by knowing the "rules" of those languages entirely.
1) timeframes? 10 years? 20? 50?
2) Is universal translation the final goal, or is it to make everyone a native speaker in a language (e.g. esperanto, a constructed language).
3) Will a secondary, constructed language, like esperanto, ever exist in a widespread sense once universal translation becomes more realistic?
3b) Will a secondary, constructed language, like esperanto, be human derived (such as esperanto) or exclusively in computer pattern recognition?
tl;dr: A general solution is not expected to me (i.e. no universal secondary language, either in computers or in humans), which brings me to
4) What will be the cutoff for "universal"?
Question 3 would establish true universal language. I, however, think it will just be done for the top 10 used languages, plus maybe 1 or 2 to try to help out w/ ancient texts (e.g. Greek), and call it a day.
5) How widespread will this be implemented? Only lofty UN meetings? The common man?
Terminology
Spoiler :
translation vs interpretation
Generally, "translations" means 1:1 accuracy or very close to it, whereas "interpretation" generally means "getting the jist/approximate meaning of it". Some people like to be very detail oriented and point out that most real-time or live translations basically need to be interpretations, e.g. live translating an idiom or phrase unique in one language, rather either translating literally or having the time to really translate the idiom or phrase into a new phrase that entirely captures the meaning properly.
Constructed language
A constructed language would be something like esperanto, which has no historial basis in terms of culture and is constructed solely for the purpose of trying to get a "universal language" or "universal secondary language"
Rosetta stone
One text translated to many other languages.
Kennigit generated postulates (KGP)
- A "universal translator", I would say, would be expected to occur before the ability to implant devices into the brain that can "install" a language to the human.
- A "universal translator working through text" would be expected before a cochlear implant or other hearing aid is able to recognize a human's speech and translate after a short real-time delay.
- A "unversal interpretor working by hearing a third party's voice" (i.e. the listener should hear the translation dominate the volume received--such as having a headset at the UN where you listen to your interpretor) would be expected before some voice modification where you hear the speaker talk to you in your own language, rather than hear a third party voice (in the future, will be entirely robotic) generate the spoken words. Or, of course, before an implant that allows the speaker to talk in your language.
The list goes on, in both scope (what are future goals) and timeframe (what will be achieved and when, and what first?). But, for instance, it may be pointless to try to develop a translator that uses the original speaker's voice and transmits it in some translated way by adjusting all the proper frequencies, etc, to reconstruct the same tone with other words (otherwise you have to have a prerecorded bank of the speaker listing out vocab words) if everyone is satisfied hearing a third party voice.
Current discussion and technologies:
I have spent 2 minutes and 23 seconds learning about this subject (coincidentally, the length of this video

The process of google translate is to look at translated documents and try statistical tests to have pattern recognition. That is, the translation works by sampling larger and larger data sets, and is a monte carlo method (not deterministic, i.e. the computer does not know the rules of every language but guesses it). The video should explain better.
Spoiler additional discussion :
Thus, perhaps translating Swahili to Japanese may not be very effective, as there are probably not many translated texts between swahili and japanese. I'm sure google's pattern recognition may make use of secondary translations, e.g. perhaps there are many more english to japanese translations than spanish to japanese, so a step of spanish -> japanese may flow as spanish -> english -> japanese, or to make use of a "rosetta stone", let's say a chinese document all translated into english, spanish, and japanese, and use that document to compare the three together to construct rules for spanish -> japanese (rather than look for samples of spanish documents translated directly into japanese)
Pictures perhaps work better:



I assume (though will not google it) that most professional translators are supposedly very knowledgeable in 2-3 languages, and work by knowing the "rules" of those languages entirely.
This thread and the future:
I'll ask a few questions to prompt discussion (my answers in spoilers):
1) timeframes? 10 years? 20? 50?
Spoiler :
I certainly expect in 25 years the equivalent of google translate to work very fluently and not be the semi-clunky mess it is now (similar to how subtitles on videos are clunky--e.g. youtube scripts, though I don't know how closed captioned seemed to have done things well and was from a long time ago).
2) Is universal translation the final goal, or is it to make everyone a native speaker in a language (e.g. esperanto, a constructed language).
Spoiler :
"Computationally" speaking, if we ever have the ability to "install" a language to a person, there is no reason to ever need to install chinese, korean, spanish, english, etc: just install everybody with 1 language. "Translation" would be a thing of the past. However, due to the cultural importances associated with language & the hope to preserve languages, I doubt this will ever be achieved
I also think that is a huge neurobiological task that I don't think will happen. There can be tools to help children or adults learn languages better, but I don't think we'll be able to just make a 6 year old polyglot
And why would we, since I think a hearing-aid like device would work better than making the user "learn" the language.
I also think that is a huge neurobiological task that I don't think will happen. There can be tools to help children or adults learn languages better, but I don't think we'll be able to just make a 6 year old polyglot
And why would we, since I think a hearing-aid like device would work better than making the user "learn" the language.
3) Will a secondary, constructed language, like esperanto, ever exist in a widespread sense once universal translation becomes more realistic?
3b) Will a secondary, constructed language, like esperanto, be human derived (such as esperanto) or exclusively in computer pattern recognition?
Spoiler :
3: I don't think so. Humans are lazy.
3b:
Esperanto was/is created trying to blend in mostly western languages afaik (e.g. french, spanish, english). Any human attempt at constructing a language also will be biased. A computer could make up a language today, that would perhaps be the most difficult language ever for a human to try to learn, but it could facilitate creating a "rosetta stone" for the computer to be able to quickly translate any language to any other language, so long as the language X <-> constructed language link is done accurately. This also would easily allow for any new language (aliens, old discovered languages, whatever) to be translated to any other language if the first step can be established.
I don't think this will happen personally. I think the research effort would go towards "patching" things where needed, e.g. say korean <-> chinese works well, the goal will just to get chinese working and then add in korean through that.
3b:
Esperanto was/is created trying to blend in mostly western languages afaik (e.g. french, spanish, english). Any human attempt at constructing a language also will be biased. A computer could make up a language today, that would perhaps be the most difficult language ever for a human to try to learn, but it could facilitate creating a "rosetta stone" for the computer to be able to quickly translate any language to any other language, so long as the language X <-> constructed language link is done accurately. This also would easily allow for any new language (aliens, old discovered languages, whatever) to be translated to any other language if the first step can be established.
I don't think this will happen personally. I think the research effort would go towards "patching" things where needed, e.g. say korean <-> chinese works well, the goal will just to get chinese working and then add in korean through that.
tl;dr: A general solution is not expected to me (i.e. no universal secondary language, either in computers or in humans), which brings me to
4) What will be the cutoff for "universal"?
Question 3 would establish true universal language. I, however, think it will just be done for the top 10 used languages, plus maybe 1 or 2 to try to help out w/ ancient texts (e.g. Greek), and call it a day.
5) How widespread will this be implemented? Only lofty UN meetings? The common man?
Spoiler :
obviously text translation is anyone with the internet. But for instance, if we had cochlear implants available, would people bother to get it? Only those that do foreign business? Only the rich? Is there counter-culture (i.e. "Speak American?")
What do you think are the goals of universal translation (scope), and what do you think will be achieved--either technologically or in actual implementation