Google’s machine-learning research team have unveiled MultiModel – a new neural network architecture that can handle multiple domains (image recognition, language translation, speech recognition, etc.), simultaneously. Declaring it a first step towards the convergence of vision, audio, and language inside a single neural network, it’s a very notable achievement for an industry that is on the brink of transforming computing. The team notes that in the last decade, the capabilities of Deep Learning have progressed at an astonishing rate. However, they add that “the current state of the field is that the neural network architectures are highly specialized to specific domains of application. An important question remains unanswered: will a convergence between these domains facilitate a unified model capable of…