IDG Contributor Network: Why Google might lose the enterprise AI wars


“Google has never understood enterprise,” asserts Chris Nicholson, CEO of Skymind. By contrast, Nicholson gets how businesses think. His company builds , the leading open-source enterprise-ready library for deep learning. “You can do anything you want in consumer, because people aren’t paying. In enterprise, your customers hand you eight figure checks and expect top-notch professional service along with golf games and steak dinners.”

The three current leaders in cloud computing services – Amazon (AWS), Microsoft (Azure), and IBM (IBM Cloud) – understand this dynamic very well. While Google is widely acknowledged as the defacto leader in AI research with DeepMind and Google Brain, winning the “Machine Learning As A Service” (MLaaS) wars is much harder than simply releasing free tools like TensorFlow.

“TensorFlow is a loss leader for Google,” explains Nicholson. Google lags in share of the enterprise cloud computing market, so they hope to make up the difference by offering machine learning tools for free. Indeed, TensorFlow rapidly overtook other popular deep learning libraries like Theano, Caffe, and Torch, as measured by number of forks on Github and , but these numbers are misleading. “Google has a Udacity course on deep learning where every student is required to fork TensorFlow,” Nicholson calls out. “These indie developers have no money and don’t represent true business usage.” 

Two major obstacles stand in Google’s way to cloud AI dominance: data gravity and lack of backwards compatibility. To illustrate the issue of data gravity, look no further than . This chair-sized flash drive, capable of storing 80 TB of precious enterprise data, is physically shipped to a customer’s on-premise data centers to load petabytes of data and then shipped back to Amazon for upload to AWS servers. Ironically, manual transfer is significantly faster and cheaper for large data sets than any internet method. Enterprises with data-hungry AI applications will have an easier time running algorithms on-prem or on AWS and Azure, where their data already lives.