[seopress_breadcrumbs]

Processing AI at the Edge – Use-Cases and AI Hardware Considerations

•

September 6, 2019

We spoke to Moe Tanabian, General Manager of Intelligent Devices at Microsoft, who is speaking at the AI Edge Summit in Mountain View, California on September 17 and 18. Tanabian discusses how to think about and reframe business problems to make them more accessible for AI, as well as AI at the edge, which involves doing AI processing on individual devices rather than in the cloud. The edge could open up new potential for business problems to be solved with AI. Tanabian also provides representative use cases of intelligent devices.

Subscribe to our AI in Industry Podcast with your favorite podcast service:

Guest: Moe Tanabian, General Manager of Intelligent Devices – Microsoft

Expertise: AI and machine learning

Brief Recognition: Tanabian holds a Master’s degree in Systems and Computer Engineering from Carleton University. Prior to Microsoft, he was Android Engineering Lead at Amazon and Corporate Vice President, CDO (AI, SW, HW) at Samsug.

Interview Higlights

(03:00) What are the most important near term impacts of AI hardware?

Moe Tanabian: There is a great book which is called Prediction Machines and Ajay Agrawal from the UoT makes a great modeling around this. That, hey, this is about some very important economic input becoming really cheap, and when something gets cheap it goes everywhere. He gives some examples.

One of the examples that I like a lot is when we started to use silicon and software to do arithmetic. Arithmetic is a very important economic input. Once we were able to do that, we started to basically handle those jobs in silicon and software instead of asking humans to do it.

We were a lot more efficient, and humans could do a lot more complex things. But something really interesting happened beyond that point. When arithmetic driven by silicon and software became really cheap, we started to turn other things around that were not arithmetic-based.

But because the arithmetic became so cheap, we used it as an economic input to other things like photography. We started to digitize photos. Photography was a chemistry problem. We turned it into an arithmetic problem because arithmetic was so cheap. The same thing is happening with AI and the economic input that AI is changing is prediction.

Humans have been the only system that could do prediction. We did prediction in a variety of places. When you went to get a loan, a loan officer would predict whether you’re going to pay the loan back or not and they would grant you the loan or not.

Or some people would figure out that the population will like these type of titles and the New York Times bestseller came about or recommended movies for the next week came about. We started to leverage AI-based prediction and recommendation engines to replace those as an economic input.

That enabled a lot of new types of businesses and enabled the business models to change the value chain and is changing the value chain as we speak. There’s intermediating existing value chains and all of a sudden moving the integration point across the value chain and hence creating new businesses.

New businesses are coming to existence, and old ones are kind of being sidelined. But that extra kind of effect that we saw in arithmetic adoption, which we turned other nonarithmetic things around, is the exciting part of AI. Now we’re turning problems that are not prediction problems in nature into prediction problems because prediction is cheap now.

Self-driving, whatever that is. Having a fairly sophisticated and common understanding of how and, in a business setting, what type of business entities impact our business results. I’ll give you an example for example. Now AI is sitting in cameras. There is computer vision.

Cameras are watching shelves and realizing what items are moving fast. Moving fast means customers like them. And then you hook that up into the store’s ERP system and you see that for some reason the store manager or the merchandising people in the store they put a low margin product on the end cap or slow-moving product on the end cap and a fast-moving product is not on the end cap.

All of a sudden that whole thing is detected. There is a remedy then, there is a message to the store manager, “hey, you might want to move this item to the end cap.” These are the things that we weren’t able to do before and it’s just impacting a lot of things.

One more context and I’ll get into what’s happening with AI hardware. We have had AI in multiple waves and these waves are not the waves that are done. They’re ongoing, but new waves are coming on board. The first wave was internet AI. This wave of AI was basically the training data that we needed to train our models.

It came from phrases that people searched, it came from mouse clicks and user keyboard input. That enabled things like Netflix’s recommendation engine, what people view, watched and those types of things. But people searched and Google searched.

The moment we started to learn that we can use, effectively and economically, deep neural networks, that we have enough hardware and data to train them, we realized that sensor-based AI is such a big deal. We saw the kind of entry point of it with voice and language, things like visual assistance and devices that enabled those types of experiences.

But the moment you add cameras as a major sensor to the mix, all of a sudden you see that the economics of a lot of business activities that we’re doing is going to change. And that’s the real push in my view, is in the next decade or so where AI is going to create value and make our lives better. It’s where perception AI is being integrated into our lives.

(10:00) What was happening with the arithmetic, is that happening both with prediction and perception or do you think that prediction is the meta umbrella in terms of this technological shift?

MT: I’m glad you brought it up. Anything pretty much that AI does currently is prediction. We do practically two different things with AI and AI toolchain that we have at our disposal. One is classification, which is “this is a dog,” “this is a cat.” or “this is Moe,” or “this is Dan” for authentication purposes.

Or the way things have evolved so far to this point in time, this is how we predict it’s going to go on. But…even classification is a prediction problem because all you’re doing in classification, you’re predicting the likelihood that the image that you see is a dog or is a cat with some probability.

(11:30) How does that shift impact AI hardware when it comes to perception being more broadly used?

MT: Let’s look at again where we’re turning other things into prediction problems and things that are generally perception and cognition and cognitive activities that humans do today. Look at driving for example. We talk a lot about autonomous driving, autonomous drones, autonomous robots.

What is driving? When you look at it really from a human’s perspective, it’s a fairly complicated cognitive task. But when you break it down. if you want to turn it into a prediction problem, if you look at a car that is driving, it has some limited number of actuations and that is “go slower,” “go faster,” “turn left,” or “turn right.”

Really you can abstract the actuations that a car can take into these four. Maybe there are some word like do the signal or turn the light on or things like that which are ancillary. They’re not core. And if you look at it from a prediction perspective, if you’re training an AI engine that you have sensors around the car.

So you have the sensory perception around your environment as you’re moving through your environment. Your question is what would a good human driver do at this point in time. If you collect enough data to train that AI and that collective set of AI engines that take the sensory data in and translate that based on that question, which is a prediction question.

We are predicting what would a good driver do. If you translate that into one of those four actuations…all of a sudden driving becomes a prediction problem. It can’t happen unless you have the sensors, which is hardware, and you can process a significant amount of data in a very relatively short amount of time near real time that can answer that question.

All of a sudden running a fairly highly efficient inference engine on the car becomes a key contributor into enabling that use case. You can scale that and basically put it on the horizon of applications of perception AI into other use cases so that basic common components are…a sensor.

You have the ability to get enough data to train the model, and you have the ability to run the inference somewhere at the edge in near real time. Some applications don’t need real-time requirements, but a lot of them do, that you can deliver that prediction problem.

(15:00) How is that shift impacting the needs and the necessity for new kinds of hardware?

MT: For a variety of reasons, some are purely technical, basically barriers, some are economic barriers, and some regulatory barriers. We need to be able to process these models. A lot of them for inference and kind of doing the prediction on a trained model. In some cases we may actually want to even train some parts of the model at the edge.

But we have to run ML models on the edge. The technical barrier, if you want a typical car with the sensors, with the lidar sensor and the cameras, on a daily basis they generate a truckload of data. This is not something that, even if you had all the bandwidth in the world, you could upload to the cloud. So there’s a technical barrier.

Even if you could, it’s so expensive that it kills the business viability. There is the economic barrier, and then there are regulatory barriers. In some cases, you can’t transmit the camera data to somewhere else outside of the on-prem structure. When you put all of that into the bucket, you realize that we need capabilities to run AI on the edge.

And again, I go back to the previous statement that I made. To do that, the core of this is a fairly efficient, and by efficient I mean from an execution perspective, like a processing perspective, from a power perspective. You need a power-efficient and high compute capacity way of running AI models on the edge.

(17:00) What are some interesting use cases of where these new capabilities, these new necessities when it comes to enabling AI. are beginning to emerge and making their way forward?

MT: One of my favorite people in Silicon Valley is a VC…He has a pretty sharp mind, and he has a very interesting saying that…a technology that promise…that [it] can replace a chore that is typically done by humans and [it] can do it better at a much cheaper price, bring those business ideas to me.

He’s always had those business ideas. Now, if you…look at different industries. You look at retail, you look at manufacturing, you look at logistics, whether it’s a autonomous robot in a warehouse or whether it’s a autonomous driving and logistics of delivery of things like FedEx.

If you look at basically learning industry and education, almost you can point out the use cases that that lens can be applied directly when you put perception AI in the mix. When you have sensors and you have AI running on the edge, all of a sudden, like in manufacturing, there are people who are experts in looking at the paint.

They have a trained eye to see whether the paint quality meets the bar when a car is leaving the production line. You can easily do that with a really good trained model and a set of cameras. I’ll give you another really interesting example in an industry that completely is irrelevant to typical industries that we think about: the mining industry.

When you mine for gold, the way you design your explosion and the size of those rocks and particles, that post-explosion you get determines how efficient your mill runs. If they’re too big, you need to spend a lot more energy to break them down. If they’re too small, there are other issues. You want to make sure that the distribution of the size of those particles are in the right range.

Today there are human experts that go look at that and say, “well, I think we did too much explosive there and we did too little explosive here. Let’s rearrange it.” Now there is a company in Vancouver, Canada…they’re just turning that into a complete perception AI problem, and automatically they have cameras on hydraulic shovels.

As they load the particles into the dump trucks, they take them to the mill, they look at the distribution of the size and calculate with some trained models that, “hey, we did too much explosive today. Let’s bring it down. Or too little explosive.”

All of a sudden, something that was done by a very highly trained human, now we’re doing it with a set of sensors and a model, a machine trained machine learning model and that human can go and do some far more complex tasks that we can’t do yet with AI. It’s incredible how almost all of our industries, healthcare is another really great example, they’re all changing because of the perception AI.

(21:00) Is there anything you might add to that in terms of helping people think through how their business might change?

MT: Heres how I would encourage people to look at their businesses. Any business is part of a value chain. There is a chain of actors in the value chain that starts from some raw material or some input and eventually it gets turned into a product or a service and delivered to its customer and consumer.

I encourage people to look at the role that they are playing in their value chain and see where the integration points are in the value chain. What part of the value chain is delivered by one actor? What part of the value chain is modularized and one actor hands over the outcome to another actor eventually for the final product or service?

And if you can find areas that you can change those modularity points to integrate further or to start modularizing further. Obviously…when you start to focus on delivering a complex task that integration is key. When you modularize it, that task is pretty mature, well understood. You can break them down, and you can create interfaces for different actors in the industry to a participant.

Look at where you can move the modularity and integration points. One good lens to put on in that exercise is, what are the tasks that are currently done by a human that a typical human can do between one and five seconds. Those are great candidates to target to get AI, particularly perception AI to participate in the value chain.

Subscribe to our AI in Industry Podcast with your favorite podcast service:

This article was sponsored by Kisaco Research and was written, edited and published in alignment with our transparent Emerj sponsored content guidelines. Learn more about reaching our AI-focused executive audience on our Emerj advertising page.

Header Image Credit: Techcentral.co.za

Recommended from Emerj

Using AI to Close Skills Gaps and Optimize Workforce Performance – with Leaders from Workera and The Coca-Cola Company

This article is sponsored by Workera and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Skill gaps are quietly throttling enterprise performance. Across industries, employees’ actual capabilities often fall short of what’s needed to…

Nick Gertsch

•

November 18, 2025

Transforming Compliance and Data into Competitive Advantage – with Leaders from OneTrust and RR Donnelly

This interview analysis is sponsored by OneTrust and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Across industries, the same challenge continues to stifle enterprise AI progress: data and governance systems that are designed…

Nick Gertsch

•

November 18, 2025

Scaling AI with Storage Efficiency – with Leaders from Pure Storage, Generac, Lexmark, Comfort Systems USA, Danaher, Alcon, and More

This interview analysis is sponsored by Pure Storage and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Data fragmentation remains a critical challenge for organizations worldwide, significantly impeding usability and obstructing digital transformation initiatives.…

Riya Pahuja

•

November 18, 2025

Lessons from Project Warp Speed in How High-Velocity Partners Are Scaling AI in Manufacturing – with Emily Nguyen of Palantir Technologies

American manufacturers are pouring millions into keeping decades-old systems alive, even as the industry races toward AI-enabled operations. The result is a widening efficiency gap: firms with tightly integrated digital environments are accelerating output and cutting costs, while the majority remain stuck in fragmented ERP and production systems that drain resources and slow decision-making. Maintaining…

Ashwin Telang

•

November 17, 2025

Artificial Intelligence at Roche Diagnostics

Roche Diagnostics is one of the two main divisions of parent company F. Hoffman-La Roche AG, a Swiss multinational healthcare company that focuses on pharmaceuticals and diagnostics. Roche Diagnostics’ North American headquarters is located in Indianapolis, Indiana. The division leverages advanced technologies to enhance its global service offerings, particularly in optimizing supply chains and sales…

Sharon Moran

•

November 10, 2025

Turning Fragmented Retail Data into Unified Insights for CPG Brands – with Leaders from Crisp and Nestlé Purina

This article is sponsored by Crisp and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Global supply chains are fragile. Even minor disruptions can lead to large losses, impacting company revenue and economic stability.…

Riya Pahuja

•

November 10, 2025

AI Collaborate 2025 Preview: Insights, Innovation, and Opportunities in AI

This article is sponsored by NLP Logix and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Event Title: AI Collaborate 2025 Event Host: NLP Logix Location: Florida, USA Date: November 12-13, 2025 What’s Happening?…

Riya Pahuja

•

November 5, 2025

AI in Healthcare Devices and the Challenge of Data Privacy – with Dr. Ankur Sharma at Bayer

In healthcare, patient data is the foundation of diagnosis, treatment, and trust. With digital health systems and AI tools becoming central to care delivery, healthcare providers collect exponentially more sensitive patient data. This data explosion has also expanded the attack surface, with countries adopting different frameworks and approaches to managing healthcare data privacy and security.…

Riya Pahuja

•

November 3, 2025

Why System Integrators Are Key to AI Success – with Pallab Deb of Google

Enterprises across industries are facing mounting pressure to modernize their infrastructure as AI becomes central to operational strategy. Yet most organizations still struggle to move beyond experimentation toward scalable deployment. Pallab Deb, Managing Director for SI & Industry GTM Partnerships at Google and frequent guest of Emerj’s ‘AI in Business’ podcast, observes that many firms…

Nick Gertsch

•

October 30, 2025

Transforming Legal Teams with AI to Make IP a Growth Driver – with Leaders from Clarivate and AbbVie

This interview analysis is sponsored by Clarivate and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Legal and intellectual property (IP) teams are facing mounting pressure to move beyond traditional administrative functions and serve…

Matthew DeMello

•

October 29, 2025

Building Smarter Legal Departments Through Responsible AI Integration – with Leaders from Filevine and Meta

This interview analysis is sponsored by Filevine and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. In-house legal teams face a mounting challenge: the scale of digital information has outpaced their ability to manage…

Nick Gertsch

•

October 28, 2025

Artificial Intelligence at Royal Bank of Canada

Royal Bank of Canada (RBC), headquartered in Toronto, Ontario, is Canada's largest bank and one of the world's largest by market capitalization. Operating across 29 countries and employing over 98,000 employees as of 2024, the company reported an annual total revenue of $57 billion and a record net income of $16.2 billion, driven in part…

Nick Gertsch

•

October 27, 2025

Search site

Search site

Processing AI at the Edge – Use-Cases and AI Hardware Considerations

Interview Higlights

Recommended from Emerj

Using AI to Close Skills Gaps and Optimize Workforce Performance – with Leaders from Workera and The Coca-Cola Company

Transforming Compliance and Data into Competitive Advantage – with Leaders from OneTrust and RR Donnelly

Scaling AI with Storage Efficiency – with Leaders from Pure Storage, Generac, Lexmark, Comfort Systems USA, Danaher, Alcon, and More

Lessons from Project Warp Speed in How High-Velocity Partners Are Scaling AI in Manufacturing – with Emily Nguyen of Palantir Technologies

Artificial Intelligence at Roche Diagnostics

Turning Fragmented Retail Data into Unified Insights for CPG Brands – with Leaders from Crisp and Nestlé Purina

AI Collaborate 2025 Preview: Insights, Innovation, and Opportunities in AI

AI in Healthcare Devices and the Challenge of Data Privacy – with Dr. Ankur Sharma at Bayer

Why System Integrators Are Key to AI Success – with Pallab Deb of Google

Transforming Legal Teams with AI to Make IP a Growth Driver – with Leaders from Clarivate and AbbVie

Building Smarter Legal Departments Through Responsible AI Integration – with Leaders from Filevine and Meta

Artificial Intelligence at Royal Bank of Canada

Customize Your Experience

Processing AI at the Edge – Use-Cases and AI Hardware Considerations

Interview Higlights

Related Posts

Share article

Subscribe to updates

Recommended from Emerj

Using AI to Close Skills Gaps and Optimize Workforce Performance – with Leaders from Workera and The Coca-Cola Company

Transforming Compliance and Data into Competitive Advantage – with Leaders from OneTrust and RR Donnelly

Scaling AI with Storage Efficiency – with Leaders from Pure Storage, Generac, Lexmark, Comfort Systems USA, Danaher, Alcon, and More

Lessons from Project Warp Speed in How High-Velocity Partners Are Scaling AI in Manufacturing – with Emily Nguyen of Palantir Technologies

Artificial Intelligence at Roche Diagnostics

Turning Fragmented Retail Data into Unified Insights for CPG Brands – with Leaders from Crisp and Nestlé Purina

AI Collaborate 2025 Preview: Insights, Innovation, and Opportunities in AI

AI in Healthcare Devices and the Challenge of Data Privacy – with Dr. Ankur Sharma at Bayer

Why System Integrators Are Key to AI Success – with Pallab Deb of Google

Transforming Legal Teams with AI to Make IP a Growth Driver – with Leaders from Clarivate and AbbVie

Building Smarter Legal Departments Through Responsible AI Integration – with Leaders from Filevine and Meta

Artificial Intelligence at Royal Bank of Canada

This Content is Exclusive to Emerj Plus Members

In-Depth Analysis

Exclusive AI Capabilities Matrix

Exclusive AI White Paper Library

Best Practices and executive guides

Register

Customize Your Experience