[seopress_breadcrumbs]

Why Executives Should Keep Up with AI Trends in Business

•

March 24, 2019

Why Executives Should Keep Up with AI Trends in Business

I hope that by the end of this episode of the AI in Industry podcast, you’ll not only be able to hire better data scientists who will be a fit for your business problems and build better data science teams, but also pick the AI applications and use cases that you should bring into your business versus those that you shouldn’t.

I spend a lot of time being the business voice who talks to the technical people and then communicates that bridge across to the other business folks of the world who are interested in artificial intelligence. In this episode, I grab somebody from the other side of the pond, someone with a formal master’s degree-level focus on machine learning, who applies these technical skills in business and has been forced to learn how to speak business.

This episode, we interview Brooke Wenig, the machine learning practice lead at Databricks. Databricks was founded by the folks who created Apache Spark. Those of you who are technically savvy with AI will be familiar with Apache Spark as an open source language for artificial intelligence and distributed computing.

Wenig works with a lot of companies with Databricks. Databricks is now close to 700 folks and helps implement AI applications into, oftentimes, large enterprise environments. Wenig speaks with us this week about what to look for in an actual data scientist and how to find data science folks with the right skills to be able to communicate to business people, not just to work with models. What should people be capable of; how should they be capable of thinking? Hopefully, some of you will have better interview questions by the end of this podcast.

In addition, we ask Brooke about what the value of covering the cutting edge applications of AI is, looking at what’s working in industry. How does that help us in our own business make better decisions?

It’s not just opening up our minds to more possibilities of AI. There are also concerns about how we can make better decisions about what kind of AI projects to adopt. So not just seeing new shiny things, but actually making smarter calls. Brooke has a pretty educated perspective in this take, having seen the inside of a lot of different businesses. I thought she brought some good insights to bear.

This episode is brought to you by Databricks. Databricks is hosting their annual San Francisco Spark+AI Summit from April 3 to 25, which is a whole event intended to bring technical folks and business leaders together with what the cutting edge applications are of artificial intelligence today and how to bring them to life.

Subscribe to our AI in Industry Podcast with your favorite podcast service:

Guest: Brooke Wenig, Machine Learning Practice Lead – Databricks

Expertise: Apache Spark, enterprise adoption of AI

Brief Recognition: Wenig earned her MS in Computer Science from UCLA. Prior to joining Databricks, she was an intern at Google, Myfitness Pal, and Splunk.

Interview Highlights

(03:00) How do you like to conceptualize and explain what a data scientist is when you’re speaking to the enterprise?

BW: A data scientist is someone who has business-level domain knowledge of what the data is, can build predictive models on it, and can communicate the models and the importance of those models to business leaders to ultimately drive decisions for that company. It very much is a cross-functional role and a very interdisciplinary role. It involves both skills in engineering, math and statistics, as well as communication.

(03:30) What do you mean when you say communication?

BW: The first step is getting buy-in from the business leaders. For example, I know a lot of companies struggle to even get data science teams because they don’t understand the importance of data science. Once they’ve got buy-in, now they need to make models that will actually drive business decisions. There’s no point in me spending hours every day mucking around with scikit-learn in Keras to build models that nobody’s actually going to use at the end of the day, so that’s why communication is so important: so their work can be adopted by the rest of the company.

Another aspect of why communication is very important is to work with other teams and understand what the data even means. If a data scientist doesn’t understand their data, they’re not going to make a model that is best-fitted for that data.

The first question I’d ask before I even start looking at the data is, “what is the business problem that you’re trying to solve?” Is it demand forecasting; is it resource forecasting? Then when we talk about the problem we’re trying to solve, can machine learning even solve it, or could you simply solve that problem with better technology?

So as part of that, the very first thing I do when I start machine learning problems is understand, “what is a baseline metric?” For example, if I always predict the average, what would be my accuracy in the case of fraud detection, or what would be my precision and my recall? So establishing what success is is the most important thing.

Then, after we establish success, we can start looking at the data and see those constraints in play. For example, we never have more than 200 of [something]. Hey, what are these null values doing here? Sometimes they can be corrupted data, but sometimes they can be an indicator of the thing you’re trying to predict at the end.

(10:00) Why do you think that we see so much of that hands-on educational stuff from vendor companies to enterprises?

BW: Yeah, I think it very much depends on the company. Databricks is built on a very technical concept, which is Apache Spark, and it isn’t a technology that many people are familiar with when they’re in school. For example, I went to UCLA, and I only had one class taught using Apache Spark, and that’s because my advisor had helped create ML Lib, which is the machine learning library for Spark, back when he was a postdoc at the AMP lab at UC Berkeley.

So with these new technologies, it’s very hard to get people that are fresh out of school learning them, so when people are already in industry, they’re a lot more hesitant to adopt these new technologies, because they didn’t learn them in school. They’re unfamiliar; they can have a very high learning curve.

That’s why with these vendor companies, they have to come in and provide resources, not just for training, but also consulting and implementation, as well as educating the team members about what the technology is and when and where you should use it. Spark is fantastic, but it shouldn’t be used everywhere.

(11:30) Do you think that these very hands-on wings of vendor companies are going to shrink in the next five years as more of these companies are familiar with the lingo, have data scientists on board?

BW: I actually think it will increase rather than decrease. Throughout the US, there’s a huge shortage of qualified data scientists, so because of that and the fact that a lot of companies aren’t making the most out of data-driven decisions, those two paired together, I think, will drive a lot more need, actually, for help from vendor companies on “what is AI” and “what is ML.”

A lot of it is hype, and so understanding what can actually be done to solve the business problem versus what is hype, I think, is very important.

(14:30) Why it is important to know what’s working now in industry? What do you see as the primary value there?

BW: I think it’s very important to stay on the cutting edge of what’s happening in industry, and to some extent, research. The reason why I say industry is because there are tons of new open source projects out there. You can find them on GitHub, but if they have very few active developers, then that project might not be maintained, and then you rely on something that’s no longer going to be actively developed.

So I think it’s very important to stay current with what’s happening in industry. If you’re competing for the same space, for example, with online retail, if your model is 10% better, that could translate into millions or billions of additional dollars of revenue. So understanding what is out there and understanding the trade-offs of, “what happens if I switch to using this new approach? Do I spend two years of dev time, or is that two weeks of dev time?” So the investment effort, both in terms of human hours and in terms of cost, are very important.

At Databricks, we used an open source platform called Horovod, which Uber had open sourced, because Uber was using Horovod in production for distributed training of their deep learning models, and we wanted to use a framework that was already known and tested by one very reputable company.

The other thing that I would add to that is understanding the data that these different companies have applied those models to. For example, if you are trying to do some NLP problem and your textual language is very different than what the model was trained on (for example, the model was trained on Twitter data, which has a character limit, I think, of 140 characters), you can’t directly take that model and apply it to your Amazon reviews dataset, for example, which might have a much longer character description length for a review.

(18:00) My guess is, Brooke, if you want to understand that transferability, probably the people who should be looking at these new algorithms should be either the unicorn person that can do both or probably a data scientist and a purely business functional person because it sounds like both people might be able to find a reason why this doesn’t transfer to to a company, right?

Yeah, definitely. As a data scientist, I’m often paired up with a data engineer when we start a project with the customer. For example, they’ll help do the data preparation. I’ll build the model. Then, together, we work on the deployment considerations, because as you had said, “do we have the compute resources needed? Does this need to work in a streaming environment, or batch pre-processing?”

Having all of the different personas in the room can help you make a decision much better than if you have a single person working on that project, and then they spend six months on it, realizing at the end, the solution’s not suitable for the deployment considerations.

Subscribe to our AI in Industry Podcast with your favorite podcast service:

This article was sponsored by Databricks, and was written, edited and published in alignment with our transparent Emerj sponsored content guidelines. Learn more about reaching our AI-focused executive audience on our Emerj advertising page.

Header Image Credit: Resolution Media

Recommended from Emerj

Scaling AI with Storage Efficiency – Emerj AI Leader Insight

This article is sponsored by Pure Storage and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page.As enterprises race to implement AI, most hit a bottleneck that's hiding in plain sight: inefficient storage infrastructure. While…

Riya Pahuja

•

May 29, 2025

The Evolving Role of Banks in Fraud Detection and AML Compliance – with Nick Lewis of Standard Chartered

Financial institutions are increasingly burdened with detecting and preventing financial crimes, leading to heightened operational costs and resource allocation challenges. According to the FBI's Internet Crime Report 2024, cybercrime continues to rise sharply in both frequency and financial impact. Last year alone, the FBI received 859,532 complaints related to cybercrime — a notable increase that…

Riya Pahuja

•

May 26, 2025

Paving the Way for Continuous Auditing Workflows in Financial Services with AI – with Leaders from MindBridge, Wells Fargo, Gulfport, Bank of China, and Citi

This article is sponsored by MindBridge and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Traditional audit cycles — often conducted annually or quarterly across many different industries — are increasingly misaligned with the…

Riya Pahuja

•

May 23, 2025

The Future of IT Operations with Automation and Real-Time Insights – with Troy Felix of BigPanda

This interview analysis is sponsored by BigPanda and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Modern IT operations are inundated with alerts from various monitoring tools, leading to alert fatigue among IT professionals.…

Riya Pahuja

•

May 22, 2025

Preparing Financial Services for Automation in the Era of Agentic AI – with Leaders from Automation Anywhere, Barclays, and Wells Fargo

This article is sponsored by Automation Anywhere and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. As artificial intelligence moves from buzzword to reality, leaders find that successful adoption requires more than deploying chatbots…

Riya Pahuja

•

May 21, 2025

Artificial Intelligence at Aviva

Aviva is a British multinational insurance company headquartered in London, England. Primarily recognized as the UK's leading diversified insurer, Aviva provides various products and services across insurance, wealth management, and retirement solutions. With 19.2 million customers spanning the UK, Ireland, and Canada, Aviva has positioned itself as a major player in the financial services industry.…

Ashwin Telang

•

May 19, 2025

Navigating Challenges and Solutions in Data Security with AI – with Dimitri Sirota of BigID

This interview analysis is sponsored by BigID and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Find out more about how BigID can help your organization adopt AI safely and responsibly here. Uncontrolled AI…

Riya Pahuja

•

May 15, 2025

The Future of Customer Experience in Financial Services with Agentic AI – with Abhii Parakh of Prudential Financial and James Wood of Interactions

This article is sponsored by Interactions and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Low customer engagement is a persistent challenge in the insurance sector, particularly with policies held for an extended period.…

Riya Pahuja

•

May 12, 2025

Artificial Intelligence at AbbVie – Two Use Cases

AbbVie is a global biopharmaceutical leader with approximately 55,000 employees in over 70 countries. In 2024, the company invested over $10.8 billion in research and development, supporting active immunology, oncology, and neuroscience clinical programs. To accelerate drug discovery, AbbVie is applying artificial intelligence (AI) to improve early-stage decision-making. The company aims to streamline target discovery…

Marilie Fouche

•

May 12, 2025

Emerj: Building Readiness for AI Agents in Healthcare Systems - Raheel Retiwalla

Building Readiness for AI Agents in Healthcare Systems – with Raheel Retiwalla of Productive Edge

This interview analysis is sponsored by Productive Edge and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Burnout among hospital staff, particularly nurses and physicians, has reached critical levels. A report by the Center…

Riya Pahuja

•

May 8, 2025

Neurobiological and Cybernetic AI for Manufacturing, Part 2 – with Oleg Savin of Unilever

In our current technology-driven era, data is considered extremely valuable. Yet, data often goes unused or underutilized. The reasons vary, but it's certainly not a newly surfaced problem. An article initially published by Harvard Business Review highlights that organizations struggle with managing and analyzing existing data. This problem is more pronounced in manufacturing, where unused…

Sharon Moran

•

May 5, 2025

Artificial Intelligence at Charles Schwab – Two Use Cases

The Charles Schwab Corporation is a leading financial services firm, reporting $10.28 trillion in client assets as of February 2025, a 16% year-over-year increase. In Q4 2024, the company generated $5.3 billion in net revenues (up 20% year-over-year) and $1.8 billion in net income, resulting in $0.94 EPS. Core net new assets reached $114.8 billion…

Riya Pahuja

•

April 28, 2025

Search site

Search site

Why Executives Should Keep Up with AI Trends in Business

Interview Highlights

Recommended from Emerj

Scaling AI with Storage Efficiency – Emerj AI Leader Insight

The Evolving Role of Banks in Fraud Detection and AML Compliance – with Nick Lewis of Standard Chartered

Paving the Way for Continuous Auditing Workflows in Financial Services with AI – with Leaders from MindBridge, Wells Fargo, Gulfport, Bank of China, and Citi

The Future of IT Operations with Automation and Real-Time Insights – with Troy Felix of BigPanda

Preparing Financial Services for Automation in the Era of Agentic AI – with Leaders from Automation Anywhere, Barclays, and Wells Fargo

Artificial Intelligence at Aviva

Navigating Challenges and Solutions in Data Security with AI – with Dimitri Sirota of BigID

The Future of Customer Experience in Financial Services with Agentic AI – with Abhii Parakh of Prudential Financial and James Wood of Interactions

Artificial Intelligence at AbbVie – Two Use Cases

Building Readiness for AI Agents in Healthcare Systems – with Raheel Retiwalla of Productive Edge

Neurobiological and Cybernetic AI for Manufacturing, Part 2 – with Oleg Savin of Unilever

Artificial Intelligence at Charles Schwab – Two Use Cases

Customize Your Experience

Why Executives Should Keep Up with AI Trends in Business

Interview Highlights

Share article

Subscribe to updates

Recommended from Emerj

Scaling AI with Storage Efficiency – Emerj AI Leader Insight

The Evolving Role of Banks in Fraud Detection and AML Compliance – with Nick Lewis of Standard Chartered

Paving the Way for Continuous Auditing Workflows in Financial Services with AI – with Leaders from MindBridge, Wells Fargo, Gulfport, Bank of China, and Citi

The Future of IT Operations with Automation and Real-Time Insights – with Troy Felix of BigPanda

Preparing Financial Services for Automation in the Era of Agentic AI – with Leaders from Automation Anywhere, Barclays, and Wells Fargo

Artificial Intelligence at Aviva

Navigating Challenges and Solutions in Data Security with AI – with Dimitri Sirota of BigID

The Future of Customer Experience in Financial Services with Agentic AI – with Abhii Parakh of Prudential Financial and James Wood of Interactions

Artificial Intelligence at AbbVie – Two Use Cases

Building Readiness for AI Agents in Healthcare Systems – with Raheel Retiwalla of Productive Edge

Neurobiological and Cybernetic AI for Manufacturing, Part 2 – with Oleg Savin of Unilever

Artificial Intelligence at Charles Schwab – Two Use Cases

This Content is Exclusive to Emerj Plus Members

In-Depth Analysis

Exclusive AI Capabilities Matrix

Exclusive AI White Paper Library

Best Practices and executive guides

Register

Customize Your Experience