[seopress_breadcrumbs]

How to Get Started with AI in Healthcare – A 3-Phase Approach

•

October 2, 2019

How to Get Started with AI in Healthcare - A 3-Phase Approach

This article was written by Sergii Gorpynich, co-Founder and CTO at Star, co-written by Perry Simpson, Managing Director of Star, and was written, edited and published in alignment with our transparent Emerj sponsored content guidelines. Learn more about reaching our AI-focused executive audience on our Emerj advertising page.

As AI and data science make their way into more and more healthcare processes, from patient care to diagnosis and prognosis, healthcare firms are struggling to realize AI’s benefits and thereby master the use of their own data.

While cataloging the state of a healthcare firm’s data ecosystem can be challenging, data issues go beyond identifying the state of existing data assets. Business and technology leadership, therefore, must be able to identify the right, high-impact use cases for data science and AI within the organization.

Many leadership teams have an initial idea of how they’d like to use AI, they’re nevertheless unsure of how to execute the initiative end-to-end—from idea to production-grade system.

In this article, we break down the stages that are critical to bringing an AI project from brainstorm to launch. When it comes to unlocking healthcare data and selecting high-value AI initiatives, Star’s data science teams follow three distinct phases:

Data Research: This exploratory data analysis state involves a multidisciplinary dialogue between business stakeholders, subject-matter experts, and AI/data science experts to assess data assets and determine high-value applications
AI Model Prototyping and Testing: Initial experiments and applicable machine learning modeling to find the best potential solution
AI Implementation: Integrating an AI initiative or product with core systems, teams and processes

We’ll focus mostly on the first two phases in the process, breaking down their component parts in a framework that healthcare leaders should find useful and be able to use immediately.

For more on adopting AI in the healthcare industry, download the Emerj-Star white paper, Executive Guide to AI Adoption in Healthcare.

Data Research

Research Process and Objectives

The first step lies with business stakeholders determining what they want to achieve and what data assets they have access to.

This involves meeting with business leadership, subject matter experts, and experienced data scientists to assess the current situation and determine opportunity areas. The group works together to understand the business’s goals, its ecosystem, and existing data assets.

With regard to the data assets, teams should ask: are there meaningful business insights that we could derive from these data? Is there a way we could apply these insights to directly impact our business?

Company leaders are aware of AI and have ideas on the outcomes they want to create. Examples include determining the incidence of hospital acquired infections, mastering EHR management to relieve laborious data entry and interpretation, optimizing appointment and staffing, etc.

Subject-matter experts may also have their own ideas regarding implementing machine learning to drive value, and data scientists will be able to assess the technical viability of their ideas.

This mix of business leadership, subject matter expertise and data science is critical in the data research phase.

For instance, a healthcare company might hire a team of data scientists to leverage machine learning, but these data scientists aren’t going to necessarily know anything about healthcare data, let alone the specific business model, patient profiles, or data ecosystem of their company.

Data scientists need business and process context in order to know what data means and how data have historically been valuable in solving business problems.

A data scientist with access to appointment data, for instance, may not realize the different staff and personnel requirements for handling dialysis treatments versus chemotherapy. The data scientist won’t know the operational leaders’ “rules of thumb” for how appointments are booked and managed today and why.

Similarly, a business leader may see value in using medical imagery data to train a computer vision system to better diagnose lung cancer, but the same leader may have no idea how much data would be required to achieve such a goal or the lengthy extent of the data labeling needed to train a machine learning model.

Bringing these groups and perspectives together generates ideas that are both viable (technically) and valuable (economically or to the end user).

Data research is a collaborative exploratory data analysis process which may take several days to 2-4 weeks of team workshops. Data scientists would come out of this phase with a good understanding of the types of data available and where to find them, as well as whether or not the data need to be harmonized or if more data need to be collected first.

In addition, insights from subject-matter experts and stakeholders (radiologists, nurses, customer service managers, etc.) should leave them with a strong sense of the problem the business wants to solve. With this in mind, data scientists can execute on their plans and start prototyping a model.

The Importance of Combined Expertise

In the data research phase, subject-matter experts should help generate ideas and provide context for data scientists. If a pilot or prototype project is chosen within the subject-matter expert’s domain (for example: radiology, customer service, EHR, etc.), then that subject-matter expert and/or their team should have consistent communication with the data scientists.

Subject-matter experts are essential to this stage of the AI-building process because they steer data scientists in the right direction. A lead data scientist might think of a dozen types of data they could use to build a machine learning model for a particular use-case.

A subject-matter expert can then provide focus by confirming more than half of those data types are sparse and rare, are poorly organized or collected, or that using them could violate regulations.

Another important perspective subject-matter experts bring to the table is the ability to find value in the data.

Ultimately, what we’re trying to search for in the data is some meaningful business insights. So we’re not looking at the data from a technical perspective…That’s…a secondary question. The first question is: Might we be able to extract any meaningful insights which will be important for the business?

Subject-matter experts can help answer these critical questions, determining the most fruitful areas on which to focus the company’s AI efforts.

Nevertheless, the reliance goes both ways, and subject-matter experts need data scientists to ground their ideas in what is technically feasible. A radiologist may want to use a computer vision system to detect lung cancer and apply the same application to a variety of other oncology diagnostic tasks.

A data scientist could then determine if the data for these other diagnostic tasks are sufficient to train a model or if the data are similar enough to transfer from one diagnostic task to another. A data scientist who understands the business context, therefore, will be able to determine the idea’s viability.

AI Model Prototyping and Testing

Why the Prototyping Phase is Necessary

Once a project is determined and data can be accessed, a technical team will need to get to work on a model prototype in order to determine whether a machine learning model can deliver the results business leadership is looking for.

For example:

A system to predict optimal E&M codes. Data scientists and a project team may be tasked with applying a logistic model to past EHR and billing data to better predict an optimal evaluation & management codes. Leveraging NLP and a classification algorithm to medical records brings a new degree of automation to what is otherwise a very manual and laborious process. Finding the right combination of data features and models would involve technical iteration and open dialogue between data scientists and subject matter experts.
Detecting lung cancer using machine vision. Data scientists could work with radiologists and oncologists to determine the criteria that lead to an X-ray diagnosis, and to ensure that thousands of previous lung cancer X-rays were properly labeled and highlighted in order to train a computer vision model to detect lung cancer in new X-ray images.
Leveraging AI to predict the spread of illness in real time. Using millions of anonymous data inputs collected from a smart connected thermometer, an AI model can be built to test the efficacy of triaging illnesses based on severity, contagion and duration. Further, ensuring these data are labeled with geo-location coordinates, the model can test its ability to predict the spread of disease, ultimately enabling officials to take action and minimize its impact on an entire population.

Any AI-enabled healthcare capability would involve a period of data collection, model training, and iterative testing.

Like the exploratory data analysis process, this phase involves ongoing dialogue between the data scientists and subject-matter experts to help train the model and to ensure its outputs (optimizing E&M coding, detecting lung cancer, or predicting illness across populations) are accurate and useful.

While the C-suite may be involved in brainstorming and determining an initial project, they are less likely to be involved in the prototyping phase, where communication between data scientists and subject-matter experts becomes most important.

Outside ongoing dialogue with subject-matter experts, data scientists will spend their time on technical tasks. They’ll clean and harmonize the data (if need be) and work on training algorithms to deliver the desired result. They’ll determine which features in the data to use and which algorithms or statistical methods they’ll apply.

Goals of the Model Prototyping and Testing Phase

A Model Prototyping and Testing phase exists in order to explore and prove the value of an AI application before bringing it into production. As such, project managers and senior leaders usually aim to reach specific benchmarks and objectives with their prototypes.

The Viability of the Model with Limited Data

One of the goals of this prototyping phase is to test the machine learning hypothesis on a limited dataset. For example, a computer vision model built to detect the presence of cancer from images of patient CAT scans will first need to be tested on a sizable portion of the overall dataset (i.e. the healthcare network’s entire archive of CAT scans).

If the algorithm correctly detects cancer within these CAT scans within a percentage of time likely determined by stakeholders, the prototype may be considered a success. Determining these specific criteria for success upfront is critical.

The Fundamental User Interface

Another goal during this stage of the AI building process involves determining how data scientists are to build the machine learning product’s user interface. Subject-matter experts will need to know who is going to be using the product. Doctors? Surgeons? Patients?

The answer will greatly affect how the product is built to allow for the greatest efficiency and, in some cases, the least risk. An effective model by itself won’t drive business value. With a useable interface, stakeholders can begin to see how their AI initiative could go from experimentation to production and serve the needs of real users.

The Economic Viability of the Project or Application

If business leadership is happy with the output of the models, they’ll then assess the resources required to bring the model into production. Determinative questions might include:

What will it cost to train staff to use this new AI-based solution within our workflows?
What are the ongoing costs of acquiring, cleaning, and labeling data in order to keep this system working?
What other IT systems will need to be changed, replaced, or integrated with this new AI-based system?
What kind of results would we need to see from this AI system before approving the resources required to run it?

AI Implementation

Once the model is tested, an effective user interface is built and the data are made accessible, the system may be ready for deployment. Implementation occurs once a model has been validated, budgets have been approved, and the leadership team believes that data assets are sufficient to deliver the business result.

If a model shows promise, and business leadership believes that its value would be worth the implementation investment, then a technical team would be tasked with delivering a scalable AI initiative into production.

Healthcare companies looking to adopt machine learning need to understand that AI is an investment in their firm’s future. The lessons learned in applying initial AI projects will often bolster a company’s ability to deploy AI applications and tools going forward.

The multi-disciplinary meetings, changes to data infrastructure, and training of unique models will all be transferrable skills for other AI projects, preparing a company to be more capable of developing other AI initiatives. Beyond potential improvements to efficiencies or patient outcomes, these are additional benefits from deploying a first AI process.

Header image credit: Mashable

Recommended from Emerj

Emerj: Building Readiness for AI Agents in Healthcare Systems - Raheel Retiwalla

Building Readiness for AI Agents in Healthcare Systems – with Raheel Retiwalla of Productive Edge

This interview analysis is sponsored by Productive Edge and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Burnout among hospital staff, particularly nurses and physicians, has reached critical levels. A report by the Center…

Riya Pahuja

•

May 8, 2025

Neurobiological and Cybernetic AI for Manufacturing, Part 2 – with Oleg Savin of Unilever

In our current technology-driven era, data is considered extremely valuable. Yet, data often goes unused or underutilized. The reasons vary, but it's certainly not a newly surfaced problem. An article initially published by Harvard Business Review highlights that organizations struggle with managing and analyzing existing data. This problem is more pronounced in manufacturing, where unused…

Sharon Moran

•

May 5, 2025

Artificial Intelligence at Charles Schwab – Two Use Cases

The Charles Schwab Corporation is a leading financial services firm, reporting $10.28 trillion in client assets as of February 2025, a 16% year-over-year increase. In Q4 2024, the company generated $5.3 billion in net revenues (up 20% year-over-year) and $1.8 billion in net income, resulting in $0.94 EPS. Core net new assets reached $114.8 billion…

Riya Pahuja

•

April 28, 2025

Driving Disease Risk Prediction and Preventative Healthcare with AI – with Dan Elton of the National Human Genome Research Institute

The landscape of preventative healthcare and genetic research is "awakening" with data, enabling earlier and more precise disease risk prediction. The evolution is particularly critical as the healthcare industry shifts from reactive treatment to proactive care. Integrating advanced capabilities with genomic data allows researchers and clinicians to analyze vast numbers of genetic variants, providing more…

Emily Smith

•

April 21, 2025

Artificial Intelligence at Foxconn – Two Use Cases

Foxconn, officially Hon Hai Precision Industry Co., Ltd., is a multinational electronics contract manufacturing company headquartered in Taiwan. Founded in 1974, it is renowned for producing consumer electronics for major companies like Apple, Microsoft, and Amazon. As of 2023, it employed approximately 90,221 people globally and reported an estimated annual revenue of $4.1 billion. The…

Riya Pahuja

•

April 14, 2025

Leveraging AI for Better Insurance Outcomes From Risk Management to Customer Care – with Mark McLaughlin of IBM

AI is transforming the insurance industry from reactive claims processing to proactive risk management. Rising competition from fintech and insurtechs and growing consumer demands for personalized, real-time experiences are driving such widespread industry adoption. Academic research highlights how insurers are increasingly using digital technologies and behavioral data to personalize services and influence customer behavior, underscoring…

Emily Smith

•

April 7, 2025

Managing End Point Storage in Hybrid Data Strategies for Financial Services – with Yonas Yohannes of Oracle

Transparency in AI is a major hurdle for businesses, particularly in the financial services industry. Generative AI (GenAI) models, particularly non-deterministic models, are often viewed as “black boxes,” making it difficult to understand the underlying decision-making processes. Due to this black box risk, banks can experience multiple types of AI incidents, including system glitches, data…

Sharon Moran

•

March 31, 2025

NLP Logix’s AI Collaborate 2024: A Look at the Future of GenAI Experiences from Sports to HR

This article is sponsored by NLP Logix and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. Event Title: AI Collaborate 2024 Event Host: NLP Logix Location: Florida, USA Date: November 19-20, 2024 What Happened…

Riya Pahuja

•

March 26, 2025

Artificial Intelligence at Aflac – Two Use Cases

Aflac is a global leader in supplemental health and life insurance, providing financial protection to over 50 million policyholders in the U.S. and Japan. In 2023, Aflac reported an annual revenue of $18.7 billion. With approximately 12,785 employees worldwide, Aflac continues to drive innovation in cancer and medical insurance. Although Aflac's total investment in AI…

Ashwin Telang

•

March 24, 2025

Neurobiological and Cybernetic AI for Manufacturing, Part 1 – with Oleg Savin of Unilever

Modern manufacturing stands to benefit from integrating AI. From improving efficiency and productivity by automating repetitive tasks to reducing unplanned downtime and cutting down on repair costs through predictive maintenance, the potential benefits are numerous. However, integrating AI in manufacturing is not without challenges. A 2019 article for the journal Engineering from the Chinese Academy of…

Sharon Moran

•

March 10, 2025

Understanding AI’s Expanding Role in Drug Discovery and Life Sciences R&D – Liran Belenzon of BenchSci

This interview analysis is sponsored by BenchSci and was written, edited, and published in alignment with our Emerj sponsored content guidelines. Learn more about our thought leadership and content creation services on our Emerj Media Services page. The intricate nature of biological systems significantly complicates drug development. Diseases often involve complex interactions among genes, proteins,…

Riya Pahuja

•

March 4, 2025

Long-Term ROI for GenAI in Healthcare – with Ylan Kazi of Blue Cross Blue Shield North Dakota

Mobile technology, including smartphones and wearable devices, collects health-related data such as physical activity metrics (e.g., step counts, heart rate), sleep patterns, and self-reported health information through surveys and applications. A 2022 research paper from Stanford University shows how these data enable AI systems to monitor health trends, predict potential health issues, and personalize healthcare…

Riya Pahuja

•

March 3, 2025

Search site

Search site

How to Get Started with AI in Healthcare – A 3-Phase Approach