Healthcare M&AI is written by Shawn Rothlis. If someone forwarded this to you and you want the full workflow access, subscribe at the link.
When I was the Director of Corporate Development at an oncology platform, we were a value-based care play. Capitated contracts with health plans and risk-bearing entities. Revenue was driven by how many attributed members we could bring under management - which meant network coverage wasn't optional. We needed to fill geographic gaps or we'd be leaving capitation dollars on the table.
Acquisitions were the primary engine. But acquisitions are bounded by where oncologists happen to be willing to sell. They don't sell in every market you need. So we partnered with CBRE and their healthcare real estate advisory team to scout de novo sites that could fill the gaps.
What that process looked like: evaluating medical office buildings, mapping 20-minute drive-time radii against census tract data for 65+ populations, estimating renovation costs, checking zoning and CON requirements. Then LoopNet. Crexi. Calling brokerage reps. Scheduling tours. Waiting on callbacks.
From first market screen to a shortlist worth touring, that process took weeks.
The free AI tools available now can compress the front-end analytical work by 60-70% before you ever engage CBRE, Buxton SCOUT, or Trilliant Health. Not replace them - but handle the work that used to burn analyst time for weeks. Here's what that workflow looks like.
The Four-Layer Framework
The de novo evaluation process has four distinct analytical layers. AI has different levels of leverage at each one.
Layer 1: Market screening - which geographies are worth looking at.
Layer 2: Site-level evaluation - once you have a specific address, rapid pre-tour scoring against your criteria.
Layer 3: Financial modeling - building the pro forma before deeper diligence.
Layer 4: Deal execution - lease negotiation, CON navigation, physician recruitment. AI has very limited utility here. More on that below.
Layer 1: Market Screening with Free Public Data
The foundational question in oncology site selection is: is there enough of a patient pool here, with the right payer mix, that a de novo clinic can reach break-even within a reasonable ramp period?
Three free public datasets, all queryable via API, answer roughly 70% of that question:
Census ACS API - Age distribution, income levels, and insurance coverage type down to the census tract level. For oncology, the operative variable is 65+ population density combined with commercial insurance penetration. High age concentration without commercial coverage means a Medicare-heavy payer mix - which matters because the spread between commercial and Medicare reimbursement on chemotherapy drugs can swing site-level EBITDA by 30-40%. ACS carries a 12-18 month lag, but it's the best publicly available payer mix proxy for initial screening.
SEER API - The NCI's Surveillance, Epidemiology, and End Results program. County-level cancer incidence rates by site (breast, lung, colorectal, prostate), overlaid against ACS age data to estimate raw patient pool size. SEER's reporting lag is improving - previously 22+ months from diagnosis to publication, they're now targeting a 2-month submission cycle using ML-based extraction from pathology reports.
CMS Provider and Utilization Data - The Medicare Physician and Other Supplier Public Use File maps service delivery volume by specialty and geography - useful for identifying existing oncology practice concentration and white space. The Provider of Services (POS) file adds facility characteristics for all Medicare-certified providers. Trilliant Health also launched a free chatbot called Oria that lets you query hospital price transparency files in natural language - ask "What is Cigna's negotiated rate for infusion administration (CPT 96413) at hospitals in Nashville, TN?" and get an actual answer.
Claude or ChatGPT can write Python scripts to query all three APIs, export structured dataframes, and analyze them for you. The only cost is your LLM subscription.
Layer 2: Real Estate Monitoring
This is where the CoStar-LoopNet-Crexi dynamic requires some navigation.
LoopNet is the largest commercial real estate marketplace in the U.S. It has native email alert functionality that works well - you can configure saved searches for MOB listings by geography, size range, and listing type. However, CoStar's terms of service explicitly prohibit AI scraping agents from programmatically reading their platforms, including LoopNet. Their terms specifically state that "no Information is exposed to an environment susceptible to access or use, directly or indirectly by any third party, including without limitation open artificial intelligence tools." That's a hard wall. The workaround: configure LoopNet's native alerts, then forward the emails to Claude or a GPT workspace. The AI can analyze the listing data from the email - it just can't scrape the site directly.
Crexi is more permissive and, in my experience, useful for medical/healthcare property filters. It offers real-time activity alerts and exposes more listing metadata at the property record level - zoning district descriptions, parcel numbers, CBSA classification, demographics data (1/3/5 mile radius). Same workflow: use the native alerts to populate your deal pipeline, then paste listing details into Claude with a structured evaluation prompt.
Once you have a specific address, geospatial analysis is where AI adds significant leverage. Google Maps Platform's Routes API - the successor to the legacy Distance Matrix API - runs $5 per 1,000 requests with 10,000 free requests per month. For a de novo project evaluating 50-100 candidate sites with 10/20/30-minute drive-time radius analysis, you're looking at under $100 in API costs total. Claude can write the Python code to run the analysis and generate isochrone maps - geographic polygons showing the catchment area reachable within a given drive time.
For bulk screening across hundreds of locations before you narrow to a shortlist, OSRM (Open Source Routing Machine) is free and surprisingly accurate for routes under 50 minutes. A Harvard study found it could process 32.8 million origin-destination pairs in under 6 minutes. That's plenty of horsepower for market-level screening.
Layer 3: Financial Modeling
The JLL 2026 benchmark for all-in MOB fit-out cost is $412 per square foot - covering hard construction (~$226/SF) plus soft costs, design fees, AV/IT, contingency, and FF&E. That's the warm white box baseline for standard outpatient. In my experience with oncology specifically, you need to adjust upward:
Moderate acuity (expanded imaging, specialized procedure rooms): roughly +10% to ~$453/SF
High acuity (infusion suites, radiation shielding, imaging with heavy structural requirements): add another ~20% over moderate, putting you at $544/SF or higher
Radiation vaulting is in a different category entirely - $1M+ per treatment room for concrete shielding alone, plus $2-5M for the linear accelerator. Most community-based de novo oncology sites stay in medical oncology (infusion/chemotherapy) and treat radiation as a Phase 2 expansion or a partnership arrangement, precisely because of those capital requirements.
AI can take those inputs - square footage, acuity level, payer mix estimate, CPT code reimbursement rates from CMS fee schedules, FTE ramp schedule - and build a structured pro forma with break-even analysis, sensitivity tables, and 5-year cash flow projections. The modeling is only as good as your input assumptions, and payer mix is the most consequential variable in oncology. But AI handles the modeling mechanics well once you feed it clean inputs.
The Paid Platforms Still Matter
What the free workflow cannot replicate:
Buxton SCOUT has 600+ proprietary datasets including psychographic segmentation across 128M+ U.S. households, covered lives by payer type at the zip code level, and automated site scoring models for 24 healthcare service lines. When you need to know not just who lives within 20 minutes of a site but what proportion of those households carry commercial insurance vs. Medicaid, and how that compares to your existing site performance benchmarks - that's Buxton. Enterprise pricing that runs six figures annually.
Trilliant Health addresses something important: traditional healthcare demand forecasting assumed perpetual growth based on demographic trends and national forecasts. Their platform uses actual claims data to avoid that bias. AdventHealth used it to evaluate net-new ambulatory site selection. Enterprise pricing as well.
Definitive Healthcare gives you KOL mapping, referral pattern data, and physician affiliation intelligence - things no public dataset can approximate. Critical for referral network analysis once you've identified a market.
CBRE Dimension is what we could have used. It's not a standalone software product you can purchase - it's deployed in conjunction with CBRE's advisory team. It incorporates their broker network, off-market deal flow, historical lease comps, and claims-based demand forecasting. The value is the combination of the platform with the relationship capital.
The free AI workflow is most useful as pre-work before engaging those platforms and advisors - so you arrive with a pre-ranked shortlist and sharper questions rather than starting from scratch.
What AI Cannot Do
From what I've seen, there are four areas where no amount of prompt engineering substitutes for human judgment and local relationships:
Physician recruitment - AI can identify that a market has oncology supply gaps. It cannot tell you whether qualified oncologists are recruitable into that market, whether the major platform consolidators (US Oncology, American Oncology Network, OneOncology) have already locked up the available talent pool, or how a physician's family situation affects their willingness to relocate. Physician search firms and your existing professional network are the only reliable intelligence sources here.
Referral network dynamics - AI can map the geographic concentration of primary care providers in a target market. It cannot assess contractual exclusivity arrangements between PCPs and competing oncology platforms, informal referral loyalties built over years of relationships, or whether a health system's employed physician network will refer outside their system. In oncology specifically, the referral pathway is the business.
CON navigation - 35 states plus DC still have active Certificate of Need laws. AI can tell you which states have CON and which service types trigger review. It cannot navigate the process. A recent North Carolina case involving WakeMed's radiation oncology CON application illustrates this well - approved in 2023, challenged by Duke and UNC Health, overturned by an administrative law judge in February 2025, then settled. Three years, hundreds of thousands in legal fees, delayed patient access. CON navigation requires specialized healthcare regulatory attorneys and state-specific political relationships.
Actual lease negotiation - Tenant improvement allowances directly offset the JLL $412/SF benchmark, and they are highly negotiated. The range depends on landlord credit quality, your covenant strength as a tenant, local market vacancy, and broker leverage. CBRE's advisors bring transaction history and comparable TI data that no public AI tool can access.
That's the framework. The free tools can do the heavy lifting on market screening and site-level quantitative analysis. The proprietary platforms and expert advisors earn their fees on the intelligence that isn't publicly available.
The prompts, API workflows, and step-by-step process to actually run this - including copy-paste prompts for each stage - are in the subscriber section below.
