<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data-Miners on LegalRealist AI</title><link>https://legalrealist.ai/tags/data-miners/</link><description>Recent content in Data-Miners on LegalRealist AI</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>hi@legalrealist.ai (LegalRealist AI)</managingEditor><webMaster>hi@legalrealist.ai (LegalRealist AI)</webMaster><copyright>© 2026 LegalRealist AI</copyright><lastBuildDate>Wed, 29 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://legalrealist.ai/tags/data-miners/index.xml" rel="self" type="application/rss+xml"/><item><title>The Data Miner's Dilemma</title><link>https://legalrealist.ai/posts/24-data-miners-dilemma/</link><pubDate>Wed, 29 Apr 2026 00:00:00 +0000</pubDate><author>hi@legalrealist.ai (LegalRealist AI)</author><guid>https://legalrealist.ai/posts/24-data-miners-dilemma/</guid><description>DOJ&amp;rsquo;s new FOCUS initiative wants better data-driven fraud cases. But it keeps its two best enforcement channels — whistleblower tips and data miner analytics — in separate silos. The real opportunity is connecting them.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://legalrealist.ai/posts/24-data-miners-dilemma/feature.png"/></item><item><title>From Kaggle to MCP: Open-Source Medicare Fraud Detection</title><link>https://legalrealist.ai/posts/40-open-source-fraud-detection/</link><pubDate>Sat, 20 Dec 2025 00:00:00 +0000</pubDate><author>hi@legalrealist.ai (LegalRealist AI)</author><guid>https://legalrealist.ai/posts/40-open-source-fraud-detection/</guid><description>The PPP fraud pipeline worked because the SBA released everything. Medicare&amp;rsquo;s public data is fragmented, de-identified, and missing the features detection needs. Here&amp;rsquo;s what exists on GitHub, where it falls short, and what CMS would need to release to let outside analysts do for healthcare fraud what one Python repo did for PPP.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://legalrealist.ai/posts/40-open-source-fraud-detection/feature.png"/></item><item><title>Show Your Work</title><link>https://legalrealist.ai/posts/39-show-your-work/</link><pubDate>Wed, 10 Dec 2025 00:00:00 +0000</pubDate><author>hi@legalrealist.ai (LegalRealist AI)</author><guid>https://legalrealist.ai/posts/39-show-your-work/</guid><description>Public data can source prosecution leads. An open-source fraud-scoring system, run against the full SBA PPP dataset, identified the same lenders, geographies, and loan populations that DOJ prosecuted — using nothing but a downloadable CSV and a standard laptop.</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://legalrealist.ai/posts/39-show-your-work/feature.png"/></item></channel></rss>