Data-Access on LegalRealist AI

Data-Access on LegalRealist AIhttps://legalrealist.ai/tags/data-access/Recent content in Data-Access on LegalRealist AIHugo -- gohugo.ioenhi@legalrealist.ai (LegalRealist AI)hi@legalrealist.ai (LegalRealist AI)© 2026 LegalRealist AIMon, 25 May 2026 00:00:00 +0000Building a Medicare Fraud Backtest in One Claude Code Sessionhttps://legalrealist.ai/posts/38-backtest-walkthrough/Mon, 25 May 2026 00:00:00 +0000hi@legalrealist.ai (LegalRealist AI)https://legalrealist.ai/posts/38-backtest-walkthrough/A walkthrough of building a Medicare fraud backtest overnight in Claude Code — from a plain-English spec to 289 matched providers across 41 states, a predictive model with AUC 0.79, and out-of-sample validation. Including the three times the pipeline failed, the data duplication bug, and the engineering decisions that shaped the final design.I Built the Backtest: What Excluded Medicare Providers Look Like Before They Get Caughthttps://legalrealist.ai/posts/37-backtest-results/Wed, 20 May 2026 00:00:00 +0000hi@legalrealist.ai (LegalRealist AI)https://legalrealist.ai/posts/37-backtest-results/The previous post described a Medicare fraud backtest nobody had built. I built it. 289 excluded providers across 41 states, matched to pre-exclusion billing data, compared against 3.39 million peers. Thirteen of fifteen features showed statistically significant differences — and the behavioral fingerprint is consistent enough to predict fraud in providers who were never excluded.From Kaggle to MCP: Open-Source Medicare Fraud Detectionhttps://legalrealist.ai/posts/40-open-source-fraud-detection/Sat, 20 Dec 2025 00:00:00 +0000hi@legalrealist.ai (LegalRealist AI)https://legalrealist.ai/posts/40-open-source-fraud-detection/The PPP fraud pipeline worked because the SBA released everything. Medicare’s public data is fragmented, de-identified, and missing the features detection needs. Here’s what exists on GitHub, where it falls short, and what CMS would need to release to let outside analysts do for healthcare fraud what one Python repo did for PPP.