Project

13F Biotech Intelligence Tool

In production at Bluestar Global Management · Q1 2026 onward

A tool that turns WhaleWisdom 13F screenshots into a fully-formatted biotech hedge fund tracker, collapsing what was a multi-hour manual workflow into a single command. Twenty-six biotech-focused funds, automatically ingested, filtered, deduplicated, and cross-referenced for consensus signal — every quarter, in about an hour of wall-clock time.

What it does

Every quarter, biotech hedge funds file 13F disclosures revealing their long positions. Aggregating and analyzing these across a coverage universe is meaningful work — but the manual version of it is tedious, error-prone, and almost entirely transcription. The tool removes that layer.

Given a folder of screenshots from WhaleWisdom, it extracts every position, applies a configurable policy filter, handles edge cases (exited positions, sub-fund consolidation, pagination artifacts), and produces a polished Excel workbook with three tabs per fund plus a master list and a consensus view. A companion auditor verifies the output against the same source screenshots — a self-consistency check that currently passes for 26 of 26 funds on Q1 2026 data.

The pipeline

screenshots/ ↓ shared/extractor.py ← Claude Sonnet 4.6 vision extraction (cached by file hash) ↓ shared/policy_filter.py ← three-rule filter: is_equity, is_healthcare, meets_threshold ↓ ├── generator/ → polished Excel: MASTER LIST · CONSENSUS · per-fund tabs └── auditor/ → verification against source screenshots

By the numbers

26 / 26
Funds passing self-consistency check
~$2
Per-quarter API cost
~1 hr
End-to-end runtime

Engineering decisions worth surfacing

What it produces

Each quarter, the deliverable is a single Excel workbook with three sheet types: a master list of all positions across the coverage universe, a consensus tab with cross-fund aggregates (number of funds holding, average weight, signal counts), and one tab per fund. Every tab shares a common ten-column layout, with formulas for quarter-over-quarter change, action labeling (New / Added / Trimmed / Exited / No change), and threshold flagging.

Charts and analysis derived from a recent run are published in the Q1 2026 Biotech Positioning Brief.

Built with

Python, Claude Code, Anthropic API (Sonnet 4.6 for vision), openpyxl for Excel writing. Source remains internal to Bluestar; this page describes architecture and decisions, not implementation.