HOW APEX WORKS
APEX blends five independently weighted pillars into one 0–100 score. Each pillar captures a distinct dimension of player value. None can be derived from the others.
K-means archetype normalization (v3.0): Shot Quality, Creation & Playmaking, and Physical Contribution z-scored within 8 data-driven peer groups. Defensive Impact retains BBall-Index defRole grouping. Offensive Impact is full-pool. GP^0.75 availability curve and NOI/CV consistency modifier (±5% cap) applied. DWS, STL%, BLK% removed in v1.7 (Jewell et al., JQAS — null significance at high minutes loads).
THREE STEPS
VERSION HISTORY
| Version | Changes & Rationale |
|---|---|
| v4.5 |
Lab expansion adds exploratory tools that use the multi-season dataset without modifying the scoring engine. Six Degrees surfaces the team-context web that partially explains APEX's on/off metric limitations. Morning Briefing provides daily utility alongside the static reference content. |
| v4.4 |
Player hub now reflects all 10 seasons rather than hardcoding four profiles, making the section useful for any historical lookup rather than just current stars. |
| v4.3 |
Nicknames make implicit analytical claims. Testing them against the model surfaces APEX's own blind spots — Stifle Tower gets a B because deterrence is unscored; The Process gets a C because Embiid never reached #1 under a model that values availability. |
| v4.2 |
Ten years of data turns APEX from a single-season snapshot into a longitudinal record. The 2015-16 and 2016-17 divergences are the most analytically significant: both trace directly to the model's deliberate exclusion of win-total and counting-stats narrative from the scoring engine. |
| v4.1 |
The dark-mode design was developed before the full player pool and multi-season scope were established. Light mode better suits a reference tool used in reading contexts. Typography overhaul prioritizes data legibility over aesthetic novelty. |
| v4.0 |
Other leading models (EPM, LEBRON) qualify at 500–800 min. The 1,500-min floor was excluding injury-shortened starters and legitimate rotation contributors — introducing a selection bias into the normalization pool without any coverage benefit not already provided by the GP^0.75 modifier. |
| v3.8 |
The MVP Case Builder surfaces the same analytical reasoning used in backtesting — making it accessible interactively rather than requiring users to read the full backtest table. |
| v3.7 |
Single-season scores lose context without the multi-season arc. Giannis's V-shaped arc (2022-23 low, +13.2 pt recovery in 2023-24) and Embiid's two-season window are both invisible in the rankings table alone. |
| v3.6 |
The original vertical-stack layout required excessive scrolling to compare pillar breakdowns. The sticky radar + panels layout keeps the visual comparison in view while the metric table scrolls. |
| v3.5 |
The MVP-voters comparison shows where APEX diverges from human voters. The algorithmic consensus section shows where APEX aligns or diverges from other analytics models — a distinct and complementary data point. |
| v3.4 |
Before v3.4, pentagon radar SVG logic was duplicated across four pages. Centralizing it ensures all card renderings stay in sync as the model evolves. |
| v3.3 |
Basketball analytics is taught as much through podcasts as papers. Curating the highest-signal shows provides contextual depth alongside the model's numerical outputs. |
| v3.2 |
Proportional redistribution on pillar sliders removes the friction of manually re-balancing weights after every adjustment. The ascending default sort on rank columns reflects that rank 1 is the primary result of interest. |
| v3.1 |
K-means archetypes are central to how Shot Quality and Playmaking normalize — publishing the archetype definitions makes the normalization transparent and gives users a vocabulary for understanding why similar-looking players rank differently. |
| v3.0 |
Full two-way EPM, LEBRON, and BPM each internally encode scoring, playmaking, and defense. Using them in an Offensive Impact pillar while independently scoring those same domains caused effective triple-counting of offensive traits (~55% effective offensive weight vs. stated 39%). Switching to offensive-only variants makes each pillar genuinely independent per the Franks et al. (2016) independence criterion. The 2-point accuracy drop is the structural cost of eliminating the architectural flaw. |
| v2.7 |
Transparency requires not just publishing weights but explaining the evidence behind each decision. The removed metrics section addresses a reproducibility gap — readers could not previously verify why specific metrics were excluded. |
| v2.6 |
Scoring: Weighting raw TS% above difficulty-adjusted Relative TS+ was internally inconsistent — Rel TS+ specifically corrects for shot selection quality and opponent context that TS% cannot see. Equal weighting reflects equal methodological validity under current literature. |
| v2.5 |
D-LEBRON is susceptible to team-context inflation for anchor bigs (Gobert, Lopez) — players whose defensive reputation elevates team-level ratings without proportional individual impact. D-EPM's RAPM-based framework is more resistant to this assignment effect (Terner & Franks, 2021). Adding D-EPM reduces dependence on a single defensive estimator without disrupting existing rankings. |
| v2.4 |
Cross-season archives provide the longitudinal context needed to validate APEX accuracy (67.5% backtest) and identify systematic voter divergence patterns. Each season requires independent K-means re-fitting because archetype boundaries shift as the league evolves. |
| v2.3 |
BBall-Index offRole categories had inconsistent granularity — Barnes and Durant shared a group despite radically different playing styles. K-means derives empirically from actual performance markers, not editorial labels. The six-feature cluster space captures the axes most relevant to within-role scoring differences. |
| v2.2 |
Full-pool z-scoring made rim-running bigs appear elite by comparing their rebounding against perimeter players who structurally do not rebound. Role-based peer groups ensure comparison against players with similar structural responsibilities, isolating quality from role assignment. |
| v2.1 |
EPM's RAPM-based approach provides more stable estimates than box BPM, particularly for high-usage defensive players. The tighter ±2.5 clamp was compressing genuine multi-sigma outliers (Jokić 2021-22) inappropriately; ±3.5 allows the model to distinguish them from the pack without unbounded inflation. |
| v2.0 |
Moving to a full qualified pool was essential for competitive rigor — hand-curation introduced selection bias into the normalization pool. The rim-running big limitation surfaced immediately in v2.0 rankings and motivated the v2.2 archetype normalization overhaul. |
| v1.9 |
Documenting academic grounding strengthens reproducibility and clarifies which design decisions are evidence-based vs. heuristic. SHAP evidence for VORP's null contribution justified keeping it off the scoring stack while retaining it for reader familiarity. |
| v1.8 |
Voter divergence analysis provides external validity evidence and surfaces the narrative and availability biases that analytics-based models like APEX do not account for. |
| v1.7 |
Jewell et al. (JQAS) demonstrate null predictive significance for DWS, STL%, and BLK% at high minutes loads — these metrics primarily reflect opportunity and defensive assignment rather than individual impact. DefOnOff compensates. Playmaking increase reflects the underweighting identified in backtest miss analysis. Versatility reduction reflects that rebounding at guard/wing positions is structurally determined by role, not quality. |
| v1.6 |
Without the USG-efficiency interaction, volume scorers accumulated high Scoring pillar scores on below-average efficiency. The consistency modifier captures performance variance that season averages mask. The voter-adjusted output was specifically separated from APEX to preserve the model's descriptive integrity. |
| v1.5 |
Temporal blending reduces noise in partial or injury-shortened seasons. Optional rather than default to preserve single-season purity for MVP comparison. DARKO DPM offered better stabilization across different defensive role types than DRAYMOND. |
| v1.4 |
The binary threshold created a cliff effect — players one game short of the cutoff received zero credit. The fractional curve produces proportional credit aligned with actual availability (Deshpande & Jensen, 2016). FTA/FGA captures the ability to draw fouls, a repeatable skill that TS% does not directly reward. |
| v1.3 |
First attempt to include a dedicated defensive impact estimator beyond box-score DBPM. DRAYMOND was the most accessible non-box public option at the time but showed instability across small samples. |
| v1.2 |
Full-pool scoring structurally penalized guards for low rebounding and centers for low assist rates — positions differ in role, not in quality. Peer-group normalization isolates player impact from positional baselines. |
| v1.1 |
EPM adjusts for opponent strength and teammate context that raw BPM ignores. Z-scores provide cross-season comparability on a fixed scale. Removing the ceiling lets the math govern outliers rather than imposing an arbitrary cap. |
| v1.0 |
Establish a working baseline to validate pillar independence before introducing proprietary or scraped metrics. The hand-tuned ceiling was a temporary editorial guard against outlier inflation. |
Version numbers track structural changes to the scoring formula. Data refreshes within a version are not separately versioned.