Department

Business Analytics

Survival analysis, Bayesian experimentation, cohort economics, and the statistical infrastructure that turns raw data into decisions worth acting on.

22 essays

18 defined terms

The thesis

The gap between a dashboard and a decision system is where most analytics investment goes to die.

Descriptive metrics tell you what happened; they cannot tell you what to do next or why it matters.

This series bridges that gap with the quantitative methods that underpin genuinely data-driven organizations, survival models for churn prediction, Bayesian frameworks for experimentation under uncertainty, cohort-level unit economics, anomaly detection systems that distinguish signal from noise, and the metric ontology design that ensures everyone in the organization is optimizing for the same definition of success.

Core concepts in this department

Bayesian Inference
Bayesian inference updates prior beliefs about a parameter using observed data via Bayes' theorem to produce a posterior distribution. In A/B testing it directly answers 'what is the probability that B beats A?', the question product teams actually ask, unlike the indirect counterfactual framing of frequentist p-values.
Survival Analysis
Survival analysis models time-to-event data, how long until a customer churns, a subscription renews, a machine fails, accounting for censored observations where the event has not yet occurred. Cox proportional hazards is the standard semi-parametric model; deep recurrent survival models handle non-proportional hazards.
Cohort Analysis
Cohort analysis groups users by a shared origin event (acquisition month, first purchase, signup source) and tracks behavior over time for each group. It separates true retention from the compositional distortion caused by new-user dilution, and is the foundational unit of analysis for subscription economics.
Anomaly Detection
Anomaly detection identifies observations that deviate meaningfully from expected behavior, accounting for trend, seasonality, and variance. In revenue data it separates true incidents (payment outages, pricing bugs) from normal fluctuation. Isolation forests and Prophet-based decomposition are the practical workhorses.
Product-Market Fit
Product-market fit is the empirical condition where a cohort's retention curve flattens above zero, a group of users has found sufficient value to make the product a persistent part of their behavior. It is not a feeling; it is a quantifiable property of retention, NPS decomposition, and usage depth.
Analytics Engineering
Analytics engineering is the discipline of building reliable, tested, version-controlled transformations on top of a cloud warehouse, bridging data engineering and analysis. Tools like dbt, Dagster, and Airbyte formalize a software-engineering workflow for SQL transformations with tests, documentation, and lineage.
Metric Ontology
A metric ontology is a versioned, centrally-governed definition of every metric an organization uses, specifying the grain, filters, time-window, and source tables so that the same metric produces identical values regardless of tool, dashboard, or analyst. It prevents the drift that silently corrupts data-driven decisions.
Unit Economics
Unit economics is the financial performance of a single customer (or transaction) decomposed into acquisition cost, gross margin per period, retention, and payback period. Cohort-level unit economics, computed per acquisition cohort rather than rolled up, is the only form that survives growth-driven distortion.
Peeking Problem
The peeking problem is the inflation of false-positive rates that occurs when a frequentist A/B test is repeatedly evaluated before reaching its pre-registered sample size. A nominal 5% false-positive rate can become 20-30% under daily peeking. Bayesian testing and sequential-analysis methods eliminate the problem.
Cox Proportional Hazards Model
The Cox proportional hazards model is the semi-parametric workhorse of survival analysis: it estimates how covariates multiply the baseline hazard rate without requiring a parametric form for the baseline. It yields interpretable hazard ratios under the assumption that the ratio is constant over time.
Isolation Forest
Isolation Forest is a tree-based anomaly detection algorithm that scores observations by how easily they can be isolated via random recursive partitioning. Anomalies are isolated in few splits; normal points require many. The algorithm handles mixed feature types without density estimation or distance calculations.
Dashboards-to-Decisions Gap
The dashboards-to-decisions gap is the structural failure of analytics investment: teams produce more dashboards but decisions don't get better or faster. Closing the gap requires moving from descriptive reports to decision systems, pre-specified trigger thresholds, automated action routing, and outcome logging for calibration.
SKAdNetwork
SKAdNetwork is Apple's privacy-preserving attribution framework for iOS app install campaigns. It delivers aggregated, delayed, randomized-timing postbacks with a sparse conversion-value payload, and requires roughly 25 conversions in a privacy bucket before any data is released to the advertiser.
Identity Resolution
Identity resolution is the process of linking pseudonymous user signals (cookies, device IDs, IP/UA fingerprints) into a coherent person-level view. Logged-in deterministic matches are typically 8% to 18% of traffic; probabilistic matches fill the remainder with calibrated confidence scores rather than absolute IDs.
Event Taxonomy
Event taxonomy is the schema design discipline applied to product analytics: object-action-context naming, entity-event separation, PII boundaries baked into the schema, versioning rules, and validation at write time. A drifting taxonomy is the dominant cause of analytics debt at scale.
Server-Side Tagging
Server-side tagging routes analytics and ad-platform events through a first-party server endpoint rather than browser-direct calls. The benefits extend beyond compliance: latency reduction, ad-blocker resilience, CAPI-grade first-party context to Meta and Google, and event enrichment before forwarding.
Differential Privacy
Differential privacy is a mathematical framework (Dwork 2006) that bounds the privacy loss any individual incurs from a query by adding calibrated random noise. The privacy budget ε quantifies the trade-off: smaller ε gives stronger guarantees and noisier outputs; production deployments typically run with ε between 1 and 10.
A/B Testing
A/B testing is a randomized controlled experiment that splits users into a treatment and a control variant to estimate the causal effect of a change on a chosen metric. Statistical validity depends on randomization quality, sample size, novelty effect controls, and correction for multiple comparisons.

Essays in this department

Business Analytics30 min readApril 5, 2026
Data Warehouse to BI Layer Arbitration Patterns: Where the Semantic Layer Should Live
An analysis of the architectural debate between BI-tool-as-semantic-layer, warehouse-as-semantic-layer, and headless BI, with the knock-on effects on metric consistency, query cost, and analyst velocity.
Posted by
Murat Ova
Business Analytics29 min readFebruary 6, 2026
Anomaly Detection on Analytics Dashboards: When the Alert Fires
A 4% revenue drop on a Tuesday could be a payment outage, a pricing bug, or normal variance. The difference between sound monitoring and alert theatre is not the model. It is the loop the alert sits inside.
Posted by
Murat Ova
Business Analytics30 min readJanuary 17, 2026
North-Star Metric Construction and Revision Discipline
Constructing a north-star metric that informs decisions, the triggers that require revision, and the political economy of changing a metric the organization has built itself around. Goodhart, Campbell, surrogation.
Posted by
Murat Ova
Business Analytics28 min readJanuary 6, 2026
The GA4 Transition Forensics: What Universal Analytics Did Better
An honest post-mortem of the UA to GA4 migration. What broke, what is genuinely better, what remains unchanged, and the opportunity cost question that nobody at Google wants to discuss in public.
Posted by
Murat Ova
Business Analytics28 min readDecember 18, 2025
Funnel-vs-Flow Analysis Trade-Offs: When Each Tool Fits
When ordered-step funnel analysis misleads and when Markov-chain flow analysis is the right tool. Non-stationarity, the cycle-vs-funnel problem, and the compute trade-off in practice.
Posted by
Murat Ova
Business Analytics28 min readJuly 9, 2025
Cohort Analysis at the Action-Set Level (Not User-Level)
Sign-up-month cohorts confuse arrival with behavior. Action-set cohorts predict retention earlier and more honestly, at the cost of an event taxonomy, materialized views, and resolved identity discipline.
Posted by
Murat Ova
Business Analytics33 min readMay 24, 2025
The Analytics Engineering Manifesto: Why dbt Changed the Data Team Operating Model Forever
Before dbt, analysts wrote SQL that nobody reviewed, nobody tested, and nobody documented. The tool was simple, SQL templating with version control. The impact was structural: it created an entirely new discipline.
Posted by
Murat Ova
Business Analytics29 min readMarch 17, 2025
Mobile App SDK Overhead vs. Telemetry Value
Most mobile apps over-instrument. The cost shows up in binary size, cold start, battery, and privacy permissions. This essay maps the SDK trade-off honestly, with the question of what to drop and what to keep.
Posted by
Murat Ova
Business Analytics29 min readJanuary 15, 2025
Customer Journey Mapping from Raw Clickstream
The journey maps that hang on conference-room walls are workshop artifacts. The journeys that customers actually take live in clickstream logs. A field guide to building maps from the data instead of the whiteboard.
Posted by
Murat Ova
Business Analytics34 min readDecember 13, 2024
Causal Discovery in Business Data: Applying PC Algorithm and FCI to Find Revenue Drivers Without Experiments
Correlation tells you that feature usage and retention move together. It doesn't tell you which causes which, or whether a third factor drives both. Causal discovery algorithms can untangle this from observational data alone.
Posted by
Murat Ova
Business Analytics28 min readDecember 3, 2024
The Death of Last-Click in Mobile-App Attribution
Why SKAdNetwork 4 postback loss, IDFA opt-out rates, and the Apple privacy threshold have ended last-click attribution for mobile apps, and how incrementality testing has become the operational ground truth.
Posted by
Murat Ova
Business Analytics28 min readNovember 25, 2024
Identity Resolution in a Cookieless World: A Probabilistic Reality
The cookie was always probabilistic. Cookieless makes the probability legible. Operators who treat new identifiers as deterministic will misattribute spend and contaminate downstream measurement.
Posted by
Murat Ova
Business Analytics28 min readNovember 13, 2024
Event Taxonomy Design as Data Engineering
Event taxonomies are schema problems, not marketing problems. Teams that treat tracking plans as living documents (with versioning, validation, and PII boundaries) avoid the drift that quietly costs everyone else.
Posted by
Murat Ova
Business Analytics27 min readSeptember 28, 2024
Server-Side Tagging Beyond Compliance: The Operational Case
Privacy compliance is the entry point for server-side tagging. The operational case is broader: latency, ad-blocker resilience, data quality, and the cost model of running an event router at production scale.
Posted by
Murat Ova
Business Analytics29 min readSeptember 9, 2024
From Dashboards to Decision Systems: Embedding Prescriptive Analytics Into Operational Workflows
Your company has 47 dashboards. How many of them changed a decision last week? Dashboards describe what happened. Decision systems prescribe what to do next, and the gap between these two is where most analytics ROI evaporates.
Posted by
Murat Ova
Business Analytics28 min readJune 3, 2024
Privacy-Preserving Analytics: Differential Privacy in Practice
Differential privacy is a formal guarantee about what an analyst can learn from a dataset. The operational question is when the guarantee is worth its accuracy cost, and when a weaker model is the honest answer.
Posted by
Murat Ova
Business Analytics28 min readApril 29, 2024
Cohort-Based Unit Economics: Why Monthly Snapshots Lie and How to Build a True P&L by Acquisition Cohort
Your company's monthly revenue is growing 20% year-over-year. Your unit economics are deteriorating. Both statements are true simultaneously, and you'll never see the second one in an aggregate P&L.
Posted by
Murat Ova
Business Analytics32 min readJuly 1, 2023
Metric Ontology Design: Building a Self-Serve Analytics Layer That Doesn't Collapse Under Ambiguity
Ask five people in your company what 'revenue' means and you'll get five different numbers. The problem isn't the data warehouse, it's that nobody agreed on the definitions before building dashboards on top of them.
Posted by
Murat Ova
Business Analytics34 min readJanuary 1, 2023
Product-Market Fit Quantified: A Composite Score Using Retention Curves, NPS Decomposition, and Usage Depth
'You'll know product-market fit when you feel it' is advice that has burned through billions in venture capital. Here's a quantitative framework that replaces gut feeling with a composite score, and it starts with retention curves, not surveys.
Posted by
Murat Ova
Business Analytics39 min readDecember 10, 2022
Survival Analysis for Subscription Businesses: Cox Proportional Hazards vs. Deep Recurrent Models
Binary churn models answer the wrong question. 'Will this user churn?' matters less than 'When will this user churn?' Survival analysis models the timing - and the when determines whether intervention is profitable.
Posted by
Murat Ova
Business Analytics36 min readAugust 14, 2021
Anomaly Detection in Revenue Data: Isolation Forests vs. Prophet-Based Decomposition
A 4% revenue drop on a Tuesday could be a payment processor outage, a pricing bug, or just normal variance. The difference between these explanations is millions of dollars, and your monitoring system can't tell them apart.
Posted by
Murat Ova
Business Analytics38 min readFebruary 18, 2021
Bayesian A/B Testing in Practice: When to Stop Experiments and How to Communicate Results to Non-Technical Stakeholders
Frequentist A/B testing answers a question nobody asked: 'If the null hypothesis were true, how surprising is this data?' Bayesian testing answers the question that matters: 'Given this data, what's the probability that B is actually better?'
Posted by
Murat Ova

Comparisons in this department

Business Analytics

Bayesian Inference

Survival Analysis

Cohort Analysis

Anomaly Detection

Product-Market Fit

Analytics Engineering

Metric Ontology

Unit Economics

Peeking Problem

Cox Proportional Hazards Model

Isolation Forest

Dashboards-to-Decisions Gap

SKAdNetwork

Identity Resolution

Event Taxonomy

Server-Side Tagging

Differential Privacy

A/B Testing

Data Warehouse to BI Layer Arbitration Patterns: Where the Semantic Layer Should Live

Anomaly Detection on Analytics Dashboards: When the Alert Fires

North-Star Metric Construction and Revision Discipline

The GA4 Transition Forensics: What Universal Analytics Did Better

Funnel-vs-Flow Analysis Trade-Offs: When Each Tool Fits

Cohort Analysis at the Action-Set Level (Not User-Level)

The Analytics Engineering Manifesto: Why dbt Changed the Data Team Operating Model Forever

Mobile App SDK Overhead vs. Telemetry Value

Customer Journey Mapping from Raw Clickstream

Causal Discovery in Business Data: Applying PC Algorithm and FCI to Find Revenue Drivers Without Experiments

The Death of Last-Click in Mobile-App Attribution

Identity Resolution in a Cookieless World: A Probabilistic Reality

Event Taxonomy Design as Data Engineering

Server-Side Tagging Beyond Compliance: The Operational Case

From Dashboards to Decision Systems: Embedding Prescriptive Analytics Into Operational Workflows

Privacy-Preserving Analytics: Differential Privacy in Practice

Cohort-Based Unit Economics: Why Monthly Snapshots Lie and How to Build a True P&L by Acquisition Cohort

Metric Ontology Design: Building a Self-Serve Analytics Layer That Doesn't Collapse Under Ambiguity

Product-Market Fit Quantified: A Composite Score Using Retention Curves, NPS Decomposition, and Usage Depth

Survival Analysis for Subscription Businesses: Cox Proportional Hazards vs. Deep Recurrent Models

Anomaly Detection in Revenue Data: Isolation Forests vs. Prophet-Based Decomposition

Bayesian A/B Testing in Practice: When to Stop Experiments and How to Communicate Results to Non-Technical Stakeholders

Bayesian A/B Testing vs Frequentist A/B Testing

Cox Proportional Hazards vs Deep Recurrent Survival