Follow these steps to interpret a waterfall chart:
Waterfall charts excel at showing sequential financial changes and inventory movements.
Scenario: A company wants to show how they went from revenue to net profit
| Category | Amount | Type |
|---|---|---|
| Revenue | $500,000 | Start (Blue) |
| Cost of Goods Sold | -$200,000 | Negative (Red) |
| Operating Expenses | -$150,000 | Negative (Red) |
| Marketing Costs | -$50,000 | Negative (Red) |
| Taxes | -$30,000 | Negative (Red) |
| Net Profit | $70,000 | End (Blue) |
Visual flow:
Insight: "We started with $500K in revenue, but after all expenses, we have $70K in net profit—a 14% profit margin."
Scenario: Tracking monthly cash changes
| Activity | Amount | Running Total |
|---|---|---|
| Beginning Cash | $100,000 | $100,000 |
| Sales Revenue | +$85,000 | $185,000 |
| Payroll | -$45,000 | $140,000 |
| Rent & Utilities | -$12,000 | $128,000 |
| Equipment Purchase | -$20,000 | $108,000 |
| Loan Payment | -$8,000 | $100,000 |
| Ending Cash | $0 change | $100,000 |
Insight: "Despite $85K in revenue, we ended the month at the same cash level due to major expenses including equipment purchase."
Scenario: Warehouse tracking monthly inventory changes
| Event | Units | Total |
|---|---|---|
| Starting Inventory | 1,200 | 1,200 |
| Received Shipment A | +500 | 1,700 |
| Sales - Week 1 | -300 | 1,400 |
| Sales - Week 2 | -350 | 1,050 |
| Received Shipment B | +600 | 1,650 |
| Sales - Week 3 | -400 | 1,250 |
| Sales - Week 4 | -250 | 1,000 |
| Damaged Goods | -50 | 950 |
| Ending Inventory | -250 net | 950 |
Insight: "Started with 1,200 units, received 1,100 more, sold 1,300, lost 50 to damage = 950 units remaining (21% decrease)"
Think of a waterfall chart as a bridge connecting two islands (starting and ending values). Each bar is a support pillar—some lifting you up (positive), others bringing you down (negative). Following the path shows exactly how you traveled from start to finish.
A funnel chart visualizes stages in a sequential process where volume progressively decreases. The width of each stage represents the quantity or percentage, and the narrowing shows drop-off at each step.
When you make coffee with a pour-over filter:
At each stage, some volume is lost. The funnel shape shows the progressive reduction.
Analytics parallel: 1,000 website visitors → 300 sign up for trial → 80 complete onboarding → 25 become paying customers
Visual: The widest section representing the largest volume
Example: "10,000 website visitors"
Represents: Everyone who enters the process
Visual: Progressively narrower sections
Example: "2,000 create account" → "500 start trial" → "150 complete setup"
Represents: Filtering at each qualification step
Visual: The smallest section representing final conversions
Example: "80 paid customers"
Represents: Successful completions of the entire process
Scenario: B2B company tracking deals from lead to closed customer
| Stage | Count | % of Previous | % of Total |
|---|---|---|---|
| Leads Generated | 1,000 | - | 100% |
| Qualified Leads | 400 | 40% | 40% |
| Demo Scheduled | 150 | 37.5% | 15% |
| Proposal Sent | 80 | 53.3% | 8% |
| Negotiation | 50 | 62.5% | 5% |
| Closed Won | 25 | 50% | 2.5% |
Key insights:
Calculation: To convert 100 customers, you need: 100 ÷ 0.025 = 4,000 leads at the top
Scenario: Online store tracking visitor-to-purchase journey
| Stage | Users | Conversion Rate | Drop-off |
|---|---|---|---|
| Website Visitors | 50,000 | - | - |
| Product Page Views | 15,000 | 30% | 35,000 left |
| Add to Cart | 3,000 | 20% | 12,000 left |
| Begin Checkout | 1,500 | 50% | 1,500 abandoned cart |
| Enter Payment Info | 1,200 | 80% | 300 left |
| Purchase Complete | 1,000 | 83.3% | 200 left |
Key insights:
Scenario: HR tracking hiring process from application to hire
| Stage | Candidates | % Advancing |
|---|---|---|
| Applications Received | 500 | - |
| Resume Screening Pass | 150 | 30% |
| Phone Screen Complete | 100 | 67% |
| First Interview | 40 | 40% |
| Second Interview | 20 | 50% |
| Offer Extended | 8 | 40% |
| Offer Accepted (Hired) | 6 | 75% |
Key insights:
Conversion rates measure the percentage of people who advance from one stage to the next.
Conversion Rate = (Next Stage Count / Current Stage Count) × 100
Data:
Step-by-step calculations:
Insights:
Focus your improvement efforts on:
Example: Improving Stage 1 from 30% to 35% gives you 50 more signups, which cascade through the entire funnel. Improving Stage 3 from 33% to 38% only gives you 15 more trial users.
Think of a funnel chart like a series of increasingly fine sieves or filters. Each stage lets through only what meets the criteria. The narrowing shape instantly shows where your process is leaky and needs attention.
A heatmap uses color intensity to show patterns across two dimensions. Instead of reading numbers in a table, you instantly see hot spots (high values) and cold spots (low values) through color.
A weather temperature map shows a region with colors:
At a glance, you see temperature patterns across the entire region without reading every number.
Analytics parallel: A sales heatmap shows which product-region combinations are hot (high sales) or cold (low sales) using color instead of requiring you to read 50+ numbers.
Visual: Horizontal categories
Example: Days of the week, products, regions
Visual: Vertical categories
Example: Hours of the day, sales channels, customer segments
Visual: Color-coded rectangles where row meets column
Example: Dark blue = high value, light blue = low value
Visual: Scale showing what each color represents
Example: "0-100: White, 100-500: Light Blue, 500+: Dark Blue"
Scenario: Marketing team wants to find the best times to post content
| Day / Hour | 9 AM | 12 PM | 3 PM | 6 PM | 9 PM |
|---|---|---|---|---|---|
| Monday | 450 | 1,200 | 850 | 600 | 350 |
| Tuesday | 500 | 1,400 | 1,100 | 650 | 400 |
| Wednesday | 550 | 1,350 | 1,050 | 700 | 450 |
| Thursday | 580 | 1,500 | 1,150 | 800 | 500 |
| Friday | 600 | 1,100 | 900 | 1,600 | 1,250 |
| Saturday | 300 | 650 | 850 | 1,200 | 1,300 |
| Sunday | 250 | 550 | 700 | 950 | 1,100 |
Color scale: Light blue (low traffic) → Dark blue (high traffic)
Key insights:
Scenario: Company tracking which products sell best in each region
| Product / Region | North | South | East | West |
|---|---|---|---|---|
| Product A | $85K | $45K | $62K | $78K |
| Product B | $42K | $95K | $38K | $55K |
| Product C | $58K | $48K | $88K | $72K |
| Product D | $35K | $28K | $42K | $98K |
| Product E | $72K | $65K | $92K | $48K |
Color scale: Light blue ($25-50K) → Dark blue ($85-100K)
Key insights:
Scenario: Analyzing relationships between marketing channels and conversions
| Variable | Social | Search | Direct | Referral | |
|---|---|---|---|---|---|
| 1.00 | 0.72 | 0.45 | 0.23 | 0.51 | |
| Social | 0.72 | 1.00 | 0.58 | 0.31 | 0.65 |
| Search | 0.45 | 0.58 | 1.00 | 0.42 | 0.49 |
| Direct | 0.23 | 0.31 | 0.42 | 1.00 | 0.28 |
| Referral | 0.51 | 0.65 | 0.49 | 0.28 | 1.00 |
Color scale: Light green (weak correlation 0-0.4) → Dark green (strong correlation 0.7-1.0)
Key insights:
Choosing the right color scale is critical for effective heatmaps.
Use when: Data goes from low to high (one direction)
Colors: Single hue progressing from light to dark
Examples:
Use for: Sales amounts, website traffic, temperature
Use when: Data has a meaningful midpoint (positive and negative)
Colors: Two contrasting hues meeting at neutral center
Examples:
Use for: Profit/loss, sentiment scores, above/below average
Use when: Data represents distinct categories (not ordered)
Colors: Distinct, different colors for each category
Examples:
Use for: Product types, customer segments, status categories
Think of a heatmap like a thermal imaging camera. It instantly shows you where things are "hot" (high activity, high correlation, high sales) and where they're "cold" (low values), allowing you to spot patterns that would be invisible in a table of numbers.
A histogram shows the distribution of numerical data by dividing values into ranges (bins) and displaying the frequency (count) of values in each bin. It reveals the shape and spread of your data.
Imagine you have 100 coins of various ages:
This visualization instantly shows: Are most coins new? Old? Evenly distributed?
Analytics parallel: 100 student test scores divided into ranges (0-10, 11-20, 21-30...) showing how many students scored in each range. The pattern reveals if most students did well, poorly, or if scores were spread evenly.
Bins are the ranges that divide your numerical data. Choosing the right bin size is crucial.
Data: 50 students took a test (scores 0-100)
Option 1: Wide bins (0-25, 26-50, 51-75, 76-100)
Insight: Most students scored between 51-100, but you lose detail
Option 2: Narrow bins (0-10, 11-20, 21-30... 91-100)
Insight: Two distinct groups of students (maybe different study approaches?)
Rule of thumb: Start with 5-15 bins, then adjust based on your data volume
These look similar but serve different purposes:
Data type: Continuous numerical data
X-axis: Ranges of values (bins)
Visual: No gaps between bars (continuous)
Purpose: Show distribution and frequency
Examples: Heights, ages, temperatures, test scores, income
Data type: Categorical data
X-axis: Distinct categories
Visual: Gaps between bars (discrete)
Purpose: Compare quantities across categories
Examples: Sales by region, products, departments, months
Histogram: "How are 100 students' heights distributed?" → Shows the spread
Bar chart: "How many students are in each grade?" → Compares categories
If you can rearrange the categories without losing meaning, it's a bar chart. If the order matters (because it's a range), it's a histogram.
The shape of a histogram reveals important patterns about your data.
Shape: Symmetrical, highest in the middle, tapering on both sides
Example: Heights of adult women: most around 5'4"-5'6", fewer very short or very tall
What it means: Most values cluster around the average, natural variation
In practice: Test scores in a well-designed exam, measurement errors, many natural phenomena
Shape: Peak on left, long tail extending right
Example: Income distribution: many people earn $30-70K, few earn $500K+
What it means: Most values are low-to-moderate, with some high outliers
In practice: Salaries, house prices, website session duration
Shape: Peak on right, long tail extending left
Example: Age at retirement: most retire 60-70, few retire very early (40s)
What it means: Most values are high, with some low outliers
In practice: Test scores on easy exams, age of death in developed countries
Shape: Bars roughly equal height across all bins
Example: Rolling a fair die: each number (1-6) appears ~equally often
What it means: All values equally likely, no clustering
In practice: Random number generators, birthdays across year (roughly)
Shape: Two distinct peaks with a valley between
Example: Gym attendance: high morning (6-8 AM) and evening (5-7 PM), low midday
What it means: Two distinct groups or patterns in the data
In practice: Mixed populations, commute times (morning/evening rush)
Scenario 1: Customer Ages at a Toy Store
Scenario 2: Website Page Load Times
Scenario: Teacher analyzing how 80 students performed on an exam
| Score Range | Frequency | Percentage |
|---|---|---|
| 0-10 | 2 | 2.5% |
| 11-20 | 1 | 1.25% |
| 21-30 | 3 | 3.75% |
| 31-40 | 5 | 6.25% |
| 41-50 | 8 | 10% |
| 51-60 | 12 | 15% |
| 61-70 | 18 | 22.5% |
| 71-80 | 15 | 18.75% |
| 81-90 | 10 | 12.5% |
| 91-100 | 6 | 7.5% |
Key insights:
Scenario: E-commerce company analyzing 1,000 customer ages
| Age Range | Count | % of Total |
|---|---|---|
| 18-25 | 280 | 28% |
| 26-35 | 350 | 35% |
| 36-45 | 210 | 21% |
| 46-55 | 100 | 10% |
| 56-65 | 45 | 4.5% |
| 66+ | 15 | 1.5% |
Key insights:
Scenario: SaaS company measuring API response times (milliseconds)
| Response Time (ms) | Frequency | Cumulative % |
|---|---|---|
| 0-50 | 3,200 | 32% |
| 51-100 | 4,500 | 77% |
| 101-150 | 1,800 | 95% |
| 151-200 | 350 | 98.5% |
| 201-300 | 100 | 99.5% |
| 301+ | 50 | 100% |
Key insights:
Think of a histogram as a mountain range viewed from the side. The height of each section shows how many data points "pile up" in that range. Peaks show where data clusters, valleys show gaps, and the overall silhouette reveals the data's natural shape.
A box plot (box-and-whisker plot) provides a statistical summary of a dataset's distribution, showing the median, quartiles, range, and outliers all in one compact visualization.
Imagine sorting 100 packages by weight:
A box plot shows all these statistics in one simple graphic.
Analytics parallel: 100 customer order values: quickly see the median order, where the middle 50% fall, and identify unusually high or low orders.
Visual: The rectangular box in the middle
Represents: Middle 50% of the data (25th to 75th percentile)
Example: If box spans 60-80, then 50% of values fall in this range
Visual: Horizontal line cutting through the box
Represents: 50th percentile (middle value)
Example: If line is at 70, half the data is below 70, half above
Visual: Lines extending from the box to minimum/maximum
Represents: Full range of "normal" data (within 1.5× IQR)
Example: Whiskers at 40 and 100 mean data spans this range
Visual: Individual points beyond the whiskers
Represents: Unusual values far from the rest
Example: Dot at 150 is an outlier (investigate why)
Scenario: Company analyzing salaries in three departments
| Statistic | Engineering | Marketing | Sales |
|---|---|---|---|
| Minimum | $55K | $42K | $38K |
| 25th Percentile (Q1) | $75K | $52K | $48K |
| Median (Q2) | $95K | $65K | $62K |
| 75th Percentile (Q3) | $125K | $78K | $85K |
| Maximum | $180K | $95K | $150K |
| Outliers | $250K (CTO) | None | $200K, $220K (top sellers) |
Key insights:
Experiment with creating each specialized chart type using sample datasets.
Select a chart type and configure it with sample data to see how each visualization works.
Sample Dataset: Company Quarterly Profit Breakdown
Exercise: Drag categories to reorder them. See how the waterfall flows from start to end value.
Try: What happens if you put Tax before Operating Expenses? The final value stays the same, but the path changes!
Sample Dataset: SaaS Trial-to-Paid Conversion
Exercise: Adjust the values at each stage. See how conversion rates auto-calculate.
Try: Where's the biggest drop? Between trials (1,500) and completed onboarding (900) - 40% drop-off!
Sample Dataset: Sales by Product × Region (in thousands)
| North | South | East | West | |
|---|---|---|---|---|
| Product A | $52K | $38K | $88K | $65K |
| Product B | $35K | $95K | $42K | $58K |
| Product C | $78K | $55K | $68K | $82K |
Exercise: Change the color scale (sequential vs. diverging). Hover to see exact values.
Try: Spot the pattern - Product A dominates East ($88K), Product B dominates South ($95K)
Sample Dataset: 100 Customer Order Values
Current bin size: $20 increments
Exercise: Use slider to adjust bin size (try $10 increments vs. $50 increments).
Try: Too few bins ($50) hides detail. Too many bins ($5) creates noise. $20 is just right to see the shape!
For each scenario, select which chart type is most appropriate:
Scenario: Show how starting inventory of 1,000 units became 850 units after sales, returns, and new shipments.
Answer: Waterfall chart - Sequential additions/subtractions from start to end
Scenario: Track how 5,000 job applicants became 10 final hires through screening stages.
Answer: Funnel chart - Progressive filtering through sequential stages
Scenario: Display which hours of the week have the most customer service calls.
Answer: Heatmap - Patterns across two dimensions (day × hour)
Scenario: Show the spread of employee ages to identify if most are young, old, or evenly distributed.
Answer: Histogram - Distribution of continuous numerical data
Scenario: Compare salary distributions across five departments to see median, range, and outliers.
Answer: Box plot - Statistical summary for multiple groups
Company Profit Breakdown:
Questions:
E-commerce Funnel:
Calculate:
Call Volume by Day × Hour (darker = more calls):
Questions:
Website Session Duration (minutes):
Questions:
Determine if each scenario needs a waterfall or funnel chart:
Scenario: Monthly budget started at $50K, added $20K revenue, subtracted $15K expenses, subtracted $8K debt payment = $47K remaining
Answer: Waterfall (shows + and - changes to a running total)
Scenario: 1,000 leads → 300 qualified → 100 demos → 40 proposals → 15 closed deals
Answer: Funnel (sequential filtering with drop-offs)
Scenario: Website had 500 users, gained 200 new signups, lost 50 to churn = 650 total users
Answer: Waterfall (net change from start to end)
Scenario: 10,000 students applied → 2,000 admitted → 800 enrolled → 750 completed first year
Answer: Funnel (progressive reduction through stages)
Scenario: You have data on product returns by category and month. January-December (columns) × Electronics, Clothing, Home Goods, Toys (rows).
Questions:
Identify if each should be a histogram or bar chart:
Data: Customer satisfaction scores (1-5 stars) and count of responses for each rating
Answer: Bar chart (5 distinct categories, not continuous ranges)
Data: Employee ages from 22 to 68, grouped into ranges
Answer: Histogram (continuous numerical data, ranges)
Data: Sales by region (North, South, East, West)
Answer: Bar chart (categorical regions)
Data: Website load times from 0.1s to 8.5s, grouped into time ranges
Answer: Histogram (continuous numerical data)
Data: Number of orders per month (Jan, Feb, Mar... Dec)
Answer: Bar chart (12 distinct time categories)
Exam Scores - Box Plot Statistics:
Questions:
Scenario: You're analyzing your coffee shop's performance.
For each question, choose the best chart type:
Show how we went from $10K opening cash to $12.5K closing cash through daily sales and expenses
Chart: Waterfall
Track how 500 website visitors became 85 online orders through browsing stages
Chart: Funnel
Identify which day of week × hour of day has most foot traffic
Chart: Heatmap
Understand the distribution of transaction amounts (are most orders small or large?)
Chart: Histogram
Compare tip amounts across morning, afternoon, and evening shifts (median, range, outliers)
Chart: Box plot
Identify what's wrong with each visualization:
Waterfall chart: Categories are: Revenue, Taxes, COGS, Operating Expenses, Net Profit
Error: Illogical order - COGS and Operating Expenses should come before Taxes
Funnel chart: Stages show: 100 leads → 150 qualified → 80 demos → 30 closed
Error: Stage 2 (150) is larger than Stage 1 (100) - funnels should only decrease
Heatmap: Uses red for low values and green for high values
Error: Counterintuitive - typically red = high/hot, blue/light = low/cold
Histogram: Shows sales by region (North, South, East, West) with gaps between bars
Error: This should be a bar chart (categorical data), and gaps are fine for bar charts but wrong for histograms
Box plot: The median line is outside the box
Error: Median must always be inside the box (between Q1 and Q3)
Scenario: Your company's quarterly sales went from $200K to $285K. Here are the components:
Tasks:
Test your understanding of specialized chart types with instant feedback!
1. A company wants to show how their revenue of $500K became a net profit of $80K after various expenses. Which chart type is most appropriate?
2. A sales funnel shows: 1,000 leads → 400 qualified → 120 demos → 30 closed. What is the demo-to-closed conversion rate?
3. You're creating a heatmap showing profit/loss by product and month. Some products have positive profits, others have losses. Which color scale is most appropriate?
4. Which of the following datasets should be displayed as a histogram rather than a bar chart?
5. A histogram of customer order values shows most orders around $50, very few orders below $20, and a long tail extending up to $500. What distribution shape is this?
6. What is WRONG with this waterfall chart practice?
7. In a box plot, what does the line inside the box represent?
8. An e-commerce company wants to identify which day of week and hour of day combinations generate the most website traffic to optimize their email campaign scheduling. Which chart type would be MOST effective?
9. Which advanced chart type is best for showing the distribution of a single continuous variable like customer ages?
10. What is a Sankey diagram best used for?