Reading a Waterfall Chart

Follow these steps to interpret a waterfall chart:

  1. Start at the left: Find the initial value (usually blue)
  2. Follow the steps: Green bars add, red bars subtract
  3. Track the running total: Each bar starts where the previous one ended
  4. End at the right: The final bar shows the result of all changes

Waterfall Chart Use Cases

Waterfall charts excel at showing sequential financial changes and inventory movements.

Example 1: Company Profit Breakdown

Scenario: A company wants to show how they went from revenue to net profit

Category Amount Type
Revenue $500,000 Start (Blue)
Cost of Goods Sold -$200,000 Negative (Red)
Operating Expenses -$150,000 Negative (Red)
Marketing Costs -$50,000 Negative (Red)
Taxes -$30,000 Negative (Red)
Net Profit $70,000 End (Blue)

Visual flow:

Insight: "We started with $500K in revenue, but after all expenses, we have $70K in net profit—a 14% profit margin."

Example 2: Cash Flow Statement

Scenario: Tracking monthly cash changes

Activity Amount Running Total
Beginning Cash $100,000 $100,000
Sales Revenue +$85,000 $185,000
Payroll -$45,000 $140,000
Rent & Utilities -$12,000 $128,000
Equipment Purchase -$20,000 $108,000
Loan Payment -$8,000 $100,000
Ending Cash $0 change $100,000

Insight: "Despite $85K in revenue, we ended the month at the same cash level due to major expenses including equipment purchase."

Example 3: Inventory Flow

Scenario: Warehouse tracking monthly inventory changes

Event Units Total
Starting Inventory 1,200 1,200
Received Shipment A +500 1,700
Sales - Week 1 -300 1,400
Sales - Week 2 -350 1,050
Received Shipment B +600 1,650
Sales - Week 3 -400 1,250
Sales - Week 4 -250 1,000
Damaged Goods -50 950
Ending Inventory -250 net 950

Insight: "Started with 1,200 units, received 1,100 more, sold 1,300, lost 50 to damage = 950 units remaining (21% decrease)"

Waterfall Chart Best Practices

✅ DO

  • Use consistent colors: Green for increases, red for decreases, blue for totals
  • Label every bar: Show both the category name and value
  • Order logically: Arrange categories in a meaningful sequence
  • Show the running total: Help viewers track cumulative changes
  • Clearly mark start and end: Make totals stand out visually
  • Add data labels: Display exact values on or near bars

❌ DON'T

  • Mix up the order: Random sequences confuse the narrative
  • Use unclear colors: Avoid purple for positive and orange for negative
  • Omit labels: Viewers need to know what each bar represents
  • Hide the starting value: Context is essential
  • Use too many categories: More than 10-12 bars becomes cluttered
  • Forget connector lines: They help show the flow

Mental Model: The Bridge

Think of a waterfall chart as a bridge connecting two islands (starting and ending values). Each bar is a support pillar—some lifting you up (positive), others bringing you down (negative). Following the path shows exactly how you traveled from start to finish.

Part 2: Funnel Charts

A funnel chart visualizes stages in a sequential process where volume progressively decreases. The width of each stage represents the quantity or percentage, and the narrowing shows drop-off at each step.

⏳ Real-Life Analogy: Filtering Coffee

When you make coffee with a pour-over filter:

At each stage, some volume is lost. The funnel shape shows the progressive reduction.

Analytics parallel: 1,000 website visitors → 300 sign up for trial → 80 complete onboarding → 25 become paying customers

Visual Structure of Funnel Charts

1

Top Stage (Widest)

Visual: The widest section representing the largest volume

Example: "10,000 website visitors"

Represents: Everyone who enters the process

2

Middle Stages

Visual: Progressively narrower sections

Example: "2,000 create account" → "500 start trial" → "150 complete setup"

Represents: Filtering at each qualification step

3

Bottom Stage (Narrowest)

Visual: The smallest section representing final conversions

Example: "80 paid customers"

Represents: Successful completions of the entire process

Reading a Funnel Chart

  1. Start at the top: This is the broadest stage with the most volume
  2. Move downward: Each stage filters out some portion
  3. Note the width: Wider = more volume, narrower = fewer remaining
  4. Look for big drops: Large gaps between stages indicate problem areas
  5. Check percentages: Conversion rate from one stage to the next

Funnel Chart Use Cases

Example 1: Sales Pipeline Funnel

Scenario: B2B company tracking deals from lead to closed customer

Stage Count % of Previous % of Total
Leads Generated 1,000 - 100%
Qualified Leads 400 40% 40%
Demo Scheduled 150 37.5% 15%
Proposal Sent 80 53.3% 8%
Negotiation 50 62.5% 5%
Closed Won 25 50% 2.5%

Key insights:

Calculation: To convert 100 customers, you need: 100 ÷ 0.025 = 4,000 leads at the top

Example 2: E-Commerce Conversion Funnel

Scenario: Online store tracking visitor-to-purchase journey

Stage Users Conversion Rate Drop-off
Website Visitors 50,000 - -
Product Page Views 15,000 30% 35,000 left
Add to Cart 3,000 20% 12,000 left
Begin Checkout 1,500 50% 1,500 abandoned cart
Enter Payment Info 1,200 80% 300 left
Purchase Complete 1,000 83.3% 200 left

Key insights:

Example 3: Recruitment Funnel

Scenario: HR tracking hiring process from application to hire

Stage Candidates % Advancing
Applications Received 500 -
Resume Screening Pass 150 30%
Phone Screen Complete 100 67%
First Interview 40 40%
Second Interview 20 50%
Offer Extended 8 40%
Offer Accepted (Hired) 6 75%

Key insights:

Calculating Conversion Rates

Conversion rates measure the percentage of people who advance from one stage to the next.

The Formula

Conversion Rate = (Next Stage Count / Current Stage Count) × 100

Example: Multi-Stage Conversion Calculation

Data:

Step-by-step calculations:

  1. Visitor to Signup: 300 / 1,000 × 100 = 30%
  2. Signup to Trial: 100 / 300 × 100 = 33.3%
  3. Trial to Paid: 40 / 100 × 100 = 40%
  4. Overall (End-to-End): 40 / 1,000 × 100 = 4%

Insights:

Where to Optimize

Focus your improvement efforts on:

Example: Improving Stage 1 from 30% to 35% gives you 50 more signups, which cascade through the entire funnel. Improving Stage 3 from 33% to 38% only gives you 15 more trial users.

Funnel Chart Best Practices

✅ DO

  • Order stages correctly: Top to bottom must follow actual process flow
  • Show both numbers and percentages: Count + conversion rate
  • Highlight drop-offs: Call attention to biggest losses
  • Label each stage clearly: Viewers should understand immediately
  • Use consistent colors: Same color for all stages or gradient
  • Include benchmarks: "Industry average: 3.5%" for context

❌ DON'T

  • Mix up stage order: Stages must be sequential
  • Show only percentages: Absolute numbers matter too
  • Make stages equal width: Defeats the purpose of the funnel
  • Use for non-sequential data: Funnel implies a process flow
  • Overcomplicate with too many stages: 5-7 stages is ideal
  • Forget the denominator: Always clarify "% of what?"

Mental Model: The Filter

Think of a funnel chart like a series of increasingly fine sieves or filters. Each stage lets through only what meets the criteria. The narrowing shape instantly shows where your process is leaky and needs attention.

Part 3: Heatmaps

A heatmap uses color intensity to show patterns across two dimensions. Instead of reading numbers in a table, you instantly see hot spots (high values) and cold spots (low values) through color.

🌡️ Real-Life Analogy: Weather Map

A weather temperature map shows a region with colors:

At a glance, you see temperature patterns across the entire region without reading every number.

Analytics parallel: A sales heatmap shows which product-region combinations are hot (high sales) or cold (low sales) using color instead of requiring you to read 50+ numbers.

Visual Structure of Heatmaps

1

Rows (Y-axis)

Visual: Horizontal categories

Example: Days of the week, products, regions

2

Columns (X-axis)

Visual: Vertical categories

Example: Hours of the day, sales channels, customer segments

3

Cells (Intersection)

Visual: Color-coded rectangles where row meets column

Example: Dark blue = high value, light blue = low value

4

Color Legend

Visual: Scale showing what each color represents

Example: "0-100: White, 100-500: Light Blue, 500+: Dark Blue"

Reading a Heatmap

  1. Check the legend: Understand what the colors mean
  2. Scan for dark/bright areas: These are your extremes (high or low)
  3. Look for patterns: Clusters, rows, columns, or diagonals
  4. Find intersections: Where specific row + column meet
  5. Compare regions: Which areas are hotter or cooler than others?

Heatmap Use Cases

Example 1: Website Activity by Day and Hour

Scenario: Marketing team wants to find the best times to post content

Day / Hour 9 AM 12 PM 3 PM 6 PM 9 PM
Monday 450 1,200 850 600 350
Tuesday 500 1,400 1,100 650 400
Wednesday 550 1,350 1,050 700 450
Thursday 580 1,500 1,150 800 500
Friday 600 1,100 900 1,600 1,250
Saturday 300 650 850 1,200 1,300
Sunday 250 550 700 950 1,100

Color scale: Light blue (low traffic) → Dark blue (high traffic)

Key insights:

Example 2: Sales by Product and Region

Scenario: Company tracking which products sell best in each region

Product / Region North South East West
Product A $85K $45K $62K $78K
Product B $42K $95K $38K $55K
Product C $58K $48K $88K $72K
Product D $35K $28K $42K $98K
Product E $72K $65K $92K $48K

Color scale: Light blue ($25-50K) → Dark blue ($85-100K)

Key insights:

Example 3: Correlation Matrix

Scenario: Analyzing relationships between marketing channels and conversions

Variable Email Social Search Direct Referral
Email 1.00 0.72 0.45 0.23 0.51
Social 0.72 1.00 0.58 0.31 0.65
Search 0.45 0.58 1.00 0.42 0.49
Direct 0.23 0.31 0.42 1.00 0.28
Referral 0.51 0.65 0.49 0.28 1.00

Color scale: Light green (weak correlation 0-0.4) → Dark green (strong correlation 0.7-1.0)

Key insights:

Color Scales for Heatmaps

Choosing the right color scale is critical for effective heatmaps.

Sequential Scale

Use when: Data goes from low to high (one direction)

Colors: Single hue progressing from light to dark

Examples:

  • White → Light Blue → Dark Blue
  • Light Yellow → Orange → Dark Red
  • Light Green → Dark Green

Use for: Sales amounts, website traffic, temperature

Diverging Scale

Use when: Data has a meaningful midpoint (positive and negative)

Colors: Two contrasting hues meeting at neutral center

Examples:

  • Red → White → Blue
  • Red → Yellow → Green
  • Purple → Gray → Orange

Use for: Profit/loss, sentiment scores, above/below average

Categorical Scale

Use when: Data represents distinct categories (not ordered)

Colors: Distinct, different colors for each category

Examples:

  • Red, Blue, Green, Yellow, Purple
  • Each color = different product, region, or status

Use for: Product types, customer segments, status categories

Color Accessibility

Heatmap Best Practices

✅ DO

  • Include a clear legend: Show the color scale and what it represents
  • Choose appropriate colors: Sequential for one direction, diverging for two
  • Label axes clearly: Viewers should understand row/column categories
  • Use consistent intervals: Equal steps in your color scale
  • Keep cells same size: Unless intentionally showing hierarchy
  • Add hover values: In interactive versions, show exact numbers

❌ DON'T

  • Use too many colors: More than 5-7 shades becomes confusing
  • Omit the legend: Colors are meaningless without context
  • Use random color orders: Colors should progress logically
  • Make cells too small: Patterns become invisible
  • Forget to sort: Clustering similar rows/columns reveals patterns
  • Mix categorical and sequential: Use one color scheme type

Mental Model: The Thermal Camera

Think of a heatmap like a thermal imaging camera. It instantly shows you where things are "hot" (high activity, high correlation, high sales) and where they're "cold" (low values), allowing you to spot patterns that would be invisible in a table of numbers.

Part 4: Histograms

A histogram shows the distribution of numerical data by dividing values into ranges (bins) and displaying the frequency (count) of values in each bin. It reveals the shape and spread of your data.

📊 Real-Life Analogy: Sorting Coins

Imagine you have 100 coins of various ages:

This visualization instantly shows: Are most coins new? Old? Evenly distributed?

Analytics parallel: 100 student test scores divided into ranges (0-10, 11-20, 21-30...) showing how many students scored in each range. The pattern reveals if most students did well, poorly, or if scores were spread evenly.

Understanding Bins

Bins are the ranges that divide your numerical data. Choosing the right bin size is crucial.

Example: Test Scores with Different Bin Sizes

Data: 50 students took a test (scores 0-100)

Option 1: Wide bins (0-25, 26-50, 51-75, 76-100)

Insight: Most students scored between 51-100, but you lose detail

Option 2: Narrow bins (0-10, 11-20, 21-30... 91-100)

Insight: Two distinct groups of students (maybe different study approaches?)

Rule of thumb: Start with 5-15 bins, then adjust based on your data volume

Histogram vs. Bar Chart

These look similar but serve different purposes:

Histogram

Data type: Continuous numerical data

X-axis: Ranges of values (bins)

Visual: No gaps between bars (continuous)

Purpose: Show distribution and frequency

Examples: Heights, ages, temperatures, test scores, income

Bar Chart

Data type: Categorical data

X-axis: Distinct categories

Visual: Gaps between bars (discrete)

Purpose: Compare quantities across categories

Examples: Sales by region, products, departments, months

Key Difference

Histogram: "How are 100 students' heights distributed?" → Shows the spread

Bar chart: "How many students are in each grade?" → Compares categories

If you can rearrange the categories without losing meaning, it's a bar chart. If the order matters (because it's a range), it's a histogram.

Distribution Shapes

The shape of a histogram reveals important patterns about your data.

📊

Normal Distribution (Bell Curve)

Shape: Symmetrical, highest in the middle, tapering on both sides

Example: Heights of adult women: most around 5'4"-5'6", fewer very short or very tall

What it means: Most values cluster around the average, natural variation

In practice: Test scores in a well-designed exam, measurement errors, many natural phenomena

📈

Right-Skewed (Tail on Right)

Shape: Peak on left, long tail extending right

Example: Income distribution: many people earn $30-70K, few earn $500K+

What it means: Most values are low-to-moderate, with some high outliers

In practice: Salaries, house prices, website session duration

📉

Left-Skewed (Tail on Left)

Shape: Peak on right, long tail extending left

Example: Age at retirement: most retire 60-70, few retire very early (40s)

What it means: Most values are high, with some low outliers

In practice: Test scores on easy exams, age of death in developed countries

Uniform (Flat)

Shape: Bars roughly equal height across all bins

Example: Rolling a fair die: each number (1-6) appears ~equally often

What it means: All values equally likely, no clustering

In practice: Random number generators, birthdays across year (roughly)

🏔️

Bimodal (Two Peaks)

Shape: Two distinct peaks with a valley between

Example: Gym attendance: high morning (6-8 AM) and evening (5-7 PM), low midday

What it means: Two distinct groups or patterns in the data

In practice: Mixed populations, commute times (morning/evening rush)

Example: Recognizing Distribution Types

Scenario 1: Customer Ages at a Toy Store

Scenario 2: Website Page Load Times

Histogram Use Cases

Example 1: Student Test Score Distribution

Scenario: Teacher analyzing how 80 students performed on an exam

Score Range Frequency Percentage
0-10 2 2.5%
11-20 1 1.25%
21-30 3 3.75%
31-40 5 6.25%
41-50 8 10%
51-60 12 15%
61-70 18 22.5%
71-80 15 18.75%
81-90 10 12.5%
91-100 6 7.5%

Key insights:

Example 2: Customer Age Distribution

Scenario: E-commerce company analyzing 1,000 customer ages

Age Range Count % of Total
18-25 280 28%
26-35 350 35%
36-45 210 21%
46-55 100 10%
56-65 45 4.5%
66+ 15 1.5%

Key insights:

Example 3: Response Time Analysis

Scenario: SaaS company measuring API response times (milliseconds)

Response Time (ms) Frequency Cumulative %
0-50 3,200 32%
51-100 4,500 77%
101-150 1,800 95%
151-200 350 98.5%
201-300 100 99.5%
301+ 50 100%

Key insights:

Histogram Best Practices

✅ DO

  • Choose appropriate bin size: Not too wide (lose detail) or too narrow (too noisy)
  • Start at minimum value: X-axis should begin at lowest data point
  • No gaps between bars: Continuous data means continuous bars
  • Label axes clearly: X = "Test Score", Y = "Number of Students"
  • Equal bin widths: All ranges should be same size (e.g., all 10-point ranges)
  • Show the shape: Let the distribution tell its story

❌ DON'T

  • Use for categorical data: Use a bar chart instead
  • Leave gaps between bars: Defeats the purpose of showing continuity
  • Use inconsistent bin widths: Makes comparison misleading
  • Start Y-axis above zero: Distorts the visual proportions
  • Use too many bins: Creates noise instead of pattern
  • Use too few bins: Hides important distribution details

Mental Model: The Mountain Range

Think of a histogram as a mountain range viewed from the side. The height of each section shows how many data points "pile up" in that range. Peaks show where data clusters, valleys show gaps, and the overall silhouette reveals the data's natural shape.

Part 5: Box Plots (Bonus)

A box plot (box-and-whisker plot) provides a statistical summary of a dataset's distribution, showing the median, quartiles, range, and outliers all in one compact visualization.

📦 Real-Life Analogy: Package Sorting

Imagine sorting 100 packages by weight:

A box plot shows all these statistics in one simple graphic.

Analytics parallel: 100 customer order values: quickly see the median order, where the middle 50% fall, and identify unusually high or low orders.

Reading a Box Plot

1

The Box (Interquartile Range - IQR)

Visual: The rectangular box in the middle

Represents: Middle 50% of the data (25th to 75th percentile)

Example: If box spans 60-80, then 50% of values fall in this range

2

The Line Inside (Median)

Visual: Horizontal line cutting through the box

Represents: 50th percentile (middle value)

Example: If line is at 70, half the data is below 70, half above

3

The Whiskers (Range)

Visual: Lines extending from the box to minimum/maximum

Represents: Full range of "normal" data (within 1.5× IQR)

Example: Whiskers at 40 and 100 mean data spans this range

4

The Dots (Outliers)

Visual: Individual points beyond the whiskers

Represents: Unusual values far from the rest

Example: Dot at 150 is an outlier (investigate why)

Example: Comparing Salaries Across Departments

Scenario: Company analyzing salaries in three departments

Statistic Engineering Marketing Sales
Minimum $55K $42K $38K
25th Percentile (Q1) $75K $52K $48K
Median (Q2) $95K $65K $62K
75th Percentile (Q3) $125K $78K $85K
Maximum $180K $95K $150K
Outliers $250K (CTO) None $200K, $220K (top sellers)

Key insights:

When to Use Box Plots

Interactive Specialized Chart Builder

Experiment with creating each specialized chart type using sample datasets.

🛠️ Chart Builder Tool

Select a chart type and configure it with sample data to see how each visualization works.

1. Waterfall Chart Builder

Sample Dataset: Company Quarterly Profit Breakdown

  • Starting Revenue: $500,000
  • Cost of Goods: -$180,000
  • Operating Expenses: -$120,000
  • Marketing: -$45,000
  • Tax: -$35,000
  • Net Profit: $120,000

Exercise: Drag categories to reorder them. See how the waterfall flows from start to end value.

Try: What happens if you put Tax before Operating Expenses? The final value stays the same, but the path changes!

2. Funnel Chart Builder

Sample Dataset: SaaS Trial-to-Paid Conversion

  • Website Visitors: 10,000
  • Trial Signups: 1,500 (15% conversion)
  • Completed Onboarding: 900 (60% conversion)
  • Used Product 3+ Times: 450 (50% conversion)
  • Paid Customers: 180 (40% conversion)

Exercise: Adjust the values at each stage. See how conversion rates auto-calculate.

Try: Where's the biggest drop? Between trials (1,500) and completed onboarding (900) - 40% drop-off!

3. Heatmap Builder

Sample Dataset: Sales by Product × Region (in thousands)

North South East West
Product A $52K $38K $88K $65K
Product B $35K $95K $42K $58K
Product C $78K $55K $68K $82K

Exercise: Change the color scale (sequential vs. diverging). Hover to see exact values.

Try: Spot the pattern - Product A dominates East ($88K), Product B dominates South ($95K)

4. Histogram Builder

Sample Dataset: 100 Customer Order Values

Current bin size: $20 increments

  • $0-$20: 12 orders
  • $21-$40: 28 orders
  • $41-$60: 35 orders (peak)
  • $61-$80: 18 orders
  • $81-$100: 5 orders
  • $100+: 2 orders (outliers)

Exercise: Use slider to adjust bin size (try $10 increments vs. $50 increments).

Try: Too few bins ($50) hides detail. Too many bins ($5) creates noise. $20 is just right to see the shape!

Experiment Tips

Practice Exercises

Exercise 1: Choose the Right Chart

For each scenario, select which chart type is most appropriate:

  1. Scenario: Show how starting inventory of 1,000 units became 850 units after sales, returns, and new shipments.

    Answer: Waterfall chart - Sequential additions/subtractions from start to end

  2. Scenario: Track how 5,000 job applicants became 10 final hires through screening stages.

    Answer: Funnel chart - Progressive filtering through sequential stages

  3. Scenario: Display which hours of the week have the most customer service calls.

    Answer: Heatmap - Patterns across two dimensions (day × hour)

  4. Scenario: Show the spread of employee ages to identify if most are young, old, or evenly distributed.

    Answer: Histogram - Distribution of continuous numerical data

  5. Scenario: Compare salary distributions across five departments to see median, range, and outliers.

    Answer: Box plot - Statistical summary for multiple groups

Exercise 2: Interpret This Waterfall

Company Profit Breakdown:

Questions:

  1. What was the Cost of Goods Sold? (Revenue - After COGS = $800K - $500K = $300K)
  2. What expense category cost the most? (COGS at $300K)
  3. What's the profit margin? ($150K / $800K = 18.75%)

Exercise 3: Calculate Funnel Conversion Rates

E-commerce Funnel:

Calculate:

  1. Homepage → Product Page conversion: (8,000 / 20,000 = 40%)
  2. Add to Cart → Checkout conversion: (1,200 / 2,400 = 50%)
  3. Overall visitor → purchase conversion: (900 / 20,000 = 4.5%)
  4. Which stage has the worst conversion? (Checkout → Purchase: 900/1,200 = 75% is good; Cart → Checkout: 50% is the weakest)

Exercise 4: Read This Heatmap

Call Volume by Day × Hour (darker = more calls):

Questions:

  1. When should you schedule staff breaks? (Friday afternoon, Saturday - low call volume)
  2. When do you need maximum staffing? (Monday and Wednesday mornings)
  3. What pattern do you notice about weekends? (Consistently low volume - could reduce weekend staff)

Exercise 5: Interpret This Histogram

Website Session Duration (minutes):

Questions:

  1. What's the distribution shape? (Right-skewed - most sessions are short, few long ones)
  2. What percentage stay less than 4 minutes? ((450 + 320) / 1,100 = 70%)
  3. Is this concerning? (Yes - 70% of visitors leave within 4 minutes suggests engagement problem)
  4. Where should you focus improvement? (Content in first 4 minutes to keep users engaged longer)

Exercise 6: Waterfall or Funnel?

Determine if each scenario needs a waterfall or funnel chart:

  1. Scenario: Monthly budget started at $50K, added $20K revenue, subtracted $15K expenses, subtracted $8K debt payment = $47K remaining

    Answer: Waterfall (shows + and - changes to a running total)

  2. Scenario: 1,000 leads → 300 qualified → 100 demos → 40 proposals → 15 closed deals

    Answer: Funnel (sequential filtering with drop-offs)

  3. Scenario: Website had 500 users, gained 200 new signups, lost 50 to churn = 650 total users

    Answer: Waterfall (net change from start to end)

  4. Scenario: 10,000 students applied → 2,000 admitted → 800 enrolled → 750 completed first year

    Answer: Funnel (progressive reduction through stages)

Exercise 7: Design a Heatmap

Scenario: You have data on product returns by category and month. January-December (columns) × Electronics, Clothing, Home Goods, Toys (rows).

Questions:

  1. What should the color represent? (Number of returns or return rate)
  2. What color scale should you use? (Sequential: white/light (few returns) to red/dark (many returns))
  3. What patterns might you look for? (Seasonal spikes - toys in January post-holidays, clothing in September post-summer)
  4. How would you identify problem areas? (Dark red cells = high returns for that category-month combo)

Exercise 8: Histogram vs. Bar Chart

Identify if each should be a histogram or bar chart:

  1. Data: Customer satisfaction scores (1-5 stars) and count of responses for each rating

    Answer: Bar chart (5 distinct categories, not continuous ranges)

  2. Data: Employee ages from 22 to 68, grouped into ranges

    Answer: Histogram (continuous numerical data, ranges)

  3. Data: Sales by region (North, South, East, West)

    Answer: Bar chart (categorical regions)

  4. Data: Website load times from 0.1s to 8.5s, grouped into time ranges

    Answer: Histogram (continuous numerical data)

  5. Data: Number of orders per month (Jan, Feb, Mar... Dec)

    Answer: Bar chart (12 distinct time categories)

Exercise 9: Box Plot Interpretation

Exam Scores - Box Plot Statistics:

Questions:

  1. What score did half the students exceed? (74 - the median)
  2. What range contains the middle 50% of scores? (62-85 - the IQR)
  3. Is the distribution symmetric? (No - median (74) is closer to Q3 (85) than Q1 (62), suggesting slight left skew)
  4. What should you do about the outlier score of 18? (Investigate - did student misunderstand? Need help?)

Exercise 10: Real-World Application

Scenario: You're analyzing your coffee shop's performance.

For each question, choose the best chart type:

  1. Show how we went from $10K opening cash to $12.5K closing cash through daily sales and expenses

    Chart: Waterfall

  2. Track how 500 website visitors became 85 online orders through browsing stages

    Chart: Funnel

  3. Identify which day of week × hour of day has most foot traffic

    Chart: Heatmap

  4. Understand the distribution of transaction amounts (are most orders small or large?)

    Chart: Histogram

  5. Compare tip amounts across morning, afternoon, and evening shifts (median, range, outliers)

    Chart: Box plot

Exercise 11: Spot the Error

Identify what's wrong with each visualization:

  1. Waterfall chart: Categories are: Revenue, Taxes, COGS, Operating Expenses, Net Profit

    Error: Illogical order - COGS and Operating Expenses should come before Taxes

  2. Funnel chart: Stages show: 100 leads → 150 qualified → 80 demos → 30 closed

    Error: Stage 2 (150) is larger than Stage 1 (100) - funnels should only decrease

  3. Heatmap: Uses red for low values and green for high values

    Error: Counterintuitive - typically red = high/hot, blue/light = low/cold

  4. Histogram: Shows sales by region (North, South, East, West) with gaps between bars

    Error: This should be a bar chart (categorical data), and gaps are fine for bar charts but wrong for histograms

  5. Box plot: The median line is outside the box

    Error: Median must always be inside the box (between Q1 and Q3)

Exercise 12: Build Your Own Analysis

Scenario: Your company's quarterly sales went from $200K to $285K. Here are the components:

Tasks:

  1. Calculate Q2 ending sales: ($200K + $120K - $45K + $65K - $25K - $30K = $285K)
  2. Order these logically for a waterfall chart: (Start → New customers → Upsells → Churn → Refunds → Discounts → End)
  3. Which change had the biggest positive impact? (New customers +$120K)
  4. Which change had the biggest negative impact? (Churn -$45K)
  5. What's one action you'd recommend? (Reduce churn - it's the largest negative contributor)

📝 Knowledge Check

Test your understanding of specialized chart types with instant feedback!

1. A company wants to show how their revenue of $500K became a net profit of $80K after various expenses. Which chart type is most appropriate?

2. A sales funnel shows: 1,000 leads → 400 qualified → 120 demos → 30 closed. What is the demo-to-closed conversion rate?

3. You're creating a heatmap showing profit/loss by product and month. Some products have positive profits, others have losses. Which color scale is most appropriate?

4. Which of the following datasets should be displayed as a histogram rather than a bar chart?

5. A histogram of customer order values shows most orders around $50, very few orders below $20, and a long tail extending up to $500. What distribution shape is this?

6. What is WRONG with this waterfall chart practice?

7. In a box plot, what does the line inside the box represent?

8. An e-commerce company wants to identify which day of week and hour of day combinations generate the most website traffic to optimize their email campaign scheduling. Which chart type would be MOST effective?

9. Which advanced chart type is best for showing the distribution of a single continuous variable like customer ages?

10. What is a Sankey diagram best used for?

← Previous: Scatter & Area Charts Next: Choosing Visualizations →