Best Data Viz
Relationship

Scatter Plot

Plots two numeric variables as points, revealing correlations, clusters, and outliers.

Study Hours vs Test Score

50 students, midterm exam

View data (50 rows)
Chart data table: Study Hours vs Test Score
HoursScore
135
141
138
242
245
247
240
346
350
353
348
451
455
458
453
556
560
562
558
555
660
664
667
662
665
765
769
772
767
770
871
875
878
872
874
976
980
982
978
979
1081
1084
1088
1083
1086
1187
1190
1192
1293
1295
Make a scatter plot with your data

Use a scatter plot when…

  • Exploring correlation between two variables
  • Spotting outliers and clusters
  • Regression analysis

Avoid when…

  • Categorical data
  • Evenly-sampled time series (a line chart is usually clearer)
  • Too many overlapping points (>1000) — use hexbin or 2D contour

Data it needs

PropertyValue
Min Rows10
Min Columns2
Column Types
numbernumber

Visual anatomy

Marks
circle
Channels
position-xposition-ycolor-hue (optional)size (optional, for bubble variants)
Axes
x-quantitativey-quantitative

Guiding principles

Consider instead

Common mistakes

  • Assuming correlation = causation

  • Overplotting without transparency

  • Ignoring outliers

History

Pioneered by John Herschel in the 1830s; formalized by Francis Galton in the 1880s.

Accessibility notes

Report the correlation coefficient and describe trend as positive/negative/none. When using color to encode a third variable, pick a colorblind-safe palette and pair color with shape or label so the encoding survives a grayscale print.

Related reading

Got data? Let's see what works.

Drop your CSV. You'll get a Scatter Plot plus four alternatives - ranked by which one actually fits your data best.