
Scatter Plot: How to Show Relationships Between Two Variables
A scatter plot helps reveal relationships between two numerical variables by placing individual observations on an x–y coordinate system. Use it to explore patterns, correlations, clusters, and outliers that might not be visible in tables or summary statistics.
Turn to a scatter plot when understanding how variables interact is more important than comparing totals or categories. The distribution of points often tells the story: tight clusters may suggest strong relationships, while wide dispersion may indicate variability or independence between variables.
Scatter plots are especially powerful because they preserve the raw structure of the data. Instead of aggregating values into bars or averages, each observation remains visible, allowing deeper exploration of trends and anomalies.
Related Charts and Graphs
Several visuals relate closely to scatter plots and may be better choices depending on the goal.
- Line graphs emphasize trends over time rather than relationships between variables.
- Hexbin plots reduce overplotting when datasets become extremely dense.
- Bubble charts add a third variable using size encoding.
- Pair plots expand scatterplots into a matrix for exploring many variables at once.
- Connected scatter plots show progression or movement across points rather than independent observations.
Choosing between these options depends on whether the focus is density, comparison, or multidimensional exploration.
What a Scatter Plot Is
A scatter plot displays individual data points positioned according to two numerical variables. Each point A scatter plot displays individual data points positioned according to two quantitative variables. Each point represents a single observation, allowing viewers to see how values align or vary together.
Common elements include:
- A horizontal axis representing one variable
- A vertical axis representing a second variable
- Markers that represent individual observations
- Optional trend lines or regression curves
- Labels or annotations highlighting key insights
Unlike charts that aggregate or summarize data, scatter plots preserve detail. This makes them ideal for exploratory analysis, hypothesis testing, and uncovering hidden patterns.
Patterns in a scatter plot can take many forms. Points may form an upward trend, suggesting a positive relationship, or a downward pattern indicating a negative relationship. In some cases, points may cluster into groups or form curved shapes that hint at more complex relationships.
When to Use a Scatter Plot
Use a scatter plot when the goal is to understand how two variables relate or influence one another.
This visual works especially well when the goal is to:
- Identify positive or negative relationships
- Explore correlations between measurements
- Detect clusters or groupings
- Spot anomalies or outliers
- Support exploratory data analysis
- Evaluate assumptions before modeling or forecasting
Scatter plots are especially valuable early in an analysis process. They help reveal structure before more complex visualizations or statistical techniques are applied.
Scatter plots also work well when communicating uncertainty. Instead of presenting a single summary number, the viewer sees the full spread of observations, which can help avoid oversimplification.
Types of Data Sets That Work Best for a Scatter Plot
Scatter plots work best with paired quantitative data, where each observation contains two measurable values.
Strong candidates include:
- Height vs. weight measurements
- Study time vs. exam performance
- Price vs. demand or revenue
- Temperature vs. humidity or rainfall
- Risk vs. return in financial analysis
- Engagement metrics such as impressions vs. conversions
Datasets that include dozens or hundreds of observations often produce the most meaningful scatter plots, since patterns become more visible with increased sample size.
Scatter plots can also incorporate additional encodings. Color might distinguish groups, while marker size can represent a third variable. However, these enhancements should be used sparingly to maintain readability.
Real-World Examples of a Scatter Plot
Scatter plots appear frequently in research, analytics, and journalism whenever relationships between variables need to be explored.
Academic or Educational Data


Show how study habits or participation relate to performance outcomes.
Financial or Economic Analysis
Reveal relationships between economic indicators or market variables.
Marketing or Customer Behavior Analysis
Analyze how engagement metrics relate to conversion or performance.
Health or Demographic Studies


Show relationships between health indicators or demographic factors.
What to Avoid or Be Careful Of with a Scatter Plot
❌ Don’t assume correlation equals causation
Patterns suggest relationships but do not prove cause.
❌ Don’t ignore overplotting
Large datasets may require transparency, jittering, or alternative visuals like hexbin plots.
❌ Don’t mix scales without context
Axes should be labeled clearly to avoid misinterpretation.
❌ Don’t overload with too many encodings
Color, size, and shape should be used sparingly to maintain clarity.
❌ Don’t use it when categories are the main focus
Bar charts or grouped visuals may communicate categorical comparisons more effectively.
*Content on this page was curated and edited by expert humans with the creative assistance of AI.