A SQL JOIN combines rows from two or more tables based on a related column. The ON clause specifies the matching condition — typically a foreign key in one table matching a primary key in another. Without a JOIN, relational data locked in separate tables cannot be combined in a single query result.

Which JOIN is the fastest in SQL?

JOIN performance depends on indexes, data volume, and the query optimizer — not the JOIN type itself. INNER JOIN is often the fastest in practice because it works on the smallest result set (only matching rows). LEFT and RIGHT JOINs must preserve all rows from one side, which can mean more work. CROSS JOIN is the slowest for large tables as it produces a Cartesian product. Always index the JOIN columns and run EXPLAIN to see the actual query plan.

When should I use LEFT JOIN instead of INNER JOIN?

Use LEFT JOIN when you need all rows from the left table regardless of whether matching rows exist in the right table. Common examples: listing all customers including those with zero orders; showing all employees including those not yet assigned to a department; finding records with no related data (WHERE right_table.id IS NULL). If you only want records that have matches in both tables, use INNER JOIN.

What is a SELF JOIN in SQL?

A SELF JOIN joins a table to itself. It is used for hierarchical or recursive relationships stored in a single table — the most common example is an employees table where each row has a manager_id column that references another row in the same table. To self-join, alias the table twice: SELECT e.name, m.name AS manager FROM employees e LEFT JOIN employees m ON e.manager_id = m.id.

What is a Cartesian product in SQL?

A Cartesian product is the result of combining every row from one table with every row from another table, with no matching condition. It is produced by CROSS JOIN or by accidentally omitting the ON clause. A Cartesian product of a 1,000-row table and a 500-row table produces 500,000 rows. Always verify your JOIN conditions to avoid accidental Cartesian explosions.

Can you JOIN more than two tables in SQL?

Yes. SQL allows any number of JOINs in a single query. Each JOIN adds one more table: SELECT ... FROM a JOIN b ON b.a_id = a.id JOIN c ON c.b_id = b.id. The query optimizer processes them in the most efficient order. Always use table aliases and ensure every JOIN has an ON condition to avoid accidental Cartesian products.

Are SQL JOINs expensive?

JOINs on indexed columns are very efficient. The main cost factors are: missing indexes on JOIN columns (leading to full table scans), large intermediate result sets from early JOINs, and Cartesian products from missing conditions. Use EXPLAIN to inspect the query plan, add indexes on foreign key and JOIN columns, and filter early with WHERE to reduce rows before joining.

What is the difference between LEFT JOIN and LEFT OUTER JOIN?

They are identical. OUTER is optional syntax — LEFT JOIN and LEFT OUTER JOIN produce exactly the same result in all SQL databases. Similarly, INNER JOIN and JOIN are equivalent. The OUTER keyword is kept in SQL syntax for historical reasons but has no functional effect.

Home/Blog/SQL JOINs Explained: INNER, LEFT, RIGHT, FULL OUTER & Real Examples

SQL BasicsJOINsInterview PrepPerformance

SQL JOINs Explained: INNER, LEFT, RIGHT, FULL OUTER & Real Examples

Q: What is the difference between INNER JOIN and LEFT JOIN?

INNER JOIN returns only rows where a match exists in both tables — unmatched rows from either table are excluded. LEFT JOIN returns every row from the left table, plus matching rows from the right table; where no match exists, right-side columns are filled with NULL. Use INNER JOIN when you only want matched data; use LEFT JOIN when you need all left-table rows regardless of whether a match exists.

Q: What is FULL OUTER JOIN?

FULL OUTER JOIN returns all rows from both tables. Where a match exists, the row is combined normally. Where no match exists on either side, the missing columns are filled with NULL. It is useful for finding all records across two tables including unmatched rows on both sides — such as a reconciliation report showing customers with no orders AND orders with no associated customer.

SQLab Hub Team·2026-05-14·24 min read

SQL JOINs are the mechanism that makes relational databases relational. Every meaningful real-world database splits data across multiple tables — customers in one table, their orders in another, products in a third — and JOINs are the tool that brings that data back together into a single, usable result. If you cannot write confident JOIN queries, you cannot do meaningful work with relational databases. This guide is the complete reference for SQL JOINs. It covers every JOIN type — INNER JOIN, LEFT JOIN, RIGHT JOIN, FULL OUTER JOIN, SELF JOIN, and CROSS JOIN — from the first-principles explanation a beginner needs to the execution-plan details an engineer uses when optimizing a slow query. You will find 20+ real-world examples drawn from e-commerce, HR, analytics, and SaaS scenarios; a deep comparison of INNER JOIN vs LEFT JOIN; performance optimization techniques; the most common JOIN mistakes and how to fix them; 15 SQL JOIN interview questions with answers; and practice exercises at three difficulty levels. Every example uses the SQLab Hub schema — users, orders, order_items, products, employees — so you can run every query immediately in the free SQL playground without any setup.

Practice these queries in SQLab Hub →

Real SQLite engine · 8 tables · No signup needed

Open Playground

What Are SQL JOINs?

The core concept

A SQL JOIN combines rows from two or more tables into a single result set, based on a related column between them. The matching condition is specified in the ON clause — most commonly a foreign key in one table that references the primary key in another.

Without JOINs, relational data is trapped in separate tables. A users table tells you who a customer is. An orders table tells you what they bought. A JOIN connects those two tables so one query can answer: "Who bought what, and how much did they spend?"

SQL JOINs are a controlled, conditioned merge. The database compares every relevant row from table A against every relevant row from table B, returns the rows where the condition matches, and discards — or preserves, depending on JOIN type — the rows where it does not.

How relational databases use foreign keys

Relational databases store each piece of information in exactly one place. Customer details go in the users table. Order details go in the orders table. A thin link — the foreign key — connects them:

  users table:   id (primary key), first_name, email, city
  orders table:  id (primary key), user_id (foreign key → users.id), total, status

The user_id column in orders references the id column in users. A JOIN uses this link to reconstruct the combined picture:

SELECT u.first_name, o.total, o.status
FROM users u
INNER JOIN orders o ON o.user_id = u.id;

SQL compares every o.user_id against every u.id. Where they match, it merges those rows into one output row.

The six SQL JOIN types at a glance

JOIN Type         | What It Returns
------------------|----------------------------------------------------------
INNER JOIN        | Only rows with a match in BOTH tables
LEFT JOIN         | All rows from left table + matching rows from right (NULLs where no match)
RIGHT JOIN        | All rows from right table + matching rows from left (NULLs where no match)
FULL OUTER JOIN   | All rows from both tables (NULLs on either side where no match)
CROSS JOIN        | Every row from left × every row from right (Cartesian product)
SELF JOIN         | A table joined to itself using two different aliases

SQL JOIN Syntax

INNER JOIN syntax

-- INNER JOIN (also written as just JOIN — the INNER keyword is optional)
SELECT u.first_name, o.total, o.status
FROM users u
INNER JOIN orders o ON o.user_id = u.id
ORDER BY o.total DESC;

LEFT JOIN syntax

-- LEFT JOIN (also: LEFT OUTER JOIN — OUTER keyword is optional)
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name
ORDER BY order_count DESC;

RIGHT JOIN syntax

-- RIGHT JOIN (also: RIGHT OUTER JOIN)
-- Returns all rows from orders (right), matching rows from users (left)
SELECT o.id AS order_id, o.total, u.first_name
FROM users u
RIGHT JOIN orders o ON o.user_id = u.id;

FULL OUTER JOIN syntax

-- FULL OUTER JOIN (also: FULL JOIN)
-- Returns all rows from both tables, NULLs on the unmatched side
SELECT u.first_name, o.id AS order_id, o.total
FROM users u
FULL OUTER JOIN orders o ON o.user_id = u.id;

-- Note: MySQL does not support FULL OUTER JOIN — use a UNION workaround (covered below)

SELF JOIN syntax

-- SELF JOIN — alias the same table twice
SELECT e.first_name AS employee, m.first_name AS manager
FROM employees e
LEFT JOIN employees m ON m.id = e.manager_id
ORDER BY e.department, e.first_name;

CROSS JOIN syntax

-- CROSS JOIN — Cartesian product, no ON clause
-- Every row from sizes × every row from colors
SELECT s.size, c.color
FROM sizes s
CROSS JOIN colors c
ORDER BY s.size, c.color;
-- Result: 4 sizes × 3 colors = 12 rows

INNER JOIN

What is INNER JOIN?

INNER JOIN returns only the rows where the ON condition is satisfied in both tables. If a row in the left table has no matching row in the right table, that left-table row is excluded from the result. The same applies to right-table rows with no match.

INNER JOIN is the most common JOIN type and the default when you write just JOIN without specifying a type.

How INNER JOIN works — visual

Table A (users):              Table B (orders):
  id | name                       id | user_id | total
  1  | Alice                       1  |    1    | 120.00
  2  | Bob                         2  |    1    |  45.00
  3  | Carol  ← no orders          3  |    2    |  89.00
                                   4  |    5    |  60.00  ← user 5 missing

INNER JOIN result — only rows with matches on both sides:
  name  | total
  Alice | 120.00
  Alice |  45.00
  Bob   |  89.00

Carol is excluded (no orders). Order for user_id=5 is excluded (no user).

Example: customers with their orders

SELECT u.first_name, u.email,
       o.id AS order_id, o.total, o.status
FROM users u
INNER JOIN orders o ON o.user_id = u.id
ORDER BY o.total DESC
LIMIT 10;

-- Only customers who have placed at least one order appear.
-- Customers with zero orders are silently excluded.

Example: order items with product details (3-table INNER JOIN)

SELECT o.id AS order_id, p.name AS product,
       oi.quantity, oi.unit_price,
       ROUND(oi.quantity * oi.unit_price, 2) AS line_total
FROM order_items oi
INNER JOIN orders o   ON o.id = oi.order_id
INNER JOIN products p ON p.id = oi.product_id
WHERE o.status = 'completed'
ORDER BY line_total DESC;

-- All three ON conditions must be satisfied.
-- Any order_item with a missing product or order is excluded.

Business use case

INNER JOIN is used in sales reporting when you only want records with complete data on both sides. Revenue dashboards, fulfillment reports, and purchase histories all use INNER JOIN because orphaned or incomplete records would distort the numbers.

LEFT JOIN

What is LEFT JOIN?

LEFT JOIN (also called LEFT OUTER JOIN) returns every row from the left table, plus the matching rows from the right table. Where no match exists in the right table, the right-side columns are filled with NULL.

The key insight: LEFT JOIN never removes rows from the left table. It is the JOIN to use when you want a complete list of one entity, with optional related data from another.

How LEFT JOIN works — visual

Table A (users):              Table B (orders):
  id | name                       id | user_id | total
  1  | Alice                       1  |    1    | 120.00
  2  | Bob                         2  |    1    |  45.00
  3  | Carol  ← no orders          3  |    2    |  89.00

LEFT JOIN result — ALL users, with order data where available:
  name  | order_id | total
  Alice |    1     | 120.00
  Alice |    2     |  45.00
  Bob   |    3     |  89.00
  Carol |   NULL   |  NULL   ← Carol preserved with NULLs

Example: all users with order count (including users with zero orders)

SELECT u.first_name, u.email,
       COUNT(o.id)                        AS order_count,
       COALESCE(ROUND(SUM(o.total), 2), 0) AS lifetime_value
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name, u.email
ORDER BY lifetime_value DESC;

-- COUNT(o.id) skips NULLs — gives 0 for users with no orders.
-- COALESCE converts NULL SUM to 0.00 for users with no orders.

Example: find users who have NEVER placed an order

SELECT u.id, u.first_name, u.email, u.created_at
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
WHERE o.id IS NULL
ORDER BY u.created_at DESC;

-- LEFT JOIN + WHERE right_table.id IS NULL is the standard anti-join pattern.
-- Finds rows in the left table with no match in the right table.
-- Use cases: inactive users, unassigned items, orphaned records.

Business use case

LEFT JOIN is the most commonly used JOIN in analytics and reporting. Use cases include: all products including those never ordered (inventory analysis); all employees including those in no department (org chart audit); all campaigns including those with zero clicks (marketing performance); all customers including those who never converted.

RIGHT JOIN

What is RIGHT JOIN?

RIGHT JOIN (also RIGHT OUTER JOIN) is the mirror image of LEFT JOIN. It returns every row from the right table, plus matching rows from the left table. Where no match exists in the left table, the left-side columns are NULL.

In practice, RIGHT JOIN is rarely used. Any RIGHT JOIN can be rewritten as a LEFT JOIN by swapping the table order — which is almost always clearer.

Example: orphaned orders data integrity check

SELECT o.id AS order_id, o.total, o.status,
       u.first_name, u.email
FROM users u
RIGHT JOIN orders o ON o.user_id = u.id
WHERE u.id IS NULL;

-- Returns orders with no matching user record.
-- Useful for data quality audits where referential integrity is not enforced.
-- Equivalent as a LEFT JOIN:
-- FROM orders o LEFT JOIN users u ON u.id = o.user_id WHERE u.id IS NULL

Why RIGHT JOIN is rarely used in practice

Most SQL developers avoid RIGHT JOIN because it reduces readability. The natural mental model is "start with this table and look up related data." LEFT JOIN matches that model. Any RIGHT JOIN can be rewritten as a LEFT JOIN with the table order swapped — which is cleaner and more maintainable in every situation.

FULL OUTER JOIN

What is FULL OUTER JOIN?

FULL OUTER JOIN returns all rows from both tables. Where a match exists, the row is combined. Where no match exists on either side, NULLs fill the missing columns.

Think of it as LEFT JOIN + RIGHT JOIN combined: you get the matched rows, the unmatched rows from the left table (NULLs on the right), and the unmatched rows from the right table (NULLs on the left).

Result structure of FULL OUTER JOIN

  left_col | right_col
  ---------|-----------
  Alice    | order_101   ← matched row
  Bob      | order_102   ← matched row
  Carol    | NULL        ← left-only row (no matching order)
  NULL     | order_103   ← right-only row (orphaned order)

FULL OUTER JOIN surfaces three categories:
  1. Rows that matched on both sides
  2. Rows only in the left table  (right side = NULL)
  3. Rows only in the right table (left side = NULL)

Example: reconcile users and orders (find all gaps)

SELECT u.first_name, u.email,
       o.id AS order_id, o.total
FROM users u
FULL OUTER JOIN orders o ON o.user_id = u.id
ORDER BY u.id;

-- Users with no orders: o.id IS NULL
-- Orphaned orders with no user: u.id IS NULL
-- PostgreSQL and SQL Server support FULL OUTER JOIN natively.
-- MySQL does NOT — use the UNION workaround below.

MySQL workaround for FULL OUTER JOIN

-- MySQL does not support FULL OUTER JOIN — emulate with UNION:
SELECT u.first_name, o.id AS order_id, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id

UNION

SELECT u.first_name, o.id AS order_id, o.total
FROM users u
RIGHT JOIN orders o ON o.user_id = u.id
WHERE u.id IS NULL;

-- LEFT JOIN covers all left-side rows.
-- RIGHT JOIN + IS NULL adds the orphaned right-side rows.
-- UNION deduplicates the combined result.

Business use case

FULL OUTER JOIN is primarily used in data reconciliation: comparing two tables that should be in sync — customer records from two systems, bank transactions vs internal ledger entries, or product catalog vs inventory system. It immediately surfaces rows that exist in one source but not the other.

SELF JOIN in SQL

What is a SELF JOIN?

A SELF JOIN joins a table to itself. SQL requires you to alias the table twice so the query can distinguish between the two "sides" of the join. SELF JOIN is the standard technique for hierarchical data stored in a flat table — most commonly employee-manager relationships.

Example: employee and manager names from one table

-- employees: id, first_name, department, salary, manager_id
-- manager_id references the id of another row in the same table

SELECT e.first_name AS employee,
       m.first_name AS manager,
       e.department
FROM employees e
LEFT JOIN employees m ON m.id = e.manager_id
ORDER BY e.department, e.first_name;

-- Employees with no manager (top-level executives) have NULL in the manager column.
-- LEFT JOIN ensures they still appear — INNER JOIN would exclude them.

Example: find employees earning more than their manager

SELECT e.first_name AS employee, e.salary,
       m.first_name AS manager, m.salary AS manager_salary,
       ROUND(e.salary - m.salary, 2) AS salary_advantage
FROM employees e
INNER JOIN employees m ON m.id = e.manager_id
WHERE e.salary > m.salary
ORDER BY salary_advantage DESC;

-- Classic interview question — tests understanding of self-joins.

Example: find pairs of employees in the same department

SELECT a.first_name AS employee_1,
       b.first_name AS employee_2,
       a.department
FROM employees a
INNER JOIN employees b
  ON a.department = b.department
  AND a.id < b.id
ORDER BY a.department, a.first_name;

-- a.id < b.id prevents duplicate pairs (Alice+Bob and Bob+Alice)
-- and prevents self-pairs (Alice+Alice).

CROSS JOIN Explained

What is CROSS JOIN?

CROSS JOIN produces the Cartesian product of two tables — every row from the left table is paired with every row from the right table. No ON clause is used. If table A has 100 rows and table B has 50 rows, CROSS JOIN produces 5,000 output rows.

CROSS JOIN has legitimate use cases (generating combinations, seeding test data, creating time-series grids) but is dangerous when used accidentally — a missing ON clause on a regular JOIN creates an implicit CROSS JOIN.

Example: generate all product variants (size × color)

-- sizes: S, M, L, XL   |   colors: Red, Blue, Green
SELECT s.size, c.color
FROM sizes s
CROSS JOIN colors c
ORDER BY s.size, c.color;

-- Result: 4 sizes × 3 colors = 12 rows
-- S-Red, S-Blue, S-Green, M-Red, M-Blue, M-Green, ...
-- Use case: generating every product variant for an e-commerce catalog.

Warning: accidental Cartesian products

The most dangerous SQL mistake is an unintentional CROSS JOIN — a JOIN with a missing ON clause. On large tables this produces hundreds of millions of rows, overwhelming the database.

Rule: every JOIN must have an ON clause. Write deliberate CROSS JOINs explicitly as CROSS JOIN — never rely on missing conditions.

Joining Multiple Tables

How to JOIN three or more tables

SQL allows any number of JOINs in a single query. Each JOIN adds one more table. The optimizer determines the most efficient execution order. For complex queries with many JOINs on large tables, verify the plan with EXPLAIN.

Example: 4-table JOIN for a detailed order line report

SELECT u.first_name    AS customer,
       o.id             AS order_id,
       o.status,
       p.name           AS product,
       oi.quantity,
       oi.unit_price,
       ROUND(oi.quantity * oi.unit_price, 2) AS line_total
FROM users u
JOIN orders o       ON o.user_id = u.id
JOIN order_items oi ON oi.order_id = o.id
JOIN products p     ON p.id = oi.product_id
WHERE o.status IN ('completed', 'shipped')
ORDER BY o.id, line_total DESC;

-- Each JOIN adds one table to the chain.
-- Aliases (u, o, oi, p) prevent ambiguous column names.

Example: employees with department and project data

SELECT e.first_name,
       d.name     AS department,
       p.name     AS project,
       ep.role    AS project_role
FROM employees e
JOIN departments d          ON d.id = e.department_id
LEFT JOIN employee_projects ep ON ep.employee_id = e.id
LEFT JOIN projects p            ON p.id = ep.project_id
ORDER BY d.name, e.first_name;

-- INNER JOIN to departments: every employee must have a department.
-- LEFT JOINs to projects: employees with no project still appear.

Best practices for multi-table JOINs

1. Always use table aliases — they reduce verbosity and prevent ambiguous column errors.
2. Give every JOIN an explicit ON condition — never use implicit comma-join syntax.
3. Choose JOIN type intentionally — LEFT vs INNER has a major impact on row counts.
4. Filter early — WHERE conditions reduce rows before JOIN expansion.
5. Run EXPLAIN on any query JOINing tables with more than 100,000 rows.

SQL JOIN Examples: 20+ Real-World Queries

Example 1: Top 10 customers by lifetime spend

SELECT u.first_name, u.email,
       COUNT(o.id)            AS total_orders,
       ROUND(SUM(o.total), 2) AS lifetime_value
FROM users u
INNER JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name, u.email
ORDER BY lifetime_value DESC
LIMIT 10;

Example 2: Customers who have never ordered

SELECT u.id, u.first_name, u.email, u.created_at
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
WHERE o.id IS NULL
ORDER BY u.created_at DESC;

-- Anti-join pattern: LEFT JOIN + WHERE right_table.id IS NULL

Example 3: Revenue by product category

SELECT p.category,
       COUNT(DISTINCT o.id)                       AS orders_with_category,
       SUM(oi.quantity)                            AS units_sold,
       ROUND(SUM(oi.quantity * oi.unit_price), 2)  AS revenue
FROM order_items oi
JOIN orders o   ON o.id = oi.order_id
JOIN products p ON p.id = oi.product_id
WHERE o.status = 'completed'
GROUP BY p.category
ORDER BY revenue DESC;

Example 4: Monthly revenue with unique customer count

SELECT SUBSTR(o.created_at, 1, 7)  AS month,
       COUNT(DISTINCT o.user_id)      AS unique_customers,
       COUNT(o.id)                    AS total_orders,
       ROUND(SUM(o.total), 2)         AS monthly_revenue
FROM orders o
JOIN users u ON u.id = o.user_id
WHERE o.status = 'completed'
GROUP BY SUBSTR(o.created_at, 1, 7)
ORDER BY month;

Example 5: Products that have never been ordered

SELECT p.id, p.name, p.category, p.price
FROM products p
LEFT JOIN order_items oi ON oi.product_id = p.id
WHERE oi.id IS NULL
ORDER BY p.category, p.name;

-- Dead inventory identification: products with no sales history.

Example 6: Average order value by customer city

SELECT u.city,
       COUNT(o.id)            AS order_count,
       ROUND(AVG(o.total), 2) AS avg_order_value,
       ROUND(SUM(o.total), 2) AS total_revenue
FROM users u
INNER JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed'
GROUP BY u.city
HAVING COUNT(o.id) >= 5
ORDER BY avg_order_value DESC;

Example 7: Employee salary vs department average

SELECT e.first_name,
       e.department,
       e.salary,
       d_avg.avg_salary,
       ROUND(e.salary - d_avg.avg_salary, 2) AS vs_avg
FROM employees e
JOIN (
  SELECT department, ROUND(AVG(salary), 2) AS avg_salary
  FROM employees
  GROUP BY department
) d_avg ON d_avg.department = e.department
ORDER BY vs_avg DESC;

-- Joins each employee to their department's average salary.

Example 8: Complete order receipt (4-table JOIN)

SELECT u.first_name   AS customer,
       u.email        AS email,
       o.id           AS order_id,
       o.created_at   AS order_date,
       p.name         AS product,
       oi.quantity,
       oi.unit_price,
       o.status
FROM orders o
JOIN users u        ON u.id = o.user_id
JOIN order_items oi ON oi.order_id = o.id
JOIN products p     ON p.id = oi.product_id
WHERE o.id = 42;

Example 9: Employees with no manager (top-level leadership)

SELECT e.id, e.first_name, e.department, e.salary
FROM employees e
LEFT JOIN employees m ON m.id = e.manager_id
WHERE m.id IS NULL
ORDER BY e.department;

-- Self-join + LEFT JOIN: employees whose manager_id matches no id in the table.

Example 10: Best-selling products with revenue

SELECT p.name,
       p.category,
       SUM(oi.quantity)                           AS units_sold,
       ROUND(SUM(oi.quantity * oi.unit_price), 2) AS total_revenue
FROM products p
JOIN order_items oi ON oi.product_id = p.id
JOIN orders o       ON o.id = oi.order_id
WHERE o.status = 'completed'
GROUP BY p.id, p.name, p.category
ORDER BY total_revenue DESC
LIMIT 20;

Example 11: Repeat customers (ordered more than once)

SELECT u.first_name, u.email,
       COUNT(o.id)         AS order_count,
       MIN(o.created_at)   AS first_order,
       MAX(o.created_at)   AS most_recent_order
FROM users u
JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name, u.email
HAVING COUNT(o.id) > 1
ORDER BY order_count DESC;

Example 12: Department headcount and salary report

SELECT d.name              AS department,
       COUNT(e.id)          AS headcount,
       ROUND(AVG(e.salary)) AS avg_salary,
       MIN(e.salary)        AS min_salary,
       MAX(e.salary)        AS max_salary
FROM departments d
LEFT JOIN employees e ON e.department_id = d.id
GROUP BY d.id, d.name
ORDER BY headcount DESC;

-- LEFT JOIN: departments with zero employees still appear with 0 headcount.

Example 13: Customers who ordered Electronics

SELECT DISTINCT u.first_name, u.email, o.id AS order_id
FROM users u
JOIN orders o       ON o.user_id = u.id
JOIN order_items oi ON oi.order_id = o.id
JOIN products p     ON p.id = oi.product_id
WHERE p.category = 'Electronics'
  AND o.status = 'completed'
ORDER BY o.id DESC;

-- DISTINCT prevents one customer appearing multiple times if
-- they ordered multiple electronics items in the same order.

Example 14: Loyal customers — ordered in 3+ months

SELECT u.first_name, u.email,
       COUNT(DISTINCT SUBSTR(o.created_at, 1, 7)) AS active_months,
       ROUND(SUM(o.total), 2)                      AS lifetime_value
FROM users u
JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed'
GROUP BY u.id, u.first_name, u.email
HAVING COUNT(DISTINCT SUBSTR(o.created_at, 1, 7)) >= 3
ORDER BY active_months DESC;

Example 15: SaaS — users who logged in AND purchased this month

SELECT u.id, u.first_name, u.email,
       MAX(l.login_at) AS last_login,
       COUNT(o.id)     AS purchases_this_month
FROM users u
JOIN user_logins l ON l.user_id = u.id
JOIN orders o      ON o.user_id = u.id
WHERE l.login_at   >= DATE('now', '-30 days')
  AND o.created_at >= DATE('now', '-30 days')
  AND o.status = 'completed'
GROUP BY u.id, u.first_name, u.email
ORDER BY purchases_this_month DESC;

INNER JOIN vs LEFT JOIN: Key Differences

Comparison table

Feature                 | INNER JOIN                           | LEFT JOIN
------------------------|--------------------------------------|------------------------------------------
Rows returned           | Only rows matching in BOTH tables    | ALL rows from left + matched from right
Unmatched left rows     | Excluded                             | Included (right columns = NULL)
Unmatched right rows    | Excluded                             | Excluded
Typical result size     | Smaller — only intersecting rows     | Larger — full left table preserved
NULLs in result         | Only if source data has NULLs        | NULLs where right table has no match
When to use             | You need complete data on both sides | You need all left rows, match optional
Common use case         | Sales reports, fulfillment data      | User lists, inventory with zero sales
Performance             | Often slightly faster (smaller set)  | Marginally more work to preserve NULLs

The critical difference: row elimination

-- INNER JOIN: only customers who have placed at least one order
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
INNER JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;
-- Result: 250 rows (only the 250 users who have orders)

-- LEFT JOIN: ALL customers, including those with zero orders
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;
-- Result: 500 rows (all 500 users, 0 for those with no orders)

-- The choice between INNER and LEFT JOIN changes the denominator of your report.

When to use INNER JOIN

Use INNER JOIN when:
• You only want records with complete data on both sides
• Missing related data means the row is not relevant to your query
• You are building a report where every row must be "complete"
• You want the smallest, cleanest result set

Examples: completed order reports, active subscription analytics, purchase history.

When to use LEFT JOIN

Use LEFT JOIN when:
• You need ALL rows from the left table, even with no related data
• You want to find records that have no related data (WHERE right.id IS NULL)
• You are computing metrics where zero is a valid and meaningful answer
• The absence of a related record is itself important information

Examples: user acquisition funnels, inventory analysis, churn identification, full customer lists.

LEFT JOIN vs RIGHT JOIN

The only real difference: which table is preserved

LEFT JOIN preserves all rows from the left (first-named) table.
RIGHT JOIN preserves all rows from the right (second-named) table.

Every RIGHT JOIN can be rewritten as a LEFT JOIN by swapping the table order — they are functionally equivalent. Convention in the SQL community is to always use LEFT JOIN because it reads more naturally: name the main table first, then look up optional related data.

Same query as LEFT JOIN and RIGHT JOIN

-- RIGHT JOIN (uncommon)
SELECT o.id, o.total, u.first_name
FROM users u
RIGHT JOIN orders o ON o.user_id = u.id;

-- Equivalent LEFT JOIN (preferred)
SELECT o.id, o.total, u.first_name
FROM orders o
LEFT JOIN users u ON u.id = o.user_id;

-- Both return all orders, with NULLs for orders with no matching user.
-- The LEFT JOIN version reads clearly: "start with orders, optionally add user."

Recommendation

Write all outer joins as LEFT JOINs. Name the main entity first, then LEFT JOIN the optional table. This makes query intent immediately clear and avoids directional confusion in complex multi-table queries.

SQL JOINs vs Subqueries

Comparison table

Dimension             | JOIN                                 | Subquery
----------------------|--------------------------------------|-------------------------------------------
Readability           | Better for multi-column lookups      | Better for single existence checks
Columns exposed       | Can SELECT from any joined table     | WHERE subquery cannot expose inner columns
Duplicate rows        | May need DISTINCT with 1:many joins  | EXISTS and IN naturally deduplicate
Performance (modern)  | Comparable — optimizers equalize     | Correlated subqueries can be slow
Filtering rows        | Works but may inflate row count      | EXISTS and IN are clean for filtering
Column values         | Required for multi-table SELECT      | Scalar subquery works for single values
Debugging             | Easy to inspect intermediate tables  | Harder to isolate inner query behavior

When to prefer JOINs

-- Use JOIN when you need columns from both tables in the SELECT:
SELECT u.first_name, u.email, o.total, o.status
FROM users u
JOIN orders o ON o.user_id = u.id;
-- Cannot be done with a WHERE subquery —
-- subquery cannot expose o.total to the outer SELECT.

-- Use JOIN for aggregations spanning multiple tables:
SELECT u.city, ROUND(SUM(o.total), 2) AS revenue
FROM users u
JOIN orders o ON o.user_id = u.id
GROUP BY u.city;

When to prefer subqueries

-- Use EXISTS for clean existence checks (short-circuits at first match):
SELECT u.first_name, u.email
FROM users u
WHERE EXISTS (
  SELECT 1 FROM orders o
  WHERE o.user_id = u.id AND o.total > 500
);

-- Use IN for filtering against a set:
SELECT name FROM products
WHERE category_id IN (
  SELECT id FROM categories WHERE active = 1
);

-- These can also be written as JOINs, but the subquery versions
-- more clearly communicate intent: "where a qualifying record exists."

Performance in modern SQL engines

PostgreSQL, MySQL 8+, and SQL Server automatically rewrite many subqueries into equivalent JOIN plans. The performance difference is often negligible for non-correlated subqueries. The exception is correlated subqueries on large tables — these execute once per outer row and can be orders of magnitude slower than an equivalent JOIN. Always verify with EXPLAIN.

SQL JOIN Performance Optimization

Rule 1: Index your JOIN columns

-- Without an index on orders.user_id, every JOIN requires a full table scan.
-- For a table with 1 million orders, this reads 1 million rows per query.

-- Add indexes on foreign key columns (the JOIN columns in child tables):
CREATE INDEX idx_orders_user_id       ON orders (user_id);
CREATE INDEX idx_order_items_order_id ON order_items (order_id);
CREATE INDEX idx_order_items_prod_id  ON order_items (product_id);

-- Index scans vs full table scans: can reduce query time from seconds to milliseconds.

Rule 2: Filter before joining

-- Modern optimizers push WHERE conditions into the JOIN automatically,
-- but making the intent explicit helps both the optimizer and the reader.

-- For very large tables, pre-filter in a CTE or subquery:
WITH recent_completed AS (
  SELECT id, user_id, total
  FROM orders
  WHERE status = 'completed'
    AND created_at >= '2026-01-01'
)
SELECT u.first_name, rc.total
FROM users u
JOIN recent_completed rc ON rc.user_id = u.id;

-- Add a composite index to support both the filter and the join:
CREATE INDEX idx_orders_status_date ON orders (status, created_at);

Rule 3: Use EXPLAIN to inspect the query plan

-- PostgreSQL / MySQL
EXPLAIN SELECT u.first_name, o.total
FROM users u
JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed';

-- What to look for:
--   Seq Scan / ALL type  → full table scan → add an index
--   Index Scan / ref     → index in use (good)
--   Nested Loop with high row estimates → may need better indexes

-- PostgreSQL: EXPLAIN ANALYZE runs the query and shows actual row counts:
EXPLAIN ANALYZE SELECT u.first_name, o.total
FROM users u JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed';

Rule 4: Use composite indexes for multi-column filters

-- If queries frequently filter on both user_id AND status together:
CREATE INDEX idx_orders_user_status ON orders (user_id, status);

-- This composite index serves:
-- 1. JOIN ON orders.user_id = users.id   (uses user_id)
-- 2. WHERE orders.status = 'completed'   (adds status to the same index scan)

-- Column order matters: put the equality condition column first.

Rule 5: Avoid SELECT * in joined queries

-- AVOID: fetches every column from every joined table
SELECT * FROM users u JOIN orders o ON o.user_id = u.id;
-- 20 user columns + 15 order columns = 35 columns per row transferred,
-- even if you only need 3.

-- BETTER: select only what you need
SELECT u.first_name, u.email, o.total, o.status
FROM users u
JOIN orders o ON o.user_id = u.id;
-- Reduces network transfer, memory, and enables index-only scans.

Rule 6: Beware of row multiplication in 1:many joins

-- If a user has 10 orders, they appear 10 times after JOIN.
-- This is correct with GROUP BY — dangerous without it.

-- WRONG: joining to two 1:many tables without aggregating first
-- 10 orders × 5 addresses = 50 rows per user (totals are wrong)
SELECT u.first_name, SUM(o.total), COUNT(a.id)
FROM users u
JOIN orders o    ON o.user_id = u.id
JOIN addresses a ON a.user_id = u.id
GROUP BY u.id, u.first_name;  -- SUM(o.total) is inflated 5×

-- CORRECT: aggregate each 1:many relationship in a subquery first
SELECT u.first_name, ords.total_spend, addr.address_count
FROM users u
LEFT JOIN (SELECT user_id, SUM(total) AS total_spend FROM orders GROUP BY user_id) ords ON ords.user_id = u.id
LEFT JOIN (SELECT user_id, COUNT(*) AS address_count FROM addresses GROUP BY user_id) addr ON addr.user_id = u.id;

Common SQL JOIN Mistakes

Mistake 1: Missing ON clause (accidental Cartesian product)

-- WRONG ❌ — old comma-join syntax with no condition
SELECT u.first_name, o.total
FROM users u, orders o;

-- WRONG ❌ — modern syntax but missing ON clause
SELECT u.first_name, o.total
FROM users u JOIN orders o;

-- CORRECT ✅
SELECT u.first_name, o.total
FROM users u
JOIN orders o ON o.user_id = u.id;

-- Impact: 500 users × 10,000 orders = 5,000,000 rows with no condition.

Mistake 2: INNER JOIN when LEFT JOIN is needed (silent data loss)

-- Counting orders per user, including those with 0 orders:
-- WRONG ❌ — silently excludes users with no orders
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
INNER JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;

-- CORRECT ✅
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;
-- COUNT(o.id) = 0 for users with no orders.

Mistake 3: COUNT(*) vs COUNT(column) after LEFT JOIN

-- WRONG ❌ — COUNT(*) counts the NULL row, giving 1 instead of 0
SELECT u.first_name, COUNT(*) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;

-- CORRECT ✅ — COUNT(o.id) skips NULLs
SELECT u.first_name, COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name;

Mistake 4: Ambiguous column names

-- WRONG ❌ — 'id' exists in both tables; throws ambiguity error
SELECT id, first_name, total
FROM users JOIN orders ON orders.user_id = users.id;

-- CORRECT ✅ — qualify every column with its table alias
SELECT u.id AS user_id, u.first_name,
       o.id AS order_id, o.total
FROM users u
JOIN orders o ON o.user_id = u.id;

Mistake 5: Filtering LEFT JOIN with WHERE instead of ON

-- SUBTLE BUG ❌ — WHERE on right-table column turns LEFT JOIN into INNER JOIN
SELECT u.first_name, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed';  -- excludes users with no completed orders

-- CORRECT ✅ — move the filter to the ON clause to preserve left-side rows
SELECT u.first_name, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id AND o.status = 'completed';
-- Users with no completed orders still appear; o.total = NULL for them.

Mistake 6: Row multiplication from two 1:many joins

-- WRONG ❌ — joining users to orders AND addresses simultaneously
-- multiplies rows: 3 orders × 2 addresses = 6 rows per user
SELECT u.first_name, SUM(o.total) AS spend, COUNT(a.id) AS addresses
FROM users u
JOIN orders o    ON o.user_id = u.id
JOIN addresses a ON a.user_id = u.id
GROUP BY u.id, u.first_name;
-- SUM(o.total) is inflated by the address join.

-- CORRECT ✅ — pre-aggregate each relationship independently
SELECT u.first_name, ords.spend, addr.addresses
FROM users u
LEFT JOIN (SELECT user_id, SUM(total) AS spend FROM orders GROUP BY user_id) ords
  ON ords.user_id = u.id
LEFT JOIN (SELECT user_id, COUNT(*) AS addresses FROM addresses GROUP BY user_id) addr
  ON addr.user_id = u.id;

Real-World SQL JOIN Use Cases

E-commerce: Customer lifetime value segmentation

SELECT u.first_name, u.email,
       COUNT(o.id)            AS total_orders,
       ROUND(SUM(o.total), 2) AS lifetime_value,
       CASE
         WHEN SUM(o.total) >= 1000 THEN 'VIP'
         WHEN SUM(o.total) >= 500  THEN 'High Value'
         WHEN SUM(o.total) >= 100  THEN 'Regular'
         ELSE 'Low Value'
       END AS segment
FROM users u
LEFT JOIN orders o ON o.user_id = u.id AND o.status = 'completed'
GROUP BY u.id, u.first_name, u.email
ORDER BY lifetime_value DESC NULLS LAST;

Finance: Payment reconciliation

-- Find orders where payment does not match order total
SELECT o.id AS order_id, o.total AS expected,
       p.amount AS received,
       ROUND(o.total - COALESCE(p.amount, 0), 2) AS discrepancy
FROM orders o
LEFT JOIN payments p ON p.order_id = o.id
WHERE o.status = 'completed'
  AND ABS(o.total - COALESCE(p.amount, 0)) > 0.01
ORDER BY discrepancy DESC;

HR systems: Org chart with direct report count

SELECT e.first_name    AS employee,
       e.salary,
       e.department,
       m.first_name    AS reports_to,
       COUNT(dr.id)    AS direct_reports
FROM employees e
LEFT JOIN employees m  ON m.id = e.manager_id
LEFT JOIN employees dr ON dr.manager_id = e.id
GROUP BY e.id, e.first_name, e.salary, e.department, m.first_name
ORDER BY direct_reports DESC, e.department;

SaaS: Feature adoption by plan tier

SELECT u.first_name, u.email, u.plan,
       COUNT(DISTINCT fa.feature_name) AS features_enabled
FROM users u
LEFT JOIN feature_activations fa
  ON fa.user_id = u.id
  AND fa.activated_at >= DATE('now', '-30 days')
WHERE u.plan = 'paid'
GROUP BY u.id, u.first_name, u.email, u.plan
ORDER BY features_enabled DESC;

Analytics: Weekly revenue trend with user acquisition

SELECT STRFTIME('%Y-W%W', o.created_at) AS week,
       COUNT(DISTINCT o.user_id)            AS buying_users,
       COUNT(o.id)                          AS orders,
       ROUND(SUM(o.total), 2)               AS revenue,
       ROUND(SUM(o.total) / COUNT(o.id), 2) AS avg_order_value
FROM orders o
JOIN users u ON u.id = o.user_id
WHERE o.status = 'completed'
  AND o.created_at >= DATE('now', '-90 days')
GROUP BY STRFTIME('%Y-W%W', o.created_at)
ORDER BY week;

SQL JOIN Interview Questions & Answers

Q1: What is the difference between INNER JOIN and OUTER JOIN?

INNER JOIN returns only rows where the join condition is met in both tables. OUTER JOIN (LEFT, RIGHT, or FULL) preserves rows from one or both tables even when no match exists, filling missing columns with NULL. The critical distinction: INNER JOIN can silently drop rows, while LEFT JOIN guarantees all left-table rows appear in the output.

Q2: Write a query to find customers with no orders.

-- Method 1: LEFT JOIN + IS NULL (standard anti-join pattern)
SELECT u.id, u.first_name, u.email
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
WHERE o.id IS NULL;

-- Method 2: NOT EXISTS (often more explicit and readable)
SELECT u.id, u.first_name
FROM users u
WHERE NOT EXISTS (
  SELECT 1 FROM orders o WHERE o.user_id = u.id
);

Q3: What happens when you JOIN without an ON clause?

You get a Cartesian product — every row from table A paired with every row from table B. On large tables this produces millions of rows and can crash or lock the database. Some databases throw an error; others silently return the Cartesian product. Always verify every JOIN has an explicit ON condition.

Q4: Can a JOIN condition use inequality instead of equality?

-- Yes. Non-equijoins use comparison operators other than =:
SELECT e.first_name, e.salary, g.grade
FROM employees e
JOIN salary_grades g
  ON e.salary BETWEEN g.min_salary AND g.max_salary;

-- Also valid: >, <, >=, <=, LIKE.
-- Non-equijoins often generate more rows — verify the result count makes sense.

Q5: What is the difference between UNION and JOIN?

JOIN combines tables horizontally — it adds columns by matching rows across tables. UNION combines tables vertically — it stacks rows from two queries that have the same column structure. JOIN is for relating different entities. UNION is for combining the same type of entity from different sources (e.g., active + archived customers).

Q6: How would you optimize a slow JOIN query?

1. Run EXPLAIN — identify full table scans (Seq Scan / ALL)
2. Add indexes on JOIN columns (foreign keys especially)
3. Add indexes on WHERE filter columns
4. Ensure WHERE filters reduce rows before the JOIN expands them
5. Consider composite indexes for multi-column JOIN + filter combinations
6. Check for row multiplication from 1:many JOINs
7. Update table statistics (ANALYZE in PostgreSQL)
8. Rewrite correlated subqueries as JOINs if the optimizer is not doing it automatically

Q7: What is the difference between NATURAL JOIN and INNER JOIN?

NATURAL JOIN automatically joins tables on all columns with the same name — no ON clause needed. INNER JOIN requires an explicit ON condition. NATURAL JOIN is dangerous: if a new column is added to either table with a matching name, the join condition silently changes. Always use explicit INNER JOIN with a named ON condition in production code.

Q8: Write a query to find the second-highest salary using a self-join.

SELECT MAX(salary) AS second_highest
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);

-- Alternatively using self-join:
SELECT DISTINCT e1.salary
FROM employees e1
JOIN employees e2 ON e2.salary > e1.salary
GROUP BY e1.salary
HAVING COUNT(DISTINCT e2.salary) = 1;

Q9: What is a hash join and when does the optimizer use it?

A hash join builds a hash table from the smaller table, then probes it with rows from the larger table. It is efficient for large unsorted datasets but requires memory. For small indexed tables, the optimizer prefers nested loop joins. You can see which algorithm is chosen with EXPLAIN.

Q10: Can you JOIN on multiple conditions?

-- Yes. Multiple conditions are combined with AND in the ON clause:
SELECT o.id, o.total, d.region
FROM orders o
JOIN distribution_zones d
  ON d.country = o.shipping_country
  AND d.warehouse = o.warehouse_id
  AND d.active = 1;

-- All three conditions must be satisfied for rows to match.
-- Multi-column JOINs often benefit from composite indexes.

Q11: How does NULL behave in JOIN conditions?

NULL does not equal NULL in SQL — NULL = NULL evaluates to NULL (not TRUE). This means rows with NULL in the JOIN column will never match, even if both sides have NULL. This is intentional: NULLs represent unknown values, and two unknowns are not considered equal. Rows with NULL in the JOIN column are silently excluded from INNER JOINs.

Q12: What is the difference between ON and USING in a JOIN?

-- ON: full flexibility, any condition
SELECT u.first_name, o.total
FROM users u JOIN orders o ON o.user_id = u.id;

-- USING: shorthand when the join column has the same name in both tables:
SELECT first_name, total
FROM users JOIN orders USING (id);
-- Note: USING removes the duplicate column from the result; ON keeps both.
-- ON is preferred for clarity and portability across all SQL databases.

Q13: When would you use FULL OUTER JOIN in production?

FULL OUTER JOIN is primarily used for data reconciliation — comparing two tables that should be in sync to find discrepancies on either side. Real examples: comparing orders in a payment gateway vs your application database; comparing employee records between HR and payroll systems; identifying new, deleted, or modified records between ETL source and target.

Q14: What is a lateral join / CROSS APPLY?

LATERAL (PostgreSQL) / CROSS APPLY (SQL Server) allows a subquery in the FROM clause to reference columns from preceding tables in the same FROM. It is commonly used to get top-N rows per group:

SELECT u.first_name, top3.total
FROM users u
CROSS JOIN LATERAL (
  SELECT total FROM orders
  WHERE user_id = u.id
  ORDER BY total DESC LIMIT 3
) top3;

This retrieves the top 3 orders per user efficiently without a window function.

Q15: What is the difference between a nested loop join, hash join, and merge join?

Nested Loop Join: for each row in table A, scan table B for matches. Best for small tables or indexed lookups on table B.

Hash Join: build a hash table from the smaller table, probe with rows from the larger table. Best for large unsorted tables with no useful indexes.

Merge Join (Sort-Merge): sort both tables by the join key, then step through in parallel. Best when both sides are already sorted (indexed). The query optimizer chooses automatically based on table size, indexes, and statistics.

Frequently Asked Questions

Which SQL JOIN type is the most commonly used?

INNER JOIN and LEFT JOIN together cover the vast majority of real-world queries. INNER JOIN is used when you need matched data from both tables. LEFT JOIN is used when you need all rows from the primary entity, with optional related data. RIGHT JOIN, FULL OUTER JOIN, and CROSS JOIN are used far less frequently.

Is LEFT JOIN the same as LEFT OUTER JOIN?

Yes — identical. The OUTER keyword is optional. LEFT JOIN, LEFT OUTER JOIN, RIGHT JOIN, RIGHT OUTER JOIN, FULL JOIN, and FULL OUTER JOIN are all valid. Most developers omit OUTER for brevity.

What is the difference between filtering in WHERE vs ON for a LEFT JOIN?

-- ON condition: filters which right-table rows are matched
-- but preserves all left-table rows (NULLs for non-matching right rows):
SELECT u.first_name, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id AND o.status = 'completed';
-- All users appear; only their completed orders are shown.

-- WHERE condition: filters AFTER the join — turns LEFT JOIN into INNER JOIN behavior:
SELECT u.first_name, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
WHERE o.status = 'completed';
-- Users with no completed orders are EXCLUDED (WHERE eliminates NULL rows).
-- This is one of the most common subtle bugs in SQL.

Can I use JOINs in UPDATE and DELETE statements?

-- MySQL / SQL Server UPDATE with JOIN:
UPDATE orders o
JOIN users u ON u.id = o.user_id
SET o.discount = 0.10
WHERE u.segment = 'VIP';

-- PostgreSQL UPDATE with FROM:
UPDATE orders
SET discount = 0.10
FROM users
WHERE users.id = orders.user_id AND users.segment = 'VIP';

-- DELETE with JOIN (MySQL):
DELETE o FROM orders o
JOIN users u ON u.id = o.user_id
WHERE u.status = 'banned';

How do I join on a date range instead of an exact value?

-- Non-equijoin with BETWEEN or comparison operators:
SELECT e.first_name, e.salary, sg.grade_name
FROM employees e
JOIN salary_grades sg
  ON e.salary BETWEEN sg.min_salary AND sg.max_salary;

-- Or with explicit operators:
SELECT o.id, o.created_at, p.name AS promo_name
FROM orders o
JOIN promotions p
  ON o.created_at >= p.start_date
  AND o.created_at <= p.end_date;

Can two JOINs conflict with each other in the same query?

JOINs do not conflict, but they can interact in ways that cause unexpected row counts. The most common issue is row multiplication: joining a table on two separate 1:many relationships produces a Cartesian product within each group. The solution is to aggregate one or both relationships into a subquery before joining the aggregated results together.

Practice Exercises

Beginner: List all orders with customer name and email

-- Write a query to show every order with the customer's first name and email.
-- Columns: customer_name, email, order_id, total, status

SELECT u.first_name AS customer_name, u.email,
       o.id AS order_id, o.total, o.status
FROM orders o
JOIN users u ON u.id = o.user_id
ORDER BY o.id;

Beginner: Show all users and their order count (include users with 0 orders)

-- Show every user and how many orders they have placed.
-- Include users with 0 orders. Sort by order_count descending.

SELECT u.first_name, u.email,
       COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON o.user_id = u.id
GROUP BY u.id, u.first_name, u.email
ORDER BY order_count DESC;

Intermediate: Revenue per product category (completed orders only)

-- Find total revenue per product category from completed orders.
-- Columns: category, units_sold, total_revenue
-- Sort by total_revenue descending.

SELECT p.category,
       SUM(oi.quantity)                           AS units_sold,
       ROUND(SUM(oi.quantity * oi.unit_price), 2) AS total_revenue
FROM products p
JOIN order_items oi ON oi.product_id = p.id
JOIN orders o       ON o.id = oi.order_id
WHERE o.status = 'completed'
GROUP BY p.category
ORDER BY total_revenue DESC;

Intermediate: Find products that have never been ordered

-- Which products have no entries in order_items?
-- Return product name, category, and price.

SELECT p.name, p.category, p.price
FROM products p
LEFT JOIN order_items oi ON oi.product_id = p.id
WHERE oi.id IS NULL
ORDER BY p.category, p.name;

Advanced: Employees who earn more than their manager

-- Show each employee alongside their manager's name and salary.
-- Include a salary_advantage column (employee - manager salary).
-- Only return employees who earn MORE than their manager.

SELECT e.first_name AS employee, e.salary,
       m.first_name AS manager, m.salary AS manager_salary,
       ROUND(e.salary - m.salary, 2) AS salary_advantage
FROM employees e
JOIN employees m ON m.id = e.manager_id
WHERE e.salary > m.salary
ORDER BY salary_advantage DESC;

Advanced: Customer first-purchase month with lifetime value

-- For each user, find the month of their first purchase and their total lifetime value.
-- Include users who have never ordered (lifetime_value = 0).

SELECT u.first_name, u.email,
       SUBSTR(MIN(o.created_at), 1, 7)            AS first_purchase_month,
       COALESCE(ROUND(SUM(o.total), 2), 0)         AS lifetime_value
FROM users u
LEFT JOIN orders o ON o.user_id = u.id AND o.status = 'completed'
GROUP BY u.id, u.first_name, u.email
ORDER BY first_purchase_month NULLS LAST;

Try it yourself

Open the SQL Playground and run every exercise above against the real SQLab Hub schema — users, orders, order_items, products, and employees tables are all available. The free playground runs a real SQLite engine with no signup required.

Conclusion

Summary: which JOIN to use

INNER JOIN    → Only matched rows (complete data from both sides required)
LEFT JOIN     → All rows from the primary table, optional matches from the secondary
RIGHT JOIN    → Same as LEFT JOIN with tables swapped (prefer LEFT JOIN for clarity)
FULL OUTER    → All rows from both tables, matched and unmatched
SELF JOIN     → Relate rows within the same table (hierarchies, peer comparisons)
CROSS JOIN    → Every combination of two sets (product variants, calendar grids)

Key performance rules

• Index every JOIN column — especially foreign keys
• Use EXPLAIN to verify index usage before running on large datasets
• Filter rows early with WHERE to reduce data before JOIN expansion
• Use COUNT(column) not COUNT(*) after LEFT JOINs to avoid counting NULLs
• Pre-aggregate 1:many relationships in subqueries to prevent row multiplication
• Prefer INNER JOIN when both sides are always present — smaller result sets perform better

Best practices to carry forward

1. Always use table aliases on every multi-table query
2. Give every JOIN an explicit ON clause — never use implicit comma syntax
3. Choose JOIN type intentionally — do not default to INNER JOIN without considering LEFT JOIN
4. Test with SELECT COUNT(*) first — unexpected row counts reveal missing conditions
5. Avoid SELECT * in joined queries — name only the columns you actually need
6. Run EXPLAIN on any query joining tables with more than 100,000 rows

Continue learning

JOINs are most powerful combined with other SQL building blocks. Explore these related guides:
• SQL Subqueries Explained — when a nested query is cleaner than a JOIN
• GROUP BY & HAVING — aggregate your JOIN results into meaningful metrics
• SQL Window Functions — rank, compare, and compute running totals across joined data
• SQL Performance Tips — a complete guide to indexing, EXPLAIN, and query optimization
• SQL Practice Challenges — apply your JOIN knowledge to graded real-world problems

Start practicing SQL for free

Run every query from this guide in the SQLab Hub interactive SQL playground — no signup, no download, instant results.

Open SQL Playground →View SQL Exercises →

Top 50 SQL Interview Questions & Answers (2026 Guide)25 min read GROUP BY vs HAVING: What's the Difference?10 min read SQL Window Functions Explained with Practical Examples (2026 Guide)28 min read