Excludes 1 And Excludes 2 Examples: 5 Real Examples Explained

Have you ever tried to filter out data and ended up pulling the wrong rows?
It’s a classic mix‑up that happens all the time in data work, especially when you’re dealing with excludes clauses. The trick is knowing the difference between exclude‑1 logic and exclude‑2 logic, and then applying the right one to the right situation Simple, but easy to overlook..

What Is “Excludes 1” and “Excludes 2”

When people talk about “excludes 1” or “excludes 2,” they’re usually referring to two common patterns for removing unwanted records. Think of them as two different ways to say “don’t show me these.”

Excludes 1

Direct negation: You list the values you want to filter out and the system excludes them.
Typical syntax: WHERE column NOT IN (…) or WHERE NOT (condition).
Use case: When you know exactly which items you want to drop and the list is manageable.

Excludes 2

Complementary logic: You start with a broad set and then subtract a subset using a condition that’s the opposite of the inclusion rule.
Typical syntax: WHERE column <> value or WHERE NOT EXISTS (…).
Use case: When the exclusion criteria are based on a dynamic or complex rule, not just a static list.

Both patterns are about the same goal—filtering out data—but they’re not interchangeable. Picking the wrong one can lead to missed records or, worse, a data set that’s too small to be useful Most people skip this — try not to..

Why It Matters / Why People Care

Imagine you’re a marketer pulling a list of customers who didn’t buy in the last quarter. If you use the wrong exclude pattern, you might lose half the leads or, even worse, keep people you already tried to reach Less friction, more output..

Accuracy: The right exclude pattern keeps your data clean and reliable.
Performance: A poorly written exclude clause can slow down queries dramatically.
Compliance: In regulated industries, you must be sure that excluded records are truly omitted; otherwise you risk legal headaches.

So, before you fire off that query, double‑check which exclude type you need.

How It Works (or How to Do It)

Excludes 1: Direct Negation

The most straightforward way to drop records is to say “don’t include these.”

Example (SQL)

SELECT *
FROM customers
WHERE country NOT IN ('USA', 'Canada', 'Mexico');

What it does: Pulls every customer whose country isn’t in the list.
When it shines: Small, static lists; when you’re comfortable hard‑coding the values.

When to avoid it

If the list is huge (hundreds of values) → the query planner might choke.
If the list changes frequently → you’ll need to rewrite the query each time.

Excludes 2: Complementary Logic

Instead of listing what to remove, you describe what to keep and then negate that That alone is useful..

Example (SQL)

SELECT *
FROM orders
WHERE NOT EXISTS (
    SELECT 1
    FROM returns
    WHERE returns.order_id = orders.id
);

What it does: Pulls orders that have no corresponding return record.
Why it’s useful: The exclusion condition is dynamic—based on another table’s state.

When to avoid it

If the subquery returns a massive set → performance suffers.
If the logic is simple enough to list, a NOT IN might be clearer.

Performance Tips

Indexes matter: Make sure the columns you’re excluding are indexed.
Avoid OR with NOT: WHERE NOT (A OR B) can be rewritten as WHERE NOT A AND NOT B for better optimization.
Use EXISTS over IN when subquery size is large: NOT EXISTS often outperforms NOT IN for correlated subqueries.

Common Mistakes / What Most People Get Wrong

Using NOT IN with NULLs
WHERE column NOT IN (NULL) will return no rows because NULL comparisons are unknown.
Assuming NOT EXISTS is always faster
In some engines, a well‑indexed NOT IN can beat NOT EXISTS. Test both.
Hard‑coding large exclusion lists
As the list grows, the query becomes unwieldy and hard to maintain.
Mixing exclude logic with pagination
If you paginate after an exclusion, you might get duplicate or missing rows.
Ignoring case sensitivity
In PostgreSQL, NOT IN ('a', 'b') is case‑sensitive by default. Use ILIKE or lower() if needed.

Practical Tips / What Actually Works

Keep exclusion lists in a table

CREATE TABLE excluded_countries (country TEXT PRIMARY KEY);
INSERT INTO excluded_countries VALUES ('USA'), ('Canada'), ('Mexico');

Then query:

SELECT *
FROM customers
WHERE country NOT IN (SELECT country FROM excluded_countries);

Use CTEs for readability

WITH active_orders AS (
    SELECT * FROM orders
    WHERE status = 'active'
)
SELECT *
FROM active_orders
WHERE NOT EXISTS (
    SELECT 1 FROM returns WHERE returns.order_id = active_orders.id
);

Cache subquery results
If you’re using NOT EXISTS against a large lookup table, materialize the lookup once and join against it.
Test with EXPLAIN
Run EXPLAIN to see if the planner chooses a nested loop or hash join.
Document your choice
Add a comment in the query: -- Excludes 1: Exclude static list of countries.

FAQ

Q1: Can I use NOT IN with a subquery that returns NULLs?
A1: No. If the subquery can return NULL, wrap it in WHERE subquery IS NOT NULL or use NOT EXISTS instead Practical, not theoretical..

Q2: Which is faster: NOT IN or NOT EXISTS?
A2: It depends on the database and the data size. Generally, NOT EXISTS is safer for large subqueries, but test both in your environment.

Q3: How do I exclude records based on a date range?
A3: Use a condition like WHERE date_col NOT BETWEEN '2023-01-01' AND '2023-12-31' or WHERE date_col < '2023-01-01' OR date_col > '2023-12-31'.

Q4: Is there a way to combine Excludes 1 and 2 in one query?
A4: Yes. Use a CTE or subquery for the dynamic part and a NOT IN for the static list The details matter here..

Q5: What if my database doesn’t support NOT EXISTS?
A5: Most modern SQL engines do. If not, you can emulate it with a LEFT JOIN and filter on NULLs.

So, next time you’re filtering out data, remember: Excludes 1 is the straight‑up “don’t list these” approach, while Excludes 2 is the “pull everything except what meets this dynamic rule” style. Pick the right one, test it, and your queries will stay lean, fast, and—most importantly—accurate. Happy filtering!

When to Reach for the “Hybrid” Pattern

In many real‑world scenarios you’ll find yourself needing both a static block list and a dynamic rule set. The cleanest way to express that in SQL is to combine a NOT IN clause with a NOT EXISTS (or anti‑join) in a single WHERE predicate. Because each part is evaluated independently, the optimizer can often push the static list filter down to the scan level, while the dynamic part is handled later in the plan Not complicated — just consistent..

WITH dynamic_exclusions AS (
    SELECT user_id
    FROM user_activity
    WHERE last_login < CURRENT_DATE - INTERVAL '90 days'
)
SELECT *
FROM users u
WHERE u.country NOT IN ('USA','CAN','MEX')          -- Excludes 1 (static)
  AND NOT EXISTS (                                 -- Excludes 2 (dynamic)
        SELECT 1
        FROM dynamic_exclusions d
        WHERE d.user_id = u.id
      );

Why this works well

Benefit	Explanation
Predicate push‑down	The static `NOT IN` can be applied as early as the table scan, reducing I/O. Also, last_login`is indexed, the CTE will be fast, and the`NOT EXISTS` anti‑join can be turned into a hash anti‑join. Practically speaking,
Index usage	If `user_activity.
Maintainability	The static list lives in code (or a small lookup table), while the dynamic rule lives in a clearly named CTE.
Testability	Each part can be unit‑tested in isolation—run the CTE alone, run the static filter alone, then verify the combined result.

Common Pitfalls & How to Avoid Them

Pitfall	Symptom	Fix
Accidental cross‑join	Query runs for minutes, rows explode	Ensure every `NOT EXISTS` or `LEFT JOIN … IS NULL` has a proper correlation predicate (`WHERE sub.id = main.id`).
Over‑eager `DISTINCT`	Results look correct but performance drags	Only use `DISTINCT` if you truly need to deduplicate; often the anti‑join already guarantees uniqueness. And
Hard‑coded lists that grow	“Add another country” requires a code change	Store the list in a small reference table (`excluded_countries`) and join to it; you can still use `NOT IN` if you prefer the literal syntax for very tiny lists. Day to day,
Neglecting `NULL` handling	Some rows unexpectedly appear in the result set	Wrap subqueries with `WHERE column IS NOT NULL` or switch to `NOT EXISTS`.
Missing statistics	Planner picks a nested‑loop when a hash join would be faster	Run `ANALYZE` after bulk inserts/updates to keep statistics fresh.

Performance‑Testing Checklist

Baseline – Run the query without any exclusions and capture execution time and row count.
Add static exclusions – Measure impact; you should see a proportional drop in rows scanned.
Add dynamic exclusions – Compare NOT EXISTS vs. LEFT JOIN … IS NULL. Use EXPLAIN (ANALYZE, BUFFERS) to see which plan the optimizer chooses.
Swap the order – Put the dynamic filter first, then the static one, and re‑run. In most engines the order of predicates doesn’t matter, but the optimizer may produce a different plan if one predicate is more selective.
Scale the data – If possible, test on a production‑sized dataset (or a realistic subset). Small test sets can hide cardinality problems that explode in production.

If any step shows a regression larger than ~5 % of the baseline, dig into the plan and consider adding an index, materializing the subquery, or rewriting the logic Less friction, more output..

Real‑World Example: A SaaS Billing Dashboard

Imagine a multi‑tenant SaaS product that needs to show active subscriptions for a given tenant, but must exclude:

Static block list – Countries where the service is prohibited (e.g., sanctions).
Dynamic block list – Customers who have been flagged for fraud in the last 30 days.

-- 1️⃣ Static block list (tiny, rarely changes)
CREATE TABLE prohibited_countries (code TEXT PRIMARY KEY);
INSERT INTO prohibited_countries VALUES ('IR'),('KP'),('SY');

-- 2️⃣ Dynamic block list (grows daily)
CREATE MATERIALIZED VIEW recent_fraud AS
SELECT customer_id
FROM fraud_events
WHERE event_date >= CURRENT_DATE - INTERVAL '30 days';

-- 3️⃣ Dashboard query
SELECT s.subscription_id,
       s.start_date,
       s.end_date,
       c.email,
       c.country
FROM subscriptions s
JOIN customers c ON c.id = s.customer_id
WHERE s.tenant_id = $1
  AND c.country NOT IN (SELECT code FROM prohibited_countries)   -- static
  AND NOT EXISTS (                                               -- dynamic
        SELECT 1 FROM recent_fraud f WHERE f.customer_id = c.id
      )
  AND s.status = 'active';

Why this design shines

The static list lives in a tiny table that can be cached in memory, so the NOT IN is essentially a constant‑time filter.
The dynamic list is a materialized view refreshed nightly (or more frequently if needed). Because it’s indexed on customer_id, the anti‑join is a fast hash lookup.
Adding a new prohibited country is a single INSERT—no code changes.
Adding a new fraud detection rule merely updates the underlying fraud_events table; the view reflects it automatically.

TL;DR Cheat Sheet

Need	Recommended Pattern	Example
Small, immutable list	`NOT IN ('A','B','C')` or a tiny lookup table	`WHERE country NOT IN ('US','CA')`
Large or frequently‑changing list	`NOT EXISTS (SELECT 1 FROM exclusion_table WHERE …)`	`WHERE NOT EXISTS (SELECT 1 FROM blocked_customers b WHERE b.col IS NOT NULL AND t.id = c.id = c.On the flip side, id WHERE e. id IS NULL`
Concerned about `NULL`s	Use `NOT EXISTS` or filter out `NULL`s first	`WHERE NOT EXISTS (SELECT 1 FROM t WHERE t.id)`
Both static & dynamic	Combine `NOT IN` with `NOT EXISTS` (or anti‑join)	See hybrid query above
Want to keep logic out of the main query	Store exclusions in a dedicated table or materialized view and join/anti‑join to it	`FROM customers c LEFT JOIN excluded e ON e.col = main.

Final Thoughts

Excluding rows is one of those seemingly simple tasks that can quickly become a performance nightmare if you ignore the underlying data characteristics and the optimizer’s behavior. The key takeaways are:

Separate concerns – Keep static block lists and dynamic rule sets in their own objects.
Prefer NOT EXISTS for large or nullable subqueries – It avoids the pitfalls of NULL handling that plague NOT IN.
make use of CTEs and materialized views – They give you both readability and the ability to pre‑compute expensive exclusion sets.
Always validate with EXPLAIN – The planner’s choice tells you whether your indexes are being used and whether a hash anti‑join or a nested‑loop is in play.
Document the intent – A short comment explaining why a row is being excluded saves future developers (and your future self) countless hours of debugging.

By treating exclusions as a first‑class part of your data model rather than an after‑thought in ad‑hoc SQL, you’ll write queries that are easier to understand, easier to maintain, and—most importantly—faster to run at scale. Happy querying!

What Is “Excludes 1” and “Excludes 2”

Excludes 1

Excludes 2

Why It Matters / Why People Care

How It Works (or How to Do It)

Excludes 1: Direct Negation

Example (SQL)

When to avoid it

Excludes 2: Complementary Logic

Example (SQL)

When to avoid it

Performance Tips

Common Mistakes / What Most People Get Wrong

Practical Tips / What Actually Works

FAQ

When to Reach for the “Hybrid” Pattern

Common Pitfalls & How to Avoid Them

Performance‑Testing Checklist

Real‑World Example: A SaaS Billing Dashboard

TL;DR Cheat Sheet

Final Thoughts

Out This Morning

Interesting Nearby