Nice to Know

Chapter 1: Python & Programming Foundations 12 topics

I'll be honest — for a long time, I'd encounter things like __get__, metaclass=, or a triple-nested decorator factory in someone's codebase and do the engineering equivalent of backing out of the room slowly. I'd copy the pattern, hope it worked, and move on. The discomfort of not knowing what these things actually do eventually got unbearable. So I went down the rabbit holes. Here's what I found.

These aren't features you'll use every day. Think of them like specialty tools in a workshop — the ones you reach for twice a year, but when you need them, nothing else will do. More importantly, recognizing them on sight gives you a kind of X-ray vision when reading framework source code. The "magic" in Django, SQLAlchemy, and PyTorch stops being magic. It becomes machinery you can actually understand.

We'll work through these by building a small mental scenario: imagine you're writing a tiny ML experiment tracker — something that logs runs, stores configs, and wraps training functions. As we go, each feature shows up because we need it, not because it's next on some syllabus.

The Gotchas That Bite First

Before we build anything, let's talk about the traps. These are the bugs that don't throw errors — they produce wrong results silently, and you spend hours blaming your model when the problem was a Python quirk all along.

Mutable Default Arguments

Suppose our experiment tracker has a function that collects tags for a run.

def log_run(name, tags=[]):
    tags.append("v1")
    return {"name": name, "tags": tags}

print(log_run("run_a"))  # {'name': 'run_a', 'tags': ['v1']}
print(log_run("run_b"))  # {'name': 'run_b', 'tags': ['v1', 'v1']}  ← wait, what?

That second call inherited the tags from the first. The default list [] is created once — when Python first reads the function definition, not each time the function is called. Every call that doesn't pass tags shares the same list object in memory. It's like giving everyone at a restaurant the same plate — whatever the first person puts on it, the next person gets too.

I got burned by this in production. A logging function was accumulating metadata across requests, and the bug only showed up under load because sequential calls in testing happened to mask it. The fix is straightforward — use None as a sentinel.

def log_run(name, tags=None):
    if tags is None:
        tags = []
    tags.append("v1")
    return {"name": name, "tags": tags}

Now each call gets its own fresh list. The None sentinel pattern shows up everywhere in Python codebases. When you see it, you'll know exactly why it's there.

Late Binding Closures

Say we want to create a batch of callback functions — one for each metric our tracker records.

metrics = ["loss", "accuracy", "f1"]
callbacks = []
for m in metrics:
    callbacks.append(lambda: print(f"Recording {m}"))

callbacks[0]()  # Recording f1
callbacks[1]()  # Recording f1
callbacks[2]()  # Recording f1

All three print "f1." Every single one. Python closures capture variables by reference, not by value. By the time any callback runs, the loop has finished, and m points to the last value. It's like writing someone's phone number on a Post-it, but the Post-it always shows whatever was last written on the whiteboard — not what was there when you stuck it.

# Fix: bind the current value as a default argument
callbacks = []
for m in metrics:
    callbacks.append(lambda m=m: print(f"Recording {m}"))

callbacks[0]()  # Recording loss ✓

The m=m trick captures the value at that moment by binding it to the default parameter. It looks odd, but it's the standard pattern.

`is` vs `==`

One more trap worth naming. == checks if two values are equal. is checks if they're the same object in memory. For most things, you want ==. But CPython caches small integers (-5 to 256) and some strings, which means is can give you the right answer for the wrong reason.

a = 256
b = 256
print(a is b)   # True — CPython caches this integer

a = 257
b = 257
print(a is b)   # False — not cached, different objects
print(a == b)   # True — same value

The rule is stark: use is for None checks (if x is None), use == for everything else. Code that accidentally uses is for value comparison will work in testing with small numbers and break in production with large ones. I still occasionally catch this in code reviews, sometimes my own.

Shallow vs Deep Copy

Our experiment tracker stores nested config dicts. When we copy a config to tweak it for a new run, we hit the last gotcha in this set.

import copy

config = {"model": "resnet", "params": {"lr": 0.001, "layers": [64, 128]}}
config_v2 = copy.copy(config)          # shallow copy
config_v2["params"]["lr"] = 0.01

print(config["params"]["lr"])           # 0.01 ← original is mutated!

copy.copy creates a new outer dict, but the inner objects are still shared references. It's like photocopying a folder — you get a new folder, but the documents inside are the same physical pages. Mutate one, mutate both.

config_v2 = copy.deepcopy(config)       # recursively copies everything
config_v2["params"]["lr"] = 0.01
print(config["params"]["lr"])           # 0.001 ✓ original untouched

deepcopy walks the entire object tree and creates independent copies at every level. It's slower, but it's the only safe option for nested mutable structures.

Decorators Under the Hood

Our tracker needs timing and logging on every function that trains a model. We could paste timing code everywhere, but we've been writing Python long enough to know that's the path to madness. This is exactly what decorators are for — and understanding how they actually work changes them from "syntax I copy" to "machinery I control."

A decorator is not special syntax. It's a function that takes a function and returns a function. That's it. When you write this:

@timer
def train(epochs):
    ...

Python is doing exactly this:

def train(epochs):
    ...
train = timer(train)

The @ symbol is syntactic sugar. Nothing more. The real function train gets replaced by whatever timer returns. If timer returns a wrapper function, that wrapper runs instead of the original whenever anyone calls train().

Why `functools.wraps` Matters

Here's a decorator without it.

def timer(func):
    def wrapper(*args, **kwargs):
        import time
        start = time.time()
        result = func(*args, **kwargs)
        print(f"{func.__name__} took {time.time() - start:.2f}s")
        return result
    return wrapper

@timer
def train(epochs): ...

print(train.__name__)  # "wrapper" — not "train"

The original function's name, docstring, and module are lost. Stack traces show "wrapper." Debugging tools show "wrapper." Introspection is broken. In production, when you're reading logs at 2 AM trying to find which function is slow, "wrapper" tells you nothing.

from functools import wraps

def timer(func):
    @wraps(func)          # copies __name__, __doc__, __module__ from func
    def wrapper(*args, **kwargs):
        import time
        start = time.time()
        result = func(*args, **kwargs)
        print(f"{func.__name__} took {time.time() - start:.2f}s")
        return result
    return wrapper

print(train.__name__)  # "train" ✓

@wraps(func) is a one-liner that preserves the decorated function's identity. Every decorator you write should include it. Every decorator in a library that doesn't include it has a small bug.

Decorator Factories — When You Need Arguments

What if we want our timer to only print when execution exceeds a threshold? We need a decorator that takes parameters, which means adding one more layer of nesting.

def timer(threshold=1.0):
    def decorator(func):
        @wraps(func)
        def wrapper(*args, **kwargs):
            import time
            start = time.time()
            result = func(*args, **kwargs)
            elapsed = time.time() - start
            if elapsed > threshold:
                print(f"⚠ {func.__name__} took {elapsed:.2f}s (threshold: {threshold}s)")
            return result
        return wrapper
    return decorator

@timer(threshold=0.5)
def train(epochs): ...

Three levels deep. The outermost function (timer) takes the arguments and returns the actual decorator. The middle function (decorator) takes the function. The innermost (wrapper) does the work. I still double-check this nesting every time I write one. It's the kind of thing that makes perfect sense once you trace through it, and is confusing every time you write it from scratch.

Stacking Decorators

When you stack decorators, they apply bottom-up but execute top-down.

@authenticate    # outer: runs first on call
@timer           # inner: applied first at definition
def train(epochs): ...

This is equivalent to train = authenticate(timer(train)). At definition time, timer wraps train first, then authenticate wraps the result. When called, authenticate's wrapper runs first (it's the outermost layer), then timer's, then the original function. Think of it like nesting Russian dolls — you build from the inside out, but open from the outside in.

Context Managers — Cleanup You Can't Forget

Our tracker opens files for logging, acquires database connections, and temporarily changes working directories. All of these share a pattern: acquire a resource, use it, guarantee cleanup even if something blows up. That's what context managers formalize.

The with statement isn't doing anything fancy. It calls __enter__ when the block starts and __exit__ when it ends — including when exceptions occur. That guarantee is the whole point.

class DatabaseConnection:
    def __enter__(self):
        self.conn = connect_to_db()
        return self.conn

    def __exit__(self, exc_type, exc_val, exc_tb):
        self.conn.close()             # always runs, even on exception
        return False                  # don't suppress the exception

with DatabaseConnection() as conn:
    conn.execute("INSERT ...")

If the INSERT fails, __exit__ still runs and closes the connection. No leaked handles. No finally blocks to remember. The cleanup is baked into the structure.

But writing a class with __enter__ and __exit__ for every small cleanup task gets tedious. That's where contextlib comes in.

from contextlib import contextmanager

@contextmanager
def temp_directory(path):
    original = os.getcwd()
    os.chdir(path)
    try:
        yield path          # everything before yield = __enter__
    finally:
        os.chdir(original)  # everything after yield = __exit__

with temp_directory("/data/experiment_42"):
    # we're in the temp dir here
    save_results()
# back to original directory, guaranteed

The code before yield is setup. The code after is teardown. The try/finally ensures cleanup happens even on exceptions. This is how most library authors write context managers — it's more readable than a full class.

One more tool worth naming: ExitStack. When you don't know ahead of time how many resources you'll need — say, opening a variable number of log files — ExitStack manages them dynamically.

from contextlib import ExitStack

with ExitStack() as stack:
    files = [stack.enter_context(open(f)) for f in log_paths]
    # all files open, all will be closed when we leave this block

Rest Stop

If you've made it this far, you already have X-ray vision for three of the most common "what is this?" moments in Python codebases: the silent gotchas that cause midnight debugging sessions, the decorator machinery that powers every framework, and the context manager protocol that keeps resources from leaking. You can stop here and be well-equipped. But if the mystery of how @property actually works is nagging at you — and how Django somehow turns class definitions into database tables — read on.

Descriptors — The Machinery Behind `@property`

When you write @property on a method, it feels like a special language feature. It's not. It's a descriptor — an object that implements __get__, __set__, or __delete__. Python calls these methods automatically when you access an attribute. The entire attribute access system in Python runs on this protocol.

class Celsius:
    def __init__(self):
        self._temp = 0

    @property
    def fahrenheit(self):
        return self._temp * 9/5 + 32

t = Celsius()
print(t.fahrenheit)   # 32.0

Under the hood, property is a class that defines __get__. When you access t.fahrenheit, Python doesn't find a regular attribute — it finds a descriptor object on the class, and calls its __get__ method, which runs your getter function. @classmethod and @staticmethod work the same way — they're all descriptor objects that intercept attribute access.

There's a subtlety worth knowing. Python distinguishes data descriptors (which define __set__ or __delete__) from non-data descriptors (which only define __get__). Data descriptors take priority over instance attributes. Non-data descriptors don't. That's why you can't accidentally override a property by assigning to the same name, but you can shadow a regular method.

I'll be honest — the lookup order (data descriptor → instance dict → non-data descriptor) took me several readings to internalize, and I still occasionally draw it out on paper when debugging something weird. You don't need to memorize it. But knowing it exists means the next time attribute access does something unexpected, you know where to look.

Metaclasses — The Nuclear Option

A metaclass is a class whose instances are themselves classes. If that sentence made you pause, good — it made me pause too. The analogy that finally made it click: if a class is a blueprint for objects, a metaclass is a blueprint for blueprints. It controls what happens at the moment a class is defined, not when instances are created.

Where do you encounter them? ORMs. When you write this in Django:

class Article(models.Model):
    title = models.CharField(max_length=200)
    published = models.BooleanField(default=False)

You're not writing a normal class. The metaclass ModelBase intercepts this class definition and does extraordinary things: it inspects every attribute, builds database column mappings from the field objects, registers the model with Django's app registry, sets up query managers, and wires signal handlers. All before a single instance is created. That's why you can write Article.objects.filter(published=True) — the metaclass set up objects for you.

But here's the thing — you'll read code that uses metaclasses far more often than you'll write them. And for most use cases where you think you need a metaclass, Python 3.6+ offers a simpler alternative.

class Plugin:
    registry = {}

    def __init_subclass__(cls, **kwargs):
        super().__init_subclass__(**kwargs)
        Plugin.registry[cls.__name__] = cls

class AudioPlugin(Plugin): ...
class VideoPlugin(Plugin): ...

print(Plugin.registry)  # {'AudioPlugin': , 'VideoPlugin': }

__init_subclass__ runs every time a subclass is created. It handles the most common metaclass use case — automatic registration — without any metaclass machinery. It covers about 80% of the cases that would have required a metaclass in older Python. The remaining 20% involves modifying the class namespace before the class body executes, which truly requires a metaclass, and which you'll encounter primarily when reading ORM and framework source code.

Modern Python Worth Knowing

Python has been evolving faster in recent versions than most people realize. A few features have crossed the threshold from "cool new thing" to "you'll see this in production code."

The Walrus Operator (`:=`)

Assigns and uses a value in a single expression. It looks strange the first time. Then it becomes indispensable in specific patterns.

# without walrus — compute len twice or use a temp variable
data = get_batch()
if len(data) > 0:
    process(data)

# with walrus — assign and test in one shot
if (n := len(data := get_batch())) > 0:
    print(f"Processing {n} items")
    process(data)

Where it really shines is in while loops and comprehension filters, where the alternative is awkward repeated computation or a clunky pre-assignment.

Structural Pattern Matching

Python 3.10 introduced match/case, and it's far more powerful than a switch statement. It destructures data.

def handle_event(event):
    match event:
        case {"type": "metric", "name": name, "value": float(v)}:
            record_metric(name, v)
        case {"type": "error", "message": msg}:
            log_error(msg)
        case [x, y, *rest]:
            print(f"Sequence starting with {x}, {y}")
        case _:
            print("Unknown event")

It's not string matching. It's structural — it checks the shape of the data and binds variables as it goes. Parsing API responses, handling command-line arguments, routing messages — anywhere you're doing if isinstance(...) elif isinstance(...) chains, pattern matching is cleaner.

f-string Debug Mode

Tiny feature from 3.8. Add = after the expression and Python prints both the expression and its value.

batch_size = 64
lr = 3e-4
print(f"{batch_size=}, {lr=}")
# batch_size=64, lr=0.0003

print(f"{len(train_data)=}")
# len(train_data)=50000

It saves typing the variable name twice during debugging. Small, but once you start using it, you can't go back.

Dataclasses Beyond the Basics

Basic @dataclass gives you __init__, __repr__, and __eq__ for free. The advanced options turn it into something much more powerful.

from dataclasses import dataclass, field

@dataclass(frozen=True, slots=True)
class ExperimentConfig:
    model: str
    lr: float = 0.001
    tags: list[str] = field(default_factory=list)

    def __post_init__(self):
        if self.lr <= 0:
            raise ValueError(f"Learning rate must be positive, got {self.lr}")

frozen=True makes instances immutable — any attempt to change a field raises an error. This is invaluable for configs that shouldn't change mid-experiment. slots=True (Python 3.10+) eliminates the instance __dict__, cutting memory usage and speeding up attribute access. field(default_factory=list) solves the mutable default gotcha we saw earlier — each instance gets its own list. And __post_init__ runs after the auto-generated __init__, giving you a place for validation without writing the entire __init__ yourself.

For configs, data transfer objects, and anything that's "a bag of named values with validation," dataclasses with these options have essentially replaced manual __init__ methods in modern Python.

Type Hints That Actually Help

Beyond basic int and str annotations, two typing features have become genuinely useful in production.

Protocol is structural subtyping — compile-time duck typing. Instead of requiring inheritance, you define what methods a type must have, and any class that has them qualifies.

from typing import Protocol

class Trainable(Protocol):
    def fit(self, X, y) -> None: ...
    def predict(self, X) -> list: ...

def evaluate(model: Trainable, test_X, test_y):
    preds = model.predict(test_X)
    ...

Any class with fit and predict methods satisfies Trainable, no inheritance required. This is how you type-hint "anything that looks like a scikit-learn model" without coupling to scikit-learn.

The new type parameter syntax in Python 3.12 cleans up generics significantly.

# old way — separate TypeVar declaration
from typing import TypeVar
T = TypeVar("T")
def first(items: list[T]) -> T: ...

# new way (3.12+) — inline, clean
def first[T](items: list[T]) -> T: ...

Same meaning, less boilerplate. You'll see this in newer libraries that have dropped pre-3.12 support.

The GIL — What It Actually Means

The Global Interpreter Lock is Python's most misunderstood feature. In CPython (the standard interpreter), only one thread can execute Python bytecode at a time, even on a multi-core machine. This doesn't mean threading is useless — it means you have to know when it helps and when it doesn't.

For I/O-bound work (network requests, file reads, database queries), threading works fine. The GIL is released while waiting for I/O, so threads can genuinely run concurrently. For CPU-bound work (matrix math, data processing), threads give you zero speedup because the GIL prevents parallel execution. Use multiprocessing instead — separate processes, separate GILs.

And there's asyncio, which handles I/O concurrency with a single thread using cooperative scheduling. No GIL contention, no threading overhead, and it scales to thousands of concurrent connections. Most modern Python web frameworks (FastAPI, newer Django) are built on it.

The Point of This Section

None of these are features you'll use every day. They're recognition tools — the X-ray vision that lets you look at framework source code and see machinery instead of magic. The next time you encounter a triple-nested decorator, a descriptor protocol method, or a metaclass in someone's ORM, you won't need to back out of the room. You'll know what's going on under the hood, and that changes everything.

What You Should Now Be Able To Do

Explain why mutable default arguments and late binding closures cause bugs — and fix them on sight
Write decorators with functools.wraps, decorator factories with arguments, and reason about stacking order
Use context managers and contextlib for resource management that survives exceptions
Read code that uses descriptors, metaclasses, or __init_subclass__ without confusion
Leverage modern Python — pattern matching, walrus operator, advanced dataclasses, Protocol — in production code
Know when to use threads vs multiprocessing vs asyncio based on whether work is I/O-bound or CPU-bound

← PreviousAlgorithms and Data Structures for ML