nx1.info | Object-Oriented Design in Python

These are notes from the book Practical Object-Oriented Design in Ruby that I have adapted into Python. | (link) | (More books r-5.org) This book is extremely good at explaining why to do things a certain way as opposed to simply how (which is what I found 99% of object OOP content on the internet seemed to be). The book is written for Ruby however the principles hold for any OO language.

Table of Contents (click to jump)

1. Object-Oriented Design 2. Designing Classes with a Single Responsibility - Deciding What Belongs in a Class - Grouping Methods into Classes - Organizing Code to Allow for Easy Changes - Creating Classes That Have a Single Responsibility - An Example Application: Bicycles and Gears - Why Single Responsibility Matters - Determining If a Class Has a Single Responsibility - Determining When to Make Design Decisions - Writing Code That Embraces Change - Depend on Behavior, Not Data - Enforce Single Responsibility Everywhere - Finally, the Real Wheel - Summary 3. Managing Dependencies - Understanding Dependencies - Recognizing Dependencies - Coupling Between Objects (CBO) - Other Dependencies - Writing Loosely Coupled Code - Inject Dependencies - Isolate Dependencies - Remove Argument-Order Dependencies - Managing Dependency Direction - Reversing Dependencies - Choosing Dependency Direction - Summary 4. Creating Flexible Interfaces - Understanding Interfaces - Defining Interfaces - Public Interfaces - Private Interfaces - Responsibilities, Dependencies, and Interfaces - Finding the Public Interface - An Example Application: Bicycle Touring Company - Constructing an Intention - Using Sequence Diagrams - Asking for “What” Instead of Telling “How” - Seeking Context Independence - Trusting Other Objects - Using Messages to Discover Objects - Creating a Message-Based Application - Writing Code That Puts Its Best (Inter)Face Forward - Create Explicit Interfaces - Honor the Public Interfaces of Others - Exercise Caution When Depending on Private Interfaces - Minimize Context - The Law of Demeter - Defining Demeter - Consequences of Violations - Avoiding Violations - Listening to Demeter - Summary 5. Reducing Costs with Duck Typing - Understanding Duck Typing - Overlooking the Duck - Compounding the Problem - Finding the Duck - Consequences of Duck Typing - Writing Code That Relies on Ducks - Recognizing Hidden Ducks - Placing Trust in Your Ducks - Documenting Duck Types - Sharing Code Between Ducks - Choosing Your Ducks Wisely - Conquering a Fear of Duck Typing - Subverting Duck Types with Static Typing - Static versus Dynamic Typing - Embracing Dynamic Typing - Summary 6. Acquiring Behavior Through Inheritance - Understanding Classical Inheritance - Recognizing Where to Use Inheritance - Starting with a Concrete Class - Embedding Multiple Types - Finding the Embedded Types - Choosing Inheritance - Drawing Inheritance Relationships - Misapplying Inheritance - Finding the Abstraction - Creating an Abstract Superclass - Promoting Abstract Behavior - Separating Abstract from Concrete - Using the Template Method Pattern - Implementing Every Template Method - Managing Coupling Between Superclasses and Subclasses - Understanding Coupling - Decoupling Subclasses Using Hook Messages - Summary 7. Sharing Role Behavior with Modules - Understanding Roles - Finding Roles - Organizing Responsibilities - Removing Unnecessary Dependencies - Writing the Concrete Code - Extracting the Abstraction - Looking Up Methods - Inheriting Role Behavior - Writing Inheritable Code - Recognize the Antipatterns - Insist on the Abstraction - Honor the Contract 8. Testing - Testing Private Methods - Ignoring Private Methods During Tests - Removing Private Methods from the Class Under Test - Choosing to Test a Private Method - Testing Outgoing Messages - Ignoring Query Messages - Proving Command Messages - Testing Duck Types - Testing Roles - Using Role Tests to Validate Doubles - Testing Inherited Code - Specifying the Inherited Interface - Specifying Subclass Responsibilities - Testing Unique Behavior - Summary

Chapter 1: Object-oriented design (OOD)

While procedural programming follows a fixed sequence of steps, OOD models the world as interacting objects with independent behaviors. This enables emergent behavior without explicitly coding every scenario. OOD requires shifting perspective from procedures to message-passing between objects. Software exists to fulfill a purpose, and efficient, enjoyable programming aligns with cost-effective development. Object-oriented design (OOD) addresses both technical and moral challenges by making code flexible and maintainable. While a perfectly static application wouldn't require design, real-world software inevitably changes. Changing requirements introduce challenges akin to friction and gravity, making adaptability crucial. Well-designed applications are easy to modify and extend, while rigid ones become costly and frustrating to maintain. Change is difficult because object-oriented applications consist of interconnected parts that depend on each other. Dependencies make objects resistant to modification, causing small changes to ripple unpredictably through the system. Poorly managed dependencies lead to inflexible, hard-to-test, and duplicated code. While bad design in small applications may be manageable, as they grow, they become fragile and difficult to maintain. Good design minimizes dependencies, making code easier to change, reuse, and test. Design is the arrangement of code within an application, and it varies between programmers like an art form. Good design balances immediate functionality with future adaptability, as the cost of change will eventually surpass initial development costs. Practical design does not try to predict specific future requirements but instead keeps options open. Its main purpose is to make future design possible and reduce the cost of change. Design is both an art and a science. While every programmer arranges code differently, good design follows established principles that reduce the cost of change and keep future options open. The SOLID principles, DRY, and the Law of Demeter emerged from experience, but their effectiveness was later confirmed through empirical research. Early studies measured object-oriented design metrics in small applications, and later studies—such as those analyzing NASA software—validated that well-designed code correlates with higher quality and lower costs. Ultimately, good design isn't just opinion; it has measurable benefits. The Gang of Four (GoF) formalized common solutions to recurring design problems, making software more modular, reusable, and understandable. Patterns provide a shared language for developers, improving communication and collaboration. However, misuse—applying patterns to the wrong problems—can lead to overly complex code. While this book does not focus on patterns, it aims to equip readers with the foundational knowledge needed to understand and apply them effectively. While object-oriented design (OOD) principles and patterns provide a strong foundation, applying them effectively remains a challenge. Design is not just about knowing the right principles—it requires skill, experience, and judgment. The analogy to woodworking highlights that having good tools doesn’t guarantee good results; mastery comes from practice. Well-designed software, like well-crafted furniture, depends on the expertise of the creator. ways design fails: Lack of Design: Beginners can write working applications without understanding design principles, but these applications become difficult to modify over time. Early ease leads to long-term frustration as changes break everything. Overdesign: Intermediate programmers, excited by their newfound knowledge, overapply principles and patterns, creating rigid, overly complex systems. Instead of allowing for flexibility, their designs become barriers to change. Disconnected Design: When design is separated from programming, often in rigid top-down approaches, it fails to evolve with real-world needs. Without iterative feedback, initial misunderstandings become baked into the system, leading to impractical implementations. The key takeaway is that good design is iterative and must evolve alongside development. Agile methodologies align well with this approach, ensuring that design remains adaptable and responsive to real-world changes. Good software design isn't about writing the most code or following arbitrary metrics—it’s about ``minimizing the long-term cost of change'' while delivering features in a reasonable timeframe.

Chapter 2: Designing Classes with a Single Responsibility

Common Questions in Class Design: - How many classes should there be? - What behavior should they implement? - How much should one class know about others? - How much should a class expose of itself to the outside world? Simplicity is Key: - In the early stages, the goal is to keep things simple. - The application should work but also be easy to change later. - Immediate functionality can be achieved through brute force, but ease of change is more challenging and requires thoughtful design. - Easy changeability is what separates good programming from functional programming. It involves skill, experience, and a bit of creativity. Creating Easy-to-Change Applications: - Making an application easy to change requires knowledge of design principles that promote flexibility and maintainability. - By understanding and applying them, one can structure classes to be both functional now and flexible in the future. Deciding What Belongs in a Class: When designing an object-oriented application, one is faced with the challenge of deciding what should belong in each class. While the implementation of the application might be clear, the organizational structure is what often proves difficult. The focus on grouping methods into classes and making these decisions in a way that allows for future flexibility. Grouping Methods into Classes: - The classes defined will shape how you and others perceive and interact with the code. They set the boundaries for how functionality is structured, and they can potentially constrain future development. - Perfect organization isn’t expected at the start. Design is a balance between the current needs and the potential future changes. - Preserving flexibility is more critical than achieving a perfect structure at the outset. Good design allows for easy changesin the future, even when initial choices are inevitably wrong. Organizing Code for Easy Changes: The goal is to create code that is easy to change when needed. Easy to change means: - No unexpected side effects from changes - Small changes lead to proportionally small code modifications - Code is reusable in new contexts - The easiest way to change code is to add new code that is also easy to change The mnemonic TRUE (Transparent, Reasonable, Usable, Exemplary): Transparent: The consequences of any change should be clear both in the code that is changing and in the code that relies on it. Reasonable: The cost of any change should be proportional to the benefit it provides. Avoid over-engineering or making changes that are costly without significant advantages. Usable: Existing code should be adaptable and usable in unexpected or new contexts without requiring significant rewrites. Exemplary: The code should encourage future developers to maintain these qualities, making it easy to modify in the future.

Creating Classes That Have a Single Responsibility

On a bicycle, the gear ratio is:
gear_ratio = N_chainring / N_cog
Nouns like "bicycle" and "gear" are candidates for classes, but while "bicycle" lacks specific behavior, "gear" involves data and behavior (chainrings, cogs, and ratios), making it suitable for a class.
class Gear:
    def __init__(self, chainring, cog):
        self.chainring = chainring
        self.cog = cog

    def ratio(self):
        return self.chainring / float(self.cog)

print(Gear(52, 11).ratio()) # 4.72727272727273
print(Gear(30, 27).ratio()) # 1.11111111111111
Your cyclist friend requests an enhancement to the Gear calculator. She has two bikes with the same gearing but different wheel sizes, and wants to account for the impact of wheel size on the calculation. Larger wheels cover more distance with each rotation compared to smaller ones. To address this, you need to calculate "gear inches" which combines both the gear ratio and wheel size using the formula: \( \mathrm{gear \ inches} = D_{\mathrm{wheel}} \times \mathrm{gear \ ratio} \) Where: \( D_{\mathrm{wheel}} = D_{\mathrm{rim}} + 2 D_{\mathrm{tire}} \) This allows cyclists to compare bikes with different wheel sizes based on their gearing and wheel size together. We can change the gear class to add this behavior:
class Gear:
    def __init__(self, chainring, cog, rim, tire):
        self.chainring = chainring
        self.cog = cog
        self.rim = rim
        self.tire = tire

    def ratio(self):
        return self.chainring / float(self.cog)

    def gear_inches(self):
        return self.ratio() * (self.rim + (2*self.tire))

print(Gear(52, 11, 26, 1.5).gear_inches()) # 137.1
print(Gear(30, 27, 24, 1.25).gear_inches()) # 125.2
Writing Code That Embraces Change: 1. Depend on Behavior, Not Data - Encapsulate behavior in methods instead of directly accessing data. DRY (Don't Repeat Yourself) to ensure each behavior exists in only one place. - This makes changes easier since modifying a single method updates behavior everywhere it's used. - Behavior exists in methods and is triggered by messages. - Single-responsibility classes ensure each piece of behavior has one authoritative location. 2. Hide Instance Variables - Instead of referring directly to instance variables, wrap them in accessor methods. - In Python this is usually done using the @property decorator. and private attributes proceeeded by an underscore.
class Gear:
def __init__(self, chainring, cog, adjustment_factor=1):
    self._chainring = chainring
    self._cog = cog
    self._adjustment_factor = adjustment_factor

@property
def cog(self):
    return self._cog * self._adjustment_factor  # Now cog is dynamic

def ratio(self):
        return self._chainring / float(self.cog)  # Uses encapsulated property
This may seem pointless, however, implementing this method changes cog from data (which is referenced all over) to behavior (which is defined once). If the _cog instance variable is referred to ten times and it suddenly needs to be adjusted, the code will need many changes. However, if _cog is wrapped in a method, you can change what cog means by implementing your own version of the method. Dealing with data as if it’s an object that understands messages introduces two new issues: 1. Visibility: Wrapping the _cog instance variable in a public cog method exposes this variable to the other objects in your application; any other object can now send cog to a Gear. It would have been just as easy to create a private wrapping method, although in python there is no explicit private methods, they are often indicated by proceeding underscores _ . Although thinking about this this is not really an issue in python since attributes in classes can be freely accessed by externally. 2. Because it’s possible to wrap every instance variable in a method and to therefore treat any variable as if it’s just another object, the distinction between data and a regular object begins to disappear. While it’s sometimes expedient to think of parts of your application as behavior-less data, most things are better thought of as plain old objects. Regardless of how far your thoughts move in this direction, you should hide data from yourself. Doing so protects the code from being affected by unexpected changes. Data very often has behavior that you don’t yet know about. Send messages to access variables, even if you think of them as data.

Hide Data Structures

This principle applies directly to Python, and the equivalent approach would involve encapsulation and creating structured objects instead of relying on nested data structures. Bad Approach: Exposing a Raw Data Structure
class ObscuringReferences:
    def __init__(self, data):
        self.data = data  # Stores a 2D array

    def diameters(self):
        # Hardcoded knowledge of the structure (BAD)
        return [cell[0] + (cell[1] * 2) for cell in self.data]
    
# Example usage 
data = [[622, 20], [622, 23], [559, 30], [559, 40]]
obscure = ObscuringReferences(data)
print(obscure.diameters())  # [662, 668, 619, 639]
Why is this bad? - `diameters()` directly references the structure of `data`. If `data` changes, this method (and others) will break. - If other methods also depend on `cell[0]` and `cell[1]`, this structure is duplicated across the class. - Hard-to-maintain: If `data` changes (e.g., from lists to dictionaries), you have to update every reference. Good Approach: Encapsulate the Data in Objects Instead of exposing raw arrays, we encapsulate the structure in a `Wheel` class, similar to the Ruby `Struct`.
from collections import namedtuple

class RevealingReferences:
    Wheel = namedtuple('Wheel', ['rim', 'tire'])  # Struct-like object

    def __init__(self, data):
        self.wheels = self._wheelify(data)  # Convert raw data to structured objects

    def _wheelify(self, data):
        """Convert list of lists into list of Wheel objects"""
        return [self.Wheel(rim, tire) for rim, tire in data]

    def diameters(self):
        """Compute diameters using structured data"""
        return [wheel.rim + (wheel.tire * 2) for wheel in self.wheels]

# Example usage:
data = [[622, 20], [622, 23], [559, 30], [559, 40]]
revealed = RevealingReferences(data)
print(revealed.diameters())  # [662, 668, 619, 639]
Why is this better? - `diameters()` no longer cares about the internal structure of the data. - `cell[0]` --> `wheel.rim`, making the code more readable and less error-prone. - `_wheelify()` isolates the transformation logic. If the input format changes, you only modify this one method. Even Better: Use a Proper Class Instead of `namedtuple` If `Wheel` needs more behavior, we can use a class instead of `namedtuple`:
class Wheel:
    def __init__(self, rim, tire):
        self.rim = rim
        self.tire = tire

    def diameter(self):
        return self.rim + (self.tire * 2)

class RevealingReferences:
    def __init__(self, data):
        self.wheels = self._wheelify(data)

    def _wheelify(self, data):
        return [Wheel(rim, tire) for rim, tire in data]

    def diameters(self):
        return [wheel.diameter() for wheel in self.wheels]

# Example usage:
data = [[622, 20], [622, 23], [559, 30], [559, 40]]
revealed = RevealingReferences(data)
print(revealed.diameters())  # [662, 668, 619, 639]
Benefits of using a full class: - The logic of `diameter()` belongs inside `Wheel`, making `RevealingReferences` even simpler. - If `Wheel` later needs more behavior (e.g., weight calculations), it can be added without modifying `RevealingReferences`. Takeaways: - Don't expose raw data structures. Convert them into meaningful objects. - Encapsulate transformation logic. Use a method like `_wheelify()` to isolate messy data. - Use namedtuples or classes to replace nested data structures. - Make behavior belong to the object. The `diameter()` method should be in `Wheel`, not in `RevealingReferences`.

Enforce Single Responsibility Everywhere

Designing classes with a single responsibility improves maintainability, but the same principle applies beyond class design—it should be used throughout your code. Extract Extra Responsibilities from Methods: Just like classes, methods should also have a single responsibility. This makes them easier to modify and reuse. You can apply the same design techniques—ask what the method does and try to summarize its responsibility in one sentence. Consider the `diameters` method in a `RevealingReferences` class:
def diameters(self):
    return [wheel.rim + (wheel.tire * 2) for wheel in self.wheels]
This method has two responsibilities: 1. iterating over `self.wheels` 2. calculating the diameter of each wheel. To simplify the code, separate these responsibilities into distinct methods. The refactored version moves the diameter calculation to its own method:
def diameters(self): 
    return [self.diameter(wheel) for wheel in self.wheels] 

def diameter(self, wheel):
    return wheel.rim + (wheel.tire * 2)
Would you ever need to calculate the diameter of just one wheel? You already do! This refactoring isn’t overengineering it simply makes the code cleaner and easier to use elsewhere.

Identifying Hidden Responsibilities

Separating iteration from the action performed on each element is an easy-to-spot case of multiple responsibilities. However, sometimes the issue is more subtle. Take the `gear_inches` method from a `Gear` class:
def gear_inches(self):
    return self.ratio * (self.rim + (self.tire * 2))
Does `gear_inches` belong in `Gear`? It might seem reasonable at first, but something about it feels wrong it’s messy and unclear. The real issue is that it does more than one thing. Hidden inside `gear_inches` is the calculation for wheel diameter. Extracting that logic into a separate method makes the class's responsibilities clearer:
def gear_inches(self):
    return self.ratio * self.diameter()

def diameter(self):
    return self.rim + (self.tire * 2)
Now, `gear_inches` simply retrieves the diameter instead of computing it. The logic remains unchanged, but the structure is far better. Even when you aren’t sure about the final design, these refactorings help. Good practices don’t require knowing the end goal—they help reveal it. The benefits of single-responsibility methods include: - Making hidden design flaws obvious – Refactoring methods to have a single responsibility clarifies what a class actually does. Even if you don’t move methods into different classes right away, making their purpose explicit helps future design decisions. - Reducing the need for comments – Comments often become outdated, as they aren’t executable. If a method requires a comment to explain what part of it does, extract that part into a separate method instead. The method name now conveys the meaning. - Encouraging reuse – Small methods naturally lead to reusable code. Other developers (or your future self) will reuse methods instead of duplicating logic, reinforcing good coding habits. - Making code easier to refactor – When design changes arise, small methods are easier to move between classes, reducing the effort needed to restructure code. While each individual refactoring may seem small, their combined impact results in cleaner, more maintainable code.

Finally, the Real Wheel

While you're refining the design of the Gear class, a new request comes in. You show your calculator to your cyclist friend again, and she tells you that while it’s nice, she’d also like a way to calculate "bicycle wheel circumference." Her bike computer needs this information to properly calculate speed. This is exactly the kind of feature request that informs your next design decision. \(\mathrm{circumference} = Dπ\). embedded Wheel class already calculates diameter, adding a method to compute circumference is straightforward. However, this request also reveals that your application now has a clear need for a standalone Wheel class, independent of Gear. It’s time to separate Wheel into its own class. Because you've already isolated wheel-related behavior within the Gear class, extracting it is painless. You just need to convert the existing Wheel logic into a separate class and add the new circumference method:
import math

class Gear:
    def __init__(self, chainring, cog, wheel=None):
        self.chainring = chainring
        self.cog = cog
        self.wheel = wheel

    def ratio(self):
        return self.chainring / float(self.cog)

    def gear_inches(self):
        return self.ratio() * self.wheel.diameter()

class Wheel:
    def __init__(self, rim, tire):
        self.rim = rim
        self.tire = tire

    def diameter(self):
        return self.rim + (self.tire * 2)

    def circumference(self):
        return self.diameter() * math.pi

# Example usage:
wheel = Wheel(26, 1.5)
print(wheel.circumference())  
# -> 91.106186954104

print(Gear(52, 11, wheel).gear_inches())  
# -> 137.090909090909

print(Gear(52, 11).ratio())  
# -> 4.72727272727273

Both classes now have a single responsibility. While the design isn’t perfect,
it achieves an important goal: it is good enough. Summary

The foundation of changeable and maintainable object-oriented software lies in
classes with a single responsibility. Classes that focus on one thing remain
isolated from the rest of the application, making them easy to change without
unintended consequences and easy to reuse without duplication.

Chapter 3: Managing Dependencies

Collaborating Objects and Managing Dependencies

Object-oriented programming languages model real-world problems naturally. Objects represent qualities of a real-world system, and their interactions provide solutions. These interactions are unavoidable no single object can know everything, so it must communicate with others. If you could observe the internal workings of a complex application, the flow of messages between objects might seem overwhelming. However, stepping back to see the bigger picture reveals a pattern: - every [message] originates from an [object] to trigger some [behavior]. The [behavior] itself is distributed among different [objects]. For any given [action] an [object]: - either knows how to do it itself - inherits the behavior - knows another object that can do it. The previous chapter focused on the first case: ensuring that a class implements its own behavior. The second case inheriting behavior will be covered in Chapter 6. This chapter focuses on the third: accessing behavior implemented in other objects. Because well-designed objects follow the single responsibility principle, they must collaborate to perform complex tasks. This collaboration is both powerful and risky. For an object to work with another, it must have some knowledge of it. This creates **dependencies**, which, if not carefully managed, can make your application fragile and difficult to maintain.

Understanding Dependencies

An object depends on another object if changes in one force modifications in the other:
class Gear:
    def __init__(self, chainring, cog, rim, tire):
        self.chainring = chainring
        self.cog = cog
        self.rim = rim
        self.tire = tire

    def gear_inches(self):
        return self.ratio() * Wheel(self.rim, self.tire).diameter()

    def ratio(self):
        return self.chainring / float(self.cog)


class Wheel:
    def __init__(self, rim, tire):
        self.rim = rim
        self.tire = tire

    def diameter(self):
        return self.rim + (self.tire * 2)


print(Gear(52, 11, 26, 1.5).gear_inches())
Gear has dependencies on Wheel. If Wheel changes, Gear may need to change as well. Here are four dependencies that make Gear harder to maintain: 1. Explicit instantiation: Gear() directly creates a Wheel() instance instead of accepting an existing one. If Wheel's initialization changes, Gear must also change. 2. Hardcoded structure: Gear assumes that Wheel will always be initialized with rim and tire. If Wheel's arguments change, Gear will break. 3. Direct method call: Gear calls Wheel().diameter() directly, assuming this method exists. If diameter() is renamed or removed, Gear must be updated. 4. Implicit coupling: Gear knows too much about Wheel’s internal behavior, reducing flexibility and making modifications risky. These dependencies are unnecessary and make Gear harder to change. A better design would inject a Wheel instance into Gear rather than having Gear create it internally.

Recognizing Dependencies in Python

A class has a dependency when it knows: 1. The name of another class: Gear() explicitly references Wheel(), assuming it exists. 2. The name of a method it intends to call on another object: Gear expects Wheel to have a diameter() method. 3. The arguments required by another class's constructor: Gear assumes Wheel is initialized with rim and tire. 4. The order of those arguments: Gear expects Wheel(rim, tire), meaning changes to this order could break it. Each of these dependencies makes Gear tightly coupled to Wheel, increasing the risk that changes to Wheel will force changes in Gear. Why Is This a Problem? If Wheel changes, Gear must also change: If Wheel.__init__ changes, Gear's call to Wheel(self.rim, self.tire) will break. Reusability is reduced: Gear cannot easily work with a different wheel-like object because it assumes a specific Wheel class. Testing becomes harder: Gear cannot be tested in isolation since it always requires a Wheel instance. The Design Challenge A good design minimizes dependencies so that each class only knows what it absolutely needs to function. The next step is refactoring to reduce coupling and improve flexibility.

Coupling Between Objects (CBO)

These dependencies couple Gear to Wheel. In other words, each coupling creates a dependency. The more Gear knows about Wheel, the tighter the coupling. When two objects are tightly coupled, they effectively behave as a single unit. hen objects are so tightly coupled that they act as one unit, it becomes impossible to reuse just one. Changes to one force changes to all, and unmanaged dependencies can turn minor tweaks into major undertakings. Other dependency issues include: - Chained messages: When an object relies on a series of calls to reach a behavior in a distant object, increasing the chance that any change affects the entire chain. - Over-coupled tests: Tests written too tightly to the code may break every time the code is refactored, even if the core behavior remains the same. Despite these challenges, you can avoid a tangled mess of dependencies by recognizing and managing them carefully. The design goal is to have each class know only what is necessary for its function.

Writing Loosely Coupled Code

Inject Dependencies

class Gear:
def __init__(self, chainring, cog, rim, tire):
        self.chainring = chainring
        self.cog = cog
        self.rim = rim
        self.tire = tire

    def gear_inches(self):
        return self.ratio() * Wheel(self.rim, self.tire).diameter()

    def ratio(self):
        return self.chainring / float(self.cog)
Gear explicitly calls Wheel(self.rim, self.tire) If the name of the Wheel class changes, Gear's gear_inches method must also change. Although this dependency might seem minor (and can be fixed with a global find/ replace), it reveals a deeper problem. When Gear hard-codes a reference to Wheel, it declares that it is only willing to calculate gear inches for instances of Wheel. Gear refuses to collaborate with any other kind of object, even if that object has a diameter. If your application later includes objects such as disks or cylinders that have a diameter, Gear won't be able to calculate their gear inches because it is stuck to Wheel. The code above forces an unjustified attachment to static types. Gear only needs an object that responds to diameter; it doesn't care about the object's class. This tight coupling reduces Gear's reusability and makes it more likely to break if any dependency changes. To decouple, refactor Gear so it is initialized with an object that responds to diameter:
class Gear:
    def __init__(self, chainring, cog, wheel):
        self.chainring = chainring
        self.cog = cog
        self.wheel = wheel

    def gear_inches(self):
        return self.ratio() * self.wheel.diameter()

    def ratio(self):
        return self.chainring / float(self.cog)
Now, Gear does not care if the object is an instance of Wheel. It only needs an object that implements a diameter method. This technique is known as dependency injection. It reduces Gear's explicit dependencies on the Wheel class and its constructor arguments, limiting Gear's dependency to just the diameter method. Gear becomes smarter by knowing less. Dependency injection relies on the idea that the responsibility for knowing a class's name and the responsibility for sending a specific message may belong in different objects. Just because Gear needs to call diameter does not mean it should know about Wheel. In summary, by injecting dependencies, you reduce coupling, increase reusability, and make your code more flexible.

Isolate Dependencies

It's best to break all unnecessary dependencies, but while this is always technically possible, it may not be practically feasible. When working on an existing application, you might face severe constraints on how much you can change. If perfection is out of reach, aim to improve the overall situation by leaving the code better than you found it. If you cannot remove unnecessary dependencies, isolate them within your class. In "Designing Classes with a Single Responsibility," you isolated extra responsibilities so they would be easy to recognize and remove when the time was right. Here, isolate unnecessary dependencies so they are easy to spot and reduce when circumstances permit. Think of every dependency as an alien bacterium trying to infect your class. Give your class a vigorous immune system; quarantine each dependency. Dependencies are foreign invaders that represent vulnerabilities, so they should be concise, explicit, and isolated.

Isolate Instance Creation

If you're so constrained that you cannot change the code to inject a Wheel into a Gear, isolate the creation of a new Wheel within the Gear class. The idea is to explicitly expose the dependency while reducing its reach in your class. Example 1: Move the creation of a new Wheel from the gear_inches method to the initialization method. This cleans up gear_inches and publicly exposes the dependency in __init__. Note that this technique creates a new Wheel every time a new Gear is instantiated.
class Gear:
    def __init__(self, chainring, cog, rim, tire):
        self.chainring = chainring
        self.cog = cog
        self._wheel = Wheel(rim, tire)

    def gear_inches(self):
        return self.ratio() * self._wheel.diameter()

    def ratio(self):
        return self.chainring / float(self.cog)
Example 2: Isolate the creation of a new Wheel in its own wheel method. This method lazily creates a new Wheel using a caching technique. The new Wheel is created only when gear_inches calls the wheel method.
class Gear:
    def __init__(self, chainring, cog, rim, tire):
        self.chainring = chainring
        self.cog = cog
        self.rim = rim
        self.tire = tire
        self._wheel = None

    def gear_inches(self):
        return self.ratio() * self.wheel().diameter()

    def ratio(self):
        return self.chainring / float(self.cog)

    def wheel(self):
        if self._wheel is None:
            self._wheel = Wheel(self.rim, self.tire)
        return self._wheel
In both examples, Gear still takes rim and tire as initialization arguments and creates its own new instance of Wheel, meaning it is still tightly coupled to Wheel; it can calculate gear inches for no other kind of object. However, these coding styles reduce the number of dependencies within gear_inches while publicly exposing Gear's dependency on Wheel. They reveal dependencies instead of concealing them, lowering the barriers to reuse and making the code easier to refactor when circumstances allow. This change makes the code more agile and better able to adapt to an unknown future. The way you manage dependencies on external class names profoundly affects your application. If you are mindful of dependencies and develop a habit of routinely injecting them, your classes will naturally be loosely coupled. If you ignore this issue and let class references proliferate, your application will resemble a big woven mat rather than a set of independent objects. An application whose classes are sprinkled with entangled and obscure class name references is unwieldy and inflexible, while one whose class name dependencies are concise, explicit, and isolated can easily adapt to new requirements.

Isolate Vulnerable External Messages

Calls to external functions in a class (i.e. not self) should also be isolated. This means that changes to that external function or class will mean that your function code does not have to change, only the isolated function would have to change. In the `gear_inches()` method below, `ratio` is sent to `self`, but `diameter` is sent to `wheel`:
def gear_inches(self):
    return self.ratio * self.wheel.diameter()
This is fine in simple cases, but imagine that calculating `gear_inches` required more complex math:
def gear_inches(self):
    # ... a few lines of complex calculations
    foo = some_intermediate_result * self.wheel.diameter()
    # ... more calculations
`wheel.diameter()` is deeply embedded in a complex method, increasing its vulnerability to changes in wheel. `gear_inches()` now depends on `Gear` responding to `self.wheel` and on `Wheel` responding to `diameter()`. To make `gear_inches()` more resilient, isolate the dependency in its own method:
def gear_inches(self):
    # ... a few lines of complex calculations
    foo = some_intermediate_result * self.diameter()
    # ... more calculations

def diameter(self):
    return self.wheel.diameter()
At first glance, this might seem unnecessary—why create `diameter()` when there's only one reference to `wheel.diameter()`? However, this technique proactively removes an external dependency, making the code easier to modify in the future. Before refactoring: `gear_inches()` knew that `wheel` had a `diameter()`. This creates a direct dependency between `Gear` and `Wheel`. If `Wheel` ever changes the method name or its signature, `gear_inches` will break. By isolating `wheel.diameter()` in a separate method, `gear_inches` now relies only on a message to `self`. If `Wheel` changes, the only adjustment needed is within the `diameter()` method. This technique is especially useful when: - A method depends on an external message that is likely to change. - A class contains multiple scattered references to an external method. - You want to reduce the unintended side effects when modifying dependencies. Not every external call requires this treatment, but identifying the most vulnerable dependencies and wrapping them in methods can significantly improve maintainability.

Remove Argument-Order Dependencies

Having fixed order arguments to function calls are vulnerable to changes. This can be especially bad during early development when one is constantly changing what exactly arguments will be passed to a function. One way around this is by using keyword arguments, especially for classes that are likely to change in the future, for example, a Model() class that may have many parameters that are subject to change.
class Gear:
    def __init__(self, **kwargs):
        self.chainring = kwargs.get('chainring')
        self.cog       = kwargs.get('cog')
        self.wheel     = kwargs['wheel']
using `kwargs.get('name')` will return None if the key does not exist, which is essentially equivalent to: `__init__(self, name=None)` in contrast using kwargs['name'] will return a KeyError if the variable does not exist, This can actually be useful for compulsary arguments.

Explicitly Define Defaults

You may however want the lack of an argument to not simply default to None. If this is the case, the dictionary `get()` function can just be given an extra argument that corresponds to the default value.
class Gear:
    def __init__(self, **kwargs):
        self.chainring = kwargs.get('chainring', 40)
        self.cog       = kwargs.get('cog', 18)
        self.wheel     = kwargs.get('wheel')

# Usage examples:
gear1 = Gear(chainring=52, cog=11, wheel="WheelObject")
gear2 = Gear(wheel="WheelObject")  # Defaults used: chainring=40, cog=18
If the default arguments are more complicated than simple values or strings, we can ask a `defaults()` function to go and get these, then overwrite them with whatever the user has inputted.
class Gear:
    def __init__(self, **kwargs):
        args = {**self.defaults(), **kwargs}
        self.chainring = kwargs.get('chainring')
        self.cog       = kwargs.get('cog')
        self.wheel     = kwargs.get('wheel')

    def default(self):
        args = {'chainring' : 40,
                'cog'       : self.default_cog(),
                'wheel'     : self.default_wheel(),
        return args

    def default_cog(self): ...
    def default_wheel(self): ... 
The line:
args = {**self.defaults(), **kwargs}
unpacks the defaults first into a dictionary, then the kwargs from the user, since there can only be one unique key in a dinctionary, any existing ones are overwriten.

Isolate Multiparameter Initialization

When using external libraries, you may not be able to change the order of the arguments. A solution involves wrapping of the external dependency:
# External dependency we don't control:
class ExternalGear:
    def __init__(self, chainring, cog, wheel):
        self.chainring = chainring
        self.cog = cog
        self.wheel = wheel

    def gear_inches(self):
        return (self.chainring / float(self.cog)) * self.wheel.diameter()

class Wheel:
    def __init__(self, rim, tire):
        self.rim = rim
        self.tire = tire

    def diameter(self):
        return self.rim + (self.tire * 2)

# Our wrapper function isolates the fixed-order initialization.
# It accepts keyword arguments, so callers aren't dependent on the order.
def gear_wrapper(**kwargs):
    # The external ExternalGear requires fixed-order parameters,
    # so we extract them by name and pass them in the correct order.
    return ExternalGear(kwargs['chainring'], kwargs['cog'], kwargs['wheel'])

# Usage example:
g = gear_wrapper(chainring=52, cog=11, wheel=Wheel(26, 1.5))
print(g.gear_inches())
This isolates the dependency on argument order so that the rest of your code can use a more flexible, self-documenting interface without worrying about the fixed parameter order required by `ExternalGear`.

Managing Dependency Direction

Dependencies have a direction. In our earlier examples: `Gear` --> `Wheel` But there is no reason the reverse cannot be true. You can reverse the dependency direction so that the external dependency is injected rather than hardcoded. For example, instead of having `Gear` directly call `wheel.diameter()`, you pass the diameter value into `Gear.gear_inches()`. This shifts the dependency so that `Wheel` now relies on `Gear` rather than the other way around. Why? Encapsulation: The complex dependency on an external object's method is isolated in a single wrapper method. Flexibility: If the external object's interface changes (e.g., `Wheel` changes how it computes diameter), you only need to update the wrapper method. Maintenance: It minimizes the ripple effect of changes, making the code easier to modify and less fragile over time. Choosing Dependency Direction: Generally, it is better for classes depend on abstractions rather than concrete implementations. If a class is likely to change, it should be lower in the dependency hierarchy, and other classes should depend on more stable, abstract interfaces.
class Gear:
    def __init__(self, chainring, cog):
        self.chainring = chainring
        self.cog = cog

    def gear_inches(self, diameter):
        # Uses the provided diameter instead of calling wheel.diameter() directly.
        return self.ratio() * diameter

    def ratio(self):
        return self.chainring / float(self.cog)


class Wheel:
    def __init__(self, rim, tire, chainring, cog):
        self.rim = rim
        self.tire = tire
        # Create a Gear instance using provided chainring and cog.
        self.gear = Gear(chainring, cog)

    def diameter(self):
        # Compute wheel diameter: rim + (tire * 2)
        return self.rim + (self.tire * 2)

    def gear_inches(self):
        # Pass the calculated diameter to Gear's gear_inches method.
        return self.gear.gear_inches(self.diameter())

wheel_instance = Wheel(26, 1.5, 52, 11)
print(wheel_instance.gear_inches())
This Python example demonstrates how to reverse the dependency: - `Gear.gear_inches()` now receives the diameter from `Wheel` instead of directly calling a method on `Wheel`. - This isolates the dependency on `Wheel.diameter()` to a single place in `Wheel`, reducing the risk of widespread changes if `Wheel`'s implementation changes. Summary: Dependency management is core to creating future-proof applications. Injecting dependencies creates loosely coupled objects that can be reused in novel ways. Isolating dependencies allows objects to quickly adapt to unexpected changes. Depending on abstractions decreases the likelihood of facing these changes. The key to managing dependencies is to control their direction. The road to maintenance nirvana is paved with classes that depend on things that change less often than they do

4. Creating Flexible Interfaces

An object oriented application may be made up of classes, but it is defined by the messages. Designed must be concerned with these messages, responsibilities and dependencies. Conversation between objects takes places using their `interfaces' Creating flexible interfaces is important for applications to grow and change.

Understanding Interfaces

Although in python there is no explicit public and private keywords to define what can and cannot be accessed outside a class. Properly thinking about what is revealed to other objects and what isn't is of key importance. the public interface comprimises of the exposed methods of a class.

Defining Interfaces

Imagine the following: Resteraunt() Kitchen() Menu() <-- Public interface for the Kitchen and menu Customer() The Kitchen will have many internal processes relating how to make a certain dish but these should all be private to the Kithcen(). Public Interface: - Reveal its primary responsibility - Are expected to be invoked by others - Will not change on a whim - Are safe for others to depend on - Are thoroughly documented in the tests Private Interface: - Handle implementation details - Are not expected to be sent by other objects - Can change for any reason whatsoever - Are unsafe for others to depend on - May not even be referenced in the tests

Responsibilities, Dependencies, and Interfaces

Finding the Public Interface

An Example Application: Bicycle Touring Company

Constructing an Intention

Using Sequence Diagrams

Asking for “What” Instead of Telling “How”

Seeking Context Independence

Trusting Other Objects

Using Messages to Discover Objects

Creating a Message-Based Application

Writing Code That Puts Its Best (Inter)Face Forward

Create Explicit Interfaces

Honor the Public Interfaces of Others

Exercise Caution When Depending on Private Interfaces

Minimize Context

The Law of Demeter

Defining Demeter

Consequences of Violations

Avoiding Violations

Listening to Demeter

Summary

5. Reducing Costs with Duck Typing

Duck Types are public interfaced that are not tied to any specific class. if an object quacks like a duck and walks like a duck, then its class is immaterial, it’s a duck

Understanding Duck Typing

An object's type is less about its class and more about the behavior it exhibits through its methods and properties. This means that if an object implements the necessary methods, it can be used interchangeably with others, regardless of its actual class. This is known as known as duck typing. Focusing on an object’s behavior (its public interface) rather than its class allows for flexible and adaptable designs. It’s important to document these interfaces so that users of your code understand what methods an object is expected to have. Benefits: Increased flexibility, easier code reuse, and more dynamic interactions between objects. Cautions: Without explicit contracts (as provided by interfaces in statically typed languages), designs can become chaotic if the expected behaviors aren’t clearly defined and adhered to.

Overlooking the Duck

In the following example: Trip.prepare() sends the message .prepare_bicycles() to the object contained in self.mechanic No reference is made to the Mechanic class.
class Trip:
    def __init__(self, bicycles, customers, vehicle):
        self.bicycles = bicycles
        self.customers = customers
        self.vehicle = vehicle

    def prepare(self, mechanic):
        mechanic.prepare_bicycles(self.bicycles)

class Mechanic:
    def prepare_bicycles(self, bicycles):
        for bicycle in bicycles:
            self.prepare_bicycle(bicycle)

    def prepare_bicycle(self, bicycle):
        # Implementation for preparing a single bicycle.
        print(f"Preparing {bicycle}")


# Example usage:
trip = Trip(bicycles=["Bike1", "Bike2"], customers=[], vehicle=None)
mechanic = Mechanic()
trip.prepare(mechanic)
Using this code as it shown in the example requires that some external object sends .prepare() to Trip along with a mechanic argument. Even though the prepare() method has no explicit dependency on the Mechanic class, it does depend on receiving an object that can respond to prepare_bicycles(). This dependency is so fundamental that it’s easy to miss or to discount, but nonetheless, it exists. Trip’s prepare method firmly believes that its argument contains a preparer of bicycles.

Compounding the Problem

Imagine that requirements change. In addition to a mechanic, trip preparation now involves a trip coordinator and a driver. You Create TripCoordinator() and Driver(). You also change Trip.prepare() to invoke the correct behavior from each of its arguments.
class Trip:
    def __init__(self, bicycles, customers, vehicle):
        self.bicycles = bicycles
        self.customers = customers
        self.vehicle = vehicle

    def prepare(self, preparers):
        for preparer in preparers:
            if isinstance(preparer, Mechanic):
                preparer.prepare_bicycles(self.bicycles)
            elif isinstance(preparer, TripCoordinator):
                preparer.buy_food(self.customers)
            elif isinstance(preparer, Driver):
                preparer.gas_up(self.vehicle)
                preparer.fill_water_tank(self.vehicle)

class Mechanic:
    def prepare_bicycles(self, bicycles):
        # Implementation for preparing bicycles.
        print("Mechanic: Preparing bicycles.")

class TripCoordinator:
    def buy_food(self, customers):
        # Implementation for buying food.
        print("TripCoordinator: Buying food for customers.")

class Driver:
    def gas_up(self, vehicle):
        # Implementation for gassing up.
        print("Driver: Gassing up vehicle.")

    def fill_water_tank(self, vehicle):
        # Implementation for filling water tank.
        print("Driver: Filling water tank.")

# Example usage:
bicycles = ["Bike1", "Bike2"]
customers = ["Alice", "Bob"]
vehicle = "Van"
trip = Trip(bicycles, customers, vehicle)
preparers = [Mechanic(), TripCoordinator(), Driver()]
trip.prepare(preparers)
Trip.prepare() now refers to three different classes by name and knows specific methods implemented in each. Risks have dramatically gone up. Trip’s prepare method might be forced to change because of a change elsewhere and it might unexpectedly break as the result of a distant, unrelated change. This code is the first step in a process that will paint you into a corner with no way out. Blinded by existing classes and neglecting overlooked messages results in this dependent-laden code that is a natural outgrowth of a class-based perspective. Initially, prepare() required a mechanic that knew how to prepare_bicycle(). After a change, prepare() now received arguments of other classes that do not know how to prepare_bicycle(). As a result, you now go hunting for messages in the new classes (TripCoordinator() and Drive()) for the behavior we want buy_food() gas_up() and fill_water_tank() The most obvious way to invoke this behavior is to send these very messages, but now you’re stuck. Every one of your arguments is of a different class and implements different methods; you must determine each argument’s class to know which message to send. Adding a case statement that switches on class solves the problem of sending the correct message to the correct object but causes an explosion of dependencies. Count the number of new dependencies in the prepare method. - It relies on specific classes, no others will do. - It relies on the explicit names of those classes. - It knows the names of the messages that each class understands - It knows the arguments that those messages require. many distant changes will now have side effects on this code. To make matters worse, this style of code propagates itself. When another new trip preparer appears, you, or the next person down the programming line, will add a new when branch to the case statement. Your application will accrue more and more methods like this, where the method knows many class names and sends a specific message based on class. The logical endpoint of this programming style is a stiff and inflexible application, where it eventually becomes easier to rewrite everything than to change anything.

Finding the Duck

Removing the dependencies requires the recognicion that Trip.prepare() serves a single purpose. The arguments passed prepare() wish to collaborate to accomplish a single goal. Every argument is here for the same reason and that reason is unrelated to the argument’s underlying class. Avoid getting sidetracked by your knowledge of what each argument’s class already does Think instead about what prepare() needs. Considered from prepare’s point of view, the problem is straightforward. The prepare method wants to prepare the trip. Its arguments arrive ready to collaborate in trip preparation. The design would be simpler if prepare just trusted them to do so. Therefore, prepare needs a Preparer(). You’ve pried yourself loose from existing classes and invented a duck type. The next step is to ask what message the prepare method can fruitfully send each Preparer. From this point of view, the answer is obvious: prepare_trip. Preparer() a this point is an abstraction, objects that implement prepare_trip() are Preparers.
class Trip:
    def __init__(self, bicycles=None, customers=None, vehicle=None):
        self.bicycles = bicycles or []
        self.customers = customers or []
        self.vehicle = vehicle

    def prepare(self, preparers):
        for preparer in preparers:
            preparer.prepare_trip(self)


class Mechanic:
    def prepare_trip(self, trip):
        for bicycle in trip.bicycles:
            self.prepare_bicycle(bicycle)

    def prepare_bicycle(self, bicycle):
        print(f"Preparing bicycle {bicycle}")


class TripCoordinator:
    def prepare_trip(self, trip):
        self.buy_food(trip.customers)

    def buy_food(self, customers):
        print(f"Buying food for {len(customers)} customers")


class Driver:
    def prepare_trip(self, trip):
        vehicle = trip.vehicle
        self.gas_up(vehicle)
        self.fill_water_tank(vehicle)

    def gas_up(self, vehicle):
        print(f"Gassing up vehicle {vehicle}")

    def fill_water_tank(self, vehicle):
        print(f"Filling water tank for vehicle {vehicle}")
There is a temptation here to create a Prepare() class or interface (abstract base class) and then have Driver() TripCoordinator() and mechnic all inherit from it ensuring that the prepare_trip() method is implemented. But when you step back, you realize this isn't needed. If prepare() gets an object that doesn't implement prepare_trip, you will get an error that will tell you what you need to know (that object does not have .prepapre_trip()).

Consequences of Duck Typing

Duck typing shifts your code from relying on concrete classes to depending on behavior, which makes it easier to extend without modifying existing code. Although concrete code is simpler to understand, it’s costly to change; abstract, duck-typed code requires a deeper initial understanding but is far more adaptable. This design approach embraces polymorphism—the idea that many different objects can respond to the same message—allowing them to be used interchangeably as long as they behave correctly. Ultimately, by focusing on behavior rather than class, you gain flexibility, but you must ensure that all objects reliably implement the expected methods.

Writing Code That Relies on Ducks

Recognizing Hidden Ducks

1. Case statements that switch on class
class Trip:
    def __init__(self, bicycles, customers, vehicle):
        self.bicycles = bicycles
        self.customers = customers
        self.vehicle = vehicle

    def prepare(self, preparers):
        for preparer in preparers:
            if isinstance(preparer, Mechanic):
                preparer.prepare_bicycles(self.bicycles)
            elif isinstance(preparer, TripCoordinator):
                preparer.buy_food(self.customers)
            elif isinstance(preparer, Driver):
                preparer.gas_up(self.vehicle)
                preparer.fill_water_tank(self.vehicle)
2. kind_of? and is_a?
if isinstance(preparer, Mechanic):
    preparer.prepare_bicycles(bicycle)
elif isinstance(preparer, TripCoordinator):
    preparer.buy_food(customers)
elif isinstance(preparer, Driver):
    preparer.gas_up(vehicle)
    preparer.fill_water_tank(vehicle)
3. responds_to?
if hasattr(preparer, 'prepare_bicycles'):
    preparer.prepare_bicycles(bicycle)
elif hasattr(preparer, 'buy_food'):
    preparer.buy_food(customers)
elif hasattr(preparer, 'gas_up'):
    preparer.gas_up(vehicle)
    preparer.fill_water_tank(vehicle)

Placing Trust in Your Ducks

Use of kind_of?, is_a?, responds_to?, and case statements that switch on your classes indicate the presence of an unidentified duck. In each case the code is effectively saying “I know who you are and because of that I know what you do.” This knowledge exposes a lack of trust in collaborating objects and acts as a millstone around your object’s neck. It introduces dependencies that make code difficult to change. Just as in Demeter violations, this style of code is an indication that you are missing an object, one whose public interface you have not yet discovered. The fact that the missing object is a duck type instead of a concrete class matters not at all; it’s the interface that matters, not the class of the object that implements it. Flexible applications are built on objects that operate on trust; it is your job to make your objects trustworthy. When you see these code patterns, concentrate on the offending code’s expectations and use those expectations to find the duck type. Once you have a duck type in mind, define its interface, implement that interface where necessary, and then trust those implementers to behave correctly

Documenting Duck Types

Because the Prepare() duck type is an abstraction, it doesn't appear anywhere in the code, there is no class Prepare() anywhere. This can make it difficult to see that it exists, therefore good documentation in the form of tests should be used, more in Chapter 9.

Sharing Code Between Ducks

in our example the only thing Mechanic, Driver and TripCoordinator have in common is prepare_trip() however, if there is more share code it becomes more involved, more in Chapter 7.

Summary

Messages are at the center of object-oriented applications and they pass among objects along public interfaces. Duck typing detaches these public interfaces from specific classes, creating virtual types that are defined by what they do instead of by who they are. Duck typing reveals underlying abstractions that might otherwise be invisible. Depending on these abstractions reduces risk and increases flexibility, making your application cheaper to maintain and easier to change.

6. Acquiring Behavior Through Inheritance

Understanding Classical Inheritance

Inheritance is a mechanism for automatic message delegation. It does this by creating a forwarding path for messages that are not understood by a subclass to a superclass. Classical inheritance just refers to inheritance through superclass and subclasses. As opposed to other mechnisms such as prototypical inheritance in javascript.

Recognizing Where to Use Inheritance

Scenario: A company called FastFeet offers road bike trips, these bikes are maintained by mechanics who take an asortment of spare parts on every trip.

Starting with a Concrete Class

class Bicycle: def __init__(self, size, tape_color): self.size = size self.tape_color = tape_color def spares(self): s = {'chain' : '10-speed', 'tire_size' : '23', 'tape_color': self.tape_color} return s bike = Bicycle(size='M', 'tape_color'='red') print(bike.size) # -> 'M' print(bike.spares()) # -> {'chain': '10-speed', 'tire_size': '23', 'tape_color': 'red'} Bicycle instances can respond to the spares, size, and tape_color messages and a Mechanic can figure out what spare parts to take by asking each Bicycle for its spares. Imagine now that FastFeet begins to lead mountain bike trips. Mountain bikes and road bikes are much alike but there are clear differences between them. Mountain bikes are meant to be ridden on dirt paths instead of paved roads. They have sturdy frames, fat tires, straight-bar handlebars (with rubber hand grips instead of tape), and suspension. The bicycle in Figure 6.2 has front suspension only, but some mountain bikes also have rear, or “full” suspension. Your design task is to add support for mountain bikes to FastFeet’s application. Much of the behavior that you need already exists; mountain bikes are definitely bicycles. They have an overall bike size and a chain and tire size. The only differences between road and mountain bikes are that road bikes need handlebar tape and mountain bikes have suspension.

Embedding Multiple Types

When a preexisting concrete class contains most of the behavior you need, it’s tempting to solve this problem by adding code to that class. class Bicycle: def __init__(self, **kwargs): self.style = kwargs['style'] self.size = kwargs['size'] self.tape_color = kwargs['tape_color'] self.front_shock = kwargs['front_shock'] self.rear_shock = kwargs['rear_shock'] # This starts down a slippery slope. def spares(self): if self.style == 'road' s = {'chain' : '10-speed', 'tire_size' : '23', 'tape_color': self.tape_color} else: s = {'chain' : '10-speed', 'tire_size' : '2.1', 'tape_color': self.rear_shock} return s bike1 = Bicycle(style='road', size='M', tape_color='red', front_shock=None, rear_shock=None) bike2 = Bicycle(style='mountain', size='S', tape_color'='red', front_shock='Manitou', rear_shock='Fox') The new front_shock and rear_shock variables hold mountain bike specific parts. The new style variable determines which parts appear on the spares list. This example illustrates an antipattern in object-oriented design. The `Bicycle` class embeds multiple types by using a `style` attribute to determine behavior in the `spares` method. This approach introduces several problems: - If you add a new style you must change the if statement. - If you write careless code where the last option is the default (as does the code above) an unexpected style will do something but perhaps not what you expect. - The spares method started out containing embedded default strings, some of these strings are now duplicated on each side of the if statement. - Bicycle has an implied public interface that includes spares, size, and all the individual parts. - The size method still works, spares generally works, but the parts methods are now unreliable. - It’s impossible to predict, for any specific instance of Bicycle, whether a specific part has been initialized. - Objects holding onto an instance of Bicycle may, for example, be tempted to check style before sending it tape_color or rear_shock. The code wasn’t great to begin with; this change did nothing to improve it. The initial Bicycle class was imperfect but its imperfections were hidden, encapsulated within the class. These new flaws have broader consequences. Bicycle now has more than one responsibility, contains things that might change for different reasons, and cannot be reused as is. checking an attribute like `style' to determine behavior is the same as checking an object's class. In the first approach, we ask: "What is your style?" → Then behave accordingly. In the second approach, Python asks: "What class are you?" → And behaves accordingly.

Finding the Embedded Types

The if statement in the spares method above switches on a variable named style, but it would have been just as natural to call that variable type or category. Variables with these kinds of names are your cue to notice the underlying pattern. Type and category are words perilously similar... The style variable effectively divides instances of Bicycle into two different kinds of things. These two things share a great deal of behavior but differ along the style dimension. Some of Bicycle’s behavior applies to all bicycles, some only to road bikes, and some only to mountain bikes. This single class contains several different, but related, types. This is the exact problem that inheritance solves; that of highly related types that share common behavior but differ along some dimension.

Choosing Inheritance

Objects receive messages. The receiving object ultimately handles any message in one of two ways: 1. It either responds directly 2. or it passes the message on to some other object for a response. Inheritance provides a way to define two objects as having a relationship such that when the first receives a message that it does not understand, it automatically forwards, or delegates, the message to the second.

Misapplying Inheritance

In this example, we illustrate a problematic use of inheritance. The Bicycle class was originally designed with road bike behavior in mind. When MountainBike is subclassed from Bicycle, it inherits both the general bicycle behavior and road-specific defaults, leading to mixed-up, inconsistent behavior. For example, a MountainBike may report a tire size of "23" and have a tape_color (meant for road bikes), even though these attributes don’t apply. The problematic design forces us to modify the Bicycle class (a concrete class) when creating a subclass that needs different behavior, which violates the Open-Closed Principle and makes the design rigid and error-prone. class Bicycle: def __init__(self, size, tape_color=None, tire_size="23", chain="10-speed"): self.size = size self.tape_color = tape_color self.tire_size = tire_size self.chain = chain def spares(self): return { "tire_size": self.tire_size, "chain": self.chain, "tape_color": self.tape_color } class MountainBike(Bicycle): def __init__(self, size, front_shock, rear_shock, **kwargs): self.front_shock = front_shock self.rear_shock = rear_shock super().__init__(size, **kwargs) def spares(self): base_spares = super().spares() base_spares.update({"rear_shock": self.rear_shock}) return base_spares # Example usage: mountain_bike = MountainBike( size='S', front_shock='Manitou', rear_shock='Fox', tape_color=None ) print(mountain_bike.size) # Expected: 'S' print(mountain_bike.spares()) # {'tire_size': '23', 'chain': '10-speed', 'tape_color': None, 'rear_shock': 'Fox'} This design shows how MountainBike, as a subclass of Bicycle, ends up with behavior that mixes road bike defaults (like tire_size and tape_color) with mountain bike parts (front_shock and rear_shock). Embedding these defaults in Bicycle forces MountainBike to inherit unwanted behavior, which is a common antipattern when a concrete class is used as a superclass. A better design would separate the general bicycle behavior from the specifics of each bike type, possibly through duck typing or by refactoring Bicycle into more abstract components.

Finding the Abstraction

Initially Bicycle() was designed for road bikes. However, once MountainBike is introduced, the name Bicycle becomes misleading, since it implies a general type while actually representing only road bikes. Inheritance works best when a subclass (like MountainBike) is a true specialization of its superclass (a generic Bicycle). A MountainBike should have all the features of a Bicycle, plus extra features. For inheritance to be effective, your classes must reflect a true generalization–specialization relationship and be implemented correctly. Since the current Bicycle class mixes general bicycle behavior with road-specific code, it’s better to separate them. Move the road bike–specific code into a RoadBike subclass, leaving Bicycle as a true, generic abstraction for all bicycles.

Creating an Abstract Superclass

What we want to do is create an abstract class Bicycle() that will contain the common behavior. Abstract classes are not meant to be instanced themselves, in Python, the abc module allows you to explicitly defined abstract classes, however this is not strictly needed, in many cases the programmer should be able to use their brain to figure out what is abstract or not. It almost never makes sense to create an abstract superclass with only one sub- class. Even though its possible you anticipate having other kinds of bikes and it is possible to imagine modeling it as two classes from the very beginning, do not. Regardless of how strongly you anticipate having other kinds of bikes, that day may never come. Even two bikes, it still may not be right to commit to inheritance, creating a hierarchy has costs, and the best way to minimize this cost is to ensure you have the abstract right. One is more likely to get the abstraction right once once more behaviour is implemented as you can see what is shared between the objects. The following steps show a possible approach when refactoring: # Create the abstract class class Bicycle: pass # Rename the old Bicycle() to RoadBike() and inherit from Bicycle class RoadBike(Bicycle): # unchanged # Create the MountainBike which subclass of Bicycle class MountainBike(Bicycle): # empty

Promoting Abstract Behavior

.size and spares() are common to both bikes we can promote them to the super-class. .size is simpler, so we do that first: class Bicycle: def __init__(self, **kwargs): self.size = kwargs['size'] class RoadBike(Bicycle): def __init__(self, **kwargs): super().__init__(**kwargs) self.tape_color = kwargs['tape_color'] class MountainBike(Bicycle): def __init__(self, **kwargs): super().__init__(**kwargs) self.front_shock = kwargs['front_shock'] self.rear_shock = kwargs['rear_shock'] b1 = RoadBike(size='M', tape_color='red') b2 = MountainBike(size='L', front_shock='Manitou', rear_shock='Fox') print(b1.size, b2.size) # M L An approach when refactoring classes is to first push everything down into the concrete classes, i.e. atrributes and functions. Then only after proceed to promote up the shared abstractions between the classes. The reason for this is to try and avoid leaving remnants of concrete parts of the code in the super class. A more general way of looking at this is to ask the question of: "What will happen if I'm wrong?" In the case of failing to correctly identify and promote the abstraction, and as a result, some of it is left behind in the subclass, then the problem will be realized when another subclass requires that behaviour. You'll either have to duplicate the code or promote that behaviour. In the case of not pushing down the concrete behaviour, and leaving them in the super-class, it now means that all the sub-classes will inherit it. this is a problem as the subclasses now do not truely specializations of their superclasses and the hierarchy becomes untrustworthy. The result is that new code will embed knowlege of these quirks, such as checking classes of objects, the consequences of not having concrete behaviour in the superclass are far more severe.

Separating Abstract from Concrete

We now return to the .spares() function, which in the example in the book ended up as a weird missmatch between the two classes. In a nutshell,spares should give back: RoadBike: chain, tire_size, tape_color MountainBike: chain, tire_size, front_shock, rear_shock The rest of this chapter is quite confusing, so for now i'll just implement the most straightforward solution I can think of... class Bicycle: def __init__(self, size, chain=None, tire_size=None, **kwargs): self.size = size self.chain = chain self.tire_size = tire_size def spares(self): return {'chain' : self.chain, 'tire_size' : self.size} class RoadBike(Bicycle): def __init__(self, **kwargs): super().__init__(**kwargs) self.tape_color = kwargs['tape_color'] def spares(self): s = super().spares() s['tape_color'] = self.tape_color return s class MountainBike(Bicycle): def __init__(self, **kwargs): super().__init__(**kwargs) self.front_shock = kwargs['front_shock'] self.rear_shock = kwargs['rear_shock'] def spares(self): s = super().spares() s['front_shock'] = self.front_shock s['rear_shock'] = self.rear_shock return s b1 = RoadBike(size='M', tape_color='red') b2 = MountainBike(size='L', front_shock='Manitou', rear_shock='Fox') print(b1.spares()) print(b2.spares())

Using the Template Method Pattern

The template method pattern is used to define a common initialization structure in a superclass (Bicycle) while allowing subclasses (RoadBike and MountainBike) to customize parts of the behavior. The Bicycle class sets up defaults for attributes, and if a value isn’t provided, it calls a template method to obtain the default. Subclasses can override these template methods to supply their own default values. class Bicycle: def __init__(self, size, chain=None, tire_size=None): self.size = size self.chain = chain or self.default_chain() self.tire_size = tire_size or self.default_tire_size() def default_chain(self): return "10-speed" def default_tire_size(self): """Template method to be overwritten by subclass.""" raise NotImplementedError("Subclasses must implement default_tire_size()") class RoadBike(Bicycle): def __init__(self, size, tape_color, chain=None, tire_size=None): super().__init__(size, chain, tire_size) self.tape_color = tape_color def default_tire_size(self): return "23" class MountainBike(Bicycle): def __init__(self, size, front_shock, rear_shock, chain=None, tire_size=None): super().__init__(size, chain, tire_size) self.front_shock = front_shock self.rear_shock = rear_shock def default_tire_size(self): return "2.1" road_bike = RoadBike(size="M", tape_color="red") print("RoadBike tire size:", road_bike.tire_size) # Output: '23' print("RoadBike chain:", road_bike.chain) # Output: '10-speed' mountain_bike = MountainBike(size="S", front_shock="Manitou", rear_shock="Fox") print("MountainBike tire size:", mountain_bike.tire_size) # Output: '2.1' print("MountainBike chain:", mountain_bike.chain) # Output: '10-speed' default_chain() is a common default for both bikes. default_tire_size() is a template method, overwritten by the implementation in the subclass. NOTE: using the or syntax as is done above is less explicit than using: tire_size if tire_size is not None else self.default_tire_size() using the or syntax will treat empty strings, lists and 0s as null entries. This can be useful, the syntax is clearer but it is imporatnt to be aware of this. If the default tire size can be 0 for example, then don't use the or. Another way of defining the defaults is as follows: class Bicycle: def __init__(self, **kwargs): self._size = kwargs.get('size') # Use provided chain or get default; use provided tire_size or get default self._chain = kwargs.get('chain', self.default_chain) self._tire_size = kwargs.get('tire_size', self.default_tire_size)

Implementing Every Template Method

In the previous example, when using the template method pattern, the superclass Bicycle() sends messages to get default values that subclasses must override. If a subclass forgets to implement default_tire_size, Python wont complain, but it will cause an error later on. Bicycle() imposes a requirement on its subclasses that isn’t clear by just reading the code. The original designer knew that every subclass needed to define default_tire_size, but new programmers might overlook this requirement. The abstract base class (abc) module in python lets you explicitly define template functions that will scream at you if you don't implement them. from abc import ABC, abstractmethod class Bicycle(ABC): # Same as before @abstractmethod def default_tire_size(self): pass class RecumbentBike(Bicycle): def default_chain(self): return "9-speed" bike = RecumbentBike(size="S") # TypeError: Can't instantiate abstract class RecumbentBike without # an implementation for abstract method 'default_tire_size'

Managing Coupling Between Superclasses and Subclasses

The final piece of the puzzle is the implementation of .spares() There are a few ways to do this, some resulting in more tightly coupled classes than others. What follows are two different implementations:

Understanding Coupling

When using inheritance, one must be careful about how much each class knows about the other. Simple solutions may lead to tightly coupled classes which may cause problems later. from abc import ABC, abstractmethod class Bicycle(ABC): def __init__(self, **kwargs): self.size = kwargs.get('size') self.chain = kwargs.get('chain', self.default_chain()) self.tire_size = kwargs.get('tire_size', self.default_tire_size()) def default_chain(self): return '10-speed' @abstractmethod def default_tire_size(self): pass def spares(self): return { 'tire_size': self.tire_size, 'chain': self.chain } class RoadBike(Bicycle): def __init__(self, **kwargs): self.tape_color = kwargs.get('tape_color') super().__init__(**kwargs) def spares(self): s = super().spares() s['tape_color'] = self.tape_color return s def default_tire_size(self): return '23' class MountainBike(Bicycle): def __init__(self, **kwargs): self.front_shock = kwargs.get('front_shock') self.rear_shock = kwargs.get('rear_shock') super().__init__(**kwargs) def spares(self): s = super().spares() s['front_shock'] = self.front_shock s['rear_shock'] = self.rear_shock return s def default_tire_size(self): return '2.1' rb = RoadBike(size='XL', tire_size=29, tape_color='red') mb = MountainBike(size='L', chain='test', tire_size=24, front_shock='bob', rear_shock='smith') print(rb.spares()) print(mb.spares()) #{'tire_size': 29, 'chain': '10-speed', 'tape_color': 'red'} #{'tire_size': 24, 'chain': 'test', 'front_shock': 'bob', 'rear_shock': 'smith'} Everything works as expected, but the fact that the sub-classes in the above example have to call super() in both the __init__() and also the spares() means that a form of coupling has been created. Any new programmer implementing a subclass of bicycle must remember to add this to the functions or else things will not work as expected.

Decoupling Subclasses Using Hook Messages

There is a way to avoid having the subclasses needing to send super() to the parent class. This can be done via the use of a hook: from abc import ABC, abstractmethod class Bicycle(ABC): def __init__(self, **kwargs): self.size = kwargs.get('size') self.chain = kwargs.get('chain', self.default_chain()) self.tire_size = kwargs.get('tire_size', self.default_tire_size()) self.post_init(**kwargs) def post_init(self, **kwargs): pass # To be overwritten by subclass def default_chain(self): return '10-speed' @abstractmethod def default_tire_size(self): pass def spares(self): return {'tire_size': self.tire_size, 'chain' : self.chain, **self.local_spares} class RoadBike(Bicycle): def post_init(self, **kwargs): self.tape_color = kwargs.get('tape_color') @property def local_spares(self): return {'tape_color': self.tape_color} def default_tire_size(self): return '23' class MountainBike(Bicycle): def post_init(self, **kwargs): self.front_shock = kwargs.get('front_shock') self.rear_shock = kwargs.get('rear_shock') @property def local_spares(self): return {'front_shock': self.front_shock, 'rear_shock' : self.rear_shock} def default_tire_size(self): return '2.1' rb = RoadBike(size='XL', tire_size=29, tape_color='red') mb = MountainBike(size='L', chain='test', tire_size=24, front_shock='bob', rear_shock='smith') print(rb.spares()) print(mb.spares()) Adding post_init() to the parent __init__() means that it is always called on constructing a bike. Implementing this function in subclasses means that it is called in the parent class, this means we do not need an __init__ in the subclasses. A similar thing is done for spares() where we just add the local_spares to the dictionary in the parent class and return it.

Summary

The best way to create an abstract superclass is by pushing code up from concrete subclasses. Identifying the correct abstraction is easiest if you have access to at least three existing concrete classes. Abstract superclasses use the template method pattern to invite inheritors to supply specializations, and use hook methods to allow these inheritors to contribute these specializations without being forced to send super. Hook methods allow subclasses to contribute specializations without knowing the abstract algorithm. They remove the need for subclasses to send super and therefore reduce the coupling between layers of the hierarchy and increase its tolerance for change. Well-designed inheritance hierarchies are easy to extend with new subclasses, even for programmers who know very little about the application. This ease of extension is inheritance’s greatest strength. When your problem is one of needing numerous specializations of a stable, common abstraction, inheritance can be an extremely low-cost solution.

7. Sharing Role Behavior with Modules

Imagine there is a need for a recumbent mountain bikes. Creation of a recumbent mountain bike subclass requires combining the qualities of two existing subclasses, something that inheritance cannot readily accommodate.

Understanding Roles

Some problems require sharing behavior among unrelated objects. These objects can be thought of sharing a role. Using a role creates dependencies among the objects involved and these dependencies introduce risks that you must take into account when deciding among design options. This section unearths a hidden role and creates code to share its behavior among all players, while at the same time minimizing the dependencies thereby incurred.

Finding Roles

In chapter 5, we used a "preparer" as duck type. Mechanic().prepare_trip() Driver().prepare_trip() TripCoordinator().prepare_trip() # are all interchangable for perparer.prepare_trip() The existence of a Preparer role suggests that there is also a Preparable role.

Organizing Responsibilities

Removing Unnecessary Dependencies

Writing the Concrete Code

Extracting the Abstraction

Looking Up Methods

Inheriting Role Behavior

Writing Inheritable Code

Recognize the Antipatterns

Insist on the Abstraction

Honor the Contract