📘 Property-Based Testing: Hypothesis Framework

🎯 Introduction

Welcome to the exciting world of property-based testing with Hypothesis! 🎉 Ever felt like you’re only testing the cases you can think of? What about the ones you can’t? That’s where property-based testing comes to the rescue! 🦸‍♂️

Imagine you’re a quality inspector at a toy factory 🏭. Instead of checking just a few specific toys, property-based testing is like having a magical robot that creates thousands of random toys and checks if they all follow the safety rules!

By the end of this tutorial, you’ll be writing tests that find bugs you never knew existed. Let’s transform your testing game! 🚀

📚 Understanding Property-Based Testing

🤔 What is Property-Based Testing?

Property-based testing is like having a super-smart testing assistant 🤖 that generates hundreds of test cases automatically. Instead of writing specific test examples, you describe the properties your code should have, and Hypothesis creates the test data for you!

Think of it as the difference between:

Traditional testing: “Check if 2 + 2 = 4” 🔢
Property-based testing: “Check if addition always gives the same result regardless of order” 🔄

In Python terms, Hypothesis generates random inputs and verifies that certain properties always hold true. This means you can:

✨ Find edge cases you never thought of
🚀 Test with thousands of inputs automatically
🛡️ Catch subtle bugs before they reach production

💡 Why Use Hypothesis?

Here’s why developers love property-based testing:

Automatic Test Generation 🎲: No more thinking up test cases
Edge Case Discovery 🔍: Finds the weird inputs that break your code
Shrinking 🔬: Automatically simplifies failing cases
Reproducible 📌: Failing tests can be replayed exactly

Real-world example: Imagine testing a shopping cart 🛒. Instead of manually testing with 1, 2, or 10 items, Hypothesis could test with 0, 1, 1000, negative numbers, and find that your cart breaks with exactly 256 items!

🔧 Basic Syntax and Usage

📝 Installation and Setup

First, let’s install Hypothesis:

# 🚀 Install hypothesis
pip install hypothesis

# 📦 Import what we need
from hypothesis import given, strategies as st
import hypothesis

🎯 Your First Property-Based Test

Let’s start with a simple example:

# 👋 Hello, Hypothesis!
from hypothesis import given
import hypothesis.strategies as st

# 🎨 Our function to test
def reverse_string(s):
    return s[::-1]

# 🧪 Property-based test
@given(st.text())
def test_reverse_twice_returns_original(s):
    """Reversing a string twice should return the original! 🔄"""
    assert reverse_string(reverse_string(s)) == s
    
# 🎮 Run the test
if __name__ == "__main__":
    test_reverse_twice_returns_original()
    print("✅ All tests passed!")

💡 Explanation: The @given decorator tells Hypothesis to generate random text strings. It then checks our property: reversing twice returns the original!

🎯 Common Strategies

Here are the strategies you’ll use daily:

# 🏗️ Basic strategies
@given(st.integers())  # 🔢 Generate integers
def test_with_integers(n):
    assert isinstance(n, int)

@given(st.floats(allow_nan=False))  # 🎲 Generate floats
def test_with_floats(f):
    assert isinstance(f, float)

@given(st.text(min_size=1))  # 📝 Generate non-empty text
def test_with_text(s):
    assert len(s) >= 1

@given(st.lists(st.integers()))  # 📋 Generate lists of integers
def test_with_lists(lst):
    assert isinstance(lst, list)

# 🎨 Composite strategies
@given(st.dictionaries(
    keys=st.text(min_size=1),
    values=st.integers()
))  # 🗂️ Generate dictionaries
def test_with_dicts(d):
    assert isinstance(d, dict)

💡 Practical Examples

🛒 Example 1: Shopping Cart Calculator

Let’s test a shopping cart with property-based testing:

# 🛍️ Shopping cart implementation
class ShoppingCart:
    def __init__(self):
        self.items = []  # 📦 Items in cart
        
    def add_item(self, name, price, quantity):
        """Add item to cart 🛒"""
        if price < 0 or quantity < 0:
            raise ValueError("Price and quantity must be positive! 💰")
        self.items.append({
            'name': name,
            'price': price,
            'quantity': quantity,
            'emoji': '🛍️'
        })
    
    def get_total(self):
        """Calculate total price 💵"""
        return sum(item['price'] * item['quantity'] for item in self.items)
    
    def apply_discount(self, percentage):
        """Apply percentage discount 🏷️"""
        if not 0 <= percentage <= 100:
            raise ValueError("Discount must be between 0 and 100! 🚫")
        discount_factor = 1 - (percentage / 100)
        return self.get_total() * discount_factor

# 🧪 Property-based tests
from hypothesis import given, assume
import hypothesis.strategies as st

# 🎯 Strategy for valid items
item_strategy = st.fixed_dictionaries({
    'name': st.text(min_size=1, max_size=50),
    'price': st.floats(min_value=0.01, max_value=10000),
    'quantity': st.integers(min_value=1, max_value=100)
})

@given(st.lists(item_strategy, max_size=20))
def test_cart_total_is_sum_of_items(items):
    """Total should equal sum of all items! 🧮"""
    cart = ShoppingCart()
    
    # ➕ Add all items
    for item in items:
        cart.add_item(item['name'], item['price'], item['quantity'])
    
    # 🔍 Calculate expected total
    expected = sum(item['price'] * item['quantity'] for item in items)
    
    # ✅ Check property
    assert abs(cart.get_total() - expected) < 0.01  # Float comparison

@given(
    st.lists(item_strategy, min_size=1, max_size=10),
    st.floats(min_value=0, max_value=100)
)
def test_discount_reduces_price(items, discount):
    """Discount should always reduce or maintain price! 💸"""
    cart = ShoppingCart()
    
    # 🛒 Fill cart
    for item in items:
        cart.add_item(item['name'], item['price'], item['quantity'])
    
    original_total = cart.get_total()
    discounted_total = cart.apply_discount(discount)
    
    # ✅ Properties to check
    assert discounted_total <= original_total  # Never increase price!
    assert discounted_total >= 0  # Never negative!
    
    # 🎯 Special cases
    if discount == 0:
        assert discounted_total == original_total
    elif discount == 100:
        assert discounted_total == 0

# 🎮 Run the tests!
print("🧪 Testing shopping cart...")
test_cart_total_is_sum_of_items()
test_discount_reduces_price()
print("✅ All shopping cart tests passed! 🎉")

🎯 Try it yourself: Add a test for “buy 2 get 1 free” promotions!

🎮 Example 2: Password Validator

Let’s test a password validator with tricky edge cases:

# 🔐 Password validator
class PasswordValidator:
    def __init__(self, min_length=8):
        self.min_length = min_length
        self.emoji_strength = {
            'weak': '🔴',
            'medium': '🟡', 
            'strong': '🟢'
        }
    
    def is_valid(self, password):
        """Check if password meets requirements 🔍"""
        if len(password) < self.min_length:
            return False
        
        has_upper = any(c.isupper() for c in password)
        has_lower = any(c.islower() for c in password)
        has_digit = any(c.isdigit() for c in password)
        has_special = any(c in "!@#$%^&*" for c in password)
        
        return has_upper and has_lower and has_digit and has_special
    
    def get_strength(self, password):
        """Rate password strength 💪"""
        if not self.is_valid(password):
            return 'weak'
        
        score = 0
        if len(password) >= 12:
            score += 1
        if len(password) >= 16:
            score += 1
        if any(c in "!@#$%^&*()_+-=" for c in password):
            score += 1
            
        if score >= 2:
            return 'strong'
        elif score >= 1:
            return 'medium'
        return 'weak'

# 🧪 Property-based tests
from string import ascii_lowercase, ascii_uppercase, digits

# 🎯 Strategy for valid passwords
valid_password = st.text(
    alphabet=ascii_lowercase + ascii_uppercase + digits + "!@#$%^&*",
    min_size=8
).filter(
    lambda s: (
        any(c.islower() for c in s) and
        any(c.isupper() for c in s) and
        any(c.isdigit() for c in s) and
        any(c in "!@#$%^&*" for c in s)
    )
)

@given(valid_password)
def test_valid_passwords_are_accepted(password):
    """Valid passwords should always be accepted! ✅"""
    validator = PasswordValidator()
    assert validator.is_valid(password)
    assert validator.get_strength(password) != 'weak'

@given(st.text(max_size=7))
def test_short_passwords_are_invalid(password):
    """Short passwords should always fail! 🚫"""
    validator = PasswordValidator(min_length=8)
    assert not validator.is_valid(password)
    assert validator.get_strength(password) == 'weak'

@given(st.text(alphabet=ascii_lowercase, min_size=8))
def test_lowercase_only_is_weak(password):
    """Lowercase-only passwords are weak! 🔴"""
    validator = PasswordValidator()
    assert not validator.is_valid(password)
    
# 🎮 Property: strength ordering
@given(valid_password)
def test_longer_passwords_not_weaker(password):
    """Longer passwords shouldn't be weaker! 📏"""
    validator = PasswordValidator()
    
    # 🔍 Test with extended password
    extended = password + "A1!"
    
    strength_order = ['weak', 'medium', 'strong']
    original_strength = validator.get_strength(password)
    extended_strength = validator.get_strength(extended)
    
    # ✅ Extended should be at least as strong
    assert strength_order.index(extended_strength) >= strength_order.index(original_strength)

print("🔐 Testing password validator...")
test_valid_passwords_are_accepted()
test_short_passwords_are_invalid()
test_lowercase_only_is_weak()
test_longer_passwords_not_weaker()
print("✅ All password tests passed! 🎉")

🚀 Advanced Concepts

🧙‍♂️ Stateful Testing

For testing stateful systems, Hypothesis provides state machines:

# 🎯 Testing a bank account state machine
from hypothesis.stateful import RuleBasedStateMachine, rule, invariant
import hypothesis.strategies as st

class BankAccount:
    def __init__(self):
        self.balance = 0
        self.transactions = []
        self.emoji = "🏦"
    
    def deposit(self, amount):
        if amount <= 0:
            raise ValueError("Deposit must be positive! 💰")
        self.balance += amount
        self.transactions.append(f"➕ Deposited ${amount}")
        
    def withdraw(self, amount):
        if amount <= 0:
            raise ValueError("Withdrawal must be positive! 💸")
        if amount > self.balance:
            raise ValueError("Insufficient funds! 🚫")
        self.balance -= amount
        self.transactions.append(f"➖ Withdrew ${amount}")

# 🧪 Stateful test
class BankAccountStateMachine(RuleBasedStateMachine):
    def __init__(self):
        super().__init__()
        self.account = BankAccount()
        self.model_balance = 0  # 📊 Track expected balance
    
    @rule(amount=st.floats(min_value=0.01, max_value=10000))
    def deposit(self, amount):
        """Test deposits 💵"""
        self.account.deposit(amount)
        self.model_balance += amount
        
    @rule(amount=st.floats(min_value=0.01, max_value=10000))
    def withdraw(self, amount):
        """Test withdrawals 💸"""
        if amount <= self.model_balance:
            self.account.withdraw(amount)
            self.model_balance -= amount
    
    @invariant()
    def balance_matches_model(self):
        """Balance should always match our model! 🎯"""
        assert abs(self.account.balance - self.model_balance) < 0.01
        
    @invariant()
    def balance_never_negative(self):
        """Balance should never go negative! 🛡️"""
        assert self.account.balance >= 0

# 🎮 Run stateful test
TestBankAccount = BankAccountStateMachine.TestCase
print("🏦 Testing bank account state machine...")
# This would be run by pytest normally

🏗️ Custom Strategies

Create your own strategies for domain-specific data:

# 🚀 Custom strategy for email addresses
@st.composite
def email_strategy(draw):
    """Generate valid email addresses 📧"""
    username = draw(st.text(
        alphabet=ascii_lowercase + digits + "._-",
        min_size=1,
        max_size=20
    ))
    domain = draw(st.text(
        alphabet=ascii_lowercase,
        min_size=2,
        max_size=10
    ))
    tld = draw(st.sampled_from(['com', 'org', 'net', 'edu']))
    
    return f"{username}@{domain}.{tld}"

# 🎨 Custom strategy for RGB colors
@st.composite  
def rgb_color_strategy(draw):
    """Generate RGB colors 🎨"""
    r = draw(st.integers(0, 255))
    g = draw(st.integers(0, 255))
    b = draw(st.integers(0, 255))
    return {'r': r, 'g': g, 'b': b, 'hex': f"#{r:02x}{g:02x}{b:02x}"}

# 🧪 Using custom strategies
@given(email_strategy())
def test_email_format(email):
    """Emails should have @ and . 📮"""
    assert '@' in email
    assert '.' in email.split('@')[1]

@given(rgb_color_strategy())
def test_rgb_to_hex(color):
    """RGB values should convert to valid hex 🌈"""
    assert color['hex'].startswith('#')
    assert len(color['hex']) == 7
    assert all(c in '0123456789abcdef' for c in color['hex'][1:])

⚠️ Common Pitfalls and Solutions

😱 Pitfall 1: Flaky Tests

# ❌ Wrong - using current time
import time

@given(st.integers())
def test_with_time(n):
    start_time = time.time()  # 😰 Changes every run!
    result = process_number(n)
    assert time.time() - start_time < 1  # 💥 Might fail randomly!

# ✅ Correct - use deterministic properties
@given(st.integers())
def test_deterministic(n):
    result = process_number(n)
    assert isinstance(result, int)  # ✅ Always consistent!

🤯 Pitfall 2: Too Broad Strategies

# ❌ Dangerous - allows problematic inputs
@given(st.floats())
def test_divide(x):
    result = 1 / x  # 💥 Fails with 0, inf, nan!

# ✅ Safe - constrain your inputs
@given(st.floats(min_value=0.1, max_value=1000, allow_nan=False))
def test_divide_safe(x):
    result = 1 / x  # ✅ Safe now!
    assert 0.001 <= result <= 10

🤔 Pitfall 3: Assuming Too Much

# ❌ Wrong - assuming list order
@given(st.lists(st.integers()))
def test_sorting(lst):
    sorted_lst = sorted(lst)
    assert sorted_lst[0] == min(lst)  # 💥 Fails on empty list!

# ✅ Correct - handle edge cases
@given(st.lists(st.integers()))
def test_sorting_safe(lst):
    sorted_lst = sorted(lst)
    if lst:  # ✅ Check if not empty
        assert sorted_lst[0] == min(lst)
        assert sorted_lst[-1] == max(lst)

🛠️ Best Practices

🎯 Start Simple: Begin with basic properties, add complexity gradually
📝 Name Properties Clearly: test_addition_is_commutative not test_add
🛡️ Constrain Inputs: Use min_value, max_value, filter()
🎨 Use Composite Strategies: Build complex data from simple parts
✨ Let Hypothesis Shrink: Don’t override shrinking without good reason
🔍 Check the Examples: Use @example for specific important cases
📊 Profile Your Tests: Use --hypothesis-show-statistics

🧪 Hands-On Exercise

🎯 Challenge: Build a String Processor Test Suite

Create property-based tests for a string processor:

📋 Requirements:

✅ Test a function that removes duplicate characters
🏷️ Test a function that counts word frequencies
👤 Test a function that masks sensitive data (emails, phones)
📅 Test a function that parses dates in multiple formats
🎨 Each test should check multiple properties!

🚀 Bonus Points:

Use custom strategies for phone numbers
Test with Unicode and emojis
Create stateful tests for a text buffer

💡 Solution

🔍 Click to see solution

# 🎯 String processor implementation and tests
import re
from collections import Counter
from hypothesis import given, assume
import hypothesis.strategies as st

class StringProcessor:
    def __init__(self):
        self.emoji = "📝"
        
    def remove_duplicates(self, text):
        """Remove duplicate characters while preserving order 🧹"""
        seen = set()
        result = []
        for char in text:
            if char not in seen:
                seen.add(char)
                result.append(char)
        return ''.join(result)
    
    def count_words(self, text):
        """Count word frequencies 📊"""
        words = re.findall(r'\b\w+\b', text.lower())
        return dict(Counter(words))
    
    def mask_sensitive(self, text):
        """Mask emails and phone numbers 🔒"""
        # Mask emails
        text = re.sub(r'\b[\w.-]+@[\w.-]+\.\w+\b', '***@***.***', text)
        # Mask phone numbers (simple pattern)
        text = re.sub(r'\b\d{3}[-.]?\d{3}[-.]?\d{4}\b', '***-***-****', text)
        return text

# 🧪 Property-based tests
processor = StringProcessor()

@given(st.text())
def test_remove_duplicates_reduces_length(text):
    """Removing duplicates never increases length! 📏"""
    result = processor.remove_duplicates(text)
    assert len(result) <= len(text)

@given(st.text())
def test_remove_duplicates_preserves_unique(text):
    """Already unique strings stay the same! 🎯"""
    unique_text = processor.remove_duplicates(text)
    # Applying again should give same result
    assert processor.remove_duplicates(unique_text) == unique_text

@given(st.text())
def test_remove_duplicates_has_all_chars(text):
    """Result contains all unique characters! ✅"""
    result = processor.remove_duplicates(text)
    assert set(result) == set(text)

# 🎯 Custom strategy for text with words
words_text = st.text(alphabet=st.characters(whitelist_categories=("Lu", "Ll", "Nd"), whitelist_characters=" "), min_size=1)

@given(words_text)
def test_word_count_values_positive(text):
    """All word counts are positive! 🔢"""
    counts = processor.count_words(text)
    assert all(count > 0 for count in counts.values())

@given(words_text)
def test_word_count_sum_matches(text):
    """Sum of counts equals total words! 🧮"""
    counts = processor.count_words(text)
    words = re.findall(r'\b\w+\b', text.lower())
    assert sum(counts.values()) == len(words)

# 🎨 Custom strategy for emails
@st.composite
def email_in_text_strategy(draw):
    """Generate text with emails 📧"""
    prefix = draw(st.text(max_size=20))
    email = draw(st.from_regex(r'[a-z]+@[a-z]+\.com'))
    suffix = draw(st.text(max_size=20))
    return f"{prefix} {email} {suffix}"

@given(email_in_text_strategy())
def test_mask_hides_emails(text):
    """Emails are properly masked! 🔐"""
    result = processor.mask_sensitive(text)
    # Original email pattern shouldn't exist
    assert not re.search(r'\b[\w.-]+@[\w.-]+\.\w+\b', result)
    # But masked version should
    assert '***@***.***' in result

# 🏆 Stateful test for text buffer
from hypothesis.stateful import RuleBasedStateMachine, rule

class TextBufferStateMachine(RuleBasedStateMachine):
    def __init__(self):
        super().__init__()
        self.buffer = []
        self.processor = StringProcessor()
    
    @rule(text=st.text())
    def append_text(self, text):
        """Append text to buffer 📝"""
        self.buffer.append(text)
    
    @rule()
    def process_buffer(self):
        """Process and clear buffer 🔄"""
        if self.buffer:
            combined = ''.join(self.buffer)
            result = self.processor.remove_duplicates(combined)
            # Property: result has all unique chars from buffer
            assert set(result) == set(combined)
            self.buffer.clear()
    
    @invariant()
    def buffer_size_reasonable(self):
        """Buffer doesn't grow too large 📦"""
        assert len(self.buffer) <= 1000

# 🎮 Run all tests
print("🧪 Testing string processor...")
test_remove_duplicates_reduces_length()
test_remove_duplicates_preserves_unique()
test_remove_duplicates_has_all_chars()
test_word_count_values_positive()
test_word_count_sum_matches()
test_mask_hides_emails()
print("✅ All string processor tests passed! 🎉")

🎓 Key Takeaways

You’ve mastered property-based testing with Hypothesis! Here’s what you can now do:

✅ Write property-based tests that find edge cases automatically 💪
✅ Use Hypothesis strategies to generate test data 🎲
✅ Create custom strategies for domain-specific needs 🎨
✅ Test stateful systems with state machines 🤖
✅ Avoid common pitfalls in property-based testing 🛡️

Remember: Property-based testing doesn’t replace example-based tests, it complements them! Use both for maximum confidence. 🤝

🤝 Next Steps

Congratulations! 🎉 You’ve unlocked the power of property-based testing!

Here’s what to do next:

💻 Install Hypothesis and try the exercises above
🏗️ Add property-based tests to your existing projects
📚 Explore more advanced strategies in the Hypothesis docs
🌟 Share your coolest bug finds with the community!

Remember: The best tests are the ones that find bugs you didn’t know existed. Keep testing, keep discovering, and most importantly, have fun finding those edge cases! 🚀

Happy testing! 🎉🧪✨

Prerequisites

What you'll learn