Debugging Python Code: Tips for AI Beginners

Master Python debugging for AI projects. Learn to read error messages, use print debugging, leverage debuggers, handle common errors, and debug machine learning code effectively.

By Techietory on February 1, 2026

Introduction: Debugging is a Core Skill, Not an Afterthought

Every programmer writes code with bugs. This isn’t a sign of incompetence—it’s an inevitable reality of software development. The difference between novice and experienced programmers isn’t that experienced programmers write bug-free code; it’s that they can find and fix bugs quickly and systematically. Debugging is as fundamental as writing code itself, yet many beginners treat it as something they’ll figure out eventually. This approach leads to hours of frustration staring at code, randomly changing things, and hoping problems disappear.

Effective debugging is methodical, not magical. It involves understanding error messages, forming hypotheses about what’s wrong, testing those hypotheses systematically, and fixing the underlying problem rather than just symptoms. Good debuggers approach problems scientifically: they observe behavior, form theories, run experiments, and iterate until they identify root causes. This systematic approach works regardless of programming language, domain, or problem complexity.

In machine learning, debugging presents unique challenges beyond typical software bugs. Your code might run without errors but produce wrong results—models that don’t learn, predictions that make no sense, or accuracy that’s suspiciously high. These logical errors are harder to detect than syntax errors because Python doesn’t complain. You must understand both your code and your problem domain deeply enough to recognize when results are wrong and trace back to the cause.

This comprehensive guide will transform you from someone who gets stuck on bugs into someone who debugs efficiently and confidently. We’ll start by understanding what bugs are and why they happen, building intuition about common failure modes. We’ll master reading and interpreting Python error messages, which contain crucial debugging information if you know how to read them. We’ll explore debugging techniques from simple print statements through sophisticated debuggers. We’ll examine common error types and their fixes. We’ll dive into machine learning-specific debugging challenges. Throughout, we’ll emphasize systematic approaches that work across different problems rather than memorizing specific fixes.

Understanding Bugs: Types and Causes

Before learning to fix bugs, you need to understand what bugs are and why they occur. Different bug types require different debugging approaches.

The Three Categories of Bugs

Syntax errors occur when Python can’t parse your code. Missing colons, unmatched parentheses, incorrect indentation—these prevent code from running at all. Python catches syntax errors immediately when it tries to read your file, before executing any code. These are the easiest bugs to fix because Python tells you exactly where the problem is.

Runtime errors (exceptions) occur during execution when Python encounters something it can’t do: dividing by zero, accessing a nonexistent dictionary key, or calling a method on None. The code is syntactically correct, so Python starts running it, but then encounters an impossible operation. Runtime errors produce tracebacks showing exactly where the error occurred and what type of error happened.

Logical errors (semantic errors) occur when code runs successfully but produces wrong results. Python can’t detect these because the code is valid—it just doesn’t do what you intended. These are the hardest bugs to find and fix because there’s no error message. You must recognize that results are wrong and trace through logic to find mistakes.

Python

# Syntax Error Example
def calculate_average(numbers)  # Missing colon
    return sum(numbers) / len(numbers)

# This code won't run at all. Python immediately says:
# SyntaxError: expected ':'

# Runtime Error Example
def calculate_average(numbers):
    return sum(numbers) / len(numbers)

calculate_average([])  # Division by zero - empty list
# This runs until it tries to divide by zero, then:
# ZeroDivisionError: division by zero

# Logical Error Example
def calculate_average(numbers):
    return sum(numbers) / len(numbers)

prices = [10, 20, 30]
average_price = calculate_average(prices)
total_cost = average_price  # Should multiply by quantity!

# This runs fine, but produces wrong business results
# No error - just wrong calculation

# Syntax Error Example
def calculate_average(numbers)  # Missing colon
    return sum(numbers) / len(numbers)

# This code won't run at all. Python immediately says:
# SyntaxError: expected ':'

# Runtime Error Example
def calculate_average(numbers):
    return sum(numbers) / len(numbers)

calculate_average([])  # Division by zero - empty list
# This runs until it tries to divide by zero, then:
# ZeroDivisionError: division by zero

# Logical Error Example
def calculate_average(numbers):
    return sum(numbers) / len(numbers)

prices = [10, 20, 30]
average_price = calculate_average(prices)
total_cost = average_price  # Should multiply by quantity!

# This runs fine, but produces wrong business results
# No error - just wrong calculation

What this demonstrates: Different bugs require different approaches. Syntax errors: read the error message and fix the indicated location. Runtime errors: examine the traceback to see what operation failed. Logical errors: test your code with known inputs and verify outputs match expectations.

Why Bugs Happen

Understanding why bugs occur helps you prevent them and recognize patterns:

Typos and mistakes: Simple typing errors create bugs. Variable names spelled wrong, methods called incorrectly, missing characters. These are inevitable but usually easy to fix once found.

Misunderstood requirements: You implement something correctly but it’s not what was needed. The code works as written but doesn’t solve the actual problem.

Edge cases: Code works for typical inputs but fails on unusual ones—empty lists, zero values, None, very large numbers. Forgetting to handle edge cases causes many bugs.

Incorrect assumptions: You assume something about your data or code that isn’t true. You assume all customers have email addresses, but some don’t. You assume the API always returns data, but sometimes it returns errors. Violated assumptions cause unexpected failures.

Copy-paste errors: You copy code and forget to change all relevant parts. Variables, file names, or logic remain from the original context where they don’t belong.

Type confusion: Python’s dynamic typing means variables can hold any type. You assume a variable contains a list but it’s actually a string, causing errors when you try to use list methods.

Off-by-one errors: Indexing mistakes are common. You iterate over range(10) expecting 1-10 but get 0-9. You slice [1:5] expecting 5 elements but get 4.

Recognizing these patterns helps you quickly identify likely causes when debugging.

Reading Error Messages: Python Tells You What’s Wrong

Python’s error messages contain valuable debugging information, but beginners often ignore them in panic. Learning to read error messages carefully and systematically is one of the most important debugging skills.

Anatomy of a Traceback

When Python encounters a runtime error, it produces a traceback showing the call stack—the sequence of function calls that led to the error:

Python

# Example code with error
def process_customer_data(customers):
    """Process customer purchase data."""
    for customer in customers:
        analyze_purchases(customer)

def analyze_purchases(customer):
    """Analyze customer purchases."""
    total = calculate_total(customer['purchases'])
    print(f"Customer {customer['name']} total: ${total}")

def calculate_total(purchases):
    """Calculate total purchase amount."""
    return sum(purchase['amount'] for purchase in purchases)

# Data with problem
customers = [
    {'name': 'Alice', 'purchases': [{'amount': 50}, {'amount': 75}]},
    {'name': 'Bob', 'purchases': [{'amount': 100}]},
    {'name': 'Charlie'}  # Missing 'purchases' key!
]

# Run the code
process_customer_data(customers)

# Example code with error
def process_customer_data(customers):
    """Process customer purchase data."""
    for customer in customers:
        analyze_purchases(customer)

def analyze_purchases(customer):
    """Analyze customer purchases."""
    total = calculate_total(customer['purchases'])
    print(f"Customer {customer['name']} total: ${total}")

def calculate_total(purchases):
    """Calculate total purchase amount."""
    return sum(purchase['amount'] for purchase in purchases)

# Data with problem
customers = [
    {'name': 'Alice', 'purchases': [{'amount': 50}, {'amount': 75}]},
    {'name': 'Bob', 'purchases': [{'amount': 100}]},
    {'name': 'Charlie'}  # Missing 'purchases' key!
]

# Run the code
process_customer_data(customers)

This produces:

Python

Traceback (most recent call last):
  File "example.py", line 21, in <module>
    process_customer_data(customers)
  File "example.py", line 4, in process_customer_data
    analyze_purchases(customer)
  File "example.py", line 8, in analyze_purchases
    total = calculate_total(customer['purchases'])
KeyError: 'purchases'

Traceback (most recent call last):
  File "example.py", line 21, in <module>
    process_customer_data(customers)
  File "example.py", line 4, in process_customer_data
    analyze_purchases(customer)
  File "example.py", line 8, in analyze_purchases
    total = calculate_total(customer['purchases'])
KeyError: 'purchases'

How to read this traceback:

The bottom line shows the error type (KeyError) and message ('purchases'). This is the most important information—what went wrong.

Lines above show the call stack in chronological order (oldest to newest). Each line shows: the file name (example.py), line number (line 21), function name (<module> means top-level code, not in a function), and the actual code on that line.

Reading from bottom to top (newest to oldest):

The error occurred at line 8: customer['purchases'] tried to access a key that doesn’t exist
This happened inside analyze_purchases() which was called at line 4
Which was called from process_customer_data() at line 21
Which was called from top-level code

What this tells you: The error is a KeyError for 'purchases', meaning a dictionary was missing that key. Looking at the stack, the problem is in analyze_purchases() trying to access customer['purchases']. The bug isn’t in calculate_total()—that function never ran. The problem is earlier, when accessing the dictionary.

The fix: Either ensure all customers have a 'purchases' key, or check before accessing:

Python

def analyze_purchases(customer):
    """Analyze customer purchases."""
    if 'purchases' not in customer:
        print(f"Warning: Customer {customer.get('name', 'Unknown')} has no purchases")
        return
    
    total = calculate_total(customer['purchases'])
    print(f"Customer {customer['name']} total: ${total}")

def analyze_purchases(customer):
    """Analyze customer purchases."""
    if 'purchases' not in customer:
        print(f"Warning: Customer {customer.get('name', 'Unknown')} has no purchases")
        return
    
    total = calculate_total(customer['purchases'])
    print(f"Customer {customer['name']} total: ${total}")

Common Error Types and What They Mean

Understanding common error types helps you quickly identify problems:

SyntaxError: Python can’t parse your code. Look at the line indicated and the line above—often the problem is a missing closing parenthesis or quote.

IndentationError: Inconsistent indentation. Python is strict about indentation—mixing tabs and spaces or incorrect indentation levels cause this.

NameError: You’re using a variable that doesn’t exist. Either it’s undefined, or you spelled it wrong, or it’s defined later in code.

TypeError: Operation on incompatible types. Adding string and integer, calling non-callable objects, wrong number of function arguments.

ValueError: Correct type but inappropriate value. Converting ‘abc’ to integer, unpacking wrong number of values.

KeyError: Accessing nonexistent dictionary key. Either key doesn’t exist, or you spelled it wrong.

IndexError: Accessing list/tuple index that doesn’t exist. Usually trying to access beyond the end of a sequence.

AttributeError: Accessing nonexistent attribute or method. Either object doesn’t have that attribute, or object is None when you expected something else.

ZeroDivisionError: Dividing by zero. Usually indicates missing validation of denominators.

FileNotFoundError: Opening file that doesn’t exist. Check file path and working directory.

Python

# Examples of common errors with explanations

# NameError - using undefined variable
print(total_price)  # NameError: name 'total_price' is not defined
# Fix: Define variable before using it

# TypeError - incompatible types
result = "5" + 3  # TypeError: can only concatenate str to str
# Fix: result = int("5") + 3  or  result = "5" + str(3)

# ValueError - wrong value for type
age = int("twenty")  # ValueError: invalid literal for int() with base 10
# Fix: Validate input before converting

# KeyError - missing dictionary key
person = {'name': 'Alice', 'age': 30}
email = person['email']  # KeyError: 'email'
# Fix: email = person.get('email', 'no-email@example.com')

# IndexError - index out of range
numbers = [1, 2, 3]
value = numbers[5]  # IndexError: list index out of range
# Fix: Check length first or use try-except

# AttributeError - wrong attribute
value = None
length = value.lower()  # AttributeError: 'NoneType' object has no attribute 'lower'
# Fix: Check if value is not None before calling methods

# ZeroDivisionError - division by zero
average = total / count  # ZeroDivisionError: division by zero
# Fix: Check if count > 0 before dividing

# Examples of common errors with explanations

# NameError - using undefined variable
print(total_price)  # NameError: name 'total_price' is not defined
# Fix: Define variable before using it

# TypeError - incompatible types
result = "5" + 3  # TypeError: can only concatenate str to str
# Fix: result = int("5") + 3  or  result = "5" + str(3)

# ValueError - wrong value for type
age = int("twenty")  # ValueError: invalid literal for int() with base 10
# Fix: Validate input before converting

# KeyError - missing dictionary key
person = {'name': 'Alice', 'age': 30}
email = person['email']  # KeyError: 'email'
# Fix: email = person.get('email', 'no-email@example.com')

# IndexError - index out of range
numbers = [1, 2, 3]
value = numbers[5]  # IndexError: list index out of range
# Fix: Check length first or use try-except

# AttributeError - wrong attribute
value = None
length = value.lower()  # AttributeError: 'NoneType' object has no attribute 'lower'
# Fix: Check if value is not None before calling methods

# ZeroDivisionError - division by zero
average = total / count  # ZeroDivisionError: division by zero
# Fix: Check if count > 0 before dividing

What this demonstrates: Each error type has characteristic causes and fixes. Reading the error type immediately suggests likely problems. The error message provides specifics about what went wrong. Together, they point you toward solutions.

Debugging Techniques: From Print Statements to Debuggers

Effective debugging requires a toolkit of techniques. Start with simple approaches and progress to sophisticated tools as needed.

Technique 1: Print Debugging

Print debugging is simple but effective. Add print statements to see what’s happening:

Python

# Original code with bug
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    if customer_type == 'regular':
        discount = price * 0.1
    elif customer_type == 'premium':
        discount = price * 0.2
    elif customer_type == 'vip':
        discount = price * 0.3
    
    final_price = price - discount
    return final_price

# Bug: Returns wrong value for some inputs
result = calculate_discount(100, 'gold')
print(result)  # UnboundLocalError: local variable 'discount' referenced before assignment

# Add print debugging
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    print(f"DEBUG: price={price}, customer_type={customer_type}")
    
    if customer_type == 'regular':
        discount = price * 0.1
        print(f"DEBUG: regular customer, discount={discount}")
    elif customer_type == 'premium':
        discount = price * 0.2
        print(f"DEBUG: premium customer, discount={discount}")
    elif customer_type == 'vip':
        discount = price * 0.3
        print(f"DEBUG: vip customer, discount={discount}")
    
    print(f"DEBUG: about to calculate final price with discount={discount}")
    final_price = price - discount
    return final_price

# Now when we run it, we see:
# DEBUG: price=100, customer_type=gold
# DEBUG: about to calculate final price with discount=discount
# ERROR: discount is never set because 'gold' doesn't match any condition!

# Fix: Add default case
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    if customer_type == 'regular':
        discount = price * 0.1
    elif customer_type == 'premium':
        discount = price * 0.2
    elif customer_type == 'vip':
        discount = price * 0.3
    else:
        discount = 0  # Default: no discount
        print(f"Warning: Unknown customer type '{customer_type}', no discount applied")
    
    final_price = price - discount
    return final_price

# Original code with bug
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    if customer_type == 'regular':
        discount = price * 0.1
    elif customer_type == 'premium':
        discount = price * 0.2
    elif customer_type == 'vip':
        discount = price * 0.3
    
    final_price = price - discount
    return final_price

# Bug: Returns wrong value for some inputs
result = calculate_discount(100, 'gold')
print(result)  # UnboundLocalError: local variable 'discount' referenced before assignment

# Add print debugging
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    print(f"DEBUG: price={price}, customer_type={customer_type}")
    
    if customer_type == 'regular':
        discount = price * 0.1
        print(f"DEBUG: regular customer, discount={discount}")
    elif customer_type == 'premium':
        discount = price * 0.2
        print(f"DEBUG: premium customer, discount={discount}")
    elif customer_type == 'vip':
        discount = price * 0.3
        print(f"DEBUG: vip customer, discount={discount}")
    
    print(f"DEBUG: about to calculate final price with discount={discount}")
    final_price = price - discount
    return final_price

# Now when we run it, we see:
# DEBUG: price=100, customer_type=gold
# DEBUG: about to calculate final price with discount=discount
# ERROR: discount is never set because 'gold' doesn't match any condition!

# Fix: Add default case
def calculate_discount(price, customer_type):
    """Calculate discount based on customer type."""
    if customer_type == 'regular':
        discount = price * 0.1
    elif customer_type == 'premium':
        discount = price * 0.2
    elif customer_type == 'vip':
        discount = price * 0.3
    else:
        discount = 0  # Default: no discount
        print(f"Warning: Unknown customer type '{customer_type}', no discount applied")
    
    final_price = price - discount
    return final_price

Print debugging best practices:

Add DEBUG: prefix to distinguish from normal output
Print variable names and values together: f"variable_name={value}"
Print before and after important operations
Print at the beginning of functions to verify they’re called
Remove or comment out debug prints once fixed

Technique 2: Assertions

Assertions check assumptions and fail fast when violated:

Python

def calculate_average(numbers):
    """Calculate average of numbers."""
    # Assert preconditions
    assert isinstance(numbers, list), "numbers must be a list"
    assert len(numbers) > 0, "numbers list cannot be empty"
    assert all(isinstance(n, (int, float)) for n in numbers), "all elements must be numeric"
    
    total = sum(numbers)
    count = len(numbers)
    average = total / count
    
    # Assert postconditions
    assert isinstance(average, (int, float)), "average must be numeric"
    
    return average

# Assertions catch problems early
try:
    result = calculate_average([])  # AssertionError: numbers list cannot be empty
except AssertionError as e:
    print(f"Assertion failed: {e}")

try:
    result = calculate_average([1, 2, 'three'])  # AssertionError: all elements must be numeric
except AssertionError as e:
    print(f"Assertion failed: {e}")

def calculate_average(numbers):
    """Calculate average of numbers."""
    # Assert preconditions
    assert isinstance(numbers, list), "numbers must be a list"
    assert len(numbers) > 0, "numbers list cannot be empty"
    assert all(isinstance(n, (int, float)) for n in numbers), "all elements must be numeric"
    
    total = sum(numbers)
    count = len(numbers)
    average = total / count
    
    # Assert postconditions
    assert isinstance(average, (int, float)), "average must be numeric"
    
    return average

# Assertions catch problems early
try:
    result = calculate_average([])  # AssertionError: numbers list cannot be empty
except AssertionError as e:
    print(f"Assertion failed: {e}")

try:
    result = calculate_average([1, 2, 'three'])  # AssertionError: all elements must be numeric
except AssertionError as e:
    print(f"Assertion failed: {e}")

When to use assertions:

Check function preconditions (inputs)
Check function postconditions (outputs)
Verify invariants that should always be true
Catch logic errors during development

Note: Assertions can be disabled in production (python -O), so don’t use them for input validation in production code. Use them for catching bugs during development.

Technique 3: Using the Python Debugger (pdb)

The Python debugger lets you pause execution, inspect variables, and step through code:

Python

import pdb

def analyze_data(data):
    """Analyze customer data."""
    total = 0
    count = 0
    
    for item in data:
        # Set breakpoint here
        pdb.set_trace()  # Execution pauses, interactive debugger starts
        
        if item['valid']:
            total += item['value']
            count += 1
    
    average = total / count
    return average

data = [
    {'valid': True, 'value': 100},
    {'valid': False, 'value': 50},
    {'valid': True, 'value': 150}
]

result = analyze_data(data)

import pdb

def analyze_data(data):
    """Analyze customer data."""
    total = 0
    count = 0
    
    for item in data:
        # Set breakpoint here
        pdb.set_trace()  # Execution pauses, interactive debugger starts
        
        if item['valid']:
            total += item['value']
            count += 1
    
    average = total / count
    return average

data = [
    {'valid': True, 'value': 100},
    {'valid': False, 'value': 50},
    {'valid': True, 'value': 150}
]

result = analyze_data(data)

Common pdb commands:

n (next): Execute current line, move to next
s (step): Step into function calls
c (continue): Continue execution until next breakpoint
p variable (print): Print variable value
pp variable (pretty print): Print variable with formatting
l (list): Show surrounding code
w (where): Show call stack
q (quit): Exit debugger

Using pdb effectively:

Set breakpoint with pdb.set_trace() before suspected problem area
When debugger starts, use l to see where you are
Use p to inspect variables
Use n to step through code line by line
Watch variables change to identify where values become wrong

Technique 4: Try-Except for Specific Error Handling

Catch specific errors to handle them gracefully or get more information:

Python

def safe_divide(a, b):
    """Divide a by b with error handling."""
    try:
        result = a / b
        return result
    except ZeroDivisionError:
        print(f"Error: Cannot divide {a} by zero")
        return None
    except TypeError as e:
        print(f"Error: Invalid types for division: {type(a)} and {type(b)}")
        print(f"  Details: {e}")
        return None

# Usage
print(safe_divide(10, 2))    # Works: 5.0
print(safe_divide(10, 0))    # Handles error: None
print(safe_divide(10, '2'))  # Handles error: None

def safe_divide(a, b):
    """Divide a by b with error handling."""
    try:
        result = a / b
        return result
    except ZeroDivisionError:
        print(f"Error: Cannot divide {a} by zero")
        return None
    except TypeError as e:
        print(f"Error: Invalid types for division: {type(a)} and {type(b)}")
        print(f"  Details: {e}")
        return None

# Usage
print(safe_divide(10, 2))    # Works: 5.0
print(safe_divide(10, 0))    # Handles error: None
print(safe_divide(10, '2'))  # Handles error: None

Try-except best practices:

Catch specific exceptions, not bare except:
Only catch exceptions you can handle meaningfully
Log or print error information for debugging
Re-raise exceptions you can’t handle: raise
Use finally: for cleanup that must always happen

Technique 5: Logging for Production Debugging

Logging provides persistent debugging information without cluttering code with prints:

Python

import logging

# Configure logging
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
    filename='analysis.log'
)

logger = logging.getLogger(__name__)

def process_batch(data_batch):
    """Process a batch of data."""
    logger.info(f"Processing batch of {len(data_batch)} items")
    
    processed = 0
    errors = 0
    
    for item in data_batch:
        try:
            result = process_item(item)
            processed += 1
            logger.debug(f"Processed item {item['id']}: {result}")
        except Exception as e:
            errors += 1
            logger.error(f"Error processing item {item['id']}: {e}", exc_info=True)
    
    logger.info(f"Batch complete: {processed} processed, {errors} errors")
    return processed, errors

def process_item(item):
    """Process single item."""
    # Processing logic
    return item['value'] * 2

import logging

# Configure logging
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
    filename='analysis.log'
)

logger = logging.getLogger(__name__)

def process_batch(data_batch):
    """Process a batch of data."""
    logger.info(f"Processing batch of {len(data_batch)} items")
    
    processed = 0
    errors = 0
    
    for item in data_batch:
        try:
            result = process_item(item)
            processed += 1
            logger.debug(f"Processed item {item['id']}: {result}")
        except Exception as e:
            errors += 1
            logger.error(f"Error processing item {item['id']}: {e}", exc_info=True)
    
    logger.info(f"Batch complete: {processed} processed, {errors} errors")
    return processed, errors

def process_item(item):
    """Process single item."""
    # Processing logic
    return item['value'] * 2

Logging levels (lowest to highest severity):

DEBUG: Detailed information for diagnosing problems
INFO: Confirmation that things are working as expected
WARNING: Something unexpected but not critical
ERROR: More serious problem, function couldn’t complete
CRITICAL: Very serious error, program may not continue

Why use logging:

Output goes to file, doesn’t clutter console
Can change verbosity without changing code
Includes timestamps and context automatically
Can log to multiple destinations
Standard tool for production debugging

Common Python Errors and How to Fix Them

Understanding common errors and their typical causes helps you debug faster.

Error 1: NameError

Cause: Using undefined variable, wrong spelling, or using before definition.

Python

# Problem
print(user_name)  # NameError: name 'user_name' is not defined

# Solution 1: Define variable first
user_name = "Alice"
print(user_name)

# Solution 2: Check spelling
username = "Alice"
print(username)  # Was trying to use 'user_name' but defined 'username'

# Solution 3: Check scope
def greet():
    name = "Alice"

greet()
print(name)  # NameError: 'name' is local to greet()

# Fix: Return value or use different scope
def greet():
    return "Alice"

name = greet()
print(name)

# Problem
print(user_name)  # NameError: name 'user_name' is not defined

# Solution 1: Define variable first
user_name = "Alice"
print(user_name)

# Solution 2: Check spelling
username = "Alice"
print(username)  # Was trying to use 'user_name' but defined 'username'

# Solution 3: Check scope
def greet():
    name = "Alice"

greet()
print(name)  # NameError: 'name' is local to greet()

# Fix: Return value or use different scope
def greet():
    return "Alice"

name = greet()
print(name)

Error 2: TypeError with None

Cause: Trying to use None as if it were another type.

Python

# Problem
def get_user_email(user_id):
    """Get user email from database."""
    if user_id == 1:
        return "alice@example.com"
    # Implicitly returns None for other IDs

email = get_user_email(2)
print(email.lower())  # AttributeError: 'NoneType' object has no attribute 'lower'

# Solution 1: Always return appropriate value
def get_user_email(user_id):
    """Get user email from database."""
    if user_id == 1:
        return "alice@example.com"
    return "unknown@example.com"  # Default value instead of None

# Solution 2: Check for None before using
email = get_user_email(2)
if email is not None:
    print(email.lower())
else:
    print("No email found")

# Solution 3: Use default
email = get_user_email(2)
email = email or "unknown@example.com"
print(email.lower())

# Problem
def get_user_email(user_id):
    """Get user email from database."""
    if user_id == 1:
        return "alice@example.com"
    # Implicitly returns None for other IDs

email = get_user_email(2)
print(email.lower())  # AttributeError: 'NoneType' object has no attribute 'lower'

# Solution 1: Always return appropriate value
def get_user_email(user_id):
    """Get user email from database."""
    if user_id == 1:
        return "alice@example.com"
    return "unknown@example.com"  # Default value instead of None

# Solution 2: Check for None before using
email = get_user_email(2)
if email is not None:
    print(email.lower())
else:
    print("No email found")

# Solution 3: Use default
email = get_user_email(2)
email = email or "unknown@example.com"
print(email.lower())

Error 3: List Index Out of Range

Cause: Accessing index beyond list length.

Python

# Problem
scores = [85, 92, 78]
print(scores[3])  # IndexError: list index out of range (indices are 0, 1, 2)

# Solution 1: Check length first
if len(scores) > 3:
    print(scores[3])
else:
    print("Index 3 doesn't exist")

# Solution 2: Use try-except
try:
    print(scores[3])
except IndexError:
    print("Index out of range")

# Solution 3: Use negative indexing for end
print(scores[-1])  # Last element: 78

# Solution 4: Iterate instead of indexing
for score in scores:
    print(score)

# Problem
scores = [85, 92, 78]
print(scores[3])  # IndexError: list index out of range (indices are 0, 1, 2)

# Solution 1: Check length first
if len(scores) > 3:
    print(scores[3])
else:
    print("Index 3 doesn't exist")

# Solution 2: Use try-except
try:
    print(scores[3])
except IndexError:
    print("Index out of range")

# Solution 3: Use negative indexing for end
print(scores[-1])  # Last element: 78

# Solution 4: Iterate instead of indexing
for score in scores:
    print(score)

Error 4: Dictionary KeyError

Cause: Accessing key that doesn’t exist.

Python

# Problem
person = {'name': 'Alice', 'age': 30}
print(person['email'])  # KeyError: 'email'

# Solution 1: Use .get() with default
email = person.get('email', 'no-email@example.com')
print(email)

# Solution 2: Check if key exists
if 'email' in person:
    print(person['email'])
else:
    print("No email in record")

# Solution 3: Use try-except
try:
    print(person['email'])
except KeyError:
    print("Email key doesn't exist")

# Solution 4: Use defaultdict for counters/collectors
from collections import defaultdict
word_counts = defaultdict(int)
for word in ['apple', 'banana', 'apple']:
    word_counts[word] += 1  # No KeyError even on first access

# Problem
person = {'name': 'Alice', 'age': 30}
print(person['email'])  # KeyError: 'email'

# Solution 1: Use .get() with default
email = person.get('email', 'no-email@example.com')
print(email)

# Solution 2: Check if key exists
if 'email' in person:
    print(person['email'])
else:
    print("No email in record")

# Solution 3: Use try-except
try:
    print(person['email'])
except KeyError:
    print("Email key doesn't exist")

# Solution 4: Use defaultdict for counters/collectors
from collections import defaultdict
word_counts = defaultdict(int)
for word in ['apple', 'banana', 'apple']:
    word_counts[word] += 1  # No KeyError even on first access

Error 5: Off-by-One Errors

Cause: Incorrect range or slice boundaries.

Python

# Problem: Want first 10 items
items = list(range(100))
first_ten = items[1:10]  # Gets items 1-9, not 0-9!
print(len(first_ten))  # 9, not 10

# Solution: Remember Python uses 0-indexing and exclusive end
first_ten = items[0:10]  # Or items[:10]
print(len(first_ten))  # 10

# Problem: Want to iterate 10 times
for i in range(1, 10):  # Runs 9 times (1-9)
    print(i)

# Solution: Range end is exclusive
for i in range(1, 11):  # Runs 10 times (1-10)
    print(i)

# Or start from 0
for i in range(10):  # Runs 10 times (0-9)
    print(i)

# Problem: Want first 10 items
items = list(range(100))
first_ten = items[1:10]  # Gets items 1-9, not 0-9!
print(len(first_ten))  # 9, not 10

# Solution: Remember Python uses 0-indexing and exclusive end
first_ten = items[0:10]  # Or items[:10]
print(len(first_ten))  # 10

# Problem: Want to iterate 10 times
for i in range(1, 10):  # Runs 9 times (1-9)
    print(i)

# Solution: Range end is exclusive
for i in range(1, 11):  # Runs 10 times (1-10)
    print(i)

# Or start from 0
for i in range(10):  # Runs 10 times (0-9)
    print(i)

Debugging Machine Learning Code: Special Challenges

Machine learning code has unique debugging challenges because bugs often don’t cause errors—they just produce wrong results.

Challenge 1: Model Not Learning

Symptoms: Loss doesn’t decrease, accuracy doesn’t improve, predictions are random.

Common causes and solutions:

Python

# Problem 1: Learning rate too high
# Model diverges instead of converging
optimizer = Adam(learning_rate=1.0)  # Too high!

# Solution: Use smaller learning rate
optimizer = Adam(learning_rate=0.001)  # Typical starting point

# Problem 2: Learning rate too low
# Model learns extremely slowly
optimizer = Adam(learning_rate=0.0000001)  # Too low!

# Solution: Increase learning rate
optimizer = Adam(learning_rate=0.001)

# Problem 3: Labels not matching model output format
# Binary classification but using categorical loss
y_true = [0, 1, 0, 1]  # Binary labels
model.compile(loss='categorical_crossentropy')  # Wrong! Expects one-hot

# Solution: Match loss to label format
model.compile(loss='binary_crossentropy')  # Correct for binary

# Problem 4: Features not normalized
# Different feature scales cause training instability
X = np.array([[1, 1000], [2, 2000], [3, 3000]])  # Very different scales

# Solution: Normalize features
from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Problem 5: Forgot to shuffle data
# Model sees all class 0, then all class 1, learns wrong pattern
X = np.concatenate([class_0_data, class_1_data])

# Solution: Shuffle before training
indices = np.random.permutation(len(X))
X_shuffled = X[indices]
y_shuffled = y[indices]

# Problem 1: Learning rate too high
# Model diverges instead of converging
optimizer = Adam(learning_rate=1.0)  # Too high!

# Solution: Use smaller learning rate
optimizer = Adam(learning_rate=0.001)  # Typical starting point

# Problem 2: Learning rate too low
# Model learns extremely slowly
optimizer = Adam(learning_rate=0.0000001)  # Too low!

# Solution: Increase learning rate
optimizer = Adam(learning_rate=0.001)

# Problem 3: Labels not matching model output format
# Binary classification but using categorical loss
y_true = [0, 1, 0, 1]  # Binary labels
model.compile(loss='categorical_crossentropy')  # Wrong! Expects one-hot

# Solution: Match loss to label format
model.compile(loss='binary_crossentropy')  # Correct for binary

# Problem 4: Features not normalized
# Different feature scales cause training instability
X = np.array([[1, 1000], [2, 2000], [3, 3000]])  # Very different scales

# Solution: Normalize features
from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Problem 5: Forgot to shuffle data
# Model sees all class 0, then all class 1, learns wrong pattern
X = np.concatenate([class_0_data, class_1_data])

# Solution: Shuffle before training
indices = np.random.permutation(len(X))
X_shuffled = X[indices]
y_shuffled = y[indices]

Challenge 2: Suspiciously High Accuracy

Symptoms: 99%+ accuracy on complex problems, perfect train accuracy.

Common causes:

Python

# Problem 1: Data leakage - test data in training
X_train = data[:800]
X_test = data[200:400]  # Overlaps with training! Leakage!

# Solution: Use proper split
X_train = data[:800]
X_test = data[800:]  # No overlap

# Problem 2: Target variable in features
# Accidentally included the answer in input features
features = ['age', 'income', 'ACTUAL_CHURN_STATUS', 'purchases']
X = df[features]
y = df['ACTUAL_CHURN_STATUS']  # Same column in X and y!

# Solution: Remove target from features
features = ['age', 'income', 'purchases']
X = df[features]
y = df['ACTUAL_CHURN_STATUS']

# Problem 3: Using test data for preprocessing
# Fit scaler on ALL data, information leakage
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)  # Fit on all data!
X_train, X_test = X_scaled[:800], X_scaled[800:]

# Solution: Fit only on training data
X_train, X_test = X[:800], X[800:]
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)  # Fit only on train
X_test_scaled = scaler.transform(X_test)  # Transform only, no fit

# Problem 1: Data leakage - test data in training
X_train = data[:800]
X_test = data[200:400]  # Overlaps with training! Leakage!

# Solution: Use proper split
X_train = data[:800]
X_test = data[800:]  # No overlap

# Problem 2: Target variable in features
# Accidentally included the answer in input features
features = ['age', 'income', 'ACTUAL_CHURN_STATUS', 'purchases']
X = df[features]
y = df['ACTUAL_CHURN_STATUS']  # Same column in X and y!

# Solution: Remove target from features
features = ['age', 'income', 'purchases']
X = df[features]
y = df['ACTUAL_CHURN_STATUS']

# Problem 3: Using test data for preprocessing
# Fit scaler on ALL data, information leakage
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)  # Fit on all data!
X_train, X_test = X_scaled[:800], X_scaled[800:]

# Solution: Fit only on training data
X_train, X_test = X[:800], X[800:]
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)  # Fit only on train
X_test_scaled = scaler.transform(X_test)  # Transform only, no fit

Challenge 3: Shape Mismatches

Symptoms: Errors about incompatible dimensions, broadcasting failures.

Debugging approach:

Python

# Add shape printing to track dimensions
def debug_model():
    print("Loading data...")
    X = load_data()
    print(f"X shape: {X.shape}")
    
    print("Splitting data...")
    X_train, X_test, y_train, y_test = train_test_split(X, y)
    print(f"X_train shape: {X_train.shape}")
    print(f"y_train shape: {y_train.shape}")
    
    print("Training model...")
    model.fit(X_train, y_train)
    
    print("Making predictions...")
    predictions = model.predict(X_test)
    print(f"Predictions shape: {predictions.shape}")
    
    # If error occurs, you know which step and what shapes were involved

# Add shape printing to track dimensions
def debug_model():
    print("Loading data...")
    X = load_data()
    print(f"X shape: {X.shape}")
    
    print("Splitting data...")
    X_train, X_test, y_train, y_test = train_test_split(X, y)
    print(f"X_train shape: {X_train.shape}")
    print(f"y_train shape: {y_train.shape}")
    
    print("Training model...")
    model.fit(X_train, y_train)
    
    print("Making predictions...")
    predictions = model.predict(X_test)
    print(f"Predictions shape: {predictions.shape}")
    
    # If error occurs, you know which step and what shapes were involved

Conclusion: Systematic Debugging as a Professional Skill

Debugging is not about memorizing fixes for specific errors—it’s about developing a systematic, scientific approach to finding and fixing problems. The skills covered in this guide—reading error messages carefully, using appropriate debugging techniques for different situations, recognizing common error patterns, and applying machine learning-specific debugging strategies—transform debugging from frustrating guesswork into methodical problem-solving.

The most important lesson is to slow down when debugging. Resist the urge to randomly change things hoping the problem disappears. Instead, read error messages completely. Form hypotheses about what’s wrong. Test those hypotheses systematically. Fix root causes, not symptoms. This disciplined approach works across all programming problems.

As you gain experience, you’ll recognize error patterns instantly. You’ll know that NoneType errors usually mean a function returned None unexpectedly. You’ll recognize KeyErrors mean missing dictionary keys. You’ll see shape mismatch errors and immediately check dimensions. This pattern recognition comes from seeing errors repeatedly and understanding their causes deeply.

Build your debugging toolkit gradually. Start with print debugging for simple problems. Add assertions to catch assumption violations. Learn to use pdb for stepping through complex logic. Implement logging for production code. Choose appropriate tools for each situation. Simple problems don’t need sophisticated debuggers; complex problems benefit from powerful tools.

Remember that every bug you fix teaches you something. Each debugging session builds pattern recognition and intuition. Each error message you decipher makes future messages clearer. Each fixed bug makes you a better programmer. Embrace debugging as a learning opportunity rather than a frustrating chore.

The mark of an experienced programmer isn’t writing bug-free code—it’s debugging efficiently when bugs inevitably occur. Develop systematic approaches, use appropriate tools, stay calm and methodical, and you’ll find that debugging becomes one of your strongest skills rather than a source of frustration.

0 Comments

Inline Feedbacks

View all comments

Discover More

Operator Overloading in C++: Making Your Classes Intuitive

Click For More

Search Techietory

Debugging Python Code: Tips for AI Beginners

Introduction: Debugging is a Core Skill, Not an Afterthought

Understanding Bugs: Types and Causes

The Three Categories of Bugs

Why Bugs Happen

Reading Error Messages: Python Tells You What’s Wrong

Anatomy of a Traceback

Common Error Types and What They Mean

Debugging Techniques: From Print Statements to Debuggers

Technique 1: Print Debugging

Technique 2: Assertions

Technique 3: Using the Python Debugger (pdb)

Technique 4: Try-Except for Specific Error Handling

Technique 5: Logging for Production Debugging

Common Python Errors and How to Fix Them

Error 1: NameError

Error 2: TypeError with None

Error 3: List Index Out of Range

Error 4: Dictionary KeyError

Error 5: Off-by-One Errors

Debugging Machine Learning Code: Special Challenges

Challenge 1: Model Not Learning

Challenge 2: Suspiciously High Accuracy

Challenge 3: Shape Mismatches

Conclusion: Systematic Debugging as a Professional Skill

Discover More

Operator Overloading in C++: Making Your Classes Intuitive

What Is a Control Loop and Why Does Every Robot Have One?

Working with std::set and std::unordered_set

Linear Regression: Your First Machine Learning Algorithm

The Purpose of Bypass Capacitors in Power Supply Filtering

Understanding Variables and Data Types in C++