david.dev - AI x Ruby on Rails

Every Rails application eventually faces the same problem: users enter data inconsistently. Email addresses arrive with trailing spaces, phone numbers come with random formatting, and names appear in mixed case. The instinct is to scatter strip and downcase calls throughout controllers and callbacks, but Rails 8 offers a cleaner solution with built-in attribute normalization.

The Problem with Scattered Normalization

Before exploring the solution, consider what typically happens without a systematic approach. Normalization logic ends up in before_validation callbacks, controller strong parameters, form objects, and even view helpers. When the business rule changes—say, email addresses should preserve case for display but compare case-insensitively—tracking down every transformation becomes a nightmare.

Rails 8 addresses this with the normalizes class method, providing a single declarative location for all attribute transformations.

Basic Normalization Patterns

The normalizes method accepts an attribute name (or multiple names) and a block that transforms the value. The block runs automatically when the attribute is assigned, not just on save.

# app/models/user.rb
class User < ApplicationRecord
  normalizes :email, with: ->(email) { email.strip.downcase }
  normalizes :first_name, :last_name, with: ->(name) { name.strip.titleize }
  normalizes :phone, with: ->(phone) { phone.gsub(/\D/, '') }
end

With this configuration, normalization happens immediately on assignment:

# In rails console or anywhere in the application
user = User.new
user.email = "  [email protected]  "
user.email  # => "[email protected]"

user.first_name = "  jane  "
user.first_name  # => "Jane"

user.phone = "(555) 123-4567"
user.phone  # => "5551234567"

This immediate normalization means forms display the normalized value on validation failures, and comparisons work correctly without explicit transformation.

Handling Nil and Blank Values

By default, Rails skips normalization for nil values, which prevents NoMethodError exceptions. However, blank strings require explicit handling. The apply_to_nil option changes this behavior when needed.

# app/models/contact.rb
class Contact < ApplicationRecord
  # Convert blank strings to nil for cleaner database storage
  normalizes :middle_name, with: ->(value) { value.presence }
  
  # Apply normalization even to nil values (use carefully)
  normalizes :status, with: ->(value) { value || 'pending' }, apply_to_nil: true
  
  # Safe pattern: handle both nil and blank
  normalizes :website, with: ->(url) {
    return nil if url.blank?
    url.strip.downcase.then { |u| u.start_with?('http') ? u : "https://#{u}" }
  }
end

The presence method converts empty strings to nil, which often produces cleaner database queries and avoids the empty-string-vs-nil ambiguity that plagues many applications.

Complex Normalization with Service Objects

For complex transformations, inline lambdas become unwieldy. Extract the logic into dedicated normalizer classes that respond to call:

# app/normalizers/slug_normalizer.rb
class SlugNormalizer
  def self.call(value)
    return nil if value.blank?
    
    value
      .strip
      .downcase
      .gsub(/[^a-z0-9\s-]/, '')  # Remove special characters
      .gsub(/\s+/, '-')           # Replace spaces with hyphens
      .gsub(/-+/, '-')            # Collapse multiple hyphens
      .gsub(/^-|-$/, '')          # Trim leading/trailing hyphens
  end
end

# app/models/article.rb
class Article < ApplicationRecord
  normalizes :slug, with: SlugNormalizer
  normalizes :title, with: ->(title) { title&.strip }
  
  before_validation :generate_slug, if: -> { slug.blank? && title.present? }
  
  private
  
  def generate_slug
    self.slug = title  # Normalization applies automatically
  end
end

This pattern keeps models clean while allowing thorough testing of normalization logic in isolation:

# spec/normalizers/slug_normalizer_spec.rb
RSpec.describe SlugNormalizer do
  describe '.call' do
    it 'converts spaces to hyphens' do
      expect(described_class.call('hello world')).to eq('hello-world')
    end
    
    it 'removes special characters' do
      expect(described_class.call('Hello, World!')).to eq('hello-world')
    end
    
    it 'handles nil values' do
      expect(described_class.call(nil)).to be_nil
    end
    
    it 'collapses multiple hyphens' do
      expect(described_class.call('hello---world')).to eq('hello-world')
    end
  end
end

Normalization in Queries

One powerful feature often overlooked: normalization applies to query parameters through special finder methods. This ensures consistent lookups without manual transformation at every call site.

# app/models/user.rb
class User < ApplicationRecord
  normalizes :email, with: ->(email) { email.strip.downcase }
end

# The normalized value is used in the query
User.find_by(email: "  [email protected]  ")
# Executes: SELECT * FROM users WHERE email = '[email protected]'

# Works with where clauses using normalize_value_for
email_input = "  [email protected]  "
normalized = User.normalize_value_for(:email, email_input)
User.where(email: normalized)

The normalize_value_for class method exposes the normalization logic for cases where explicit transformation is needed, such as building complex queries or validating input before database interaction.

Normalization vs Validation

A common question: should normalization replace validation? The answer is no—these serve different purposes. Normalization transforms valid input into a canonical form. Validation rejects invalid input entirely.

# app/models/product.rb
class Product < ApplicationRecord
  # Normalization: transform valid input
  normalizes :sku, with: ->(sku) { sku&.strip&.upcase }
  
  # Validation: reject invalid input
  validates :sku, presence: true, 
                  format: { with: /\A[A-Z]{2}-\d{4}\z/, 
                            message: 'must follow format XX-0000' }
end

With this setup, a SKU like " ab-1234 " normalizes to "AB-1234" and passes validation. Input like "invalid" normalizes to "INVALID" but fails the format validation.

Common Normalization Recipes

Several patterns appear repeatedly across applications. Here's a collection of reusable normalizers:

# app/models/concerns/common_normalizations.rb
module CommonNormalizations
  extend ActiveSupport::Concern
  
  class_methods do
    def normalizes_email(*attributes)
      normalizes(*attributes, with: ->(v) { v&.strip&.downcase })
    end
    
    def normalizes_phone(*attributes)
      normalizes(*attributes, with: ->(v) { v&.gsub(/\D/, '')&.presence })
    end
    
    def normalizes_whitespace(*attributes)
      normalizes(*attributes, with: ->(v) { v&.squish&.presence })
    end
    
    def normalizes_url(*attributes)
      normalizes(*attributes, with: ->(v) {
        return nil if v.blank?
        url = v.strip.downcase
        url.match?(/\Ahttps?:\/\//) ? url : "https://#{url}"
      })
    end
  end
end

# app/models/company.rb
class Company < ApplicationRecord
  include CommonNormalizations
  
  normalizes_email :contact_email, :billing_email
  normalizes_phone :phone, :fax
  normalizes_whitespace :name, :description
  normalizes_url :website
end

This concern-based approach provides consistent normalization across the entire application while keeping individual models readable.

Gotchas and Edge Cases

Several behaviors catch developers off guard. First, normalization runs on every assignment, including when loading records from the database. Ensure normalizers are idempotent—running them twice should produce the same result as running once.

Second, normalization bypasses dirty tracking for the transformation itself. If the original value was " test " and it normalizes to "test", the model considers "test" the assigned value for change tracking purposes.

Third, update_column and update_columns bypass normalization entirely since they skip callbacks and model logic. Use update or update! when normalization must apply.

Summary

Attribute normalization in Rails 8 eliminates scattered transformation logic by providing a single, declarative location for data cleanup. The normalizes method applies transformations on assignment, ensuring consistent data throughout the application lifecycle. For complex transformations, extract logic into dedicated normalizer classes. Combine normalization with validation—normalization transforms valid input while validation rejects invalid input. Build a library of common normalizers in a concern for consistent behavior across models.