validate migration data comprehensively

discourse/discourse

Based on 5 comments

Ruby

Migration scripts must thoroughly validate data conditions and handle edge cases to ensure reliable data transformation. Always check for data existence before applying operations, properly handle duplicate records, and avoid migrating calculated or derived fields.

Migrations Ruby

Reviewer Prompt

Key practices:

Check data existence: Verify that related data exists before performing operations (e.g., “Should we update that site setting only when there are tag groups?”)
Handle duplicates gracefully: Ensure migrations can continue when duplicates are found, returning existing records rather than failing
Skip calculated fields: Exclude computed columns like first_unread_pm_at that should be recalculated rather than migrated
Cover edge cases: Consider scenarios like missing seed data or schema evolution (e.g., custom palettes based on non-seeded built-in schemes)
Manage schema changes: When fields are split or renamed, maintain backward compatibility and proper synchronization

Example of proper duplicate handling:

duplicate = DB.query_single(<<~SQL, name: name).first
  SELECT id from ai_personas where name = :name
SQL

if duplicate.present?
  duplicate  # Return existing record to continue migration
else
  # Create new record
end

This approach prevents migration failures and ensures data integrity across different migration scenarios and schema versions.

Comments Analyzed

Ruby

Primary Language

Migrations

validate migration data comprehensively

Reviewer Prompt

Source Discussions

Add Repository

Private Repository