efcore Stop all migrations and roll back everything on failure

When looking at #22613, I realized something... In general, if an earlier migration fails, later migrations will still be attempted and may succeed, leaving the database in a potentially inconsistent and confusing state. I understand that the idea is to allow the idempotent migration to be re-executed later and "fill in" the missing migrations, but if there's any dependency between one migration and another, this doesn't work (think about data migrations where a later migration implicitly depends on an earlier one).

Our new 5.0 behavior isn't a regression (or even worse) compared to the previous behavior. However, if we wrapped all migrations in a single transaction (and set XACT_ABORT to ON in SQL Server), we should get the safer behavior above, while at the same time addressing the original ask in #7681. The problem here may be with transaction suppression - but isn't that always problematic (and also SQL Server-specific).

/cc @bricelam

Sep 19 '20 07:09 roji

Consider as part of #19587

Sep 21 '20 19:09 ajcvickers

Triage: Wrap all the migrations inside a single transaction (instead of individual transactions per migration)

Sep 21 '20 21:09 bricelam

@bricelam From what I can tell, there are only two types of migrations operations that are always TransactionSuppressed:

SqlServerCreateDatabaseOperation
SqlServerDropDatabaseOperation

The rest of the operations that are sometimes TransactionSuppressed are when the resource is memory optimized:

AddColumnOperation
AddForeignKeyOperation
AddPrimaryKeyOperation
AlterColumnOperation
RenameIndexOperation
CreateTableOperation
DropTableOperation
CreateIndexOperation
DropPrimaryKeyOperation
AlterDatabaseOperation
AlterTableOperation
DropForeignKeyOperation
DropIndexOperation
DropColumnOperation

Could this be a way of achieving this?

The "up" migration specifically places the SqlServerCreateDatabaseOperation before the migration transaction
The "down" migration specifically places the SqlServerDropDatabaseOperation after the migration transaction
If --no-transaction is not specified and any of the migrations have TransactionSuppressed = true then return an error stating that "Migrations in a transaction are only supported when not using memory optimized tables"

Sep 23 '20 01:09 ChristopherHaws

Don't worry about SqlServerCreateDatabaseOperation and DropDatabase. The fact that they're migration operations is merely an implementation detail. No migration will contain them.

TransactionSuppressed is orthogonal to --no-transaction. TransactionSuppressed means it will commit the current transaction, execute the operation, and begin a new one. --no-transactions means the script will not include those COMMIT and BEGIN TRANSACTION statements.

For this issue, we just need to lift the transaction management logic inside MigrationCommandExecutor into Migrator--similar to how GenerateScript works. Then update both to stop committing and beginning new transactions between migrations (but continue doing it for TransactionSuppressed).

Sep 23 '20 16:09 bricelam

@bricelam Ok, that mostly makes sense to me. I wasn't aware of the create and drop db commands being outside the migrations, so that's good to know.

If we continue generating the begin tran and commit's for the TransactionSuppressed commands, won't it sort of defeat the purpose of the whole migration executing inside of a single transaction since now there would be committed schema/data? I'm not sure of any other way around this, but maybe a warning would be useful.

Also, should the creation of the migration history table be inside or outside of the transaction? I don't see any reason it can't be inside.

Sep 23 '20 16:09 ChristopherHaws

I like the idea of warning about TransactionSuppressed. I'm just not sure where we can do it that isn't already too late (i.e. during Update-Database). And the script plainly shows where the transaction will be committed, obviating the need for a warning. But maybe extra warnings about data loss and suppressed transactions would actually help; not sure.

Should the creation of the migration history table be inside or outside of the transaction?

I'd keep it outside. It's more of a prerequisite than part of the migration.

Sep 23 '20 21:09 bricelam

Triage: Wrap all the migrations inside a single transaction (instead of individual transactions per migration)

Just a thought: When a migration fails to apply, rolling back the already successfully applied ones is not necessarily desirable.

It is not unusual for a migration to take quite a while to execute on a large database (think 10+ GB), and it is not unusual to apply a batch of migrations all at once, for example during a scheduled release to production. If a trivial mistake in the last migration makes it fail, it would be quite annoying to lose the many minutes/hours of time spent applying the migrations that succeeded. In such cases, it would be much nicer to be able to simply fix the last migration and re-apply that one only, just as you could in EF6.

Sep 27 '20 13:09 Zero3

@Zero3 that's definitely a valid argument... The problem is that if you leave the database with that last migration un-applied, that could mean that your program can't start at all, because the schema is in some intermediate unsupported state. At that point users have to figure out what happened, what state they're in, and how to fix it - all under the pressures of downtime; leaving a production database in such a state seems very problematic, even more so than the (possible) loss of time.

Note that it's always possible to edit a migration script and manage transaction manually - which one can definitely do around a migration that's expected to run for very long.

Sep 27 '20 13:09 roji

@roji

The problem is that if you leave the database with that last migration un-applied, that could mean that your program can't start at all, because the schema is in some intermediate unsupported state.

Indeed!

At that point users have to figure out what happened, what state they're in, and how to fix it - all under the pressures of downtime

Given the use case I presented, this can be preferred over losing hours of migration time. And given the long migration time, the downtime aspect should already be handled already anyway.

leaving a production database in such a state seems very problematic, even more so than the (possible) loss of time.

I think that depends on the use case. In the one I presented, I would definitely prefer EF not rolling back the successfully applied migrations.

Note that it's always possible to edit a migration script and manage transaction manually - which one can definitely do around a migration that's expected to run for very long.

I was actually thinking of how it will be done when using context.Database.Migrate();.

Either way, I don't mind whatever the default strategy is, as long as it is possible to make EF not roll back the successfully applied ones :).

Sep 27 '20 15:09 Zero3

Ultimately it would probably be nice to have hooks before and after the entire migration process and before and after each individual migration. This would allow people to customize the the generation of migrations to fit their needs. This is how I handled adding the pre and post migration scripts in my open PR: https://github.com/dotnet/efcore/pull/22654/files#diff-a98268716bc62993b273028b6b462e39

Perhaps I could modify my PR to have the following (currently the Pre and Post commands in my PR run before and after the entire migration, not each):

GenerateInitializeMigrationCommands: Run once before all the migrations
GeneratePreMigrationCommands: Runs before each migration
GeneratePostMigrationCommands: Runs after each migration
GenerateFinalizeMigrationCommands: Runs once after all the migrations

I do believe that the default behavior should be wrapping the entire script in a transaction and anyone who wants to change that behavior could do so via the IMigrationsSqlGenerator interface. This seems like it would be an advanced use case that the majority of users would never encounter.

Sep 28 '20 00:09 ChristopherHaws

@Zero3 and others, for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well. In effect, what you get is a database with all migrations applied except one in the middle. There may ways to avoid this later migration execution effect in some database, but it's definitely going to be tricky cross-database.

If the database is left in this state and migrations aren't idempotent, running them again will immediately fail. Even for idempotent migrations, there are still possible dependencies between migrations that can wreak havoc - idempotency means you can safely execute the migration script twice, but not necessarily if the first run had a "hole" in the middle.

So while I do see the case for per-migration transactions, the complications seem very... complicated. I tend to agree with @ChristopherHaws that we should expose hooks to allow users to customize their migration process, and allow them to do transaction-per-migration that way instead of attempting to provide it as a built-in option.

But I think all this needs more thought and design in 6.0.

Sep 28 '20 11:09 roji

@roji

for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well

This could be avoided with the same measure we are taking here: https://github.com/dotnet/efcore/issues/22613#issuecomment-696413779 All it would require is removing this if statement: https://github.com/dotnet/efcore/blob/02c470084b0f00c76c962226f7129bcb4d5847e4/src/EFCore.SqlServer/Migrations/SqlServerMigrationsSqlGenerator.cs#L144

This would allow the script to fail and not move on if there is an error in any batch regardless of if transactions will be used or not.

Sep 28 '20 17:09 ChristopherHaws

@ChristopherHaws that's purely SQL Server-specific, isn't it? We have to think about other databases as well...

Sep 28 '20 17:09 roji

Unless I'm mistaken, even within SQL Server this also depends on sqlcmd mode. That may be fine - I don't know, I'm trying to still think of migration scripts as something that doesn't depend on the specific tooling which executes them. But that may not be viable.

Sep 28 '20 17:09 roji

@roji Agreed. My understanding was that the other DB's that are supported (PostgreSQL, SQLite, etc) already have the behavior of "stop executing on first failure" and that it was only SQL server that behaves in this way.

Sep 28 '20 17:09 ChristopherHaws

@roji

for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well.

I don't understand why EF would want to do this, instead of stopping after the first failed migration (like EF6 did). I don't think EF should consider skipping a failed migration a valid thing to do.

Maybe some of these issues arose from the introduction of migration scripts? I'm only thinking in the context of context.Database.Migrate(), of which I am a happy user and appreciated the way EF6 worked (one transaction per migration, and error instead of skipping failed migrations).

Oct 04 '20 14:10 Zero3

I don't understand why EF would want to do this, instead of stopping after the first failed migration (like EF6 did). I don't think EF should consider skipping a failed migration a valid thing to do.

I don't think anyone wants this to be the behavior - but it's a possible consequence of doing transaction-per-migration (as opposed to transaction-for-all-migrations).

Oct 05 '20 14:10 roji

@roji But only when scripting and only when the script keeps running inappropriately. We don't have this behavior for Update-Database, for example. It will stop[ after the first failed migration.

Oct 05 '20 15:10 ajcvickers

Yeah, this is indeed an SQL script problem (as opposed to when applying migrations programmatic or via CLI). That could warrant different transactional strategies based on the mechanism we use.

Oct 05 '20 15:10 roji

Personally, I would definitely not vote for transaction per migration. At least it shouldn't be baked in and maybe an opt-in feature.

I am voting 110% on introducing an option to wrap all the migration scripts execution in a single transaction. Otherwise, as @roji mentioned above, the database is ending up in an inconsistent state which is a major issue!

I just tried migrations bundles hoping at least it would wrap things in a transaction, unfortunately, it isn't. And it made the database inconsistent. First Migration is applied and second has failed (on purpose). If I use migration scripts, at least I could have manually wrapped script execution inside a transaction myself (for example when running in pipelines), but with migration bundles, I don't think it's even possible, may be I can still apply the same logic using native Transactions, but this I think is a basic requirement.

Aug 25 '21 10:08 jaliyaudagedara

Yes! As @jaliyaudagedara pointed out I also would like to see transaction support for this. It is a great feature that we can definitely use in our deployment process but without native transactions support in the bundle tool we will have to implement that behavior manually. I would vote for an option to wrap the migration bundle in a single transaction so they all pass/fail atomically. No chance for inconsistent state in the application database.

Aug 26 '21 18:08 sbraswell

I have had the migration fail too many times. This would be great to have it in a transaction to roll back. Otherwise, it ruins my day and I have to manually remove the migrations or restore the database.

How do we enable the transactions?

Mar 14 '23 14:03 michaelakin

@michaelakin What version are you on? EF Core 5 introduced transactions for migrations.

EDIT: Or do you perhaps mean all migrations in one single transaction? In that case, this is the issue tracking that :)

Mar 14 '23 14:03 OskarKlintrot

@michaelakin What version are you on? EF Core 5 introduced transactions for migrations.

EDIT: Or do you perhaps mean all migrations in one single transaction? In that case, this is the issue tracking that :)

I am on EF 6 right now.

I think I mean a single set of migrations in one transaction, so it doesn't get caught in the middle of a migration. Can you point me to a code example?

Mar 14 '23 19:03 michaelakin

EF 6 or EF Core 6?

This issue tracks running all migrations inside one single transaction. Before EF Core 5 we made a small PS-script to wrap each migration in one transaction by generating an idempotent script. Until this feature is implemented you could do the same but just add one transaction to wrap all migrations.

Something like this:

$project = ".\YourProject.WebApi"

$tempScript = ".\temp\migrations.sql"
$scriptWithTransaction = ".\drop\migrations.sql"

# Create an idempotent script to work with
dotnet ef migrations script --no-build --idempotent --no-transactions --configuration Release --project $project --output $tempScript

# ArrayList is the easiest way to work with a file on a row-to-row basis, so to speak
[System.Collections.ArrayList]$file = Get-Content $tempScript

$file.insert(0,"BEGIN TRAN")
$file.insert(1,"SET XACT_ABORT ON;") # Make sure we rollback on any errors, some errors will not result in a rollback when XACT_ABORT is set to OFF!

$file.add("COMMIT")

# There should not be a migrations.sql-file at this point since the file created by EF Core was outputted to the temp-folder
if (Test-Path $scriptWithTransaction) {
  Write-Host "##vso[task.LogIssue type=error;]Migration-script already exists!"
  exit 1 # General error code
}
  
Set-Content $scriptWithTransaction $file

Mar 15 '23 07:03 OskarKlintrot

EF 6 or EF Core 6?

Thanks Oskar. and sorry @OskarKlintrot I meant EF Core 6. Do you have an example with EF Core 6?

Mar 15 '23 14:03 michaelakin

@michaelakin The example is for EF Core 6 since I assumed that was what you meant :)

Mar 15 '23 16:03 OskarKlintrot

Thanks @OskarKlintrot

Mar 24 '23 19:03 michaelakin

in the official documentation we have the following :

...The transaction handling and continue-on-error behavior of these tools are inconsistent and sometimes unexpected. This can leave your database in an undefined state if a failure occurs when applying migrations...

I'am confused about strategy used by the bundle while applying migrations. from what i understand the expected behaviour : the bundle will attemp to apply migrations ( a set of them), if it fails it wont continue and revert what is applied. Is this correct ?

Sep 02 '24 13:09 AlaRaies

@AlaRaies the sentence you quote above are about applying migrations via SQL scripts, e.g. using sqlcmd for SQL Server. Migration bundles are meant precisely to solve these shortcomings.

Starting with 9.0, migration bundles will execute all migrations inside a single transaction, so yes, if any error occurs at some point everything will be rolled back. The only exception is a very limited set of migration operations which cannot be executed inside a transaction (e.g. anything related to a SQL Server in-memory table).

Sep 02 '24 14:09 roji

efcore efcore copied to clipboard

Stop all migrations and roll back everything on failure

efcore
efcore copied to clipboard