In a nutshell, it says:

After one attempt, GPT-4 gets 50% worse at fixing your bug.
After three attempts, it’s 80% worse.
After seven attempts, it becomes 99% worse.
This problem is called debugging decay.

Go to Source