Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ebs br: resume gc and scheduler when volume snapshots created #5288

Merged
merged 8 commits into from
Sep 25, 2023

Conversation

WangLe1321
Copy link
Contributor

@WangLe1321 WangLe1321 commented Sep 12, 2023

What problem does this PR solve?

  1. resume gc and pd scheduler when all the volume snapshots are created
    Closes ebs br: shrink the window of pause gc and schedule during backup #5256

  2. send SIGTERM to the BR process when tidb-backup-manager receives SIGTERM

  3. read stderr async in case that the pipe of stderr is full and blocks the BR process
    Closes backup job is stuck(snapshot is completed) #5281

  4. The EBS backup task may cause the import task to fail
    Closes The EBS backup task may cause the import task to fail #5282

What is changed and how does it work?

data plane:

  1. check if the backupmeta file is existed when the backup enter execute phase. if it is existed, modify backup status VolumeBackupSnapshotsCreated

control plane:
2. if all the backups in data plane has status VolumeBackupSnapshotsCreated, modify resumeGcSchedule field true in the backup that is responsible to pause gc and pd schduler.

data plane:
3. if resumeGcSchedule field is true, delete the initialize pod and modify backup status VolumeBackupInitializeComplete

control plane:
5. if the backup with resumeGcSchedule: true has status VolumeBackupInitializeComplete, modify volume backup status SnapshotsCreated. It means all the volume snapshots are created and gc and pd schedulers are resumed.

Code changes

  • Has Go code change
  • Has CI related scripts change

Tests

  • Unit test
  • E2E test
  • Manual test
  • No code

Side effects

  • Breaking backward compatibility
  • Other side effects:

Related changes

  • Need to cherry-pick to the release branch
  • Need to update the documentation

Release Notes

Please refer to Release Notes Language Style Guide before writing the release note.


@BornChanger BornChanger changed the title br: modify to resume gc and scheduler when volume snapshots created br: resume gc and scheduler when volume snapshots created Sep 13, 2023
@BornChanger BornChanger changed the title br: resume gc and scheduler when volume snapshots created ebs br: resume gc and scheduler when volume snapshots created Sep 13, 2023
@BornChanger BornChanger requested review from BornChanger and csuzhangxc and removed request for howardlau1999 and lichunzhu September 13, 2023 23:23
@codecov-commenter
Copy link

codecov-commenter commented Sep 14, 2023

Codecov Report

Merging #5288 (3ec02e0) into master (2c3cd13) will increase coverage by 2.29%.
The diff coverage is 29.74%.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5288      +/-   ##
==========================================
+ Coverage   57.60%   59.90%   +2.29%     
==========================================
  Files         252      255       +3     
  Lines       30461    34113    +3652     
==========================================
+ Hits        17548    20436    +2888     
- Misses      11182    11832     +650     
- Partials     1731     1845     +114     
Flag Coverage Δ
e2e 29.44% <4.76%> (?)
unittest 57.48% <31.42%> (-0.13%) ⬇️

@WangLe1321
Copy link
Contributor Author

/test pull-e2e-kind pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: The label(s) /pull-e2e-kind, /pull-e2e-kind-across-kubernetes, /pull-e2e-kind-basic cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@WangLe1321
Copy link
Contributor Author

/test pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: The label(s) /pull-e2e-kind-across-kubernetes, /pull-e2e-kind-basic cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-across-kubernetes pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@WangLe1321
Copy link
Contributor Author

/test pull-e2e-kind-serial pull-e2e-kind-br

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: The label(s) /pull-e2e-kind-serial, /pull-e2e-kind-br cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind-serial pull-e2e-kind-br

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 15, 2023

@WangLe1321: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-serial pull-e2e-kind-br

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@BornChanger
Copy link
Contributor

/test pull-e2e-kind-serial

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 18, 2023

@BornChanger: The label(s) /pull-e2e-kind-serial cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind-serial

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 18, 2023

@BornChanger: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-serial

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@BornChanger
Copy link
Contributor

/label ok-to-test

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 21, 2023

@BornChanger: The label(s) /ok-to-test cannot be applied, because the repository doesn't have them.

In response to this:

/label ok-to-test

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@BornChanger
Copy link
Contributor

/test pull-e2e-kind-across-kubernetes

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 21, 2023

@BornChanger: The label(s) /pull-e2e-kind-across-kubernetes cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind-across-kubernetes

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 21, 2023

@BornChanger: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-across-kubernetes

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@purelind
Copy link
Contributor

/test pull-e2e-kind-basic

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 21, 2023

@purelind: The label(s) /pull-e2e-kind-basic cannot be applied, because the repository doesn't have them.

In response to this:

/test pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 21, 2023

@purelind: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-basic

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@BornChanger
Copy link
Contributor

/run-e2e-tests

1 similar comment
@purelind
Copy link
Contributor

/run-e2e-tests

@BornChanger
Copy link
Contributor

/test-pull-e2e-kind-tikv-scale-simultaneously

@WangLe1321
Copy link
Contributor Author

/run-pull-e2e-kind

@WangLe1321
Copy link
Contributor Author

/run-pull-e2e-kind-br

@WangLe1321
Copy link
Contributor Author

/run-pull-e2e-kind-tikv-scale-simultaneously

1 similar comment
@WangLe1321
Copy link
Contributor Author

/run-pull-e2e-kind-tikv-scale-simultaneously

@ti-chi-bot ti-chi-bot bot removed the lgtm label Sep 22, 2023
@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 22, 2023

[LGTM Timeline notifier]

Timeline:

  • 2023-09-20 02:45:48.27226362 +0000 UTC m=+658314.239851671: ☑️ agreed by BornChanger.
  • 2023-09-22 06:34:45.328215977 +0000 UTC m=+844851.295804028: ✖️🔁 reset by WangLe1321.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 22, 2023

New changes are detected. LGTM label has been removed.

@WangLe1321
Copy link
Contributor Author

/rtest

@WangLe1321
Copy link
Contributor Author

/retest

@BornChanger
Copy link
Contributor

/test pull-e2e-kind

@BornChanger
Copy link
Contributor

/test pull-e2e-kind-br

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 23, 2023

@BornChanger: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented Sep 23, 2023

@BornChanger: No presubmit jobs available for pingcap/tidb-operator@master

In response to this:

/test pull-e2e-kind-br

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@csuzhangxc csuzhangxc merged commit 4df3649 into pingcap:master Sep 25, 2023
4 checks passed
@BornChanger
Copy link
Contributor

/cherry-pick release-1.5

@ti-chi-bot
Copy link
Member

@BornChanger: new pull request created to branch release-1.5: #5298.

In response to this:

/cherry-pick release-1.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

csuzhangxc added a commit that referenced this pull request Sep 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ebs br: shrink the window of pause gc and schedule during backup
7 participants