Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lots of "stream to store closed" log and CDC sends huge pending tasks to TiKV which results in TiKV OOM #10136

Closed
fubinzh opened this issue Nov 22, 2023 · 2 comments · Fixed by #10237
Labels
affects-6.5 This bug affects the 6.5.x(LTS) versions. affects-7.1 This bug affects the 7.1.x(LTS) versions. affects-7.5 This bug affects the 7.5.x(LTS) versions. area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug.

Comments

@fubinzh
Copy link

fubinzh commented Nov 22, 2023

What did you do?

This issue is created from tikv/tikv#16035, to track the CDC side issue that lots of "stream to store closed"
and CDC send huge pending tasks to TiKV which results TiKV OOM.
While TiKV side fix will limit the pending initial scan tasks, cdc side need further investigate the "stream to store closed" issue.

[2023/11/19 16:11:13.208 +08:00] [INFO] [client.go:981] ["stream to store closed"] [namespace=default] [changefeed=test1] [addr=tc-tikv-2.tc-tikv-peer.webank-cdc-tps-4590005-1-42.svc:20160] [storeID=8]

img_v3_025e_437b1b6a-abbf-4f7b-894b-ac2ad5e89b5g

image

What did you expect to see?

CDC should not send so many tasks to TiKV

What did you see instead?

CDC send huge pending tasks to TiKV which results TiKV OOM.

Versions of the cluster

Upstream TiKV version (execute tikv-server --version):

/ # /tikv-server -V                                                                                                                            
TiKV                                                                                                                                           
Release Version:   6.5.3                                                                                                                       
Edition:           Enterprise                                                                                                                  
Git Commit Hash:   5165943fbffce298b61698a59620f415df239420                                                                                    
Git Commit Branch: heads/refs/tags/v6.5.3-20231116-5165943                                                                                     
UTC Build Time:    2023-11-16 07:07:20                                                                                                         
Rust Version:      rustc 1.67.0-nightly (96ddd32c4 2022-11-14)                                                                                 
Enable Features:   pprof-fp jemalloc mem-profiling portable test-engine-kv-rocksdb test-engine-raft-raft-engine cloud-aws cloud-gcp cloud-azure
Profile:           dist_release                                                                                                                

TiCDC version (execute cdc version):

bash-5.1# /cdc version                                    
Release Version: v6.5.3-20231116-255d810                  
Git Commit Hash: 255d8104c8b9b4f8af191ee69d301ced723ccc3d 
Git Branch: heads/refs/tags/v6.5.3-20231116-255d810       
UTC Build Time: 2023-11-16 06:35:28                       
Go Version: go version go1.19.12 linux/amd64              
Failpoint Build: false                                    
@fubinzh fubinzh added area/ticdc Issues or PRs related to TiCDC. type/bug The issue is confirmed as a bug. labels Nov 22, 2023
@fubinzh
Copy link
Author

fubinzh commented Nov 22, 2023

/severity moderate

@hicqu
Copy link
Contributor

hicqu commented Dec 8, 2023

Closed by #10237

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-6.5 This bug affects the 6.5.x(LTS) versions. affects-7.1 This bug affects the 7.1.x(LTS) versions. affects-7.5 This bug affects the 7.5.x(LTS) versions. area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug.
Projects
Development

Successfully merging a pull request may close this issue.

2 participants