Throughput Drops

If we have very long-running operations, then may we have drop in throughput.

Possible causes for a drop in throughput

Long-running queries
- Collection scans
- Poorly anchored regex
- Inefficient index usage
Index builds
Write contention

Diagnose and solve drop in throughput

Long-running queries

Read server logs to inspect whatever some useful indexes has been dropped.
Show the current operation to inspect whatever there is long-running operation: db.currentOp().
Turn on the profiler in analytic server.
Use explain() query.

Index builds

It depends on planning. We can do rolling index building if it possible. But, if data freshness is not priority, then we may prefer background index building.

Write contention

WiredTiger uses copy-on-write approach, also called as multiversion concurrency control:

A new version of the document is prepared.
During this process only the original document is visible to any applications.
Then the update committed by switching a pointer in a single CPU operation.
New version of the document will be available.

Problem: Optimistic concurrency protocols may caused no-lock on write. Or in other words, multiple writers trying to update the same document at the same time, caused the writers don't realize that the other writes are updating the document. The result is, all writers create their own version, then CPU choose arbitrary version caused any other versions will be removed and repeat any other uncommitted writes.

Diagnose: Simulate multiple writers

Get the information about current server status: var servStat0 = db.serverStatus();
Optional, Execute mongostat to show connection statistics: mongostat --port 3000 -o "insert,update,delete,command,dirty,used,conn"
Execute python write_to_the_same_document.py --port 3000 to simulate multiple writers run as parallel processes that try to write on the same document.
Get the information about new server status: var servStat1 = db.serverStatus();
Get number of updates attempted: servStat1.opcounters.update - servStat0.opcounters.update
Get number of inserts attempted: servStat1.opcounters.insert - servStat0.opcounters.insert
get number of inserts succeeded: servStat1.opcounters.inserted - servStat0.opcounters.inserted
Show log information: mlogfilter <LOG_PATH> | less
Look for severity level of WRITE, then check value of writeConflicts and numYields.

Solution:

If there are lot of write conflicts, then try revise the schema. Try to execute python write_to_the_same_document.py --docPerProcess --port 3000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

throughput_drops.md

throughput_drops.md

Throughput Drops

Possible causes for a drop in throughput

Diagnose and solve drop in throughput

Long-running queries

Index builds

Write contention

Files

throughput_drops.md

Latest commit

History

throughput_drops.md

File metadata and controls

Throughput Drops

Possible causes for a drop in throughput

Diagnose and solve drop in throughput

Long-running queries

Index builds

Write contention