Understand impact of batching #18

divega · 2017-12-04T22:44:18Z

In general batching is considered good for performance because it can help reduce the number of roundtrips to the database, however understanding more its impact across different database could help us prioritize when to use it and the addition of APIs.

E.g. this would help inform the prioritization of

Perf: provide a way to batch the generated commands for a complex query dotnet/efcore#10465 Perf: provide a way to batch the generated commands for a complex query
https://github.com/dotnet/corefx/issues/3688 Better API for ADO.NET command batching
https://github.com/dotnet/corefx/issues/8954 Retrieving records affected by statement

Some questions we would like to address/answer:

Concatenated SQL in a SQL command vs. proper multi-statement per network packet support: Depending on the server and protocol capabilities, the first leads to either parsing and splitting the query on the client or to polluting the server's query cache. The second one you can still send multiple statements in one network roundtrip but using an API that allows to collect the individual SQL queries and parameters without stitching them together, which avoids having to parse the SQL on the client and polluting the cache on the server.
How does this affect different database? We need to measure. We know that PostgreSQL could use this and that SqlClient has the basic capability but only when you use DataAdapter APIs (see https://blogs.msdn.microsoft.com/dataaccess/2005/05/19/does-ado-net-update-batching-really-do-something/), therefore things like EF Core can only use concatenated SQL approach.
Multiple calls to ExecuteReader vs. one call to ExecuteReader and multiple calls NextResult: For reading scenarios batching can help too. To understand how much, we would need to measure. ADO.NET already has the right APIs but the question is how much we should be using it in our higher-level APIs. We have precedent in NHibernate's future queries. For things like EF Core this could be used automatically when we generate multiple queries, or we could come up with a similar API to future queries.

dario-l · 2018-05-22T08:09:52Z

Proper multi-statement per network packet is a good choice. We are using System.Data.SqlClient.SqlCommandSet but it is internal. Thanks to NHibernate we are using exactly this implementation.

This solution give us big improvement in performance even if we sending just few records (from 5 statements and higher is unbeatable). Unfortunately for us ASP.NET Core doesn't have that implementation.

divega mentioned this issue Dec 5, 2017

Meeting notes for 12/4/2017 #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understand impact of batching #18

Understand impact of batching #18

divega commented Dec 4, 2017 •

edited

Loading

dario-l commented May 22, 2018

Understand impact of batching #18

Understand impact of batching #18

Comments

divega commented Dec 4, 2017 • edited Loading

dario-l commented May 22, 2018

divega commented Dec 4, 2017 •

edited

Loading