Cache Mongo-DB calls (in-memory and/or Redis) #926

jason-fox · 2020-10-26T13:30:48Z

Mongo-DB access is slow. This PR minimizes the need to make database calls, by caching the last 1000 device calls and 100 group calls in a refreshable cache. This in turn reduces network traffic and increases maximum throughput.

Add cache-manager and node-cache-manager-ioredis
Wrap mongo-db GET calls
Bust cache on any provisioning updates/deletes
Update unit tests.
Add Documentation

Multiple Caching strategies are accepted - either memCache, Redis or both. All parameters are settable as config or Docker ENV variables.

Adding dontCache=true as part of any Device or Group configuration will ensure that the data is never cached and therefore always retrieved from the MongoDB database.

Note that because all cacheable queries are potentially received from multiple databases it follows that they must be retrieved as JSON Objects not Mongoose Documents. Therefore the lean option has been enabled on the relevant read queries.

As the mongoose docs state:

Lean is great for high-performance, read-only cases,

so there should be a significant improvement even when not using a cache as well.

AlvaroVega · 2020-10-27T12:03:20Z

Did you test it with iotagents in HA environments? Is there a really improvement there?

jason-fox · 2020-10-27T13:26:38Z

Last time I checked stress testing throughput for memory was noticeably higher that mongodb - this just piggy backs off that result.

The degree of improvement will depend on how you set up mongo-db - I guess a sharded-environment with many read-only replicas would alleviate the situation somewhat, but why rely on end-users being knowledgeable enough to architect their way out of a problem (Hint: plenty of people seem to use my tutorial docker-compose files as the basis of their production architecture 😱 ). Of course you could always pay for scaling of more IoT Agents as well.

I agree this needs proper benchmarking. Maybe you could stress test yourself or share a common HA benchmark configuration?

mrutid · 2020-10-27T15:34:22Z

Cache policies do have an impact in HA scenarios. We always deploy IoTAgents in HA (Active-Active) clusters. Any provided solution must be fully tested and studied in HA environments.

It's not a matter of benchmarking, it's a matter of accessibility and consistency (ACID). Take into account that full accessibility and consistency is expected among the Agent-Cluster (so if you deploy a device, the device must available and updated through the whole cluster).

mrutid · 2020-10-27T15:40:58Z

lib/services/devices/deviceRegistryMongoDB.js

+const Device = require('../../model/Device');
+const async = require('async');
+const cacheManager = require('cache-manager');
+const memoryCache = cacheManager.caching({store: 'memory', max: 1000, ttl: 10/*seconds*/});


Magic numbers. MUST be config.

In addition, I think the whole possibilty of not using cache (e.g. something like cache: true|false in configuration) should be provided.

Fixed 9471f42 and subsequent pushes.

jason-fox · 2020-10-27T15:49:33Z

If you deploy a device, the device must available and updated through the whole cluster

IoT Agent A

When you first deploy a device in IoT Agent A - it will read from the DB.
If you send a measure a device in IoT Agent A - it will retrieve from the local cache.
If no cache hit occurs or the cache times out - it will retrieve from DB once again.

IoT Agent B

If you send a measure a device in IoT Agent B - it will initially read from the DB.
If you send a second measure from IoT Agent B - it will retrieve from the local cache.
If no cache hit occurs or the cache times out - it will retrieve from DB once again.

If you use a single IoT Agent, this will act like a single database since there is only one source of truth.
If you use a multiple IoT Agent instances, and provison once, it will act like a single database since there is only one source of truth.
If you use a multiple IoT Agent instances, and provison multiple times, there is a short latency when updating/deleting. If I modify or delete a group or device then only the local cache of IoT Agent A will be busted. IoT Agent B will continue to retrieve the data from its cache until the record times out.

jason-fox · 2020-10-27T15:54:15Z

So it depends on how you want the system to fall over. Do I want it to fail because it can't handle the throughput or do I want it to fail because it takes around 10 seconds for the data to settle?

- Ensure memoryCache is reset/busted on each test config - Amend test config to use memoryCache to test additional path.

jason-fox · 2020-11-03T09:58:24Z

@mrutid @fgalan - PR updated, no magic numbers and an off-switch added.

Added config elements Added Docker ENV variable equivalents.
Add documentation.
Moved set-up to separate function called from fiware-iotagent-lib.js - this means the existing tests don't need extra clean-up
Amended a single test to use cache to ensure test coverage.

fgalan · 2020-11-05T15:21:29Z

test/unit/lazyAndCommands/polling-commands-test.js

@@ -22,7 +22,7 @@
 */
 'use strict';

-var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),
+var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),   


Suggested change

var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),

var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),

Whitespacing fixed by prettier - 3c1190c 60d7dca

fgalan · 2020-11-05T15:44:51Z

A MongoDB cache for IOTAs can be an interesting feature. However, let me clarify that for us the only scenario that makes sense in real-world utilization scenarios is number 3 (cluster of several IOTAgent nodes sharing the same DB doing provisions along the time, not only an initial one). Numbers 1 and 2 are not realistic.

Taking this into account, the following requirements should be covered by a cache implementation as the one suggested in this PR:

It MUST be allowed to enable/disable cache per configuration group at provision time. That is, a new parameter in the configuration group API would allow to specify if that configuration group (i.e. the configuration group information itself and the devices associated to the configuration group) has to be cached or not. The rationale of this is that in productive deployments, we can have clients giving more importance to speed than to consistency and clients the other way around but all them using the same IOTAs cluster. Thus, a way of setting this in client’s configuration groups is a must.
It MUST NOT use any hardwired setting. These settings should be part of the configuration (with reasonable defaults) and taking into account backward compatibility. Looking to the last version of the PR (at 5d08c0e point of time) it seems this is the idea at the end. Nice!
It MUST be properly documented, i.e. information about the cache policies, which one is the recommended for ach deployment type, tradeoffs using the cache (speed vs. consistency), etc. At 5d08c0e we see some bacis documentation about the configuration parameters, but we think more topics should be covered (maybe in a separte .md file).
It SHOULD allow “segmentation”. That is, instead of one cache “box” for everything (so a high load in one client configuration group may starve the cache for other clients) several cache “boxes”, one per configuration group. Each configuration group which cache enabled (according to item 1) would use its isolated cache slice. Some cache parameters would be configurable by configuration group API (e.g. cache slice size) although other could be common (e.g. cache policy).

Side-note: Maybe we can provide one shared cache for groups, and several distinct one for devices. As far as in node.js RAM is a hard-limited resource. The size of the device’s cache belonging to a group should be part of the configuration (maybe in terms of how many devices will be stored).

Add ESLint.

chicco785 · 2020-11-24T11:17:51Z

probably, you need something like redis, in memory cache and ha, don't work well together

jason-fox · 2020-11-25T10:54:21Z

probably, you need something like redis, in memory cache and ha, don't work well together

The cache mechanism has an option to connect to Redis. I'll look at this when I have time. I guess the final architecture will look a bit like this:

Configurable Caching Policy use:
- None ✅
- MemCache ✅
- RedisCache ✅
- Both. ✅
Size and Retention Limits set for MemCache ✅
Retention Limits plus connection config set for RedisCache ✅

It MUST be allowed to enable/disable cache per configuration group at provision time. That is, a new parameter in the configuration group API would allow to specify if that configuration group

I think there is a don't cache this option in the API somewhere - this could be based around a regex or maybe the payload ✅

It MUST NOT use any hardwired setting.

Updated and magic numbers removed ✅ but more config will be needed for the Redis switches✅

It MUST be properly documented

Easier to do this once the basic architecture is agreed. ✅

It SHOULD allow “segmentation”.

The config could point to separate Redis instances.✅

jason-fox · 2020-12-07T10:07:24Z

@mrutid @fgalan - This PR is now ready for review.

jason-fox · 2021-01-11T14:00:49Z

@mapedraza @mrutid @fgalan - Getting back after the Christmas break, is there any progress on this? Is there anything else that needs to be done from my side?

mapedraza · 2021-01-19T12:15:38Z

Hello Jason, thank you for your contribution! The pull request needs to be checked, but we still need some time to review it in deep. We will let you know when we review it.

mapedraza · 2021-02-19T11:29:27Z

Finally, I could check this PR with the team.

As you know, in a production environment, adding another component means adding more complexity to the platform operation, so the performance improvement has to be really clear. Mongo-DB is very fast in key accesses and with in-memory datasets and all three cases (Mongo-DB, Redis, and MemCache) are penalized with similar network accesses (Redis and MC we are also adding extra writes that do not exist in the base scenario). It is more complex and it is not clear to me, without tests, that it significantly improves the current scenario (with a well configured Mongo). Could you provide any figure or comparison between both scenarios?

Moreover, to have at the same time Redis and MongoDB, as they cover the same architectural place (External shared state) adds architectural redundancy. Also, Mongo-DB is needed, as some other GEs needs it with use cases that cannot be covered with Redis (i.e. geo-queries done by CB are addressable in MongoDB but not in Redis)

When @mrutid was talking about consistency, was enough to make it configurable for each config group. Each scenario may require or not the cache, with and specific cache policy (for instance, cache timeout), and also @fgalan also mentioned, segmentation, allowing each provision group to have a different cache slice isolated to prevent a single user monopolize all the system cache

A good approach to have in mind using an in-memory cache, which is a real difference compared to Redis, Mongo-DB, and MemCache, is Orion Subscription Cache. The difference regarding Orion cache is that instead of having the same policy for all (as Orion does) in IOTAs we should have specific policies for each config group.

jason-fox · 2021-02-19T12:04:06Z

Two points -

Adding the cache elements (MemCache, Redis or both) to an architecture is entirely optional - you can continue to use a "cacheless" system as before.
Even without a cache, there is a performance change in the PR as the MongoDB code has been altered to use lean() on each request - this should have a significant benefit since it avoids expanding every single MongoDB document when all you need is the JSON.

The Redis cache is just doing the same job it already does when used with QuantumLeap for example (see #926 (comment)). A cache is optional there too. The reason for picking Redis is that cache-manager supports it - I guess someone could look at adding a custom Orion cache support if they wanted.

jason-fox · 2021-02-19T12:14:54Z

Relevant StackOverflow discussion of architectural differences between Redis and Mongo: https://stackoverflow.com/questions/5252577/how-much-faster-is-redis-than-mongodb . Summary is Redis should be faster on Read provided that the data lies on a single machine. The data held in a cache shouldn't be too large for that role.

jason-fox · 2021-03-02T12:17:07Z

After discussions with @mapedraza I'm going to split this into smaller chunks for easier review and consumption.

Assume that caching is not used by default and then additively use caching. This is necessary for backwards compatibility and tenant slicing.

# Conflicts: # doc/api.md

jason-fox added 2 commits October 26, 2020 14:24

Cache Mongo-DB calls.

4e6f9f1

Linting - move functions.

7872370

mrutid reviewed Oct 27, 2020

View reviewed changes

jason-fox added 5 commits November 2, 2020 15:17

Merge branch 'master' into feature/cache

c2a7ea0

Make Caching configurable.

9471f42

Add documentation

7a8edc8

Amend documentation.

d0fe4f8

Tidying up code and tests.

5d08c0e

- Ensure memoryCache is reset/busted on each test config - Amend test config to use memoryCache to test additional path.

jason-fox requested review from mrutid and fgalan November 3, 2020 09:54

fgalan reviewed Nov 5, 2020

View reviewed changes

jason-fox added 3 commits November 16, 2020 16:57

Merge branch 'master' into feature/cache

6b6dba7

Merge commit 'master' into feature/cache

3c1190c

Add ESLint.

Complete linting using ESLint.

60d7dca

jason-fox mentioned this pull request Nov 18, 2020

Feature/eslint #831

Merged

jason-fox added 6 commits November 27, 2020 14:19

Merge branch 'master' into feature/cache

56c8023

Merge branch 'master' into feature/cache

90d1318

Fixing merge

acad317

Adding Redis Cache

4b7a08a

Adding Start-up test for Redis and Memcache

b0daf0a

Updating Docs

5eaef79

jason-fox requested a review from fgalan December 9, 2020 11:48

jason-fox added 3 commits December 10, 2020 12:27

Merge branch 'master' into feature/cache

94bc29e

Remove debug

76dcd15

Merge branch 'master' into feature/cache

7d88f72

jason-fox added 3 commits January 21, 2021 20:01

Merge branch 'master' into feature/cache

bb0c277

Merge branch 'master' into feature/cache

55e017b

Merge branch 'master' into feature/cache

a0b1dd8

Merge branch 'master' into feature/cache

880f164

This was referenced Mar 2, 2021

Cache Mongo-DB calls (in memory only) #998

Open

Update MongoDB to use lean() #999

Merged

jason-fox changed the title ~~Cache Mongo-DB calls.~~ Cache Mongo-DB calls (in-memory and/or Redis) Mar 2, 2021

jason-fox added 3 commits March 16, 2021 13:53

Merge branch 'master' into feature/cache

cfdd819

Merge branch 'master' into feature/cache

d39a90b

Set cache rather than don't cache

e98c02e

Assume that caching is not used by default and then additively use caching. This is necessary for backwards compatibility and tenant slicing.

jason-fox marked this pull request as draft April 14, 2021 10:11

jason-fox added 3 commits June 24, 2021 21:09

Delete file.

7753fec

Merge branch 'master' into feature/cache

f3d6622

# Conflicts: # doc/api.md

Merge branch 'master' into feature/cache

12f9bcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache Mongo-DB calls (in-memory and/or Redis) #926

Cache Mongo-DB calls (in-memory and/or Redis) #926

jason-fox commented Oct 26, 2020 •

edited

Loading

AlvaroVega commented Oct 27, 2020

jason-fox commented Oct 27, 2020 •

edited

Loading

mrutid commented Oct 27, 2020

mrutid Oct 27, 2020

fgalan Oct 27, 2020

jason-fox Nov 3, 2020

jason-fox commented Oct 27, 2020 •

edited

Loading

jason-fox commented Oct 27, 2020

jason-fox commented Nov 3, 2020

fgalan Nov 5, 2020

jason-fox Nov 18, 2020

fgalan commented Nov 5, 2020

chicco785 commented Nov 24, 2020

jason-fox commented Nov 25, 2020 •

edited

Loading

jason-fox commented Dec 7, 2020

jason-fox commented Jan 11, 2021

mapedraza commented Jan 19, 2021

mapedraza commented Feb 19, 2021

jason-fox commented Feb 19, 2021 •

edited

Loading

jason-fox commented Feb 19, 2021 •

edited

Loading

jason-fox commented Mar 2, 2021

	var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),
	var iotAgentLib = require('../../../lib/fiware-iotagent-lib'),

Cache Mongo-DB calls (in-memory and/or Redis) #926

Are you sure you want to change the base?

Cache Mongo-DB calls (in-memory and/or Redis) #926

Conversation

jason-fox commented Oct 26, 2020 • edited Loading

AlvaroVega commented Oct 27, 2020

jason-fox commented Oct 27, 2020 • edited Loading

mrutid commented Oct 27, 2020

mrutid Oct 27, 2020

Choose a reason for hiding this comment

fgalan Oct 27, 2020

Choose a reason for hiding this comment

jason-fox Nov 3, 2020

Choose a reason for hiding this comment

jason-fox commented Oct 27, 2020 • edited Loading

IoT Agent A

IoT Agent B

jason-fox commented Oct 27, 2020

jason-fox commented Nov 3, 2020

fgalan Nov 5, 2020

Choose a reason for hiding this comment

jason-fox Nov 18, 2020

Choose a reason for hiding this comment

fgalan commented Nov 5, 2020

chicco785 commented Nov 24, 2020

jason-fox commented Nov 25, 2020 • edited Loading

jason-fox commented Dec 7, 2020

jason-fox commented Jan 11, 2021

mapedraza commented Jan 19, 2021

mapedraza commented Feb 19, 2021

jason-fox commented Feb 19, 2021 • edited Loading

jason-fox commented Feb 19, 2021 • edited Loading

jason-fox commented Mar 2, 2021

jason-fox commented Oct 26, 2020 •

edited

Loading

jason-fox commented Oct 27, 2020 •

edited

Loading

jason-fox commented Oct 27, 2020 •

edited

Loading

jason-fox commented Nov 25, 2020 •

edited

Loading

jason-fox commented Feb 19, 2021 •

edited

Loading

jason-fox commented Feb 19, 2021 •

edited

Loading