Request: paging/limit results sets #24

edperry · 2018-09-08T17:00:44Z

When walking a Large TREE 1.3.6.1, sometimes the tree is very large and created Huge documents

[2018-09-08T12:44:26,747][DEBUG][logstash.outputs.kafka   ] KafkaProducer.send() failed: org.apache.kafka.common.errors.RecordTooLargeException: The message is 1101208 bytes when serialized which is larger than the maximum request size you have configured with the max.request.size configuration. {:exception=>java.util.concurrent.ExecutionException: org.apache.kafka.common.errors.RecordTooLargeException: The message is 1101208 bytes when serialized which is larger than the maximum request size you have configured with the max.request.size configuration.}

so obviously I could Up the size of MAX messages sending to kafka, but I think it would be great if there was a feature to, split the document output by the input based on some criteria

Some ideas I have are:

every entry gets its own event
spit by MIB Tree
Set a max value of fields and just issue multiple events based on this max

The text was updated successfully, but these errors were encountered:

colinsurprenant · 2018-09-27T21:25:45Z

@edperry trying to think about this with you, wouldn't be an options to simply split the walk into multiple sub-walks which would produce individual documents pre sub-walks?

edperry · 2018-10-23T14:59:31Z

Sorry for the delay, was on vacation

That maybe a good way of doing it, I was sort of focused on the size of the payload going though Kafka, but let me give that a try, I totally missed the idea of handling it as a filter. 👍

edperry · 2018-10-23T15:13:10Z

Hmmm, just looked at the data structure and not sure how I would split this data, its not JSON array.
not sure how I would split root fields off to other events :( any suggestions?

"_source": {
    "1.org.dod.internet.mgmt.1.1.9.1.4.8": 8,
    "1.org.dod.internet.mgmt.1.1.9.1.4.4": 8,
    "1.org.dod.internet.mgmt.1.1.5.0": "hd1pbk01lx",
    "1.org.dod.internet.mgmt.1.1.9.1.2.3": "1.3.6.1.6.3.10.3.1.1",
    "1.org.dod.internet.mgmt.1.1.9.1.3.6": "The MIB module for managing IP and ICMP implementations",
    "@timestamp": "2018-10-23T15:06:28.650Z",
    "1.org.dod.internet.mgmt.1.1.9.1.2.6": "1.3.6.1.2.1.4",
    "1.org.dod.internet.mgmt.1.1.6.0": "Linux",
    "1.org.dod.internet.mgmt.1.1.9.1.4.6": 8,
    "host_ip": "192.168.7.29",
    "dst_index": "snmp",
    "1.org.dod.internet.mgmt.1.1.9.1.3.4": "The MIB module for SNMPv2 entities",
    "1.org.dod.internet.mgmt.1.1.9.1.4.2": 8,
    "1.org.dod.internet.mgmt.1.1.9.1.2.5": "1.3.6.1.2.1.49",
    "1.org.dod.internet.mgmt.1.1.9.1.4.3": 8,
    "1.org.dod.internet.mgmt.1.1.9.1.3.8": "View-based Access Control Model for SNMP.",
    "1.org.dod.internet.mgmt.1.1.9.1.2.1": "1.3.6.1.6.3.11.2.3.1.1",
    "type": "snmp-systeminfo",
    "1.org.dod.internet.mgmt.1.1.9.1.3.3": "The SNMP Management Architecture MIB.",
    "1.org.dod.internet.mgmt.1.1.9.1.3.1": "The MIB for Message Processing and Dispatching.",
    "1.org.dod.internet.mgmt.1.1.9.1.4.1": 8,

edperry · 2018-11-03T13:43:04Z

So thinking about what you wrote more.

I think walking the mib by groups would be nicer, but you still have to worry about size of the results. For example, If I polled the "Packages" or "Processes" running it still could easily exceed any value.

I think an option to just to have a flag to create and event for every MIB object. This would create a lot of little objects, but it also provides the ability to provide the META information/description for each MIB.

eperry · 2018-11-05T00:18:38Z

Issue logstash-plugins/logstash-integration-snmp#25

I was thinking about converting from

  {
     "OID": oidvalue
  }

to something like

  {
      oid_name: "OID"
      value: oid_value
  }

so this would make 3 options,

mode => batch
mode => single
mode => single_name_value

eperry · 2018-11-05T00:39:34Z

IDK if making the total fields Limit and depth to higher values is good enough but let me know what you think

  "settings": {
    "index": {
      "mapping": {
        "total_fields": {
          "limit": "600000"
        },
        "depth": {
          "limit": "100"
        }
      },
      "refresh_interval": "5s"
    }
  }

colinsurprenant · 2018-11-07T20:41:09Z

looking at this and #42 - will get back to you shortly 👀

edperry · 2018-11-19T23:08:35Z

I was just thinking about this

  {
      oid_name: "OID"
      value: oid_value
  }

oid_value could be either string, int or etc..... so I don't think this format will work or maybe we just create multiple fields.

  {
      "oid_name": "OID"
      "oid_int_val": oid_int_value
      "oid_str_val": "oid_int_value"
       ..... Other types???...........
  }

I am not sure if that is a great idea now to create a "mode => single_name_value: because it is not a straightforward string: value combination, when it comes to ELK

colinsurprenant added the enhancement New feature or request label Sep 27, 2018

This was referenced Nov 4, 2018

Issue #24 Limit size of Document. #42

Closed

Maximum size fields and Index Fields when doing walks logstash-plugins/logstash-integration-snmp#25

Open

colinsurprenant self-assigned this Nov 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: paging/limit results sets #24

Request: paging/limit results sets #24

edperry commented Sep 8, 2018

colinsurprenant commented Sep 27, 2018

edperry commented Oct 23, 2018

edperry commented Oct 23, 2018

edperry commented Nov 3, 2018

eperry commented Nov 5, 2018

eperry commented Nov 5, 2018

colinsurprenant commented Nov 7, 2018

edperry commented Nov 19, 2018 •

edited

Loading

Request: paging/limit results sets #24

Request: paging/limit results sets #24

Comments

edperry commented Sep 8, 2018

colinsurprenant commented Sep 27, 2018

edperry commented Oct 23, 2018

edperry commented Oct 23, 2018

edperry commented Nov 3, 2018

eperry commented Nov 5, 2018

eperry commented Nov 5, 2018

colinsurprenant commented Nov 7, 2018

edperry commented Nov 19, 2018 • edited Loading

edperry commented Nov 19, 2018 •

edited

Loading