splunkninja

The dojo of Splunk. Learn, share, teach, mentor.

Michael Wilde

Getting more intelligence on how much data splunk is eating.

As you know, there is a License pane in Splunk Manager (admin interface) that lets you know your "peak daily volume", and that figure is compared against your license volume. (free, or enterprise)

In the Splunk search app, (as of version 4.0.5) there is an "Index Activity" status dashboard in the search app (http://yoursplunkserver:8000/en-US/app/search/index_status). It does give you more information such as:

  • Top five sourcetypes (by total KB indexed) in the last 24 hours
  • Indexing rate over time by sourcetype (including internal indexes)
  • Indexing amount per server

But most people want more detail, such as:

  • For each host thats sending data, how much is it sending to Splunk?
  • Which sourcetypes are consuming the lions share of data on a daily basis?
  • Which sources are generating the most data today?
  • Which indexes are receiving the bulk of the data?

Splunk eats up its own metrics logs — as you would expect it to (after all.. its a log). Those logs are stored in and index called "_internal". We can use Splunk search to ask Splunk a bit more about what its really doing when its indexing.

If you look at the search below, notice a few things.

I have it set for the last month (monthsago=1), so it will look over the last month and summarize the amount of data indexed. You can change the timerange—in search language, or just in the GUI time picker.
I am searching for “group=per_host_thruput” from that specific log. Additionally, since splunk is measuring in kilobytes, i am using the "eval" search command to multiply it so you can see it in megabytes, and thus creating a new field called "totalmb" -- which if ran, with the span=1d, will give you total megabytes per day. Get it?

index=_internal metrics kb series!=_* "group=per_host_thruput" monthsago=1 | eval indexed_mb = kb / 1024 | timechart fixedrange=t span=1d sum(indexed_mb) by series | rename sum(indexed_mb) as totalmb

After the summary table loads in your browser.. Click “Show Report”

But.. What other dimensions can I get from the data being indexed? I can modify the search and change the “group=’ and get summaries for each different vector.

group=per_host_thruput -
group=per_index_thruput
group=per_source_thruput
group=per_sourcetype_thruput

Example:


Turn it in to a graph by clicking "Show Report"


Got questions, fire away. Remember, these are pretty darn good estimates of what you are doing on a daily basis, but the License Manager rules all -- and does not use the log data to calculate your daily volume -- actually its the converse -- License Manager helps spit that log data out, and stores its understanding of "peak" in its own special database.

Comment

You need to be a member of splunkninja to add comments!

Join splunkninja

Michael Wilde Comment by Michael Wilde on April 18, 2010 at 6:30pm
Steve...

If you want to alert on the actual violation, so you know when it happens, this search should work. Maybe run that search every day and alert if number of events are greater than zero.

index=_internal source=*license_audit.log LicenseManager-Audit | delta quotaExceededCount as quotadiff | stats first(quotadiff) as quotadiff | search quotadiff>0
Michael Wilde Comment by Michael Wilde on April 18, 2010 at 6:05pm
Tony... you should be able to just add "useother=f" to your timechart command.
Steve Comment by Steve on April 13, 2010 at 11:14am
I have been trying to figure out how to create an alert that would notifiy me when I have exceeded my license limit. I have a saved search that reports on license usage, but in setting up the search, I am not sure what value to alert on (i.e. what is the condition I need to match on). Help? Thank you.
Tony Reinke Comment by Tony Reinke on January 18, 2010 at 11:50am
This is great! How would I go about not having an OTHER column? I would like to see a report of all my servers.

Latest Splunk Community Postings

Loading feed

Latest Splunk Forum Posts

Loading feed

© 2010   Created by Michael Wilde.   Powered by .

Badges  |  Report an Issue  |  Terms of Service

Sign in to chat!