The API Fetcher offers way to configure fetching data from APIs every defined scrape interval.
Download the logzio-api-fetcher image:
docker pull logzio/logzio-api-fetcher
Create a local config file config.yaml
.
Configure your API inputs under apis
. For every API, mention the input type under type
field:
General API
For structuring custom API calls use type general
API with the parameters below.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | the defined url |
url | The request URL | Required | - |
headers | The request Headers | Optional | {} |
body | The request body | Optional | - |
method | The request method (GET or POST ) |
Optional | GET |
pagination | Pagination settings if needed (see options below) | Optional | - |
next_url | If needed to update the URL in the next request based on the last response. Supports using variables (see below) | Optional | - |
next_body | If needed to update the body in the next request based on the last response. Supports using variables (see below) | Optional | - |
response_data_path | The path to the data inside the response | Optional | response root |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | Add type as api-fetcher |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
If needed, you can configure pagination.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
type | The pagination type (url , body or headers ) |
Required | - |
url_format | If pagination type is url , configure the URL format used for the pagination. Supports using variables (see below). |
Required if pagination type is url |
- |
update_first_url | True or False ; If pagination type is url , and it's required to append new params to the first request URL and not reset it completely. |
Optional if pagination type is url |
False |
headers_format | If pagination type is headers , configure the headers format used for the pagination. Supports using variables (see below). |
Required if pagination type is headers |
- |
body_format | If pagination type is body , configure the body format used for the pagination. Supports using variables (see below). |
Required if pagination type is body |
- |
stop_indication | When should the pagination end based on the response. (see options below). | Optional (if not defined will stop on max_calls ) |
- |
max_calls | Max calls that the pagination can make. (Supports up to 1000) | Optional | 1000 |
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
field | The name of the field in the response body, to search the stop indication at | Required | - |
condition | The stop condition (empty , equals or contains ) |
Required | - |
value | If condition is equals or contains , the value of the field that we should stop at |
Required if condition is equals or contains |
- |
Using variables allows taking values from the response of the first request, to structure the request after it.
Mathematical operations +
and -
are supported, in order to add or reduce a number from the variable value.
Use case examples for variable usage:
- Update a date filter at every call
- Update a page number in pagination
To use variables:
- Wrap the variable name in curly brackets
- Provide the full path to that variable in the response
- Add
res.
prefix to the path.
Example: Say this is my response:
{
"field": "value",
"another_field": {
"nested": 123
},
"num_arr": [1, 2, 3],
"obj_arr": [
{
"field2": 345
},
{
"field2": 567
}
]
}
Paths to fields values are structured like so:
{res.field}
="value"
{res.another_field.nested}
=123
{res.num_arr.[2]}
=3
{res.obj_arr.[0].field2}
=345
Using the fields values in the next_url
for example like so:
next_url: https://logz.io/{res.field}/{res.obj_arr[0].field2}
Would update the URL at every call to have the value of the given fields from the response, in our example the url for the next call would be:
https://logz.io/value/345
And in the call after it, it would update again according to the response and the next_url
structure, and so on.
OAuth API
For structuring custom OAuth calls use type oauth
API with the parameters below.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | the defined url |
token_request | Nest here any detail relevant to the request to get the bearer access token. (Options in General API) | Required | - |
data_request | Nest here any detail relevant to the data request. (Options in General API) | Required | - |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | Add type as api-fetcher |
Azure Graph
For Azure Graph, use type azure_graph
with the below parameters.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | azure api |
azure_ad_tenant_id | The Azure AD Tenant id | Required | - |
azure_ad_client_id | The Azure AD Client id | Required | - |
azure_ad_secret_value | The Azure AD Secret value | Required | - |
date_filter_key | The name of key to use for the date filter in the request URL params | Optional | createdDateTime |
data_request.url | The request URL | Required | - |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
days_back_fetch | The amount of days to fetch back in the first request | Optional | 1 (day) |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
Azure Mail Reports
For Azure Mail Reports, use type azure_mail_reports
with the below parameters.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | azure api |
azure_ad_tenant_id | The Azure AD Tenant id | Required | - |
azure_ad_client_id | The Azure AD Client id | Required | - |
azure_ad_secret_value | The Azure AD Secret value | Required | - |
start_date_filter_key | The name of key to use for the start date filter in the request URL params. | Optional | startDate |
end_date_filter_key | The name of key to use for the end date filter in the request URL params. | Optional | EndDate |
data_request.url | The request URL | Required | - |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
days_back_fetch | The amount of days to fetch back in the first request | Optional | 1 (day) |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
Azure General API
For structuring custom general Azure API calls use type azure_general
API with the parameters below.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | azure api |
azure_ad_tenant_id | The Azure AD Tenant id | Required | - |
azure_ad_client_id | The Azure AD Client id | Required | - |
azure_ad_secret_value | The Azure AD Secret value | Required | - |
data_request | Nest here any detail relevant to the data request. (Options in General API) | Required | - |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
days_back_fetch | The amount of days to fetch back in the first request | Optional | 1 (day) |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
Cloudflare
For Cloudflare API, use type as cloudflare
.
By default cloudflare
API type:
- has built in pagination settings
- sets the
response_data_path
toresult
field.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | the defined url |
cloudflare_account_id | The CloudFlare Account ID | Required | - |
cloudflare_bearer_token | The Cloudflare Bearer token | Required | - |
url | The request URL | Required | - |
next_url | If needed to update the URL in next requests based on the last response. Supports using variables (see General API) | Optional | - |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
pagination_off | True if builtin pagination should be off, False otherwise | Optional | False |
1Password
By default 1password
API type has built in pagination settings and sets the response_data_path
to items
field.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | the defined url |
onepassword_bearer_token | The 1Password Bearer token | Required | - |
url | The request URL | Required | - |
method | The request method (GET or POST ) |
Optional | GET |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
days_back_fetch | The amount of days to fetch back in the first request. Applies a filter on 1password start_time parameter. |
Optional | - |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
onepassword_limit | 1Password limit for number of events to return in a single request (allowed range: 100 to 1000) | Optional | 100 |
pagination_off | True if builtin pagination should be off, False otherwise | Optional | False |
Dockerhub
For dockerhub audit logs, use type dockerhub
with the below parameters.
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
name | Name of the API (custom name) | Optional | the defined url |
dockerhub_user | DockerHub username | Required | - |
dockerhub_token | DockerHub personal access token or password | Required | - |
url | The request URL | Required | - |
next_url | URL for the next page of results (used for pagination) | Optional | - |
method | The request method (GET or POST ) |
Optional | GET |
days_back_fetch | Number of days to fetch back in the first request. Adds a filter on from parameter. |
Optional | -1 |
refresh_token_interval | Interval in minutes to refresh the JWT token | Optional | 30 (minute) |
scrape_interval | Time interval to wait between runs (unit: minutes ) |
Optional | 1 (minute) |
additional_fields | Additional custom fields to add to the logs before sending to logzio | Optional | - |
And your logzio output under logzio
:
Parameter Name | Description | Required/Optional | Default |
---|---|---|---|
url | The logzio Listener address | Optional | https://listener.logz.io:8071 |
token | The logzio shipping token | Required | - |
apis:
- name: random
type: general
additional_fields:
type: random
...
- name: another api
type: oauth
additional_fields:
type: oauth-api
...
logzio:
url: https://listener-ca.logz.io:8071 # for us-east-1 region delete url param (default)
token: <<SHIPPING_TOKEN>>
In the path where you saved your config.yaml
, run:
docker run --name logzio-api-fetcher \
-v "$(pwd)":/app/src/shared \
logzio/logzio-api-fetcher
The default logging level is INFO
. To change it, add --level
flag to the command:
docker run --name logzio-api-fetcher \
-v "$(pwd)":/app/src/shared \
logzio/logzio-api-fetcher \
--level DEBUG
Available Options: INFO
, WARN
, ERROR
, DEBUG
When you stop the container, the code will run until completion of the iteration. To make sure it will finish the iteration on time, please give it a grace period of 30 seconds when you run the docker stop command:
docker stop -t 30 logzio-api-fetcher
- 0.3.1:
- Handle internal logger configuration in memory instead of disk
- 0.3.0:
- Add support for dockerhub audit logs
- 0.2.2:
- Resolve Azure mail reports bug
- 0.2.1:
- Add 1Password Support
- Add
next_body
support to allow more customization in general settings - Support integers and boolean as values in pagination 'equals' stop condition
- 0.2.0:
- Breaking changes!!
- Deprecate configuration fields:
credentials
,start_date_name
,end_date_name
,json_paths
,settings.days_back_fetch
settings.time_interval
>>scrape_interval
- OAuth Changes:
token_http_request
>>token_request
data_http_request
>>data_request
- All APIs now nested in the config under
apis
and supported types are:general
,oauth
,azure_general
,azure_graph
,azure_mail_reports
,cloudflare
.
- Upgrade General API settings to allow more customization
- Adding Authorization Header is required, if needed, due to
credentials
field deprecation. - Adding
Content-Type
may be required based on the API
- Adding Authorization Header is required, if needed, due to
- Deprecate Cisco SecureX
- Deprecate configuration fields:
- New Features!!
- Add Pagination support
- Add variables support in
next_url
and pagination settings- Supports
+
and-
mathematical operations on values - Supports arrays
- Supports
- Upgrade python version
3.9
>>3.12
- Add Cloudflare Support
- Add option to control logging level with
--level
flag.
- Breaking changes!!
- 0.1.3:
- Support another date format for
azure_graph
.
- Support another date format for
- 0.1.2:
- Support another date format for
security_alerts
.
- Support another date format for
- 0.1.1:
- Bug fix for
security_alerts
task fails on second cycle.
- Bug fix for
- 0.1.0:
- Added
azure_mail_reports
type.
- Added
- 0.0.6:
- Improved documentation.
- Added error log.
- 0.0.5:
- Bug fix for
azure_graph
task fails on second cycle. - Changed start date filter mechanics for auth_api.
- Bug fix for
- 0.0.4:
- Bug fix for
azure_graph
task fails when trying to get start date from file.
- Bug fix for