Update platform_settings.py#

The /etc/eclecticiq/platform_settings.py file allows you to set advanced configuration parameters for your Intelligence Center instance.

Once you’ve installed EclecticIQ Intelligence Center, you should review the /etc/eclecticiq/platfrom_settings.py file to make sure the parameters set suit your environment.

Note

For changes to platfrom_settings.py to take effect, you must restart Intelligence Center services. See Restart services for changes to take effect .

Restart services for changes to take effect #

After making changes to platform_settings.py, you must restart the backend services for your changes to take effect.

To restart the services, run as root:

systemctl restart eclecticiq-platform-backend-services

Secrets #

Set secret key and session token #

When you install, update or reinstall EclecticIQ Intelligence Center, change the following default values in the platform_settings.py configuration file:

Attribute name	Default value	Description
`SECRET_KEY`	`''`	Required. Must be set for EclecticIQ Intelligence Center to start. Set this to a random string at least 32 characters long, enclosed within quotes (`""` or `''`).

Attribute name

Default value

Description

SECRET_KEY

''

Required. Must be set for EclecticIQ Intelligence Center to start.

Set this to a random string at least 32 characters long, enclosed within quotes ("" or '').

Tip

You can use /dev/urandom to generate such a random string. Run:

cat /dev/urandom | tr -dc a-zA-Z0-9 | fold -w 32 | head -n 1

Set reset password link expiration #

Attribute name	Default value	Description
`ONE_TIME_PASSWORD_EXPIRATION_MINUTES`	60	Time for password reset link to expire, in minutes.

To reset and to change their password, click Reset password on the sign-in page.

EclecticIQ Intelligence Center sends you an automatic email message with a link to a password reset page, where they can complete the operation.

By default, the password reset link in the automatic email expires 60 minutes after sending the message.

Data retention and policies #

Enable observable actions in policies #

Data retention policies allow you to set policies that perform a Delete observables action when they run.

This is disabled by default from 2.12.0. Policies will skip any Delete observables actions that are set, while the rest of the policy runs as configured.

Attribute name	Default value	Description
`DISABLE_OBSERVABLE_RETENTION_POLICIES`	`True`	(Not recommended) Set to `False` to allow policies to run Delete observables actions. May cause high resource consumption.

Attribute name

Default value

Description

DISABLE_OBSERVABLE_RETENTION_POLICIES

True

(Not recommended) Set to False to allow policies to run Delete observables actions.

May cause high resource consumption.

Prune old records in database #

You can configure EclecticIQ Intelligence Center to remove old records from blob and content_block tables in the PostgreSQL database after a given number of days.

The blob table stores raw data downloaded by incoming feeds that is subsequently processed.
The content_block table stores data that is distributed through outgoing feeds.

By default, the data in these tables is kept forever.

To set EclecticIQ Intelligence Center to delete records from these tables after a given number of days, set the following parameters:

Attribute name	Default value	Description
`CONTENT_BLOCK_RETENTION`	`None`	Number of days to retain records in the `content_block` table.
`BLOB_RETENTION`	`None`	Number of days to retain records in the `blob` table.

Change data retention period of logs and metrics indices #

These Elasticsearch indices are periodically pruned:

logstash*: Aggregates event log data.
statsite*: Aggregates metrics to monitor system health and performance.

Set how long data in these indices are kept by configuring these parameters:

Parameter	Type	Description
`LOG_RETENTION`	int	The value represents days. Sets the total number of days Elasticsearch stores Logstash log entries, before deleting them from the corresponding log indices. Default value: `365` (1 year)
`METRIC_RETENTION`	int	The value represents days. Sets the total number of days Elasticsearch stores Statsite metrics data entries, before deleting them from the corresponding metrics indices. Default value: `730` (2 years)

Parameter

Type

Description

LOG_RETENTION

int

The value represents days.

Sets the total number of days Elasticsearch stores Logstash log entries, before deleting them from the corresponding log indices.

Default value: 365 (1 year)

METRIC_RETENTION

int

The value represents days.

Sets the total number of days Elasticsearch stores Statsite metrics data entries, before deleting them from the corresponding metrics indices.

Default value: 730 (2 years)

If you find that the eiq.utilities.prune_indices task is failing with a timeout, you can extend the task’s time limit by changing the CELERY_X_EIQ_TIME_LIMIT_PER_TASK["eiq.utilities.prune_indices"] parameter:

# Time limits for specific tasks (in seconds)
CELERY_X_EIQ_TIME_LIMIT_PER_TASK = {
    # ...
    "eiq.utilities.prune_indices": 4 * 60 * 60,  # 4 hours
    # ...
}

Audit trail logs #

Audit trail log levels are configured with AUDIT_TRAIL_LOG_LEVEL. See Audit trail logs.

Limit the number of days audit trail logs are kept by setting AUDIT_TRAIL_RETENTION. By default, this is set to 360 days (AUDIT_TRAIL_RETENTION = 360).

Note

AUDIT_TRAIL_ENABLED is deprecated from 2.13.0 onward. Use AUDIT_TRAIL_LOG_LEVEL instead.

Data directories #

Allow list for feed mount points #

To use directories on EclecticIQ Intelligence Center host for incoming and outgoing feeds (such as with the “Mount point download” and “Mount point upload” transport types), you must explicitly set the directories the transport type is allowed to use.

Set these parameters:

Attribute name	Default value	Description
`MOUNT_POINT_POLL_ALLOWED_DIRECTORIES`	`[]`	Python list of allowed paths on EclecticIQ Intelligence Center host that incoming feed transport types can use. Example value: [ "/mnt/", "/media/", "/media/data/" ]
`MOUNT_POINT_PUSH_ALLOWED_DIRECTORIES`	`[]`	Python list of allowed paths on EclecticIQ Intelligence Center host that outgoing feed transport types can use. Example value: [ "/mnt/", "/media/", "/media/data/" ]

Attribute name

Default value

Description

MOUNT_POINT_POLL_ALLOWED_DIRECTORIES

[]

Python list of allowed paths on EclecticIQ Intelligence Center host that incoming feed transport types can use.

Example value:

[ "/mnt/", "/media/", "/media/data/" ]

MOUNT_POINT_PUSH_ALLOWED_DIRECTORIES

[]

Python list of allowed paths on EclecticIQ Intelligence Center host that outgoing feed transport types can use.

Example value:

[ "/mnt/", "/media/", "/media/data/" ]

Services #

TAXII 2.1 API root service #

Tip

For more information about the TAXII 2.1 API root service, see Configure Opentaxii server

Configure the TAXII 2.1 API root service (/taxii2/api_root/). As root:

Edit /etc/eclecticiq/platform_settings.py.

Add or change the TAXII2_API_ROOT attribute.

This table describes the possible keys and values:

Attribute name	Default	Description
`TAXII2_API_ROOT`	TAXII2_API_ROOT = { "title": "...", "description": "...", "is_public": True, }	Attribute that configures the TAXII 2 API root. Keys in this dictionary are described in this table.
`TAXII2_API_ROOT["title"]`	`"EIQ TAXII 2.1 api root"`	Title assigned to the API root.
`TAXII2_API_ROOT["description"]`	`"The EIQ TAXII 2.1 api root for passive outgoing feeds"`	Description assigned to API root.
`TAXII2_API_ROOT["is_public"]`	`True`	(Recommended) Set to `False` to restrict access to the following endpoints: `/taxii2/api_root/` `/taxii2/api_root/collections/` Users need to authenticate by sending their API key as a `Bearer` token, or use Basic authentication.

Save platform_settings.py.

Restart the OpenTaxii service:

systemctl restart eclecticiq-platform-backend-opentaxii

Elasticsearch: specify number of shards #

Specify the number of Elasticsearch shards per index in your cluster with these parameters:

These values are only used when:

Installing IC for the first time.
When an Elasticsearch data migration during IC upgrades contains a migration for a given Elasticsearch index.

Attribute name	Default value	Description
`ELASTICSEARCH_SHARD_NUMBER`	`1`	Number. Set this to the expected number of shards per index in your Elasticsearch cluster. This is a blanket value for all indices in your Elasticsearch cluster. `SEARCH_INDEX_SHARDS` takes higher precedence. If both are set in `platform_settings.py`, application indices that are set in `SEARCH_INDEX_SHARDS` will take their shard counts from `SEARCH_INDEX_SHARDS` instead.
`SEARCH_INDEX_SHARDS`	{ "stix": 5, "relations": 5, "extracts-unique": 3, "extracts": 3, }	Dictionary of values. New in IC 3.5.0. Set individual dictionary values to the expected number of shards per index in your Elasticsearch cluster. If set, this takes precedence over `ELASTICSEARCH_SHARD_NUMBER`.

Attribute name

Default value

Description

ELASTICSEARCH_SHARD_NUMBER

1

Number. Set this to the expected number of shards per index in your Elasticsearch cluster.

This is a blanket value for all indices in your Elasticsearch cluster. SEARCH_INDEX_SHARDS takes higher precedence. If both are set in platform_settings.py, application indices that are set in SEARCH_INDEX_SHARDS will take their shard counts from SEARCH_INDEX_SHARDS instead.

SEARCH_INDEX_SHARDS

{
  "stix": 5,
  "relations": 5,
  "extracts-unique": 3,
  "extracts": 3,
}

Dictionary of values.

New in IC 3.5.0.

Set individual dictionary values to the expected number of shards per index in your Elasticsearch cluster.

If set, this takes precedence over ELASTICSEARCH_SHARD_NUMBER.

SEARCH_INDEX_SHARDS takes precedence over ELASTICSEARCH_SHARD_NUMBER. Prefer SEARCH_INDEX_SHARDS when setting shard counts for application-related indices.

For example, with these values:

ELASTICSEARCH_SHARD_NUMBER = 1
SEARCH_INDEX_SHARDS = {
  "stix": 5,
  "relations": 5,
  "extracts-unique": 3,
  "extracts": 3,
}

The effective values are:

stix: 5 shards
relations: 5 shards
extracts-unique: 3 shards
extracts: 3 shards

Elasticsearch: specify data nodes #

EclecticIQ Intelligence Center needs to know the network location of your Elasticsearch data nodes in order to query the Elasticsearch cluster.

Attribute name	Default value	Description
`SEARCH_URLS`	`[]`	List. One or more network addresses to reach Elasticsearch data nodes at. Example value: [ "https://127.0.0.1:9200", "https://127.0.0.1:9201", "https://es-node-3.example.com:9200" ]

Attribute name

Default value

Description

SEARCH_URLS

[]

List. One or more network addresses to reach Elasticsearch data nodes at.

Example value:

[ "https://127.0.0.1:9200", "https://127.0.0.1:9201", "https://es-node-3.example.com:9200" ]

Note

DEPRECATED: EclecticIQ Intelligence Center 2.11 and older use the SEARCH_URL attribute instead, which only takes a single string.

Enable Statsite in EclecticIQ Intelligence Center #

To enable the statsite service, set the following parameters:

Attribute name	Default value	Description
`STATSD_ENABLED`	`True`	Boolean. Set to `False` to disable the service.
`STATSD_HOST`	`"127.0.0.1"`	IP address or fully qualified domain name the statsite service can be accessed at.
`STATSD_PORT`	`8125`	Port the statsite service is bound to on `STATSD_HOST`.

Set expected Celery host count for health checks #

CELERY_X_EIQ_HOSTS sets the number of Celery hosts that EclecticIQ Intelligence Center should expect.

Attribute name	Default value	Description
`CELERY_X_EIQ_HOSTS`	`1`	Number hosts running the Celery service.

When pinging Celery to check its health status, EclecticIQ Intelligence Center expects to receive as many replies as the integer value assigned to CELERY_X_EIQ_HOSTS.

If you set the value to a number that is lower than the actual amount of hosts that run Celery queues, the health status check for Celery fails, and it is highlighted in red in EclecticIQ Intelligence Center system health pane in the GUI.
If you set the value to a number that is greater than the actual amount of hosts that run Celery queues, the health status check for Celery may take a long time, since the Intelligence Center keeps pinging and waiting for responses from non-existing hosts before it gives up.

Recommended values:

For environments with a predefined, static amount of Celery hosts, the default value of 1 is a good starting point.

It means that EclecticIQ Intelligence Center instance expects there to be one host running Celery queues.
For environments with a dynamic amount of Celery hosts such as Kubernetes deployments, set the value to 0 to fall back on standard timeouts.

To inspect how long it takes to run Celery checks, including health checks, from the command line run eiq-platform diagnose run.

For more information about diagnostic tests, see Run diagnostic tests.
For more information about the command line tool, see eiq-platform command line.

Ingestion and Enrichment #

Set EXTENSION_SDK_STASH_REDIS_URL #

Extensions released for IC 3.5.0 and later may use the new stash interface.

These extensions require EXTENSION_SDK_STASH_REDIS_URL to be set in /etc/eclecticiq/platform_settings.py.

Tip

If you encounter the error redis.exceptions.AuthenticationError: Authentication required when running an incoming feed, outgoing feed, or enricher, you must set EXTENSION_SDK_STASH_REDIS_URL.

The value of EXTENSION_SDK_STASH_REDIS_URL should be set to:

EXTENSION_SDK_STASH_REDIS_URL = "redis://<redis_user>:<redis_password>@<redis_url>:<redis_port>/1<query_parameters>"

Note

EXTENSION_SDK_STASH_REDIS_URL should use a different Redis database. For most deployments, it is sufficient to just use database 1 e.g.: redis://<redis_url>:6379/1.

For most deployments, we use the same instance specified in REDIS_URL. E.g.:

REDIS_URL = 'redis://:[email protected]:6379'
EXTENSION_SDK_STASH_REDIS_URL = 'redis://:[email protected]:6379/1'

# If REDIS_URL uses query parameters to specify an SSL connection
REDIS_URL = "rediss://:[email protected]:6380?ssl_cert_reqs=required&ssl_ca_certs=/etc/eclecticiq-ca.crt"
EXTENSION_SDK_STASH_REDIS_URL = "rediss://:[email protected]:6380/1?ssl_cert_reqs=required&ssl_ca_certs=/etc/eclecticiq-ca.crt"

Set package size limits #

The platform_settings.py configuration file includes settings that limits the file sizes of packages handled by:

Incoming feeds
Outgoing feeds
Manual uploads

The hard limit for all file sizes is 100MB, or 100 * 1024 * 1024.

Caution

Changing these values may have a negative impact on performance.

Setting large size limits may lead to issues where tasks time out before it can finish processing the file, or an ingestion pipeline that is slowed down by tasks that have to process large files.

Attribute name	Default value	Description
`MAX_BLOB_SIZE`	`20 * 1024 * 1024`	This sets the maximum allowed file size for packages ingested through incoming feeds, or published through outgoing feeds. The value here is expressed in bytes, and can be a number or an expression that evaluates to a number. For example, the default value is the expression `20 * 1024 * 1024` which evaluates to 20MB.
`MAX_UPLOADED_BLOB_SIZE`	`MAX_BLOB_SIZE / 2`	This sets the maximum allowed filed size allowed for files uploaded through Manual uploads. The value here is expressed in bytes, and can be a number or an expression that evaluates to a number. We recommend leaving it at the default of `MAX_BLOB_SIZE / 2`, or half the maximum allowed file size set for `MAX_BLOB_SIZE`.

Attribute name

Default value

Description

MAX_BLOB_SIZE

20 * 1024 * 1024

This sets the maximum allowed file size for packages ingested through incoming feeds, or published through outgoing feeds.

The value here is expressed in bytes, and can be a number or an expression that evaluates to a number.

For example, the default value is the expression 20 * 1024 * 1024 which evaluates to 20MB.

MAX_UPLOADED_BLOB_SIZE

MAX_BLOB_SIZE / 2

This sets the maximum allowed filed size allowed for files uploaded through Manual uploads.

The value here is expressed in bytes, and can be a number or an expression that evaluates to a number.

We recommend leaving it at the default of MAX_BLOB_SIZE / 2, or half the maximum allowed file size set for MAX_BLOB_SIZE.

Tune timeout limits for ingestion tasks #

Caution

Avoid changing the default values for these parameters.

Changing these values may have a negative impact on performance.

If you are encountering timeouts when running certain tasks, you may want to adjust the default Celery task time limits in the CELERY_X_EIQ_TIME_LIMIT_PER_FAMILY.

# Time limits per task families (in seconds)
CELERY_X_EIQ_TIME_LIMIT_PER_FAMILY = {
    "eiq.enrichers": 2 * 60,  # 2 minutes
    "eiq.incoming-transports": 8 * 60 * 60,  # 8 hours
    "eiq.outgoing-transports": 2 * 60,  # 2 minutes
}

Parameter	Type	Description
`eiq.enrichers`	int	Set a time limit in seconds. Sets time limit for each Enrichment task run.
`eiq.incoming-transports`	int	Set a time limit in seconds. Sets time limit for each incoming feed run.
`eiq.outgoing-transports`	int	Set a time limit in seconds. Sets time limit for each outgoing feed run.

Parameter

Type

Description

eiq.enrichers

int

Set a time limit in seconds.

Sets time limit for each Enrichment task run.

eiq.incoming-transports

int

Set a time limit in seconds.

Sets time limit for each incoming feed run.

eiq.outgoing-transports

int

Set a time limit in seconds.

Sets time limit for each outgoing feed run.

Tune the ingestion scheduler #

Caution

Avoid changing the default values for these parameters.

Changing these values may have a negative impact on performance.

Intelligence Center ingestion attempts to optimize system resources to ingest data as quickly and as efficiently as possible. It features jobs to monitor feed activity, to prioritize feeds, to distribute and rotate feed priority, as well as a ranking system to periodically assess and reassign priority to incoming feeds. The score decays over time in a similar way to entity half-life. This way, EclecticIQ Intelligence Center distributes the ingestion workload to minimize idle time and bottlenecks.

Tune the ingestion scheduler by changing values in the INGESTION_QUUZ_SCHEDULER_OPTIONS parameter.

INGESTION_QUUZ_SCHEDULER_OPTIONS = {
    "prefer_full_batches": True,
    "score_half_time": 24 * 60 * 60, 
    "sweep_same_depth_max_jobs": 500,
    "sweep_same_depth_max_enqueued_jobs": 1000,
    "sweep_same_depth_max_time_total": 60,
    "sweep_same_depth_max_time_between": 30,
}

Note

Job depth: The “job depth” is then the number of hops in that chain of dependencies: if Job A depends on Job B, and Job B depends on Job C, then Job C has a depth of 2.

Jobs with the “higher” depth are more likely, but not guaranteed, to be scheduled to run first: if Job A has a depth of 1, and Job B has a depth of 10, then Job B is more likely to run before Job A.

If one or more jobs depend on the same job, then they have the same “job depth”: if Job A depends on Job B and Job C, then both Job C and Job B have a depth of 1.

The depth of a job is ‘lifted’ when all its dependencies finish running: if Job B depends on Job A, and Job A finishes running, then Job B has a depth of 0.

Parameter	Type	Description
`prefer_full_batches`	boolean	`True` by default. Allows the scheduler to continue with lower depth jobs after executing a partial batch, allowing the scheduler to distribute its time across jobs.
`score_half_time`	int	Set a time limit in seconds. Ingestion tasks are assigned a priority score that decays over time, so that ingestion workers can distribute their time fairly across all running tasks. This parameter sets the time taken for a task’s priority score to be reduced by half (a half-life).
`sweep_same_depth_max_jobs`	int	Sets the maximum number of concurrent jobs at a given job depth, before ingestion workers can move on to jobs with the highest depth value. If no jobs are available at the current depth level, the scheduler moves one level up, and it searches there for jobs that it can execute.
`sweep_same_depth_max_enqueued_jobs`	int	Similar to `sweep_same_depth_max_jobs`. It sets the the maximum number of waiting jobs that can be added to a queue at a given depth level, before an ingestion worker can move on to jobs with the highest depth value.
`sweep_same_depth_max_time_total`	int	Set a time limit in seconds. Sets the maximum mount of time workers can spend searching for jobs to execute at a given depth level, before they move on to jobs with the highest depth value. In practice, this is how long ingestion workers has for transforming packages, before moving on to process the corresponding entities.
`sweep_same_depth_max_time_between`	int	Set a time limit in seconds. This limits the time spent looking for the next job to run at the a given depth. If the scheduler takes longer than the set time to find another job at this depth to run, it resets and tries again. Meanwhile, depths assigned to existing jobs may have changed, allowing the scheduler to sweep through a fresh list of jobs at that depth.

Tune task batch size in the ingestion pipeline #

Caution

Avoid changing the default values for these parameters.

Changing these values may have a negative impact on performance.

Intelligence Center ingestion tasks run and are processed in batches to optimize system resources. The ingestion process works like a pipeline, where different tasks perform specific actions.

Based on system resources and the type of content being ingested, you may want to increase or decrease the batch size to improve overall ingestion performance.

Change the values in the INGESTION_TASK_BATCH_SIZES parameter to control task batch sizes:

# Batch sizes used for ingestion tasks.
INGESTION_TASK_BATCH_SIZES = {
    "ingest_blob_task": 100,
    "index_extracts_task": 1000,
    "search_synchronize_entity_task": 1000,
}

Parameter	Type	Description
`ingest_blob_task`	int	Sets a limit to the maximum number of concurrent tasks that initiate ingesting packages from incoming feeds and enrichers. Default limit: `100`
`index_extracts_task`	int	Sets a limit to the maximum number of concurrent tasks that index ingested observables. Default limit: `1000`
`search_synchronize_entity_task`	int	Sets a limit to the maximum number of concurrent tasks that sync ingested entities between PostgreSQL and Elasticsearch. Default limit: `1000`

Parameter

Type

Description

ingest_blob_task

int

Sets a limit to the maximum number of concurrent tasks that initiate ingesting packages from incoming feeds and enrichers.

Default limit: 100

index_extracts_task

int

Sets a limit to the maximum number of concurrent tasks that index ingested observables.

Default limit: 1000

search_synchronize_entity_task

int

Sets a limit to the maximum number of concurrent tasks that sync ingested entities between PostgreSQL and Elasticsearch.

Default limit: 1000

Automatically disable enricher after continuously failing #

By default, enrichers can fail up to ten times in a row before they are automatically disabled.

Change this by setting these parameters:

Attribute name	Default value	Description
`ENRICHER_FAILURES_TO_DISABLE`	`10`	Number of times an enricher can fail in a row before it is automatically disabled.

Tune discovery limits #

Warning

(Not recommended) Changing discovery limits can negatively impact the performance of your EclecticIQ Intelligence Center instance.

Configure limits on the number of entities discovered per discovery rule run:

Parameter	Default
`DISCOVERY_SEARCH_PAGE_SIZE`	`2500`
`DISCOVERY_RESULTS_LIMIT`	`2500`
`DISCOVERY_WORKSPACE_SIZE_LIMIT`	`2500`

Tune search timeout #

EclecticIQ Intelligence Center relies on Elasticsearch to provide various search services. If you find that searches on EclecticIQ Intelligence Center are failing due to timeouts, you can extend the timeout value

Attribute name	Default value	Description
`ELASTICSEARCH_QUERY_TIMEOUT`	`20s`	Set the timeout for search queries on EclecticIQ Intelligence Center.

Mitigate slow performance caused by hyperconnected observables #

Caution

Contact support for advice before changing these settings. Enabling this can cause unexpected behavior on EclecticIQ Intelligence Center.

Tip

Hyperconnected observables are observables that have a large number of relationships/links to entities.

Ingestion performance may be slow for instances that have hyperconnected observables. To mitigate this issue:

Remove the hyperconnected observable(s)
Or set enable the SOURCES_ACL_REDIS_CACHE_ENABLED setting.

Attribute name	Description
`SOURCES_ACL_REDIS_CACHE_ENABLED`	Default: `False` Set to `True` to mitigate slow performance caused by hyperconnected observables. When set to `True`, source and TLP information for observables are cached for the period of time set for `SOURCES_ACL_REDIS_CACHE_TTL`. Caution Enable with caution. This causes graphs and search results to show outdated information for observables.
`SOURCES_ACL_REDIS_CACHE_TTL`	Default: `12 * 3600` Value in seconds. Set to 12 hours (`12 * 3600`) by default. Only takes effect if `SOURCES_ACL_REDIS_CACHE_ENABLED` is set to `True`.
`EXTRACT_ENTITY_LINK_PROPAGATION_LIMIT`	Default: `10000` Integer When a new version of an Observable is ingested, only the Entities related to it that were most recently updated before the ingestion are updated with the new characteristics of the Observable after the ingestion, and only as many Entities as the number set here.

Attribute name

Description

SOURCES_ACL_REDIS_CACHE_ENABLED

Default: False

Set to True to mitigate slow performance caused by hyperconnected observables.

When set to True, source and TLP information for observables are cached for the period of time set for SOURCES_ACL_REDIS_CACHE_TTL.

Caution

Enable with caution.

This causes graphs and search results to show outdated information for observables.

SOURCES_ACL_REDIS_CACHE_TTL

Default: 12 * 3600

Value in seconds. Set to 12 hours (12 * 3600) by default. Only takes effect if SOURCES_ACL_REDIS_CACHE_ENABLED is set to True.

EXTRACT_ENTITY_LINK_PROPAGATION_LIMIT

Default: 10000

Integer
When a new version of an Observable is ingested, only the Entities related to it that were most recently updated before the ingestion are updated with the new characteristics of the Observable after the ingestion, and only as many Entities as the number set here.

Other #

Clean up EclecticIQ Intelligence Center notifications #

Warning

Changing default values for parameters listed in this section may cause the Intelligence Center to behave unexpectedly.

Set the maximum number of notifications retained by EclecticIQ Intelligence Center for each user.

Attribute name	Default value	Description
`NOTIFICATIONS_CLEAN_UP_EVERY`	`1000`	Set the maximum number of notifications retained for each user. Once this number is reached, older notifications are discarded.

Attribute name

Default value

Description

NOTIFICATIONS_CLEAN_UP_EVERY

1000

Set the maximum number of notifications retained for each user.

Once this number is reached, older notifications are discarded.

Enable IP binding for session tokens #

The tokens created when a session is initiated can be bound to the IP address of the user initiating the session. This way, the tokens are not valid when entered from a different IP address.
To enable this, set:

JWT_IP_ADDRESS_BINDING = TRUE

Sample platform_settings.py file #

The following example serves as a guideline:

Example platform_settings.py

settings.py (Sourced from EIQ platform-backend)

"""
Default settings module.

These are just some application defaults and will be overridden by
a custom settings file.
"""

import os
from typing import Any, Mapping, Optional, Sequence, Tuple


# Note: ensure that all settings used by the applications are listed here with
# a default value. This makes it easier to see which settings are available.


#
# Flask web app
#

# Application setup (do not override)
APPLICATION_PREFIX = "/private"
APPLICATION_PREFIX_PUBLIC = "/api"

# Static content
PLATFORM_STATIC_FOLDER = "/opt/eclecticiq-platform-docs"
PLATFORM_SWAGGER_INDEX = os.path.join(PLATFORM_STATIC_FOLDER, "swagger/index.html")
PLATFORM_DOCUMENTATION_INDEX = os.path.join(
    PLATFORM_STATIC_FOLDER, "documentation/index.html"
)

DEFAULT_SERVER_NAME = "no-server-name-configured"

#
# Logging
#

LOG_FORMAT = "json"
LOG_LEVEL = "warning,eiq:info,quuz:warning"

SENTRY_DSN = None

# Whether to store audit trails for all user requests in the database.
# Possible levels are ['none', 'write', 'read-write', 'extended read-write']
AUDIT_TRAIL_LOG_LEVEL = "write"
# How many days we should keep audit_trail events in the database.
# `None` means keep it forever.
AUDIT_TRAIL_RETENTION = 360

# default tags for statsd metrics
METRIC_TAGS: Mapping[str, str] = {}

# TP47152. Maximum value for gauge metric of pending blobs.
BLOB_QUEUE_METRIC_MAX_VALUE = 100

#
# PostgreSQL / SQLAlchemy
#

# URI to the database.
SQLALCHEMY_DATABASE_URI = ""

# it is possible to use any of the libpd supported connection parameters.
# https://docs.sqlalchemy.org/en/13/core/engines.html#sqlalchemy.create_engine
# https://www.postgresql.org/docs/10/libpq-connect.html#LIBPQ-PARAMKEYWORDS
SQLALCHEMY_ENGINE_OPTIONS = {
    "pool_recycle": 120,
    "pool_pre_ping": True,
    "pool_use_lifo": True,
    "connect_args": {"connect_timeout": 25},
}

#
# S3 blob storage
#

S3_BLOB_STORAGE_ENABLED = False
S3_BLOB_STORAGE_BUCKET = "eiq-platform-blob-storage"
S3_BLOB_STORAGE_KEY_PREFIX = None
# Enable unauthenticated access, used for local S3 testing
S3_BLOB_STORAGE_UNSIGNED = False
# Upload new blobs to database, use when need to migrate back from S3 to database
S3_BLOB_STORAGE_PAUSE_S3_UPLOADS = False

#
# Elasticsearch
#

# Format: ["host[:port]","host[:port]"]
SEARCH_URLS: Sequence[str] = []
# Use only one of the auth mechanism at a time
# ElasticSearch should be set to accept auth
SEARCH_BASIC_AUTH = ()  # ("username", "password")'
SEARCH_API_KEY_AUTH = ()  # ("api_key.id", "api_key.api_key")
# Certificate to use to connect to elasticsearch: "/path/to/ca/ca.cert"
SEARCH_CA_CERT = None

ELASTICSEARCH_SHARDS_NUMBER = 1

# customize number of shards for search indexes,
# otherwise - `ELASTICSEARCH_SHARDS_NUMBER` is used
SEARCH_INDEX_SHARDS: Mapping[str, int] = {
    "stix": 5,
    "relations": 5,
    "extracts-unique": 3,
    "extracts": 3,
}

ELASTICSEARCH_REPLICAS_NUMBER = 1

# Periodic Celery tasks configuration:
# How many days we store `logstash` indices in ES:
LOG_RETENTION = 365
# How many days we store `statsite` indices in ES:
METRIC_RETENTION = 730

#
# Redis
#

REDIS_URL = "redis://localhost/0"

# Backup (Redis) queue names
SEARCH_QUEUE_NAME = "queue:search:inbound"

#
# Statsd
#

STATSD_ENABLED = True
STATSD_HOST = "127.0.0.1"
STATSD_PORT = 8125

#
# Kibana
#

KIBANA_URL = ""


#
# OpenTAXII
#

OPENTAXII_INTERNAL_URL = "http://localhost:8009/"


#
# Email
#

SMTP_HOST = "localhost"
SMTP_PORT = 25
SMTP_USERNAME: Optional[str] = None
SMTP_PASSWORD: Optional[str] = None
SMTP_ENCRYPTION: Optional[str] = None  # None, 'tls', 'ssl'
SMTP_TIMEOUT = 30


#
# Celery (background tasks)
#

CELERY_BROKER_URL = REDIS_URL
CELERY_BROKER_TRANSPORT_OPTIONS = {
    # Big `socket_timeout` makes celery diagnostics slow.
    # Small `socket_timeout` can fail too soon during high load on broker.
    "socket_timeout": 10,
    "socket_connect_timeout": 30,
}

CELERY_ACCEPT_CONTENT = ["json"]
CELERY_ENABLE_UTC = True
CELERY_WORKER_REDIRECT_STDOUTS_LEVEL = "DEBUG"
CELERY_RESULT_BACKEND = REDIS_URL
CELERY_RESULT_SERIALIZER = "json"
CELERY_TASK_STORE_ERRORS_EVEN_IF_IGNORED = True
CELERY_TASK_PROTOCOL = 2
CELERY_TASK_SERIALIZER = "json"
CELERY_TASK_TRACK_STARTED = True
CELERY_SEND_SENT_EVENT = True
CELERY_SEND_EVENTS = True

CELERY_TASK_QUEUES = [
    {"name": "enrichers", "routing_key": "eiq.enrichers.#"},
    {"name": "enrichers-priority", "routing_key": "priority.enrichers.#"},
    {"name": "incoming-transports", "routing_key": "eiq.incoming-transports.#"},
    {
        "name": "incoming-transports-priority",
        "routing_key": "priority.incoming-transports.#",
    },
    {"name": "outgoing-transports", "routing_key": "eiq.outgoing-transports.#"},
    {
        "name": "outgoing-transports-priority",
        "routing_key": "priority.outgoing-transports.#",
    },
    {"name": "outgoing-feeds", "routing_key": "eiq.outgoing-feeds.#"},
    {"name": "outgoing-feeds-priority", "routing_key": "priority.outgoing-feeds.#"},
    {"name": "utilities", "routing_key": "eiq.utilities.#"},
    {"name": "utilities-priority", "routing_key": "priority.utilities.#"},
    {"name": "discovery", "routing_key": "eiq.discovery.#"},
    {"name": "discovery-priority", "routing_key": "priority.discovery.#"},
    {"name": "entity-rules-priority", "routing_key": "priority.entity-rules.#"},
    {"name": "entity-rules", "routing_key": "eiq.entity-rules.#"},
    {"name": "extract-rules-priority", "routing_key": "priority.extract-rules.#"},
    {"name": "reindexing", "routing_key": "eiq.reindexing.#"},
    {"name": "retention-policies", "routing_key": "eiq.retention-policies.#"},
    {"name": "synchronization", "routing_key": "eiq.synchronization.#"},
    {
        "name": "retention-policies-priority",
        "routing_key": "priority.retention-policies.#",
    },
    {"name": "metrics", "routing_key": "eiq.metrics.#"},
]

CELERY_TASK_DEFAULT_QUEUE = "default"
CELERY_TASK_DEFAULT_EXCHANGE_TYPE = "direct"
CELERY_TASK_DEFAULT_ROUTING_KEY = "default"
CELERY_TASK_ROUTES = ("eiq.platform.taskrunner.routing.TaskRouter",)

CELERY_TASK_ACKS_LATE = True
CELERY_TASK_ACKS_ON_FAILURE_OR_TIMEOUT = True
CELERY_TASK_REJECT_ON_WORKER_LOST = False


# Count of hosts that run celery queues.
# Specifies how many replies health check will expect.
# If the number is smaller than actual value, celery health check will fail.
# If the number is bigger, celery health check will be noticeably longer.
# So, if you're unsure, leave it `1` and run `eiq-platform diagnose run`.
# For env with dynamic hosts count like k8s set this to `0`.
CELERY_X_EIQ_HOSTS = 1


# Time limits per task families (in seconds)
#
# Please do not modify this object. If you want to overwrite the values, there is
# CELERY_X_EIQ_TIME_LIMIT_PER_FAMILY variable.
DEFAULT_TIME_LIMIT_PER_FAMILY: Mapping[str, int] = {
    "eiq.enrichers": 2 * 60,  # 2 minutes
    "eiq.incoming-transports": 8 * 60 * 60,  # 8 hours
    "eiq.outgoing-transports": 2 * 60,  # 2 minutes
    "eiq.discovery": 10 * 60,  # 10 minutes
    "eiq.outgoing-feeds": 85 * 60 * 60,  # 85 hours
    "eiq.reindexing": 2 * 60 * 60,  # 2 hours
    "eiq.retention-policies": 120 * 60 * 60,  # 120 hours
    "eiq.utilities": 2 * 60 * 60,  # 2 hours
    "eiq.metrics": 10 * 60,  # 10 minutes
}
# It is only necessary to provide the values you want to override. The eligible values
# are visible in the default object above.
CELERY_X_EIQ_TIME_LIMIT_PER_FAMILY: Mapping[str, int] = {}

# Time limits for specific tasks (in seconds)
#
# Please do not modify this object. If you want to overwrite the values, there is
# CELERY_X_EIQ_TIME_LIMIT_PER_TASK variable.
DEFAULT_TIME_LIMIT_PER_TASK: Mapping[str, int] = {
    "eiq.utilities.update_taxonomy_in_search": 4 * 60 * 60,  # 4 hours
    "eiq.utilities.delete_taxonomy_in_search": 4 * 60 * 60,  # 4 hours
    "eiq.utilities.send_email": 30,
    "eiq.utilities.taxii_discovery": 60,
    "eiq.utilities.postponed_entity_signals": 60,
    "eiq.utilities.create_notifications": 60,
    "eiq.utilities.prune_indices": 4 * 60 * 60,  # 4 hours
    "eiq.discovery.search_discovery": 10 * 60,  # 10 minutes
    "eiq.discovery.delete_discovery": 60,
    "eiq.entity-rules.entity_rule_task": 48 * 60 * 60,  # 48 hours
    "eiq.entity-rules.enrichment_rule_task": 16 * 60 * 60,  # 16 hours
    "eiq.entity-rules.intel_requirement_task": 24 * 60 * 60,  # 24 hours
    "eiq.extract-rules.extract_rule_task": 4 * 60 * 60,  # 4 hours
    "eiq.synchronization.reindex_batch": 2 * 60 * 60,  # 2 hours
}
# It is only necessary to provide the values you want to override. The eligible values
# are visible in the default object above.
CELERY_X_EIQ_TIME_LIMIT_PER_TASK: Mapping[str, int] = {}

CELERY_WORKER_HIJACK_ROOT_LOGGER = False
CELERY_WORKER_LOG_FORMAT = "%(message)s"
CELERY_WORKER_TASK_LOG_FORMAT = "%(message)s"

CELERY_BEAT_SCHEDULER = "eiq.platform.taskrunner.scheduler.DynamicScheduler"

# How long to store results of volatile tasks (in seconds)
CELERY_X_EIQ_VOLATILE_TASK_RESULT_DEFAULT_TTL = 180

# Indicates if potentially lost tasks (with persistent envelope) should
# be marked as failed only after their soft time limit has been exceeded.
CELERY_X_EIQ_UPDATE_PERSISTENT_TASK_WITHIN_TIME_LIMIT = False

# Defines the Cron-like schedule for lost Celery task run reaping.
CELERY_X_EIQ_LOST_TASK_RUN_REAPING_SCHEDULE = "*/10 * * * *"

#
# Application security and network access policies
#

SECRET_KEY = ""
JWT_EXPIRATION_DELTA = 60 * 30  # 30 minutes
JWT_IP_ADDRESS_BINDING = False
ONE_TIME_PASSWORD_EXPIRATION_MINUTES = 60
LOGIN_ATTEMPT_TOO_FAST_SECONDS = 1
MAX_RESET_PASSWORD_PER_DAY = 3

# Multi-factor authentication master password;
# must be produced by Fernet.generate_key().
MFA_MASTER_PASSWORD = ""

# Whether to expose the OpenAPI specs.
EXPOSE_OPENAPI = False

# Custom CA bundle to use for outgoing HTTP requests (optional)
REQUESTS_CA_BUNDLE: Optional[str] = None

# Network address block filtering. This is a list of address blocks that the
# user may not configure the platform to make requests to.
#
# The full list of private networks, consider adding them all:
# [
#     '0.0.0.0/8',  # broadcast network
#     '10.0.0.0/8',
#     '127.0.0.0/8',
#     '169.254.0.0/16',  # link-local
#     '172.16.0.0/12',
#     '192.0.0.0/29',
#     '192.0.0.170/31',
#     '192.0.2.0/24',
#     '192.168.0.0/16',
#     '198.18.0.0/15',
#     '198.51.100.0/24',
#     '203.0.113.0/24',
#     '240.0.0.0/4',
#     '255.255.255.255/32',
#
#     '::1/128',
#     '::/128',
#     '::ffff:0:0/96',
#     '100::/64',
#     '2001::/23',
#     '2001:2::/48',
#     '2001:db8::/32',
#     '2001:10::/28',
#     'fc00::/7',   # unique local addresses
#     'fe80::/10',  # link-local addresses
# ]
#
# More details: https://en.wikipedia.org/wiki/Private_network
USER_CIDR_BLACKLIST = ["0.0.0.0/8", "127.0.0.0/8", "::1/128"]

# a list of trusted API hosts, used to trus multi-host IC deployments
TRUSTED_API_HOSTS: list[str] = []

#
# STIX
#

STIX_DEFAULT_NAMESPACE_PREFIX = "not-yet-configured"
STIX_DEFAULT_NAMESPACE_URI = "http://not-yet-configured.example.org/"


#
# Half-life
#

# Default values
HALF_LIFE = {
    "attack-pattern": 720,
    "campaign": 1000,
    "course-of-action": 182,
    "eclecticiq-sighting": 182,
    "exploit-target": 182,
    "identity": 4000,
    "incident": 182,
    "indicator": 30,
    "infrastructure": 720,
    "intrusion-set": 1000,
    "location": 4000,
    "malware": 720,
    "malware-analysis": 720,
    "note": 30,
    "report": 182,
    "threat-actor": 1000,
    "tool": 720,
    "ttp": 720,
}


#
# Discovery
#

DISCOVERY_SEARCH_PAGE_SIZE = 2500
DISCOVERY_RESULTS_LIMIT = 2500
DISCOVERY_WORKSPACE_SIZE_LIMIT = 2500
DISCOVERY_SEARCH_PIT_TTL = 5 * 60


#
# Ingestion
#

# Package size limits (in bytes)
MAX_BLOB_SIZE = 20 * 1024 * 1024
MAX_UPLOADED_BLOB_SIZE = MAX_BLOB_SIZE / 2

INGESTION_FEED_PRIORITY = 100
INGESTION_UPLOAD_PRIORITY = 1000
INGESTION_BLACKLIST_IDREFS: Sequence[str] = []
INGESTION_FINALIZER_DEPTH = 100

# retry settings for only DB-dependent tasks
# this results in roughly 6 hours worth of retrying a task in sum:
# for i in range(max_attempts):
#     s += min(max_retry_delay, retry_delay * 2 ** i)

INGESTION_TASK_RETRY_SETTINGS = {
    "max_attempts": 10,
    "retry_delay": 60,  # one minute
    "max_retry_delay": 60 * 90,  # 90 minutes
}

# retry settings for external storage dependent tasks
# this results in roughly a week worth of retrying a task in sum
INGESTION_EXTERNAL_TASK_RETRY_SETTINGS = {
    "max_attempts": 25,
    "retry_delay": 60 * 5,  # 5 minutes
    "max_retry_delay": 60 * 60 * 12,  # 12 hours
}

# Batch sizes used for ingestion tasks.
INGESTION_TASK_BATCH_SIZES = {
    "ingest_blob_task": 100,
    "index_extracts_task": 1000,
    "search_synchronize_entity_task": 1000,
    # Backwards compatibility 2.14, functionality dropped in 3.0
    "graph_synchronize_enrichment_task": 1000,
    "graph_synchronize_entity_task": 1000,
    # End backwards compatibility 2.14, functionality dropped in 3.0
    "check_extract_links_task": 1000,
    "reindex_entities_for_extract_task": 100,
    "search_synchronize_entity_extracts_task": 1000,
    "calculate_observable_risk_score_task": 1000,
}

# Please do not modify this object. If you want to overwrite the values, there is
# INGESTION_QUUZ_SCHEDULER_OPTIONS variable.
#
# Quuz scheduler options for the ingestion task runner.
DEFAULT_INGESTION_QUUZ_SCHEDULER_OPTIONS = {
    # Continue with lower depth jobs after executing a partial batch,
    # which should increase the effectiveness of batch tasks.
    "prefer_full_batches": True,
    # The ‘score half time’ influences for how long a long-running worker process should
    # remember how much time was spent on a certain feed. Workers divide their time
    # fairly over all feeds, but a small amount of (slow) outliers in a feed, e.g. due
    # to timeouts, should not result in that feed getting ‘punished’ forever. The value
    # is the number of seconds after which the # It is only necessary to provide the
    # values you want to override. The eligible values are visible in the default object
    # above.score assigned to a feed loses half its value. This should be up to a few
    # orders of magnitude higher than the run time of the slowest jobs.
    "score_half_time": 24 * 60 * 60,  # 24 hours
    # How many similar jobs to execute in a row before switching to jobs with
    # the highest depth, e.g. how many blobs to transform before starting
    # to process their entities.
    "sweep_same_depth_max_jobs": 500,
    # Similar to ‘sweep_same_depth_max_jobs’, but defines how many jobs can be
    # enqueued (not run) before a worker moves to higher depth jobs again.
    "sweep_same_depth_max_enqueued_jobs": 1000,
    # How many seconds to execute similar jobs in a row before switching to
    # jobs with the highest depth, e.g. for how long to keep transforming blobs
    # before processing their entities.
    "sweep_same_depth_max_time_total": 60,
    # If a feed didn't get a chance to run for this amount of time (seconds),
    # interrupt what it was doing and pick the highest depth jobs instead.
    "sweep_same_depth_max_time_between": 30,
    "find_job_query_timeout": 300,  # 5 minutes
}
# It is only necessary to provide the values you want to override. The eligible values
# are visible in the default object above.
INGESTION_QUUZ_SCHEDULER_OPTIONS: Mapping[str, Any] = {}

# Maximum number of entities to be updated with extract meta changes in ingestion.
# Increasing this number will lead to more reindexing tasks scheduled for ES
# which increases load on ES and slows down ingestion overall
# as it will be busy reindexing old entity records in ES instead of processing new data.
EXTRACT_ENTITY_LINK_PROPAGATION_LIMIT = 10000

# Observable source ACL caching. This is usually needed to match
# ingestion performance expectations, especially with hyper-connected
# extracts present, at the cost of seeing potentially outdated data.
SOURCES_ACL_REDIS_CACHE_ENABLED = True
SOURCES_ACL_REDIS_CACHE_TTL = 2 * 3600  # 2 hours


# Cache TTL for split of Intelligence Requirements for LLM matching
INTEL_REQUIREMENT_CACHE_TTL = 3600  # 1 hour


#
# Incoming/outgoing feeds
#

# Enable caching of package counts (pending, failed, ingested) used in the
# incoming feed view. '0' means caching is disabled. A good value when
# enabling caching is 120 seconds or higher.
INCOMING_FEED_BLOB_STATS_CACHE_IN_SECONDS = 120

# Directories that can be accessed from mount point feeds. POLL is for incoming
# feeds, PUSH is for outgoing feeds. Example: ["/mnt/", "/media/"]
MOUNT_POINT_POLL_ALLOWED_DIRECTORIES: Sequence[str] = []
MOUNT_POINT_PUSH_ALLOWED_DIRECTORIES: Sequence[str] = []

# Number of entities per package for outgoing feeds
PACKING_BATCH_SIZE_ENTITIES = 25
PACKING_BATCH_SIZE_RELATIONS = 125

OUTGOING_FEED_FETCHING_ES_PAGE_SIZE = 1000

#
# Retention policies
#

RETENTION_ENTITY_FETCH_CHUNK_SIZE = 1024
RETENTION_ENTITY_DEL_CHUNK_SIZE = 1024
# chunk size used in extract policy
RETENTION_EXTRACT_DEL_CHUNK_SIZE = 1024
# chunk size used in packages policy
RETENTION_PACKAGE_CHUNK_SIZE = 1024

# Retention for files created by bulk export
EXPORT_BLOCK_RETENTION = 10  # days

# Retention for task_run records
TASK_RUN_RETENTION_DAYS = 100  # days
TASK_RUN_RETENTION_COUNT_PER_TASK = 1000

# Retention for observable_risk_score_update records
SCORE_UPDATE_RETENTION_DAYS = 100  # days
SCORE_UPDATE_RETENTION_COUNT_PER_EXTRACT = 500

#
# Platform extensions (feeds, enrichers)
#
# Number of failures before disabling an enricher
ENRICHER_FAILURES_TO_DISABLE = 10

# List of names of extensions that should not be loaded.
DISABLED_EXTENSIONS: Sequence[str] = []

# Extensions' simple storage Redis implementation. Set to None to disable.
EXTENSION_SDK_STASH_REDIS_URL: str | None = "redis://localhost/1"

# Retry behaviour
FILE_DOWNLOAD_RETRIES_LIMIT = 3
FILE_DOWNLOAD_RETRY_TIMEOUT = 10  # in seconds


#
# LDAP
#

LDAP_AUTH_ENABLED = False
LDAP_URI = "ldaps://toolbox.iw"
LDAP_IGNORE_TLS_ERRORS = True
LDAP_BIND_DN = "cn=Manager,dc=ldap,dc=eclecticiq"
LDAP_BIND_PASSWORD = "adminpassword"

# These are 2-tuples of the form (base, filter_template)
LDAP_USERS_FILTER = (
    "ou=Users,dc=ldap,dc=eclecticiq",  # Base
    "(uid={username})",  # Filter template
)
LDAP_GROUPS_FILTER = (
    "ou=EclecticIQGroups,dc=ldap,dc=eclecticiq",
    "(&(memberUid={username})(objectClass=posixGroup))",
)
LDAP_ROLES_FILTER = (
    "ou=EclecticIQRoles,dc=ldap,dc=eclecticiq",
    "(&(memberUid={username})(objectClass=posixGroup))",
)

LDAP_USER_FIRSTNAME_ATTR = "cn"
LDAP_USER_LASTNAME_ATTR = "sn"
LDAP_USER_EMAIL_ATTR = "mail"

LDAP_ROLE_NAME_ATTR = "cn"
LDAP_GROUP_NAME_ATTR = "cn"
LDAP_CASE_SENSITIVE_MATCHING = True

LDAP_USER_IS_ADMIN_ATTR = "isEclecticIQAdmin"
LDAP_ADMIN_ROLE_GROUP_NAME = "EclecticIQAdminsGroup"


#
# SAML
#

SAML_AUTH_ENABLED = False

# SAML can be configures using a live configuration page
# When this is enabled, go to /private/saml/configure to set up SAML.
SAML_CONFIGURE_MODE = False
SAML_TEST_CONFIG_FILE = "/tmp/eiq_platform_saml_test_config.json"  # nosec

# The IDP metadata file should be generated by the IDP and placed on
# the platform instance:
SAML_IDP_METADATA = {
    "url": "https://samltest.id/saml/idp",
    # "file": "/idp-metadata.xml",
}

# IDP EntityID must match the "EntityID" attribute in the idp metadata:
# Keycloack uses a url like 'http://localhost:8080/auth/realms/master'
# but it can be any string:
SAML_IDP_ENTITYID = "https://samltest.id/saml/idp"

# Optionally require users to have a minimum authorization level in order to
# log in. if set to None, there is no minimum level and any user is accepted.
SAML_IDP_MINIMUM_LEVEL: Optional[str] = None

SAML_REQUEST_USE_POST_BINDING = False

SAML_CASE_SENSITIVE_MATCHING = True

# Required attributes
SAML_USER_USERID_ATTR = "uid"
SAML_USER_EMAIL_ATTR = "email"
SAML_USER_GROUPS_ATTR = "eclecticiqGroups"
SAML_USER_ROLES_ATTR = "eclecticiqRoles"

# Optional attributes
SAML_USER_FIRSTNAME_ATTR = "givenName"
SAML_USER_LASTNAME_ATTR = "sn"
SAML_USER_IS_ADMIN_ATTR = "isEclecticIQAdmin"
SAML_ADMIN_ROLE_GROUP_NAME = "EclecticIQAdminsGroup"


SAML_SIGN_AUTHN_REQ = False
SAML_SIGN_LOGOUT_REQ = False
SAML_WANT_ASSERT_SIGNED = True
SAML_WANT_RESPONSE_SIGNED = False

# Key and certificates; these should point to filenames.
SAML_ENC_KEY: Optional[str] = None
SAML_ENC_CERT: Optional[str] = None

# Path to the system's xmlsec1 binary.
SAML_XMLSEC_BIN = "/usr/bin/xmlsec1"

SAML_METADATA_ORG = {
    "name": "Default organization name",
    "display_name": [("Organization display name in English", "en")],
    "url": "http://example.localhost",
}
SAML_METADATA_CONTACT_PERSON = {
    "given_name": "John",
    "sur_name": "Smith",
    "email_address": ["[email protected]"],
    "contact_type": "technical",
}

# In case you want users from SAML to be added to certain groups/roles
# regardless of what groups/roles the SAML server tells us.
# Deprecated. Use `EXTERNAL_` settings instead.
SAML_USER_DEFAULT_GROUPS: Sequence[str] = []
SAML_USER_DEFAULT_ROLES: Sequence[str] = []

# External user settings
# In case you want users from an external identity provider to be added to
# certain groups/roles regardless of what groups/roles the identity provider
# sends.
EXTERNAL_USER_DEFAULT_GROUPS: Sequence[str] = []
EXTERNAL_USER_DEFAULT_ROLES: Sequence[str] = []

#
# OAuth2
#
OAUTH_ENABLED: bool = False
OAUTH_DEBUG: bool = False
OAUTH_TENANT_ID: Optional[str] = None
OAUTH_APPLICATION_ID: Optional[str] = None
OAUTH_CLIENT_APPLICATION_IDS: Sequence[str] = []
OAUTH_CASE_SENSITIVE_MATCHING: bool = True

# Required attributes
OAUTH_USER_USERID_CLAIM: str = "oid"
OAUTH_USER_EMAIL_CLAIM: str = "email"
OAUTH_USER_GROUPS_CLAIM: str = "groups"
OAUTH_USER_ROLES_CLAIM: str = "groups"

# Optional attributes
OAUTH_USER_FIRSTNAME_CLAIM: Optional[str] = "given_name"
OAUTH_USER_LASTNAME_CLAIM: Optional[str] = "family_name"
OAUTH_USER_IS_ADMIN_CLAIM: Optional[str] = "isEclecticIQAdmin"
OAUTH_ADMIN_ROLE_NAME: Optional[str] = "EclecticIQAdminsGroup"


#
# Notifications
#
# The total number notifications can build up over time. If there are too many
# notifications the notification index endpoint — which is heavily used by the
# UI — can become slow. Therefore, a cleanup is done on a regular basis. This
# variable sets after how many new notifications a clean-up should occur.
NOTIFICATIONS_CLEAN_UP_EVERY = 1000
# Set the maximum number of notifications that should be kept for each user.
# The oldest notifications will be removed when this number is exceeded.
NOTIFICATIONS_MAX_PER_USER = 1000

#
# Observable and entity rules
#

# The request timeout in seconds when running an extract rule manually.
# This timeout will be used when running "scroll" queries against Elasticsearch.
OBSERVABLE_RULE_ES_TIMEOUT = 300
OBSERVABLE_RULE_ES_SCROLL_TTL = 3600 * 3  # 3 hours
OBSERVABLE_RULE_BATCH_SIZE = 500

OBSERVABLE_DECAY_ES_TIMEOUT = 3600

# The request timeout in seconds when running an entity rule manually.
# This timeout will be used when running "scroll" queries against Elasticsearch.
ENTITY_RULE_ES_TIMEOUT = 300
ENTITY_RULE_ES_SCROLL_TTL = 3600 * 3  # 3 hours
ENTITY_RULE_BATCH_SIZE = 200

# The request timeout in seconds when running an intel requirement manually.
# This timeout will be used when running "scroll" queries against Elasticsearch.
INTEL_REQUIREMENT_ES_TIMEOUT = 300
INTEL_REQUIREMENT_ES_SCROLL_TTL = 3600 * 6  # 6 hours
INTEL_REQUIREMENT_BATCH_SIZE = 500

# The request timeout in seconds when running an enrichment rule by task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
ENRICHMENT_RULE_ES_TIMEOUT = 300
ENRICHMENT_RULE_ES_SCROLL_TTL = 3600 * 3  # 3 hours
ENRICHMENT_RULE_BATCH_SIZE = 200

# The request timeout in seconds when running an entities update by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_UPDATE_ENTITIES_ES_TIMEOUT = 300
BULK_UPDATE_ENTITIES_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_UPDATE_ENTITIES_BATCH_SIZE = 500

# The request timeout in seconds when running an entities enrich by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_ENRICH_ENTITIES_ES_TIMEOUT = 300
BULK_ENRICH_ENTITIES_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_ENRICH_ENTITIES_BATCH_SIZE = 500

# The request timeout in seconds when running a bulk export by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_EXPORT_ES_TIMEOUT = 300
BULK_EXPORT_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_EXPORT_BATCH_SIZE = 1000
BULK_EXPORT_ATTACHMENTS_BATCH_SIZE = 200


# The chunk size and search page size for running an entities delete by query task.
BULK_DELETE_ENTITIES_CHUNK_SIZE = 500
BULK_DELETE_ENTITIES_ES_PAGE_SIZE = 1000

# The request timeout in seconds when running an extract update by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_UPDATE_EXTRACT_ES_TIMEOUT = 300
BULK_UPDATE_EXTRACT_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_UPDATE_EXTRACT_BATCH_SIZE = 1000

# The request timeout in seconds when running an extract enrich by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_ENRICH_EXTRACT_ES_TIMEOUT = 300
BULK_ENRICH_EXTRACT_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_ENRICH_EXTRACT_BATCH_SIZE = 1000

# The request timeout in seconds when running an extract delete by query task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
BULK_DELETE_EXTRACT_ES_TIMEOUT = 300
BULK_DELETE_EXTRACT_ES_SCROLL_TTL = 3600 * 3  # 3 hours
BULK_DELETE_EXTRACT_BATCH_SIZE = 1000

# The request timeout in seconds when running a policy apply_task.
# This timeout will be used when running "scroll" queries against Elasticsearch.
APPLY_RISK_SCORE_POLICY_ES_TIMEOUT = 300
APPLY_RISK_SCORE_POLICY_ES_SCROLL_TTL = 3600 * 3  # 3 hours
APPLY_RISK_SCORE_POLICY_BATCH_SIZE = 1000


# On average, the system is more stable when it commits immediately after a new entity
# version is ingested. It keeps good performance especially with highly connected
# entities. However, if the behavior is having the opposite effect for any reason, it is
# possible to disable it and to commit only at the end of batch.
ENTITY_RULE_COMMIT_AFTER_NEW_ENTITY_VERSION = True

#
# Miscellaneous
#

# A feature flag to unlock some potentially very destructive operations.
ALLOW_EXTREMELY_UNSAFE_OPERATIONS = False

# A feature flag to enable the subscription module, which was developed by Fusion Center
SUBSCRIPTION_MODULE_ACTIVE = False
SUBSCRIPTION_MODULE_PLATFORM_URL = "https://www.eclecticiq.com/external/"

# Max number of items sent to Elasticsearch during bulk requests
ES_BULK_CHUNK_SIZE = 1000
# Max size of the request body of Elasticsearch requests
ES_BULK_PAYLOAD_MB = 5
# Default timeout for Elasticsearch _update_by_query and _delete_by_query
ES_UPDATE_BY_QUERY_REQUEST_TIMEOUT = 15 * 60  # 15 min
# Default timeout for Elasticsearch index operations
# (open, close, update settings, update mappings, etc)
ES_INDEX_OPERATIONS_TIMEOUT = 3 * 60  # 3 min
# Default timeout for Elasticsearch client http operations
ES_HTTP_REQUEST_TIMEOUT = 60  # 1 min


# Observables retention policies are broken. Disable by default.
DISABLE_OBSERVABLES_RETENTION_POLICIES = False
# History events
WRITE_HISTORY_EVENTS = False

# Opentaxii taxii2 api root config
TAXII2_API_ROOT = {
    "title": "EIQ TAXII 2.1 api root",
    "description": "The EIQ TAXII 2.1 api root for passive feeds",
    "is_public": True,  # set to False to block anonymous feed discovery
}

# Whether to enable PgPool diagnostics check
DIAGNOSTICS_PGPOOL_ENABLE = False
# Max replication delay in WAL bytes, default is 1 WAL segment file size (16 MB)
DIAGNOSTICS_PGPOOL_MAX_REPLICATION_DELAY = 16 * 1024 * 1024

# AI Settings

AI_PROVIDER_OPENAI_MODELS = [
    "gpt-4o",
]
AI_PROVIDER_PERPLEXITY_MODELS = [
    "sonar-pro",
]

# The first 2 parameters can be configured from the API
# as long as both the settings here are kept None.
AI_PROVIDER_OLLAMA_BASE_URL: str | None = None
AI_PROVIDER_OLLAMA_API_KEY: str | None = None
# Only settable here
AI_PROVIDER_OLLAMA_VERIFY_SSL: bool = True
AI_PROVIDER_OLLAMA_CA_PATH: str | None = None
AI_PROVIDER_OLLAMA_PROXY_URL: str | None = None
# API Key should not be set for the following to be used
AI_PROVIDER_OLLAMA_BASIC_AUTH: Tuple[str, str] | None = None  # (user, pwd)

# What AI feature can work on what AI providers. This is in fact
# Mapping[AiProviderEnum, list[AIFeatureNameEnum]]
AI_PROVIDER_FEATURE_AVAILABILITY: Mapping[str, list[str]] = {
    "ai-provider-ollama": [
        "ai_contextual_relevancy",
        "ai_entity_extraction",
        "ai_intel_summary",
        "nlp_to_search",
    ],
    "ai-provider-openai": [
        "ai_contextual_relevancy",
        "ai_entity_extraction",
        "ai_intel_summary",
        "ai_writing_assistant",
        "nlp_to_search",
    ],
    "ai-provider-perplexity": [
        "ai_assistant",
    ],
}

# Malware Sandbox max file upload limit size (15 MB)
DETONATION_MAX_FILE_UPLOAD_LIMIT: int = 15 * 1024 * 1024