crowdstrike / caracara Goto Github PK

Developer enhancements (DX) for FalconPy, the CrowdStrike Python SDK

License: MIT License

Python 99.72% Shell 0.28%

crowdstrike api falcon falconpy crowdstrike-falconpy python3 falconpy-tools toolkit toolbox caracara

caracara's Introduction

Caracara

A friendly wrapper to help you interact with the CrowdStrike Falcon API. Less code, less fuss, better performance, and full interoperability with FalconPy.

Features
Installation
Basic Usage
Examples
Documentation
Contributing

Features

A few of the developer experience enhancements provided by the Caracara toolkit include:

Feature	Details
Automatic pagination with concurrency	Caracara will handle all request pagination for you, so you do not have to think about things like batch sizes, batch tokens or parallelisation. Caracara will also multithread batch data retrieval requests where possible, dramatically reducing data retrieval times for large datasets such as host lists.
Friendly to your IDE (and you!)	Caracara is written with full support for IDE autocomplete in mind. We have tested autocomplete in Visual Studio Code and PyCharm, and will accept issues and patches for more IDE support where needed. Furthermore, all code, where possible, is written with type hints so you can be confident in parameters and return values.
Logging	Caracara is built with the in-box `logging` library provided with Python 3. Simply set up your logging handlers in your main code file, and Caracara will forward over `debug`, `info` and `error` logs as they are produced. Note that the `debug` logs are very verbose, and we recommend writing these outputs to a file as opposed to the console when retrieving large amounts of lightly filtered data.
Real Time Response (RTR) batch session abstraction	Caracara provides a rich interface to RTR session batching, allowing you to connect to as many hosts as possible. Want to download a specific file from every system in your Falcon tenant? Caracara will even extract it from the `.7z` container for you.
Rich and detailed sample code	Every module of Caracara comes bundled with executable, fully configurable code samples that address frequent use cases. All samples are built around a common structure allowing for code reuse and easy reading. Just add your API credentials to `config.yml`, and all samples will be ready to go.
Simple filter syntax	Caracara provides an object-orientated Falcon Query Language (FQL) generator. The `FalconFilter` object lets you specify filters such as `Hostname`, `OS` and `Role`, automatically converting them to valid FQL. Never write a FQL filter yourself again!
Single authentication point of entry	Authenticate once and have access to every module.
100% FalconPy compatibility	Caracara is built on FalconPy, and can even be configured with a FalconPy `OAuth2` object via the `auth_object` constructor parameter, allowing you to reuse FalconPy authentication objects across Caracara and FalconPy. Authenticate once with FalconPy, and access every feature of FalconPy and Caracara.

Installation Instructions

Caracara supports all major Python packaging solutions. Instructions for Poetry and Pip are provided below.

Installing Caracara from PyPI using Poetry (Recommended!)

Poetry: Installation

poetry add caracara

Poetry: Upgrading

poetry update caracara

Poetry: Removal

poetry remove caracara

Installing Caracara from PyPI using Pip

Pip: Installation

python3 -m pip install caracara

Pip: Upgrading

python3 -m pip install caracara --upgrade

Pip: Removal

python3 -m pip uninstall caracara

Basic Usage Examples

"""List Windows devices.

This example will use the API credentials provided as keywords to list the
IDs and hostnames of all systems within your Falcon tenant that run Windows.
"""

from caracara import Client

client = Client(
    client_id="12345abcde",
    client_secret="67890fghij",
)

filters = client.FalconFilter()
filters.create_new_filter("OS", "Windows")

response_data = client.hosts.describe_devices(filters)
print(f"Found {len(response_data)} devices running Windows")

for device_id, device_data in response_data.items():
    hostname = device_data.get("hostname", "Unknown Hostname")
    print(f"{device_id} - {hostname}")

You can also leverage the built in context manager and environment variables.

"""List stale sensors.

This example will use the API credentials set in the environment to list the
hostnames and IDs of all systems within your Falcon tenant that have not checked
into your CrowdStrike tenant within the past 7 days.

This is determined based on the filter LastSeen less than or equal (LTE) to 7 days ago (-7d).
"""

from caracara import Client


with Client(client_id="${CLIENT_ID_ENV_VARIABLE}", client_secret="${CLIENT_SECRET_ENV_VARIABLE}") as client:
    filters = client.FalconFilter()
    filters.create_new_filter("LastSeen", "-7d", "LTE")
    response_data = client.hosts.describe_devices(filters)

print(f"Found {len(response_data)} stale devices")

for device_id, device_data in response_data.items():
    hostname = device_data.get("hostname", "Unknown Hostname")
    print(f"{device_id} - {hostname}")

Examples Collection

Each API wrapper is provided alongside example code. Cloning or downloading/extracting this repository allows you to execute examples directly.

Using the examples collection requires that you install our Python packaging tool of choice, Poetry. Please refer to the Poetry project's installation guide if you do not yet have Poetry installed.

Once Poetry is installed, make sure you run poetry install within the root repository folder to set up the Python virtual environment.

To configure the examples, first copy examples/config.example.yml to examples/config.yml. Then, add your API credentials and example-specific settings to examples/config.yml. Once you have set up profiles for each Falcon tenant you want to test with, execute examples using one of the two options below.

Executing the Examples

There are two ways to use Poetry to execute the examples.

Executing from a Poetry Shell

The poetry shell command will enter you into the virtual environment. All future commands will run within the Caracara virtual environment using Python 3, until you run the deactivate command.

poetry shell
examples/get_devices/list_windows_devices.py

Executing without Activating the Virtual Environment

If you do not want to enter the Caracara virtual environment (e.g., because you are using your system's installation of Python for other purposes), you can use the poetry run command to temporarily invoke the virtual environment for one-off commands.

poetry run examples/get_devices/list_windows_devices.py

All examples are also configured in the pyproject.toml file as scripts, allowing them to be executed simply.

poetry run stale-sensors

To get a complete list of available examples, execute the command util/list-examples.sh from the root of the repository folder.

Documentation

Coming soon!

Contributing

Interested in taking part in the development of the Caracara project? Start here.

Why Caracara?

Simple! We like birds at CrowdStrike, so what better bird to name a Python project after one that eats just about anything, including snakes :)

caracara's People

Contributors

Stargazers

Watchers

Forkers

jlangdev rakhithjk jshcodes classicvalues swedgwood mastouri-academy-inc magespawn freecamel mjleesment hur

caracara's Issues

[ BUG ] Enumerating hosts last "online-state" call has more than 100 ids (500 & more on the last call)

Bug Report Template

Describe the bug

When enumerating devices with caracara.hosts.describe_devices() , caracara first downloads all the host data, then proceeds to review the "online" state of each of these (btw could we have an option to disable that? it's not needed and takes time when we just want to enumerate hosts).

Nothing goes fine since since all these /devices/entities/online-state/v1 calls are shipping 500 ids in the URL every time, always producing a HTTP 200 OK ERROR 400

{
    "errors": [
        {
            "code": 400,
            "message": "request must contain between 0 and 100 ids"
        }
    ],
    "meta": {
        "powered_by": "cs.agentonline",
        "query_time": 0.001833589,
        "trace_id": "905f9d10-0d4e-405d-88bd-649b6fb849f2"
    },
    "resources": []
}

Bonus bug : the last request has more than 500 ids it seems:

$ grep '/dev[^:,]*' req_not_last.dat -aio|tr '&' '\n'|wc -l
500
$ grep '/dev[^:,]*' req_last.dat -aio|tr '&' '\n'|wc -l
822
$ python3 -m pip freeze|grep caracara
caracara==0.3.0

This ends up in a nice little stracktrace when all calls finish:

    hosts = self.caracara.hosts.describe_devices()
  File "/usr/local/lib/python3.10/dist-packages/caracara/filters/decorators.py", line 56, in wrapper
    return func(*_args.args, **_args.kwargs)
  File "/usr/local/lib/python3.10/dist-packages/caracara/modules/hosts/hosts.py", line 153, in describe_devices
    device_state_data = self.get_online_state(device_ids)
  File "/usr/local/lib/python3.10/dist-packages/caracara/modules/hosts/_online_state.py", line 59, in get_online_state
    device_online_state_data = batch_get_data(device_ids, self.hosts_api.get_online_state)
  File "/usr/local/lib/python3.10/dist-packages/caracara/common/batching.py", line 117, in batch_get_data
    raise Exception("At least one thread returned an error: " + str(errors))
Exception: At least one thread returned an error: [{'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}, {'code': 400, 'message': 'request must contain between 0 and 100 ids'}]

To Reproduce

Call caracara.hosts.describe_devices()

Expected behavior

Devices are described without having everything explode

Environment

Operating System Version

Debian bookworm

Python Version

Python 3.10.5

Poetry Version

1.4.1

Python Package Versions

$ python3 -m pip freeze | grep -iE '(caracara|falcon|crowdstrike)'
caracara==0.3.0
crowdstrike-falconpy==1.2.15
falcon-toolkit==3.1.2

[ BUG ] Queued sessions are not listed - Invalid filter and more problems

Describe the bug

Queued sessions are not described by describe-queued-sessions

To Reproduce

1/ queue-command (on a offline host)
2/ describe-queued-sessions
3/ : 0 results

Expected behavior

Queued session are enumerated.

Environment

Operating System Version

Please provide your operating system type and version. Example: Red Hat Enterprise Linux 8.3

Python Version

Python 3.10.5

Poetry Version

Poetry (version 1.4.1)

Python Package Versions

$ pip freeze | grep -E '(caracara|falconpy)'
-e git+https://github.com/CrowdStrike/caracara/@0dd2bd265889e1421346f4a8ac58df73642c21c9#egg=caracara
crowdstrike-falconpy==1.2.11

Additional context

So, the problem is that in rtr.py _get_queued_session_ids you used "1" as a constant for True, and that makes the whole FQL filter invalid, and it yields nothing.

$ ./test.py RTR_ListAllSessions -p '{"filter":"offline_queued: True+deleted_at: null"}' -q | jq '.body.resources|length'
8
$ ./test.py RTR_ListAllSessions -q | jq '.body.resources|length'
18
$ ./test.py RTR_ListAllSessions -p '{"filter":"offline_queued: 1+deleted_at: null"}' -q | jq '.body.resources|length'
0
$ ./test.py RTR_ListAllSessions -p '{"filter":"offline_queued: zemlkqfjsqdmlkf+deleted_at: null"}' -q | jq '.body.resources|length'
0

Patching the code does show valid output

RTRApiModule: Searching for RTR sessions based on filter string: offline_queued: True+deleted_at: null
..
( before : caracara.common.batching: Batch data retrieval for list_sessions (0 items) )
( now : caracara.common.batching: Batch data retrieval for list_sessions (8 items) )
caracara.common.batching: ThreadPoolExecutor-0_0 | Batch worker started with a list of 8 items. Function: list_sessions

Once this is patched, well, it still shows nothing; but I guess we're facing another issue.

Thanks for the API wrapper btw, sorting out these different pagination methods is not really straightforwards otherwise.

(caracara-py3.10) $ git diff
diff --git a/caracara/modules/rtr/rtr.py b/caracara/modules/rtr/rtr.py
index b4f9e3e..ad88bfb 100644
--- a/caracara/modules/rtr/rtr.py
+++ b/caracara/modules/rtr/rtr.py
@@ -87,7 +87,7 @@ class RTRApiModule(FalconApiModule):
         List[str]: A list of IDs of all queued RTR sessions discovered.
         """
         self.logger.info("Searching for queued RTR sessions")
-        filter_str = "offline_queued: 1+deleted_at: null"
+        filter_str = "offline_queued: True+deleted_at: null"
         session_ids = self._search_sessions(filters=filter_str)
         return session_ids

Migrate to Prompt Toolkit

Right now, we use the bullet library for our examples. This project appears to be unmaintained and, in line with our work to move from pick to prompt_toolkit in Falcon Toolkit, we should also move to Prompt Toolkit here too to reduce the number of overall dependencies and risk.

Support streamed download to a path

There is currently no way either in falconpy or caracara to stream a file down to the disk. That causes 4GB requests to be entirely loaded in memory at some point.

We're using ugly hacks to pass stream=True to a raw requests.request and call it a day, so that we can stream a large file to disk chunk by chunk.

def rtr_session_download_to_path(self, session_id, sha256, destination, known_size = None):
    '''
    Downloads an extracted file straight into a file (7z -pinfected) using
    chunks, so that we don't have a 4GB single http request in memory at
    some point. Or several in parallel.
    '''
    # First, prepare a HTTP request by stealing the self.falcon config for URL & token
    url = f'{self.falcon.base_url}/real-time-response/entities/extracted-file-contents/v1'
    params = {
        'session_id': session_id,
        'sha256': sha256,
    }
    self.logger.debug(f'Getting file sha256={sha256}, session_id={session_id} into {destination}')
    total_written_bytes = 0
    with request(
        'get',url,
        # Here we assume the token is fresh enough, which is usually the case since we just listed the file properties.
        headers = self.falcon.headers(),
        verify = self.falcon.ssl_verify,
        stream = True,
        params = params,
        ) as r:
        if not destination.parent.exists():
            self.logger.info(f'Creating folder {destination.parent}')
            destination.parent.mkdir(parents = True, exist_ok = True)
        with destination.open('wb') as f, tqdm(
            desc=str(destination),
            total=known_size,
            unit='iB',
            unit_scale=True,
            unit_divisor=1024,
        ) as bar:
            self.logger.debug(f'Actual download iteration start')
            for chunk in r.iter_content(chunk_size=10*1024):
                written_bytes = f.write(chunk)
                bar.update(written_bytes)
                total_written_bytes += written_bytes

    return destination, total_written_bytes

Could this be done natively by caracara ? I'm no asyncio expert but there's some http + file magic to be done here imo.

Thanks !

[ BUG ] - Caracara requests a token even if not used

Describe the bug

At instanciation time, caracara.client.Client configures itself, sends numerous logs and then fires a POST to https://api.eu-1.crowdstrike.com/oauth2/token to get an API token. Could it be possible to have lazy authentication ? That would mean preparing offline settings, and only trigger network requests when an API operation is required.

The falconpy behavior is not to request a token unless it's needed ( https://github.com/CrowdStrike/falconpy/blob/main/src/falconpy/api_complete.py#L307 ) ; but I might have read this wrong.

To Reproduce

Instanciate a caracara.client.Client class, there's a network call.

Expected behavior

No network call unless asked to touch the network

Environment

Operating System Version

Debian bookworm

Python Version

3.10.5

Poetry Version

1.4.1

Python Package Versions

$ pip freeze | grep -E '(caracara|falconpy)'
caracara==0.2.2
crowdstrike-falconpy==1.2.12

Additional context

We have scripts that prepare handlers to request data from various locations, and one of the providers is Caracara. For cases where all the details are already cached offline ( so far it's about session data ) ; we end up instanciating a Client object just in case ; then not using it since all the data we need is in our cache ; and instead of having instantaneous results we have to wait for one (1) HTTP call, cause by the understandable need of caracara.client.Client to request a token when instanciated.

Could it be possible to only call self.api_authentication.token() when needed, even if that means remaining unaware of the base_url variable for a while ? (which is fine, because you won't ever need it unless authenticated to answer a query ).

Feel free to say this problem is convoluted :D

Cheers,

[ ENH ] Add default User-Agent string

Describe the enhancement

The User-Agent header currently does not default to specifying Caracara, so it will fall back to crowdstrike-falconpy/version.

Expected behavior

The default for this value should be crowdstrike-caracara/version.
When provided as a keyword, this value should be provided value (crowdstrike-caracara/version).

Environment

Operating System Version

All supported

Python Version

All supported

Poetry Version

All supported

Caracara Version

<= 0.1.0

[ Feature ] - Support enumerating IOCs

As mentioned in #80 , we'd like to benefit from caracara's numerous advantages compared to manually hitting the API with falconpy to enumerate IOC content.

So far, we're hitting indicator_search_v1 to get the identifiers, then iterate over pages of indicator_get_v1. Could it be possible to have a describe_iocs_raw somewhere ? (pretty much like describe_rule_groups_raw)

Thanks !

[ FEATURE ] Support enumerating all policy types.

So far caracara's excellent pagination system only exposes prevention policies and remote response policies. There are a few more (ignore the "global_config" one, it's a setting sent back by the API when querying a host policies (under "device_policies"), but there are no associated APIs, I guess that's some kind of default vendor system-wide per-OS policies or smth.

$ ls ./data/crowdstrike/policies.dev.* -1
./data/crowdstrike/policies.dev.device_control.json
./data/crowdstrike/policies.dev.firewall.json
./data/crowdstrike/policies.dev.global_config.json
./data/crowdstrike/policies.dev.prevention.json
./data/crowdstrike/policies.dev.remote_response.json
./data/crowdstrike/policies.dev.sensor_update.json

Please expose a describe_policies_raw function for all the policy types below :

ptypes = [
    'prevention',
    'sensor_update',
    'device_control',
    # 'global_config', 
    'remote_response',
    'firewall'
]

It's not a critical need of ours, we worked out pagination, but not really properly, and not multithreaded. I'd like to get rid of self.enumerate_paginated_api_endpoint in our own code base , and since there is some bit of knowledge you can't really guess on the type of pagination used behind each API endpoint here I am, asking for per-policy support.

I'm a little bit reluctant to try to hook caracara.common.pagination directly our own code (that does mostly what caracara does, offer management functions over API endpoints) ; mostly because it's tightly integrated with the rest of caracara and that would mean reimplementing a half-baked in-house caracara clone that will get obsolete as soon as these few missing features are implemented.

So far my "developer experience" was enhanced by caracara since I could drop our own pagination function for a few use cases implemented in caracara, assuming pagination is done correctly on caracara's side.

     def refresh_ioa_cache(self):
-        ioas = self.enumerate_paginated_api_endpoint('query_rule_groups_full', limit=100, sort='modified_on.desc')
+        ioas = self.caracara.custom_ioas.describe_rule_groups_raw()
+        ioas = list(ioas.values())
+        #ioas = self.enumerate_paginated_api_endpoint('query_rule_groups_full', limit=100, sort='modified_on.desc')
         self.logger.info(f'Storing {len(ioas)} IOA in {self.ioa_cache_path}')

Here's my shopping list on things I'd like to have enumerated through caracara ( just for reading )

All policy types ( todo ⚒️ ( missing : sensor_update, device_control, firewall )
RTRScripts & Putfiles (implemented) 🥳
Queued sessions ( warning, you're listing them using falconpy.RealTimeResponse.list_sessions not falconpy.RealTimeResponse.list_queued_sessions and they have a different schema and "pwd" as command while it's not the case, that's another issue right; RTR_ListAllSessions does list all ids but then real data for queued sessions has to be fetched from list_queued_sessions. 🙃 )
Users ( todo ⚒️)
Hosts ( implemented 🥳 )
Host groups (implemented 🥳 )
IOC ( todo ⚒️ )
IOA (implemented 🥳 )

I'll go open different issues for :

Users enumeration
IOC enumeration

I won't open an issue for the queued session thingy since I'm not comfortable with the issue diagnostic and my associated needs, so far.

Thanks for reading !

[ BUG ] get_devices example is only returning 500 results

Describe the bug

When using the examples/get_devices example, you are only returned 500 results (the DATA_BATCH_SIZE) regardless of the number of hosts available within the tenant.

To Reproduce

Configure the config.yml file to access a tenant with more than 500 managed hosts.
Execute the examples/get_devices example.

Expected behavior

All hosts within the tenant are returned.

Environment

Operating System Version

All supported.

Python Version

All supported.

Poetry Version

All supported.

Python Package Versions

<= 0.1.0

Falcon Filters Design Issue

Through working on #41 and trying to add filters for IOA Rule Groups, I encountered this issue.

The Problem

The CrowdStrike API consists of lots of fragmented services, which occasionally have inconsistencies. One of the inconsistencies is in choice of filter attributes, for example:

IOA rule groups filter platforms using the FQL attribute platform (source), which should be one of three values windows, linux or mac (case-sensitive).
Hosts filter platforms using the FQL attribute platform_name (source) which the docs state should be one of Windows, Linux or Mac, but testing brings that it can also be Android (which are all also case-sensitive).

Hiding this kind of complexity is what caracara is supposed to do, so we have to design a good abstraction.

Current Implementation (as of `fdb69e1`)

On our side
- Currently, on our side to define a falcon filter attribute, we inherit from FalconFilterAttribute, which helps by providing ways to easily provide validation that a filter is valid, as well as to hide the raw FQL name and use a more use friendly name. For example (here), platform_name is validated to only be one of Windows, Mac or Linux, and the user-facing name is OS.
- These filters, when defined, are separated per module, but are aggregated together in the Client class here, providing a customised version of the FalconFilter class to the user
- In this aggregation process, if two filters have the same user-friendly name, then only one of them survives into the dict passed to FalconFilter.
- This means we can't have two filters with the same user facing name, even if it seems they should be (e.g. the platform and platform_name example above).
On the user's side
- The user, having already constructed a client, creates a new filter with filters = client.FalconFilter()
- The user then adds a filter to the filter (confusing wording?) with something like filters.create_new_filter("OS", "Windows")
- The user probably expects this filter to work for filtering anything with any kind of OS associated with it, but they can only filter hosts with it (and would need to use a different word for different APIs, with our current implementation)

Some proposed solutions

Separate filter 'flavours' per module

For example, instead of having a unified FalconFilter, you would have separate filters, maybe HostsFalconFilter and IoaFalconFilter, so this separation is presented to the user directly. We can still provide unified names (i.e. OS becomes platform_name if used with HostsFalconFilter and becomes platform is used with IoaFalconFilter) to bring a degree of unity, but still expose to the user that filters are different depending on the data you're querying

pros✅
- can still hide some complexity by using consistent naming (as described in paragraph above)
- (subjective) arguably more 'correct'
  - user could be presented with autocomplete options in their IDE if we used methods, and only methods that correspond to FQL attributes that actually mean anything in that context will be shown
  - prevents the user from creating a filter and expecting it to work on one endpoint, when in reality they can't, as this separation could be enforced with Exceptions and IDE warnings using type hints (fail-fast)
cons🚫
- doesn't hide all the complexity

Unified filter, transparently translates to different filter 'flavours' upon use

i.e. the user would still create using filters.create_new_filter("OS", "Windows") or something similar, and if the user used it for filtering hosts, it would translate to platform_name:'Windows' in FQL, but if the user used it for filtering IOAs it would translate to platform:'windows' when used.

pros✅
- hides the most complexity
- doesn't require radically changing the user-facing API
cons🚫
- there may be FQL attribute across APIs that are harder to unify like this (the example of platform_name and platform are easily reconciled, but I haven't scoured the entire API to look for harder examples)

Other improvements

Whilst we're talking about redesigning how filters work, here's some other problems we could solve!

Overloaded terminology: 'filter' referring to both FalconFilter (a kind of collection of filters), and the filters within (added via create_new_filter).
- Could use some kind of term referring to a collection, i.e. FalconFilterGroup (if you can think of a better one let me know!). This also has the plus of fitting better with the variable name commonly used in caracara to refer to an instance of FalconFilter: filters (which is used in place of filter since filter is a python builtin).
- Could also do the same thing in the opposite direction, maintaining the naming of FalconFilter being the filter, but then renaming create_new_filter to be something like create_new_condition.
Passing in an operator as a string
- Okay this isn't really a problem, but it could be fun to explore something similar to how SQLAlchemy does it (i.e. overloading builtin comparison operators to build up queries in a very 'pythonic' way, hard to find a good example but see here)

footnote

Feel free to comment with any ideas!

[ BUG ] Logging scopes do not all start with "caracara"

Describe the bug

Some caracara modules do not carry the "caracara" name in their logging scope. This causes logging.getLogger('caracara').setLevel(logging.WARNING) to only affect some parts of caracara.

2023-04-12 09:24:16,742	CustomIoaApiModule	Initialising API module: CustomIoaApiModule
2023-04-12 09:24:16,742	FlightControlApiModule	Initialising API module: FlightControlApiModule
2023-04-12 09:24:16,742	FlightControlApiModule	Configuring the FalconPy Flight Control API
2023-04-12 09:24:16,743	HostsApiModule	Initialising API module: HostsApiModule
2023-04-12 09:24:16,743	HostsApiModule	Configuring the FalconPy Hosts API
2023-04-12 09:24:16,743	HostsApiModule	Configuring the FalconPy Host Group API
2023-04-12 09:24:16,743	PreventionPoliciesApiModule	Initialising API module: PreventionPoliciesApiModule
2023-04-12 09:24:16,745	PreventionPoliciesApiModule	Configuring the FalconPy Prevention Policies API
2023-04-12 09:24:16,745	ResponsePoliciesApiModule	Initialising API module: ResponsePoliciesApiModule
2023-04-12 09:24:16,745	ResponsePoliciesApiModule	Configuring the FalconPy Response Policies API
2023-04-12 09:24:16,746	RTRApiModule	Initialising API module: RTRApiModule

To Reproduce

Run some code like the following, that just silences the 'caracara' module to a specified level :

# Silence a little bit all this debug noise enabled by default
for caracara_logging_scope in (
    'caracara',
    #'CustomIoaApiModule',
    #'FlightControlApiModule',
    #'HostsApiModule',
    #'PreventionPoliciesApiModule',
    #'RTRApiModule',
    #'ResponsePoliciesApiModule',
):  
    if self.debug:
        pass
    else: 
        l = logging.getLogger(caracara_logging_scope)
        l.setLevel(logging.WARNING)

It's not much, and we have a workaround ready with the list of modules generating logs added to a hardcoded list. That being said, I have a vague feeling that some PEP might suggest prefixing logging scopes.

Expected behavior

All code loaded by caracara receives the same log level

Suggested fixes

Prefix these logging scopes with caracara or caracara.modules. What we do in our code base is just pick the current module and call it a day. Prior to that we even added the class name, but having one class per file makes it simpler. ( I won't comment on my spaghetti code base file lengths :D )

self.logger = logging.getLogger(".".join([
    self.__module__,
    # self.__class__.__name__,
]))

Environment

Operating System Version

Debian bookworm

Python Version

Python 3.10.5

Poetry Version

Poetry (version 1.4.1)

Python Package Versions

$ pip freeze | grep -E '(caracara|falconpy)'
caracara==0.2.2
crowdstrike-falconpy==1.2.12

[ ENH ] Handle multi-cid class

Describe the bug
Ability to develop a class that can be use natively to handle foreign cid with several api keys.

[ Feature ] - Support enumerating Users

As mentioned in #80 , we'd like to benefit from caracara's numerous advantages compared to manually hitting the API with falconpy to enumerate Users.

So far, we're hitting queryUserV1 to get the identifiers, then iterate over pages of retrieveUsersGETV1. Could it be possible to have a describe_users_raw somewhere ?

Thanks !

Users Module: Writable Endpoints

Writable Users Endpoints

Describe the bug

#81 handles pulling information from the API, but it does not allow any changes to be written back to Falcon.

We intend to add the write support here (as a full CRUD implementation of the User Management APIs), so this Issue will track that effort.

crowdstrike / caracara Goto Github PK

caracara's Introduction

Caracara

Features

Installation Instructions

Installing Caracara from PyPI using Poetry (Recommended!)

Poetry: Installation

Poetry: Upgrading

Poetry: Removal

Installing Caracara from PyPI using Pip

Pip: Installation

Pip: Upgrading

Pip: Removal

Basic Usage Examples

Examples Collection

Executing the Examples

Executing from a Poetry Shell

Executing without Activating the Virtual Environment

Documentation

Contributing

Why Caracara?

caracara's People

Contributors

Stargazers

Watchers

Forkers

caracara's Issues

Bug Report Template

Describe the bug

To Reproduce

Expected behavior

Environment

Operating System Version

Python Version

Poetry Version

Python Package Versions

Describe the bug

To Reproduce

Expected behavior

Environment

Operating System Version

Python Version

Poetry Version

Python Package Versions

Additional context

Describe the bug

To Reproduce

Expected behavior

Environment

Operating System Version

Python Version

Poetry Version

Python Package Versions

Additional context

Describe the enhancement

Expected behavior

Environment

Operating System Version

Python Version

Poetry Version

Caracara Version

Describe the bug

To Reproduce

Expected behavior

Environment

Operating System Version

Python Version

Poetry Version

Python Package Versions

The Problem

Current Implementation (as of fdb69e1)

Some proposed solutions

Separate filter 'flavours' per module

Unified filter, transparently translates to different filter 'flavours' upon use

Other improvements

footnote

Describe the bug

To Reproduce

Expected behavior

Suggested fixes

Current Implementation (as of `fdb69e1`)