Import FHIR search results to pandas dataframe

Import resources from FHIR Search API results to pandas dataframe by fhiry:

FHIR search query parameters

For filter options you can set by search_parameters see FHIR search common parameters for all resource types and additional FHIR search parameters for certain resource types like Patient, Condition, Observation, ...

Example: Import all observations from FHIR server

Fetch and import all resources (since empty search parameters / no filter) of type Observation to a pandas dataframe:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir")

df = fs.search(resource_type = "Observation", search_parameters = {})

print(df.info())

Example: Import all conditions with a certain code from FHIR server

Fetch and import all condition resources with Snomed (Codesystem http://snomed.info/sct) Code 39065001 in the FHIR element Condition.code (resource type specific FHIR search parameter code) to a pandas dataframe:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir")

my_fhir_search_parameters = {
    "code": "http://snomed.info/sct|39065001",
}

df = fs.search(resource_type = "Condition", search_parameters = my_fhir_search_parameters)

print(df.info())

Columns

see df.columns

Connection settings

To set connection parameters like authentication, SSL certificates, proxies and so on, set or add standard Python requests keyword arguments to the property requests_kwargs.

Examples:

Authentication

Authentication is set by requests parameter auth.

Example using HTTP Basic Auth:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir")

# Set basic auth credentials (https://requests.readthedocs.io/en/latest/user/authentication/#basic-authentication)
fs.requests_kwargs["auth"] = ('myUser', 'myPassword')

Proxy settings

You can set HTTP(S)-Proxies by requests parameter proxies.

Example:

fs.requests_kwargs["proxies"] = {
    'http': 'http://10.10.1.10:3128',
    'https': 'http://10.10.1.10:1080',
}

Performance

Fetching all found resources from FHIR server

Since such search calls are fetching all found resources of the searched resource type matching the fhir search parameters (if none, fetching all resources of the resource type) from the FHIR server, dependent on the performance of the FHIR Server for example fetching one million resources by FHIR search (page thorough all the search results pages) can take an hour to load the resources into the resulting pandas dataframe which for this example has a RAM usage of few hundred MB RAM.

Decrease RAM usage

If you want to analyze only certain elements, you can decrease RAM usage and network overhead by defining the elements you need for your data analysis by the FHIR search option _elements.

Example:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url = "http://fhir-server:8080/fhir")

my_fhir_search_parameters = {

... Other FHIR search parameters / filters ...

    "_elements": "code,verification-status,recorded-date",
}

df = fs.search(resource_type = "Condition", search_parameters = my_fhir_search_parameters)

print(df.info())

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!