mirror of
https://github.com/bellingcat/sugartrail.git
synced 2026-06-08 03:28:31 +03:00
215 lines
5.5 KiB
Plaintext
215 lines
5.5 KiB
Plaintext
{
|
|
"cells": [
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "177cb892",
|
|
"metadata": {},
|
|
"source": [
|
|
"*In this tutorial we will investigate two seperate companies and check if they are connected.*"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "b8cd2e41",
|
|
"metadata": {},
|
|
"source": [
|
|
"There are instances where we may want to see if two companies are connected. We can do this by simply building a network for each company and comparing them to see if there are any common officers, addresses or companies.\n",
|
|
"\n",
|
|
"Lets test this approach with two example companies, Zahawi & Zahawi Ltd (07285998) and Gorgeous Services Limited (05714521):"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "10e091d0",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"import sugartrail\n",
|
|
"import pandas as pd"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "6f5bd44c",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"sugartrail.api.basic_auth.username = \"\""
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "b5897b17",
|
|
"metadata": {},
|
|
"source": [
|
|
"Create one network for Zahawi & Zahawi including some limits to reduce the number of possibly irrelevant connections:"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "8ec6a941",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"zahawi_connections = sugartrail.base.Network(company_id='07285998')\n",
|
|
"zahawi_connections.hop.officer_appointments_maxsize = 20\n",
|
|
"zahawi_connections.hop.officers_at_address_maxsize = 20\n",
|
|
"zahawi_connections.hop.companies_at_address_maxsize = 20"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "7e27a1f1",
|
|
"metadata": {},
|
|
"source": [
|
|
"Create a second network for Gorgeous Services:"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "eadec2f9",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"gorgeous_connections = sugartrail.base.Network(company_id='05714521')\n",
|
|
"gorgeous_connections.hop.officer_appointments_maxsize = 20\n",
|
|
"gorgeous_connections.hop.officers_at_address_maxsize = 20\n",
|
|
"gorgeous_connections.hop.companies_at_address_maxsize = 20"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "e8fd1658",
|
|
"metadata": {},
|
|
"source": [
|
|
"We can now pass both networks to the `find_network_connections` method which returns any connections found between two networks. The method accepts two networks as input and an optional `max_depth` value (defaults to 5) which sets the maximum depth of network we will build for both. `find_network_connections` builds each network up to the `max_depth` value and completes when connections are found or the `max_depth` is reached."
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "6a311159",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"# zahawi_connections=sugartrail.base.Network(file=f'{sugartrail.const.networks_path}multinode/connection_check_a.json')"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "2b621425",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"# gorgeous_connections=sugartrail.base.Network(file=f'{sugartrail.const.networks_path}multinode/connection_check_b.json')"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "dae95c03",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"connections = sugartrail.multinetwork.find_network_connections(zahawi_connections, gorgeous_connections)"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "4eee3f43",
|
|
"metadata": {},
|
|
"source": [
|
|
"Looks like a connection was found. We can see by the long string of characters that its an officer ID:"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "e02ca7c4",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"connections"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "78ec9aab",
|
|
"metadata": {},
|
|
"source": [
|
|
"We can now trace the path from Zahawi & Zahawi to this connection:"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "fc581d24",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"pd.DataFrame(zahawi_connections.find_path('g8BmvnpH8blqT87i93sgJeowx7I'))"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "efe1add8",
|
|
"metadata": {},
|
|
"source": [
|
|
"... and the path from Gorgeous Connections to the connection:"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "786d0d23",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": [
|
|
"pd.DataFrame(gorgeous_connections.find_path('g8BmvnpH8blqT87i93sgJeowx7I'))"
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "markdown",
|
|
"id": "b67e8187",
|
|
"metadata": {},
|
|
"source": [
|
|
"Reading both paths tells us how Zahawi & Zahawi connect to Gorgeous Connections. Zahawi & Zahawi has Nadhim Zahawi as an officer who has YOUGOV PLC as an appointment which has Benjamin Elliot as an officer who is also an officer of Gorgeous Services Limited."
|
|
]
|
|
},
|
|
{
|
|
"cell_type": "code",
|
|
"execution_count": null,
|
|
"id": "1cc6c344",
|
|
"metadata": {},
|
|
"outputs": [],
|
|
"source": []
|
|
}
|
|
],
|
|
"metadata": {
|
|
"kernelspec": {
|
|
"display_name": "Python 3 (ipykernel)",
|
|
"language": "python",
|
|
"name": "python3"
|
|
},
|
|
"language_info": {
|
|
"codemirror_mode": {
|
|
"name": "ipython",
|
|
"version": 3
|
|
},
|
|
"file_extension": ".py",
|
|
"mimetype": "text/x-python",
|
|
"name": "python",
|
|
"nbconvert_exporter": "python",
|
|
"pygments_lexer": "ipython3",
|
|
"version": "3.10.14"
|
|
}
|
|
},
|
|
"nbformat": 4,
|
|
"nbformat_minor": 5
|
|
}
|