-
Notifications
You must be signed in to change notification settings - Fork 6
/
DataDictionary.csv
We can make this file beautiful and searchable if this error is corrected: It looks like row 3 should actually have 1 column, instead of 3 in line 2.
25 lines (25 loc) · 2.3 KB
/
DataDictionary.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Name;Function;Inputs;Output;Match function;ID;Tier;Description;
Shared Domain Name;get_domain_name();URL of webpage;Domain name as string;Exact;- ;0;https://github.com/user as input gives github.com;
IP address;add_ip_address();Domain name from get_domain_name();Returns a list of IPs with a dictionary per IP with indicator type, content, and domain name;Exact;ip;1;Uses socket to retrieve the domain’s IP ;
;;;Everything till the last digits of the IP;Exact;subnet;2;;
;get_who_is();URL of webpage;WhoIs data;dict_exact;whois;1;;
;parse_meta_tags(url, soup);URL of webpage, soup (beautifulSoup) object of html of webpage;Verification IDs;Exact;verification_id;1;Uses Soup attributes ;
;;;Social media IDs;Exact;meta_social;1;;
;;;Content of other meta tags;Exact;meta_generic;2;;
;add_builtwith_indicators();Domain, API Key for built with;Tech used on the page;IOU(exact);techstack;3;Calling BuiltWith API;
;get_tech_identifiers();;IDs of tech used on the page;Exact;tech_identifier;3;Calling BuiltWith API;
;find_uuids();URL and webpage content as text ;Any UUIDs on the page;Exact;uuid;3;RegEx search;
;find_wallets();URL and webpage content as text ;Crypto Wallet IDs (Ethereum);Exact;crypto-wallet;1;RegEx search;
;add_associated_domains_from_cert();;;Exact;certificate;1;;
;;;;Exact;cert-domain;1;;
;find_google_analytics_id(;URL and webpage content as text ;Google analytics ID;Exact;ga_id;1;RegEx search;
;find_google_tag_id();URL and webpage content as text ;Google Tag ID;Exact;ga_tag_id;1;RegEx search;
;add_cdn_domains();;;IOU(exact);cdn-domain;2;;
;find_domain_suffix(;URL;Domain name and suffix;Exact;domain_suffix;0;Split and retrieve;
;find_second_level_domain(;URL;Domain Name;Exact;domain;0;Split and retrieve;
;add_urlscan_indicators(;;;IOU(exact);global_variable;3;;
;;;;dict_exact;urlscan_certificate;1;;
;parse_classes();URL of webpage, soup (beautifulSoup) object of html of webpage;CSS class used;IOU(exact);css-class;3;;
;parse_dom_tree();URL of webpage, soup (beautifulSoup) object of html of webpage;DOM tree of webpage;Similarity ;dom_tree;3;;
;parse_images();URL of webpage, soup (beautifulSoup) object of html of webpage;Returns PHash of images;abs_difference_vs_threshold;image-phash;3;;
;find_yandex_track_id();URL and webpage content as text ;Returns Yandex ID;Exact;yandex_tag_id;1;RegEx search;