Small script designed to crawl a website for links. Designed to research spammy websites who market themselves as government support.
Input two parameters:
start_url = 'https://governmentassistanceonline.com/'
max_depth = 2
And returns dataframe of URLs and their main domain.
Maybe I'll make this into something someday, but there probably is already a package out there I'm missing.