Skip to content

Conversation

@marshvee
Copy link
Contributor

@marshvee marshvee commented May 2, 2024

Iframes are sometimes used to have parts of sites controlled by a CMS.

Here we add the option of inspecting the iframe's content and for any links that are to the site being indexed, having those included. It takes into account the tag, since if the base tag matches the site being indexed, then all relative URLs should be crawled.

If you want to enable this option, you can just add the flag:

python main.py --fetch-iframes

Resolves #90

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

iframe contents ignored

1 participant