Google Dorking
Crawlers
Name the key term of what a "Crawler" is used to do
What is the name of the technique that "Search Engines" use to retrieve this information about websites?
What is an example of the type of contents that could be gathered from a website?
Search Engine Optimisation
Robots.txt
Keyword
Function
Where would "robots.txt" be located on the domain "ablog.com"?
If a website was to have a sitemap, where would that be located?
How would we only allow "Bingbot" to index the website?
How would we prevent a "Crawler" from indexing the directory "/dont-index-me/"?
What is the extension of a Unix/Linux system configuration file that we might want to hide from "Crawlers"?
Sitemaps
What is the typical file structure of a "Sitemap"?
What real life example can "Sitemaps" be compared to?
Name the keyword for the path taken for content on a website
Google Dorking
What would be the format used to query the site bbc.co.uk about flood defences?
What term would you use to search by file type?
What term can we use to look for login pages?
Last updated