Mini-Google
Internal documentation
URL Server
- Scope
- Distributes URLs to be crawled to crawlers.
- Interface
-
- Throught a declared server socket, it receives message (optional)
GET
from the Crawlers and it responds with
URL _urlid_ _urlstring_
- Throught another declared server socket, it receives messages
CRW _urlid_ _urlstring_
from the URL Resolver.
- Reads a file describing the controlled crawlers and the strategy to distribute the URLs to be parsed.