Tagged crawling | F# Snippets

Home Insert

Snippets tagged crawling

Parallel recursive crawler using agents

The aim here is to demonstrate a method of distributing work using the built in F# agent across multiple nodes in parallel the result of crawling one page might result in finding multiple new pages to fetch. This is a recursive process which will continue until no new URLs are found. The main focus is how to process a potentially indefinite queue of work across a pool of workers, rather than how to parse web pages.
6 people like this
Like the snippet!
Posted: 9 years ago by Daniel Bradley

Web Crawler extensions

The snippet extends a web crawler from snippet http://fssnip.net/3K. It synchronizes all printing using an additional agent (so printed text does not interleave) and the crawling function returns an asynchronous workflow that returns when crawling completes.
0 people like this
Like the snippet!
Posted: 2 years ago by Tomas Petricek

This web site is created using F# and Suave web server. It is hosted on Azure and the source code is on GitHub. Contributions are welcome!

The first version of fssnip.net has been created by @tomaspetricek back in 2010. This web site is a new, open-source and contribution-friendly version.

Check out the source code and contribute!
See the list of issues and suggestions
The syntax highlighting uses F# Formatting