Web content indexation and automatized deployment on OpenStack
Hyphe is a web crawler designed for social scientists, and developped by Sciences-Po médialab.
We added the following features:
- Automatic textual indexation of web corpuses by multiprocess content extraction and indexation inElasticsearch
- Automatic deployment of Hyphe server on OpenStack compatible hosting services
A Open Source and Open Data project