ACHE Crawler
0.8.0
Contents:
Installation
Build from source with Gradle
Download with Conda
Getting Started
Target Page Classifiers
Configuring Page Classifiers
title_regex
url_regex
body_regex
regex
weka
Testing Page Classifiers
Crawling Strategies
Scope
Hard-focus vs. Soft-focus
Link Classifiers
Online Learning
Backlink/Bipartite Crawling
Data Formats
FILESYSTEM_*
FILES
ELASTICSEARCH
Types and fields
Configuration
Command line parameters
Link Filters
SeedFinder Tool
ACHE Crawler
Docs
»
Crawling Strategies
Edit on GitHub
Crawling Strategies
¶
Scope
¶
TODO
Hard-focus vs. Soft-focus
¶
TODO
Link Classifiers
¶
TODO
Online Learning
¶
TODO
Backlink/Bipartite Crawling
¶
TODO
Read the Docs
v: 0.8.0
Versions
latest
stable
0.8.0
Downloads
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.