Update 'README.md'

This commit is contained in:
dano 2020-06-18 13:19:40 +00:00
parent 85c0380e16
commit 2ac0b911f5

View File

@ -28,6 +28,38 @@ Install package:
pip install https://git.kemt.fei.tuke.sk/dano/websucker-pip/archive/master.zip
### Initialize and setup database
If you have Cassandra installed, first initialize the database schema using the cqlsh command, the schema can be found in the schema.sql file
You set up the database using an environment variable (if it is on another machine):
export CASSANDRA_HOST=localhost
export CASSANDRA_PORT=9142
## Usage
websuck --help
### Create initial domain list
Save the list of domains to a file, e.g.
echo www.sme.sk > domains.txt
### Visit initial domains in file
websuck --visit file domains.txt
### Visit unvisited domains
websuck --visit unvisited 100