Dependencies

Ruby 2.2.3 (check out rbenv for a ruby version manager)
phantomjs (assuming you have Hombrew, brew install phantomjs)
bundler (gem install bundler)

Getting started

Make sure you have up-to-date gems: bundle
Execute the script by running: EMAIL=<your versionista email> PASSWORD=<your password> N=<number of hours back> INDEX=<starting index of csv> ruby capybara_script.rb
If the script completes successfully, you will have new csvs written in the output/ directory.

Extra

Sometimes the current page the script is scraping does not contain the expected html it is seeking. In these cases, Capybara will wait a set amount of time to see whether the content appears before giving up and throwing an error ( that we gracefully rescue for diff pages). The default time is 2 seconds. This number of seconds can me modified by passing the ENV variable "PAGE_WAIT_TIME" when executing the script. For example: PAGE_WAIT_TIME='1.5' or PAGE_WAIT_TIME=10 Beware that with too little a wait time, pages of the script besides the comparison pages may start failing.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
output		output
.gitignore		.gitignore
.ruby-version		.ruby-version
CONTRIBUTING.md		CONTRIBUTING.md
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
capybara_script.rb		capybara_script.rb
csv_writer.rb		csv_writer.rb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Dependencies

Getting started

Extra

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Contributors 3

Uh oh!

Languages

Uh oh!

edgi-govdata-archiving/versionista-outputter

Folders and files

Latest commit

History

Repository files navigation

Dependencies

Getting started

Extra

About

Resources

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Contributors 3

Uh oh!

Languages

Packages