Home > ksh-scraper

ksh-scraper

Ksh-scraper is a project mainly written in PYTHON and JAVASCRIPT, it's free.

scraper for the hungarian statistics office

KSH 2 CSV

is a H.A.C.K. contribution to the 2010 Open Data Hackday. KSH is the Hungarian Central Statistical Office and it publishes it's data in some unprocessable html forms (btw those guys should look into sql injections to their precious Oracle DB).

Usage

console

ksh_scrape.py http://portal.ksh.hu/pls/ksh/docs/hun/xstadat/xstadat_eves/i_onp004b.html

web

Simply prepend the URL from portal.ksh.hu with our online ksh-scrape webservice url http://mx.ctrlc.hu/ksh-scrape?, like this: http://mx.ctrlc.hu/ksh-scrape?http://portal.ksh.hu/pls/ksh/docs/hun/xstadat/xstadat_eves/i_zoi011.html

web + automagic

Using Greasemonkey Firefox, Chrome, IE, you can install also the userscript which will automatically insert "download as CSV" links on the pages with html-encumbered data.

Previous:first_app