Center for Studies in Demography and Ecology

Introduction to Web Scraping Using Helena

Instructor: Cori Mar


HELENA is a high-level programming language that can help automate repetitive interactions with well-structured webpages. A user can demonstrate how to scrape the first row of a dataset, and Helena will write a program for scraping the next hundreds or thousands of rows. Helena is also the name of the web automation language that the Helena extension uses.

See http://helena-lang.org/ for an overview of the Helena project.

In this workshop participants will draft a Helena program by recording a browser interaction, then use the Helena editor to adapt, extend, and understand it.

Guest Instructor: Sarah Chasins
UC Berkeley department of Electrical Engineering and Computer Sciences

Materials

Coming soon …

Additional Resources

INTRODUCTION

EDITING TABLES, SAVING, SCHEDULING, AND DEBUGGING PROGRAMS