Skip to content
CSDE News & Events

Making Data Science Training Resources FAIR: John Darrell Van Horn, M.Eng., Ph.D., Associate Professor of Neurology, University of Southern California (Hosted by UW eScience Institute, UW Institute for Neuroengineering, iSchool DataLab, and CSE Interactive Data LabThis, 4/11/2018)

Posted: 3/30/2018 (Local Events)

Join The UW eScience Institute, UW Institute for Neuroengineering, iSchool DataLab, and CSE Interactive Data LabThis for their next Data Science Seminar with John Darrell Van Horn, M.Eng., Ph.D., associate professor of neurology, University of Southern California.

The seminar will be on Wednesday, Apr. 11, from 3:30 to 4:20 p.m., in the Physics/Astronomy Auditorium, room A102.

This seminar will be co-sponsored by the UW Institute for Neuroengineering.

View the flyer and RSVP here.


In our rapidly evolving information era, methods for handling large quantities of data obtained in biomedical research have emerged as powerful tools for confronting critical research questions. These methods are having significant impacts in diverse domains ranging from genomics, to health informatics, to environmental research, and beyond. The NIH’s Big Data to Knowledge (BD2K) Training Consortium, in particular, has worked to empower current and future generations of researchers with a comprehensive understanding of the data science ecosystem, giving them the ability to explore, prepare, analyze, visualize, and interpret Big Data.

To this end, the BD2K Training Coordinating Center (TCC) was funded to facilitate in-person and online learning, and to open the concepts of data science to the widest possible audience. In this presentation, I will describe the activities of the BD2K TCC, particularly the construction of the Educational Resource Discovery Index (ERuDIte). ERuDIte identifies, collects, describes, and organizes over 10,000 data science training resources, including: online data science materials from BD2K awardees; open online courses; and videos from scientific lectures and tutorials. Given the richness of online training materials and the constant evolution of biomedical data science, computational methods applying information retrieval, natural language processing, and machine learning techniques are required.

In effect, data science is being used to inform training in data science where the so-called FAIR principles apply equally to these resources as well as to the datatypes and methods they describe. As a result, the work of the TCC has aimed to democratize novel insights and discoveries brought forth via large-scale data science training. This presentation will be of interest to anyone seeking to personalize their own data science education, craft unique online training curricula, and/or share their own online training content.

Read Full Article

Date: 04/11/2018

Time: 3:30 PM

Location: Physics/Astronomy Auditorium (PAA)