Big Data & Data Science Training series

The National Institutes of Health (NIH) launched their Big Data to Knowledge (BD2K) initiative in 2012. This fall, the BD2K Training Coordinating Center is offering a free weekly webinar series on Fridays at 12:00 ET. The first lecture "Introduction to Big Data and the Data Life Cycle" will be given by Mark Musen, Professor of Biomedical Informatics and Stanford University. This introduction will set the stage for upcoming lectures. Topics for the fall are below.

9/9/16:  Introduction to big data and the data lifecycle (Mark Musen, Stanford)

SECTION 1: DATA MANAGEMENT

9/16/16: Data Management Overview (Bill Hersh, Oregon Health Sciences)

9/23/16: Finding and accessing datasets, Indexing  and Identifiers (Lucila Ohno-Machado, UCSD)

9/30/16: Data curation and Version control (Pascale Gaudet, Swiss Institute of Bioinformatics)

10/7/16: Ontologies (Michel Dumontier, Stanford)

10/14/16: Provenance(Zachary Ives, Penn)

10/21/16: Metadata standards (Susanna-Assunta Sansone, Oxford)

SECTION 2: DATA REPRESENTATION

10/28/16: SECTION 2: Data Representation Overview  (Anita Bandrowski, UCSD)

11/4/16:  Databases and data warehouses, Data: structures, types, integrations (Chaitan Baru, NSF)

11/11/16: No lecture — Veteran’s Day

11/18/16: Social networking data (TBD)

12/2/16:  Data wrangling, normalization, preprocessing (Joseph Picone, Temple)

12/9/16:  Exploratory Data Analysis (Brian Caffo, Johns Hopkins)

12/16/16  Natural Language Processing (Noemie Elhadad, Columbia)

For more details, check out the training site.