• Home
  • Resources by chapter
    • Chapter 1: Why spatio-temporal epidemiology?
    • Chapter 2: Modelling health risks
    • Chapter 3: The importance of uncertainty
    • Chapter 4: Embracing uncertainty: the Bayesian approach
    • Chapter 5: The Bayesian approach in practice
    • Chapter 6: Strategies for modelling
    • Chapter 7: Is `real' data always quite so real?
    • Chapter 8: Spatial patterns in disease
    • Chapter 9: From points to fields: Modelling environmental hazards over space
    • Chapter 10: Why time also matters
    • Chapter 11: The interplay between space and time in exposure assessment
    • Chapter 12: Roadblocks on the way to causality: exposure pathways, aggregation and other sources of bias
    • Chapter 13: Better exposure measurements through better design
    • Chapter 14: New frontiers
  • Courses
    • Statistical methods in epidemiology
    • Spatio-temporal Methods in epidemiology
    • Advanced statistical modelling in space and time
    • BUC1 (CIMAT) When populations and hazards collide: modelling exposures
      and health
    • BUC2 (UNAM) Thinking Globally: The Role of Big Data
    • Detecting Pattens in Space and Time (CMM)
    • BUC4 (Bath) New Frontiers: Advanced Modelling in Space and Time
    • Big Data in Environmental Research
    • Statistics and Data Science in Research: unlocking the power of your data
    • Bayesian Hierarchical Models
    • BUCX (UNAM) Quantifying the Health Impacts of Air Pollution
    • Environmental Health Impact Assessment using R (IOM)
  • Computing resources
    • WinBUGS
    • INLA
    • EnviroStat
  • Book's webpage @ CRC

Data Science and Statistics in Research:

unlocking the power of your data

TO REGISTER FOR THE COURSE, PLEASE CLICK HERE
.

course outline

The ability to extract information from data is increasingly important in all areas of research and is an essential part of
evidence based decision making in the world of big data. The aim of the course is provide an interactive experience for
researchers from a variety of disciplines to enable them to use data science and statistical techniques to answer questions
in their own research.

The course will include both traditional style presentations and ‘hands-on’ computer practical sessions in which participants
will be guided through the analysis of a series of case studies from a number of different disciplines. These practical sessions
will involve training in R, a free software environment for statistical computing and graphics. During the last decade, the
momentum coming from both academia and industry has lifted R to become the single most important tool for statistics,
visualisation and data science. Participates will learn how to get their data into R, get it into the most useful structure,
transform it, visualise it and model it.

The course will be delivered in Ulaanbaatar Mongolia between 20th-24th November 2016 by a team of statisticians and
data scientists from University of Bath. The presenters are Gavin Shaddick, Daniel Simpson, Matthew Thomas, Aoibheann
Brady, Robbie Peck and Adwaye Rambojun.

The slides, practical sheets, and other relevant resources can be found below.

Slides

Session 1.1

Session 1.2

Session 1.4

Session 1.6

Session 2.2

Session 2.4

Session 2.6

Session 3.2

Session 3.4


Practical Sessions

Session 1.3

Session 1.5

Session 1.7

Session 2.3

Session 2.5

Session 2.7

Session 3.3

Session 3.5


Required R Packages

ggplot2

raster

foreign

Rmisc


Other Resources

How to install R and RStudio

  • ©Gavin Shaddick and James V. Zidek 2015
  • Chapter 1: Why spatio-temporal epidemiology?
  • Chapter 2: Modelling health risks
  • Chapter 3: The importance of uncertainty
  • Chapter 4: Embracing uncertainty: the Bayesian approach
  • Chapter 5: The Bayesian approach in practice
  • Chapter 6: Strategies for modelling
  • Chapter 7: Is `real' data always quite so real?
  • Chapter 8: Spatial patterns in disease
  • Chapter 9: From points to fields: Modelling environmental hazards over space
  • Chapter 10: Why time also matters
  • Chapter 11: The interplay between space and time in exposure assessment
  • Chapter 12: Roadblocks on the way to causality: exposure pathways, aggregation and other sources of bias
  • Chapter 13: Better exposure measurements through better design
  • Chapter 14: New frontiers
  • Statistical methods in epidemiology
  • Spatio-temporal Methods in epidemiology
  • Advanced statistical modelling in space and time
  • BUC1 (CIMAT) When populations and hazards collide: modelling exposures
    and health
  • BUC2 (UNAM) Thinking Globally: The Role of Big Data
  • Detecting Pattens in Space and Time (CMM)
  • BUC4 (Bath) New Frontiers: Advanced Modelling in Space and Time
  • Big Data in Environmental Research
  • Statistics and Data Science in Research: unlocking the power of your data
  • Bayesian Hierarchical Models
  • BUCX (UNAM) Quantifying the Health Impacts of Air Pollution
  • Environmental Health Impact Assessment using R (IOM)
  • WinBUGS
  • INLA
  • EnviroStat