Dr Miles Benton
Senior Scientist Bioinformatics
Human Genomics, Institute of Environmental Science and Research (ESR)
.large[HGSA Wellington, 3
rd-6
th August 2019]
.center[
]
---
class: middle
# .center[Advocate for reproducible research...]
## .center[Where possible my presentations and code are available online]
<br />
<p>
.center[
<img src="images/github_logo.png" style="width: 520px; margin-right: 1%; margin-top: 1.5em;"/>
<img src="images/sirselim_qrcode.png" style="width: 202px; margin-right: 1%; margin-top: 1.5em;"/>
]
</p>
<br />
.center[[sirselim.github.io/presentations](http://sirselim.github.io/presentations)]
---
class: middle inverse
# QUT (February 2014 - June 2018)
## Brisbane (QLD), Australia
### <span style="color:#3498DB">PostDoc Research Fellow</span>
<p>
.center[
<img src="images/qut_ihbi.jpg" style="width: 240px; margin-right: 1%; margin-top: 1.5em; border: 3px solid white;"/>
<img src="images/qut_gardens.jpg" style="width: 240px; margin-right: 1%; margin-top: 1.5em; border: 3px solid white;"/>
]
</p>
---
layout: false
background-image: url("images/proton2.jpg")
background-size: cover
---
class: middle
# Diagnostics Lab
.large[Gene panel and Exome Sequencing (NATA accredited - neurological disorders)]
<br>
## Sanger -> Gene panel -> Exome
* move to WES -> increase in variants
* manual approach through 30-60K rows of data in Excel
* increased time to identify and validate variant(s)
---
class: middle inverse
# VCF-DART - what it's not...
It is **NOT** a variant calling and quality control pipeline
... it's about data
---
# Overview
<p>
.center[<img src="images/Pipeline_flowchart_v2.png" style="width: 95%; margin-right: 1%; margin-top: -0.5em; border: 3px solid white;"/>]
</p>
---
# Variant tiers
<p>
.center[<img src="images/tiers.png" style="width: 70%; margin-right: 1%; margin-top: -0.5em; border: 3px solid white;"/>]
</p>
---
class: middle inverse
# "Module 1" - VCF-DART
.large[(<span style="color:#3498DB">**V**</span>ariant <span style="color:#3498DB">**C**</span>all <span style="color:#3498DB">**F**</span>ormat - <span style="color:#3498DB">**D**</span>iagnostic <span style="color:#3498DB">**a**</span>nd <span style="color:#3498DB">**R**</span>eporting <span style="color:#3498DB">**T**</span>ool)]
---
layout: false
background-image: url("images/vcfdart_1.png")
background-size: contain
---
layout: false
background-image: url("images/vcfdart_2.png")
background-size: contain
---
class: middle
# Run-time
.huge[30-50K variants (avg Proton exome VCF):]
- 6-10 mins (24 core 256GB RAM)
- 17-23 mins (4 core 12GB RAM)
---
class: middle inverse
# "Module 2" - VCF-DART Viewer
---
class: middle center
.huge[[...live demonstration...](https://130.216.216.85/sample-apps/WES_ShinyDiscover/)]
<br>
.huge[**?**]
---
layout: false
background-image: url("images/backup_1.png")
background-size: contain
---
layout: false
background-image: url("images/backup_2.png")
background-size: contain
---
layout: false
background-image: url("images/backup_3.png")
background-size: contain
---
layout: false
background-image: url("images/backup_4.png")
background-size: contain
---
layout: false
background-image: url("images/backup_5.png")
background-size: contain
---
layout: false
background-image: url("images/backup_6.png")
background-size: contain
---
class:
<p>
.center[<img src="images/nectarcloud.png" style="width: 70%; margin-right: 1%; margin-top: -0.5em; border: 3px solid white;"/>]
</p>
Spun up VM to give reviewers a chance to ~~break~~ test things.
* using 6 public exomes (1000G)
<br>
--
.large[**Allocated VM:**]
* Ubuntu 18.08
* 4 cores
* 12GB RAM
* 30GB root
* 120GB ephemeral disk
--
<br>
...so it can scale down to quite reasonable specs...
---
class: middle
## VCF-DART in the lab
* much quicker time to variant shortlist and curation
* can get to validation sooner (if required)
* meaning report delivered faster
## Summary
* open source
* modular (tools and databases)
* scalable (laptops -> servers)
* easy to deploy and use [soon to become easier]
* versioning
---
class: middle inverse
## Current state of play
* exciting times!
* cancer genomics - Auckland University (Prof Cristin Print), Genomics Aotearoa
* merging 'pipelines' to offer a flexible, unified and freely distributed system across NZ
* accreditation
<p>
.center[
<img src="images/UoA.png" style="width: 272px; margin-right: 1%; margin-top: 1.5em; border: #FFFFFF 3.5px outset;"/>
<img src="images/GA-Wide-Colour-1200px.jpg" style="width: 315px; margin-right: 1%; margin-top: 1.5em; border: #FFFFFF 3.5px outset;"/>
]
</p>
---
class: middle
## Availability / Deployment
<br>
.large[Working on a wide range of deployment options:]
<br>
<p>
.center[
<img src="images/github.jpg" style="width: 200px; margin-right: 2%;"/>
<img src="images/aws.png" style="width: 215px; margin-right: 2%;"/>
<img src="images/singularity-logo.svg" style="width: 135px; margin-right: 2%; margin-left: 2%;"/>
<img src="images/NESI-logo.jpg" style="width: 300px; margin-right: 2%;"/>
]
</p>
---
class: middle inverse
# to-do
* ✔ ~~submit~~ manuscript - accepted (The Journal of Molecular Diagnostics)
* ✔ [release to the wild - GitHub (anyone interested can help out)](https://github.com/sirselim/WES_ShinyDiscover)
* finish implementing ClinGen & ClinVar links
* option for GnomAD-beta
* refactoring main code base (snakemake)
* complete docker/singularity version
* continue developing documentation!
* work with clinicians to further develop and refine
---
class: top
# acknowledgements
<div id="images">
<a href="http://xyz.com/hello">
<img src="images/robert.jpg" width="180px" height="250px">
<div class="caption">Dr Robert Smith (QUT)</div>
</a>
<a href="http://xyz.com/hi">
<img src="images/donia.jpg" width="180px" height="250px">
<div class="caption">Dr Donia Macartney-Coxson (ESR)</div>
</a>
</div>
.pull-left[.medium[
Prof Lyn Griffiths
Craig Windell (QUT)
QUT lab group
]]
.pull-right[.medium[
Prof Cristin Print (UoA)
Dr Ben Curran (UoA)
Wellington Regional Genetics Lab
QRISCloud / Nectar
]]
---
class: middle inverse
<p>
.huge[.center[<b>Thank you!</b>]]
<br>
</p>