Git Product home page Git Product logo

Hi there 👋

I am a lead statistical programming analyst and lead software engineer at Roche. My daily job involves wide range of data analysis activities in pharmaceutical industry and developing software packages to support other data scientist at Roche.

Before joining Roche, I was a Senior Data Scientist at Sensyne Health. I also worked as a research associate at the Oxford Big Data Institute and Wellcome Centre for Human Genetics, at where my job mainly focused on malaria parasite genomes and statistical genomic researches using a lot of coalescent theory. On the side, I also worked as a research consultant at the BGI, at where I contributed and developed methods for data storage using DNA.

I love programming! Check out some of open source tools at this page.

GitHub Streak

朱砂于2006年新西兰坎特伯雷大学破格录取直接升入大学二年级,在校成绩优异突出(全校前15%),连续在2008,2009获得数学统计系奖学金。本科毕业后留校直博,连续三年获得坎大一等博士学奖学金,于2013年取得统计博士学位,同年荣获由**国家留学基金委颁发的国家优秀自费留学生奖(新西兰两位获奖者之一),并且被牛津大学录用并开展博士后科研工作, 立志于基因组溯祖模型的研究,在人类种群分布以及恶性疟疾疟原虫分型的运用。先后发表17篇学术论文,其中一作文章10篇,通讯作者文章4篇。公开发表6个开源软件包和软件库,支持多种编程语言下载(其中软件scrm在cran的下载量超过35000次)。并多次受邀在国际学术会议和论坛发言,尤其是在2019年1月,作为牛津大学的五位学生代表之一在新加坡参加了全球科学家峰会,并向多位诺贝尔奖,菲尔兹奖,千年科技奖获奖者交流学习。在牛津大数据研究所博后期间,被深圳华大基因公司特聘为海外专家顾问,联合开发DNA存储技术,指导并研发代码转录及压缩的算法和软件,并发表3篇学术论文(其中一篇被Nature Computational Science接收)。2019年,加入人工智能制药公司Sensyne Health, 作为“血液抗凝剂对心力衰竭患者的益处研究”技术项目负责人,设计并使用生存分析来检验假设。研究结果在股东大会和伦敦交易所展示。自2020年加入罗氏制药,先后在三个产品研发团队担任技术带头人, 主要成果包括

  1. 通过软件开发,填补了数据处理软件的空缺,并拓展运用在14个课题中,实现了数据处理的一致性,并减少了十倍的软件维护工作量。
  2. 通过云计算改善程序流程构架,运行时间从4小时缩短到半小时,显著提高了团队工作效率。
  3. 特发性肺纤维化与生物标志物的专利申请书。
  4. 2021年评选罗氏产品部的最高奖项“卓越突破奖”,本人带领的高级软件团队在180个团队参选评比中,为14个获奖团队之一。

Joe Zhu's Projects

brits icon brits

Code of NIPS18 Paper: BRITS: Bidirectional Recurrent Imputation for Time Series

deeplearning-500-questions icon deeplearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06

deepvariant icon deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

dh-tern icon dh-tern

Table, Listings, and Graphs (TLG) library for common outputs used in clinical trials

dplyr icon dplyr

dplyr: A grammar of data manipulation

enterprise-registration-data-of-chinese-mainland icon enterprise-registration-data-of-chinese-mainland

**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。

fingertipsr icon fingertipsr

R package to interact with Public Health England’s Fingertips data tool

formatters icon formatters

A framework for creating listings of raw data that include specialized formatting, headers, footers, referential footnotes, and pagination.

ghi icon ghi

GitHub Issues on the command line. Use your $EDITOR, not your browser.

handson-ml icon handson-ml

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.