Carnegie Mellon University

Past Lane Fellows


Nam Nguyen

Advisor: Ziv Bar-Joseph

Nam D. Nguyen received his Ph.D. in Computer Science at Stony Brook University, under the guidance of Professor Daifeng Wang. His doctoral research includes multiview learning for integrating and understanding functional multiomics. He is especially interested in machine learning on graphs and manifolds, and their applications for understanding molecular mechanisms and improving genotype-phenotype predictions in complex biological systems. Prior to that, he received B.Eng. degree in Computer Science at Hanoi University of Science and Technology. Overall, his research interests include computational biology, machine learning and explainable AI.


Jose Lugo-Martinez

Assistant Professor, Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University

Advisors: Robert F. Murphy and Ziv Bar-Joseph

Jose received his Ph.D. in Computer Science with a minor in Bioinformatics from Indiana University (IU) under the supervision of Predrag Radivojac. His doctoral research focused on the development of robust kernel methods for learning and mining on noisy and complex graph and hypergraph data. Prior to that, he received dual B.S. degrees in Computer Science and Mathematics at the University of Puerto Rico-Rio Piedras and M.S. degree in Computer Science at the University of California-San Diego. Additionally, he worked as a postdoctoral fellow in the Precision Health Initiative at IU where he developed computational approaches towards understanding protein function and how disruption of protein function leads to disease. Overall, Jose’s research interests include computational biology, machine learning and data mining.

Arash Gholami-Davoodi wearing sunglasses with trees in background

Arash Gholami-Davoodi

Advisor: Hosein Mohimani

Arash received his Ph.D. in Electrical Engineering from University of California Irvine under the supervision of Syed Ali Jafar. His doctoral research includes multiuser information theory and network coding. He received his B.Sc. and M.Sc. degrees in Electrical Engineering from Sharif University of Technology. Arash received a best paper award at IEEEE GLOBECOM 2014. His research is focused on analyzing metagenomics data using tools from information theory and machine learning and statistics.

Hamim Zafar

Assistant Professor, Computer Science & Engineering Department and Biosciences & Bioengineering Department, Indian Institute of Technology, Kanpur, India

Advisor: Ziv Bar-Joseph

Hamim received his Ph.D. in Computer Science from Rice University in 2018 under the supervision of Luay Nakhleh in the field of Computational Biology and Bioinformatics. His dissertation work focused on the development of novel statistical models for elucidating tumor heterogeneity and evolution from single-cell genomic data. As a part of his Ph.D. program, he also received M.S. in Computer Science from Rice University in 2015 and during this time he worked on developing a novel variant caller for single-cell DNA sequencing data. Prior to joining Rice, Hamim completed his undergraduate studies from Jadavpur University, Kolkata, India in 2012 with a B.E. in Electronics and Telecommunication Engineering. During his undergraduate studies, he conducted research on the development and application of stochastic optimization algorithms. Hamim’s research interests lie at the intersection of novel computational innovations and applications in biology and draw upon ideas from probabilistic graphical models, statistical inference, genomics-transcriptomics, evolution, cancer, and single-cell biology.


Dan DeBlasio

Assistant Teaching Professor, Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University

Advisor: Carl Kingsford

Dan received his PhD in Computer Science from the University of Arizona in 2016 with a minor in Ecology and Evolutionary Biology. His dissertation work focused on algorithms for multiple sequence alignment of proteins, in particular how to find program parameter choices that can produce better alignments for specific inputs using existing alignment tools. Before moving to Arizona Dan received his B.S. and M.S. in Computer Science from the University of Central Florida in 2007 and 2009 respectively. His thesis work also involved multiple sequence alignment, but concentrated on lowering memory consumption for structure-aided sequence alignment of RNA. Dan also worked as a research scientist at the Sanford-Burnham Medical Research Institute at Lake Nona where he studied the interactions of microRNA in melanoma. Dan’s long-term research interests are in algorithm design and analysis with inspiration for new problems coming from biology.

Heewook Lee

Assistant Professor, Computer Science & Biodesign Institute, Arizona State University

Advisor: Carl Kingsford

Ph.D.: Indiana University, Thesis Advisor: Haixu Tang

Heewook received his B.S. in Computer Science from Columbia University in 2004. He then went on to work as a software engineer, developing firmwares for cable modems and residential IP gateways with VoIP capability. Because he always had a great passion for life sciences and medicine, he decided to explore the field of computational biology/bioinformatics by taking a new position at a sequencing center/genomics company in 2005. During his time there, he was involved in various microbial genome projects and Korean Human Genome project. After working for several years, he decided to pursue graduate studies at Indiana University Bloomington, where he earned his M.S. and Ph.D. in Computer Science. During his doctoral training, he worked on the problem of detecting genomic rearrangement involving repeats to calculate the spontaneous mutation rates by applying his method to bacterial mutation accumulation experiments. His research interest lies broadly in computational biology with a focus on genetic variation.

Weihua Pan photo

Weihua Pan

Principal Investigator, Shenzhen Agricultural Genome Research Institute, Chinese Academy of Agricultural Sciences

Advisor: Jian Ma

Ph.D.: University of California, Riverside

Weihua received his Ph.D. in Computer Science with a master degree in statistics from University of California, Riverside (UCR) in 2019. At UCR, he developed three algorithms OMGS, Novo&Stitch and Chimericognizer to accurately and efficiently improve contiguity and correctness of de novo genome assembly with BioNano optical maps. Additionally, he studied SV detection with optical maps and nucleosome movement. Prior to that, he received his M.E. and B.E. degrees in Computer Science from University of Science and Technology of China and Nanjing Normal University. In China, he created algorithms and tools for haplotype phasing and metagenome reads binning. Overall, Weihua’s research interests include computational biology, bioinformatics, machine learning and combinatorial optimization.

Mingfu Shao

Assistant Professor, Department of Computer Science and Engineering, Pennsylvania State University

Advisor: Carl Kingsford

Ph.D: Swiss Federal Institute of Technology Lausanne, Thesis Advisor: Bernard M.E. Moret

Mingfu Shao received his B.S. in computer science from Beijing Institute of Technology in 2008, and his M.S. in computer science from Institute of Computing Technology, Chinese Academy of Sciences in 2011. He obtained his PhD degree in computer science from Swiss Federal Institute of Technology Lausanne (EPFL), Switzerland, in 2015.

Mingfu Shao’s research interests lie in algorithm design and various areas in computational biology. During his master study, he worked on protein folding. He designed algorithms for protein structure prediction using both combinatorial optimization and machine learning techniques. During his PhD study, he mainly focused on comparative genomics. He designed algorithms for various edit distance problems and median problems in whole-genome comparison, and also applying these algorithms in genome annotation.


Armaghan "Rumi" Naik

CEO, Sentiome, LLC

Lane Fellow Advisors: Christopher Langmead and Robert F. Murphy

Ph.D.: Carnegie Mellon University, Thesis Advisor: Robert F. Murphy

After graduating with a BS in Computational Biology, Rumi worked at Intel Corporation in pre-silicon validation teams in both Portland, OR and Hudson, MA where he specialized in writing machine checked formal proofs of correctness of control and datapath circuits from Pentium 4s (130nm and later) and several server processors. After returning to academia, Rumi has focused on problems in the automation of biological research, spanning from machine learning method development in active learning, to laboratory automation of mammalian cell culture and microscopy. His research interests focus primarily on problems in cell biology: understanding how the cell organizes and maintains cytoplasmic structures and in developing machine learning techniques for describing these.

A. Ercument Cicek

Assistant Professor, Bilkent University

Lane Fellow Advisors: Ziv Bar-Joseph and Kathryn Roeder

Ph.D.:  Case Western Reserve University, Thesis Advisor: Gultekin Ozsoyoglu

Ercument received his B.S. and M.S. degrees in Computer Science and Engineering at Sabanci University. He earned his Ph.D. in Computing and Information Sciences from Case Western Reserve University in 2013. During his Ph.D. training, he worked on algorithms to detect signatures in omics data for complex diseases using information theory and  biochemical networks. He also worked on design and development of online systems and online biological databases such as PathCase. After working mainly on the metabolome and the transcriptome, currently, he is interested in developing methods to analyze the genome to understand the mechanisms of genomic perturbations leading to disease.


Junming Yin

Assistant Professor, Department of Managment Information Systems, Eller College of Management, University of Arizona

Lane Fellow Advisor: Eric P. Xing

Ph.D.: University of California, Berkeley, Thesis Advisor: Michael Jordan

Junming obtained his Bachelor's degree in computer science from Fudan University, China. He then received his M.Sc. (honor's degree) from International Max Planck Research School for Computer Science (IMPRS-CS) at Saarbruecken, Germany, where he grew interests in computational biology. In 2005, he started his Ph.D. studies at UC Berkeley and his doctoral research mainly focused on developing new statistical methods and models for analyzing population genetics data. Junming is interested in problems that bring together statistical, computational and biological themes. In particular, he is interested in the challenges of analysis and interpretation posed by large-scale and high-dimensional genomic data sets.

Xin He

Associate Professor, Department of Human Genetics, University of Chicago

Lane Fellow Advisors:  Ziv Bar-Joseph and Kathryn Roeder

Ph.D: University of Illinois at Urbana-Champaign, Thesis Advisor: Saurabh Sinha

Xin started his career as an experimental biologist: he obtained his B.S. in Biochemistry at University of Science and Technology at China, followed by two years’ graduate training at Northwestern University. Xin moved to the area of computational biology afterwards, and earned his Ph.D. in Computer Science from University of Illinois at Urbana-Champaign in 2009, under the supervision of Dr. Saurabh Sinha. In his thesis work, Xin developed quantitative models of how regulatory sequences drive spatial gene expression patterns, and how these sequences evolve during evolution. His first postdoc is with Dr. Hao Li's at UC San Francisco. During this period, Xin became interested in the genetics of human diseases, and invented a novel method to use gene expression QTL to extract insights from genetic association data. As a Lane Fellow, Xin worked with Dr. Kathryn Roeder on methods for analyzing exome sequencing data from family and case-control studies, with particular emphasis on mapping the genetics of autism. At CMU, Xin also worked with Dr. Ziv Bar-Joseph on several topics, from constructing transcriptional regulatory networks to understanding network design principles.

As an independent investigator, Xin’s research focuses on developing and employing computational/statistical tools to identify genes and regulatory elements involved in complex diseases and to understand the mechanisms of their functions. He takes highly integrative approaches combining whole exome/genome sequencing data, expression and epigenomic data, and gene networks.


Marcel Schulz

Independent Research Group Leader, Saarland University, Saarbrücken, Germany

Lane Fellow Advisor: Ziv Bar-Joseph

Ph.D.: Freie Universitat von Berlin, Thesis Advisors: Martin Vingron, Knut Reinert, and Hugues Richard

Marcel gained his B.S. in Bioinformatics from Freie Universität Berlin, Germany. Subsequently, he got accepted to do his PhD studies at the International Max Planck Research School for Computational Biology and Scientific Computing at the Max Planck Institute for Molecular Genetics in Berlin. During his PhD he worked on algorithms and methods for Next-Generation Sequencing technologies including the detection of alternative splicing, de novo transcriptome assembly, and basepair-precision detection of structural variations. He was also involved in the design of computational methods for clinical diagnosis of patients and sequence learning with Markov models. Marcel is interested in problems where the interplay between wet lab biology, computer science, and statistical learning is crucial for success. In particular, his goal is to improve methods for analysing the transcriptome to understand gene regulatory networks of human systems.

In his time as a Lane Fellow, Marcel Schulz worked extensively with Ziv Bar-Joseph and engaged in a diverse set of projects. He worked on a new formulation of Input-Out-Hidden Markov models to analyze the dynamics of transcriptional and posttranscriptional regulation in mouse lung development to understand similarities between dysfunctions in patients with the lung disease idiopathic pulmonary fibrosis (IPF) together with people in Naftali Kaminski's Lab (then at UPMC). The first method was devised to predict time specific regulatory events of both types. Importantly, new regulators of lung development were discovered that are deregulated in patients with IPF. The new method was published in PNAS and appeared on the PNAS cover.

Further, he developed the first method for RNA-seq error correction, named SEECER, which was shown to significantly improve sequencing data quality. In a joint project with Veronica Hinman, SEECER was used to produce the first transcriptome of the developing sea cucumber. Together they developed a new approach to analyze binding patterns of transcription factors from ChIP-seq data without known reference sequence, which will enable researcher to explore transcriptional regulation in novel model organisms.

He also cooperated with Kausik Chakrabarti and Michael Widom from CMU and discovered small RNAs that are produced in the malaria causing parasite Plasmodium falciparum and potentially involved in host-pathogen interactions.

• Otto Hahn Medal 2010 by the Max Planck Society (Phd thesis award, 26 medals representing the top 3% of all Phd students from 2010 in ~80 Max Planck institutes )
• Best paper selection Decision Support, IMIA Yearbook of Medical Informatics 2010


Cheemeng Tan

Associate Professor, Department of Biomedical Engineering, University of California, Davis

Lane Fellow Advisor: Philip R. LeDuc

Ph.D.: Duke University, Thesis Advisor: Lingchong You

Cheemeng received his B.Eng. degree (first class honors) from National University of Singapore and his M.S. degree in High Performance Computing from Singapore-MIT Alliance. In 2005, he started his doctoral research in the Department of Biomedical Engineering at Duke University, where he evolved into a hybrid computational and microbial biologist. His Ph.D. thesis focused on implications of bacterial growth on antibiotic treatment and synthetic gene circuits. He published his research in journals such as Nature Chemical Biology and Biophysical Journal and was awarded the Medtronic Fellowship. His career goal is to improve the rational engineering of synthetic biological systems by tightly integrating both experiments and computational algorithms. At Carnegie Mellon University, he works on the engineering of artificial cells that carry synthetic gene circuits, which have potential impact on drug delivery and bioremediation.

Dr. Tan's research aims to engineer artificial cells using an integrated synthetic biology and computational approach. These artificial cells consist of phospholipid bilayers encapsulating cell-free expression systems that are molecularly crowded. Molecular crowding is a natural state of cells that is highly packed with macromolecules. In his research, Dr. Tan discovered that molecular crowding can uniquely affect dynamics and maintain robustness of gene expression. In this work that was published in Nature Nanotechnology, he applied mathematical modeling, single molecule and single cell imaging, and experimental biology as a multi-scale synthetic biology approach. His findings have broad implications on the engineering of artificial cellular systems for drug delivery, bioenergy, and biosensors. His career goal is to establish a synthetic biology foundation towards the construction of autonomous detect-and-respond artificial cells by harnessing functioning principles of nature cells.

• 2012-2017 Society-in-Science: Branco Weiss Fellow
• 2011 q-bio5 Travel Award


Xin Gao

Assistant Professor, Computer Science, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia

Lane Fellow Advisor: Christopher Langmead

Ph.D.: University of Waterloo, Thesis Advisor: Ming Li

Xin Gao received his Bachelor of Science degree from the Computer Science and Technology Department at Tsinghua University, China in 2004. He then applied to David R. Cheriton School of Computer Science at the University of Waterloo where he began working on his doctoral thesis in the area of bioinformatics and algorithms. His doctoral work mainly focuses on fully automated NMR protein structure determination and protein structure prediction. Xin’s research interests include computational methods and machine learning techniques in structural biology, sequence analysis, and system biology. He is particularly interested in developing highly-efficient algorithms and high-quality systems that really work on noisy and large-scale biological data sets.

During his time as a Lane Fellow, Dr. Gao worked on developing computational methods to study protein structures and functions. He developed efficient and effective algorithms to automate nuclear magnetic resonance (NMR) data-based protein three-dimensional structure determination. He also worked on studying the dynamics at the protein native state through molecular dynamics simulation and associating it with experimental data.


Hiroyuki Kuwahara

Research Scientist, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia

Lane Fellow Advisors: Veronica Hinman and Russell Schwartz

Ph.D.: University of Utah, Thesis Advisor: Chris J. Meyers

Hiroyuki Kuwahara obtained his B.S. and Ph.D. in computer science from the University of Utah. His Ph.D. thesis describes systematic and automatic model abstraction methodology to efficiently estimate temporal behaviors of genetic regulatory networks. To further pursue his research in the multidisciplinary field of computational biology, He worked for Microsoft Research – University of Trento Centre for Computational and Systems Biology. Among Hiro’s research interests are stochastic modeling, analysis, and control of biochemical systems. In particular, he is currently interested in analysis of rare deviant behaviors in the presence of stochastic fluctuations and analysis of reliable behaviors with unreliable elements.

Arvind Rao

Associate Professor, Department of Computational Medicine and Bioinformatics, MIDAS, Radiation Oncology, University of Michigan

Lane Fellow Advisor: Robert F. Murphy

Ph.D.: University of Michigan, Thesis Advisor: Alfred Hero, David States, and James Engel

Arvind received his Bachelor of Engineering degree (with distinction) in Electronics and Communications from Bangalore University, India in 2001. In 2003, he received the Master of Science in Engineering degree from the Electrical and Computer Engineeringg department at the University of Texas at Austin, with a specialization in Communications, Networks and Systems. He earned a A.M. in Statistics from the University of Michigan in 2007 and was a Rackham Predoctoral Fellow. For his doctoral work at the University of Michigan, he worked on understanding long -range transcriptional regulation in higher eukaryotes. His research interests lie at the intersection of signal processing, machine learning, experimental and computational systems biology.


Le Song

Professor and Deputy Department Chair, Mohamed bin Zayed University of Artificial Intelligence, UAE

Lane Fellow Advisor: Eric P.Xing

Ph.D.: University of Sydney, Thesis Advisor: Alex Smola

Le Song completed his B.S. degree majoring in computer science from the South China University of Technology. He then went to Australia and did his Master's and PhD studies both at the Univeristy of Sydney. He was also an endorsed student from the National ICT Australia. His main research interests are statistical machine learning, kernel methods, information visualization and their applications to biological and social problems. He has worked on various projects such as visual analysis of complex networks, identification of discriminative neuromarkers from EEG data, and selection of informative genes from microarray data. His goal is to bring modern machine learning tools into biology and generate real impact in the biology community.

Peter Huggins

Department of Statistics, University of Kentucky

Lane Fellow Advisor: Ziv Bar-Joseph

Ph.D.: University of California, Berkeley, Thesis Advisors: Lior Pachter and Bernd Sturmfels

A mathematician turned computational biologist, Peter earned his PhD in math at UC Berkeley with a designated emphasis in computational/genomic biology.  While at Berkeley, Peter applied polyhedral geometry to analyze Needleman--Wunsch sequence alignment, fitness landscapes, and the performance of neighbor-joining.  In particular he helped construct the first genome-wide parametric alignment.  Peter's current research interests include sequence analysis, proteomics and phylogenetics, with a focus on probabilistic models and machine learning techniques. He's particularly interested in applications to HIV sequence analysis and disease association studies.  In his spare time, Peter enjoys poker, backgammon, and fishing.