Big Biological Data: Challenges and Opportunities
Yixue Li,Luonan Chen
In ‘‘Omics’’ era of the life sciences, data is presented in many
forms, which represent the information at various levels of biological
systems, including data about genome, transcriptome,
epigenome, proteome, metabolome, molecular imaging, molecular
pathways, different population of people and clinical/medical
records. The biological data is big, and its scale has already
been well beyond petabyte (PB) even exabyte (EB). Nobody
doubts that the biological data will create huge amount of values,
if scientists can overcome many challenges, e.g., how to
handle the complexity of information, how to integrate the data
from very heterogeneous resources, what kind of principles or
standards to be adopted when facing with the big data. Tools
and techniques for analyzing big biological data enable us to
translate massive amount of information into a better understanding
of the basic biomedical mechanisms, which can be further
applied to translational or personalized medicine.
Application of Next-generation Sequencing
Technology in Forensic Science
Yaran Yang,Bingbing Xie,Jiangwei Yan
Next-generation sequencing (NGS) technology, with its high-throughput capacity and low cost, has developed rapidly in recent years and become an important analytical tool for many genomics researchers. New opportunities in the research domain of the forensic studies emerge by harnessing the power of NGS technology, which can be applied to simultaneously analyzing multiple loci of forensic interest in different genetic contexts, such as autosomes, mitochondrial and sex chromosomes. Furthermore, NGS technology can also have potential applications in many other aspects of research. These include DNA database construction, ancestry and phenotypic inference, monozygotic twin studies, body fluid and species identification, and forensic animal, plant and microbiological analyses. Here we review the application of NGS technology in the field of forensic science with the aim of providing a reference for future forensics studies and practice.
Pharmacogenomics of Cisplatin Sensitivity in Non-small Cell Lung Cancer
Maimon C. Rose,Elina Kostyanovskaya,R. Stephanie Huang
Cisplatin, a platinum-based chemotherapeutic drug, has been used for over 30 years in a wide variety of cancers with varying degrees of success. In particular, cisplatin has been used to treat late stage non-small cell lung cancer (NSCLC) as the standard of care. However, therapeutic outcomes vary from patient to patient. Considerable efforts have been invested to identify biomarkers that can be used to predict cisplatin sensitivity in NSCLC. Here we reviewed current evidence for cisplatin sensitivity biomarkers in NSCLC. We focused on several key pathways, including nucleotide excision repair, drug transport and metabolism. Both expression and germline DNA variation were evaluated in these key pathways. Current evidence suggests that cisplatin-based treatment could be improved by the use of these biomarkers.
Pathway-based Analysis Tools for Complex Diseases
Lv Jin,Xiao-Yu Zuo,Wei-Yang Su,Xiao-Lei Zhao,Man-Qiong Yuan,Li-Zhen Han,Xiang Zhao,Ye-Da Chen,Shao-Qi Rao
Genetic studies are traditionally based on single-gene analysis. The use of these analyses can pose tremendous challenges for elucidating complicated genetic interplays involved in complex human diseases. Modern pathway-based analysis provides a technique, which allows a comprehensive understanding of the molecular mechanisms underlying complex diseases. Extensive studies utilizing the methods and applications for pathway-based analysis have significantly advanced our capacity to explore large-scale omics data, which has rapidly accumulated in biomedical fields. This article is a comprehensive review of the pathway-based analysis methods—the powerful methods with the potential to uncover the biological depths of the complex diseases. The general concepts and procedures for the pathway-based analysis methods are introduced and then, a comprehensive review of the major approaches for this analysis is presented. In addition, a list of available pathway-based analysis software and databases is provided. Finally, future directions and challenges for the methodological development and applications of pathway-based analysis techniques are discussed. This review will provide a useful guide to dissect complex diseases.
Identification of A Novel SBF2 Frameshift Mutation in Charcot–Marie–Tooth Disease Type 4B2 Using Whole-exome Sequencing
Meiyan Chen,Jing Wu,Ning Liang,Lihui Tang,Yanhua Chen,Huishuang Chen,Wei Wei,Tianying Wei,Hui Huang,Xin Yi,Ming Qi
Charcot–Marie–Tooth disease type 4B2 with early-onset glaucoma (CMT4B2, OMIM 604563) is a genetically-heterogeneous childhood-onset neuromuscular disorder. Here, we report the case of a 15-year-old male adolescent with lower extremity weakness, gait abnormalities, foot deformities and early-onset glaucoma. Since clinical diagnosis alone was insufficient for providing pathogenetic evidence to indicate that the condition belonged to a consanguineous family, we applied whole-exome sequencing to samples from the patient, his parents and his younger brother, assuming that the patient’s condition is transmitted in an autosomal recessive pattern. A frame-shift mutation, c.4571delG (P.Gly1524Glufs∗42), was revealed in the CMT4B2-related gene SBF2 (also known as MTMR13, MIM 607697), and this mutation was found to be homozygous in the proband and heterozygous in his parents and younger brother. Together with the results of clinical diagnosis, this case was diagnosed as CMT4B2. Our finding further demonstrates the use of whole-exome sequencing in the diagnosis and treatment of rare diseases.
Computational Prediction of MicroRNAs from Toxoplasma gondii Potentially Regulating the Hosts’ Gene Expression
MÃ¼Åerref Duygu SaÃ§ar,Caner BaÄcÄ±,Jens Allmer
MicroRNAs (miRNAs) were discovered two decades ago, yet there is still a great need for further studies elucidating their genesis and targeting in different phyla. Since experimental discovery and validation of miRNAs is difficult, computational predictions are indispensable and today most computational approaches employ machine learning. Toxoplasma gondii, a parasite residing within the cells of its hosts like human, uses miRNAs for its post-transcriptional gene regulation. It may also regulate its hosts’ gene expression, which has been shown in brain cancer. Since previous studies have shown that overexpressed miRNAs within the host are causal for disease onset, we hypothesized that T. gondii could export miRNAs into its host cell. We computationally predicted all hairpins from the genome of T. gondii and used mouse and human models to filter possible candidates. These were then further compared to known miRNAs in human and rodents and their expression was examined for T. gondii grown in mouse and human hosts, respectively. We found that among the millions of potential hairpins in T. gondii, only a few thousand pass filtering using a human or mouse model and that even fewer of those are expressed. Since they are expressed and differentially expressed in rodents and human, we suggest that there is a chance that T. gondii may export miRNAs into its hosts for direct regulation.
A Comprehensive Transcriptomic Analysis
of Infant and Adult Mouse Ovary
Linlin Pan,Wei Gong,Yuanyuan Zhou,Xiaonuan Li,Jun Yu,Songnian Hu
Ovary development is a complex process involving numerous genes. A well-developed ovary is essential for females to keep fertility and reproduce offspring. In order to gain a better insight into the molecular mechanisms related to the process of mammalian ovary development, we performed a comparative transcriptomic analysis on ovaries isolated from infant and adult mice by using next-generation sequencing technology (SOLiD). We identified 15,454 and 16,646 transcriptionally active genes at the infant and adult stage, respectively. Among these genes, we also identified 7021 differentially expressed genes. Our analysis suggests that, in general, the adult ovary has a higher level of transcriptomic activity. However, it appears that genes related to primordial follicle development, such as those encoding Figla and Nobox, are more active in the infant ovary, whereas expression of genes vital for follicle development, such as Gdf9, Bmp4 and Bmp15, is upregulated in the adult. These data suggest a dynamic shift in gene expression during ovary development and it is apparent that these changes function to facilitate follicle maturation, when additional functional gene studies are considered. Furthermore, our investigation has also revealed several important functional pathways, such as apoptosis, MAPK and steroid biosynthesis, that appear to be much more active in the adult ovary compared to those of the infant. These findings will provide a solid foundation for future studies on ovary development in mice and other mammals and help to expand our understanding of the complex molecular and cellular events that occur during postnatal ovary development.
nuMap: A Web Platform for Accurate Prediction
of Nucleosome Positioning
Bader A. Alharbi,Thamir H. Alshammari,Nathan L. Felton,Victor B. Zhurkin,Feng Cui
Nucleosome positioning is critical for gene expression and of major biological interest. The high cost of experimentally mapping nucleosomal arrangement signifies the need for computational approaches to predict nucleosome positions at high resolution. Here, we present a web-based application to fulfill this need by implementing two models, YR and W/S schemes, for the translational and rotational positioning of nucleosomes, respectively. Our methods are based on sequence-dependent anisotropic bending that dictates how DNA is wrapped around a histone octamer. This application allows users to specify a number of options such as schemes and parameters for threading calculation and provides multiple layout formats. The nuMap is implemented in Java/Perl/MySQL and is freely available for public use at http://numap.rit.edu. The user manual, implementation notes, description of the methodology and examples are available at the site.