Big data to the bench: Transcriptome analysis for undergraduates
Procko C.; Morrison S.; Dunar C.; Mills S.; Maldonado B.; Cockrum C.; Peters N.E.; Huang S.-S.C.; Chory J.
2019
CBE Life Sciences Education
1
10.1187/cbe.18-08-0161
Next-generation sequencing (NGS)-based methods are revolutionizing biology. Their prevalence requires biologists to be increasingly knowledgeable about computational methods to manage the enormous scale of data. As such, early introduction to NGS analysis and conceptual connection to wet-lab experiments is crucial for training young scientists. However, significant challenges impede the introduction of these methods into the undergraduate classroom, including the need for specialized computer programs and knowledge of computer coding. Here, we describe a semester-long, course-based undergraduate research experience at a liberal arts college combining RNA-sequencing (RNA-seq) analysis with student-driven, wet-lab experiments to investigate plant responses to light. Students derived hypotheses based on analysis of RNA-seq data and designed follow-up studies of gene expression and plant growth. Our assessments indicate that students acquired knowledge of big data analysis and computer coding; however, earlier exposure to computational methods may be beneficial. Our course requires minimal prior knowledge of plant biology, is easy to replicate, and can be modified to a shorter, directed-inquiry module. This framework promotes exploration of the links between gene expression and phenotype using examples that are clear and tractable and improves computational skills and bioinformatics self-efficacy to prepare students for the “big data” era of modern biology. © 2019 C. Procko et al. CBE-Life Sciences Education.
Vision and change in undergraduate biology education: A call to action, (2011); Anders S., Pyl P.T., Huber W., HTSeq-A Python framework to work with high-throughput sequencing data, Bioinformatics, 31, pp. 166-169, (2014); Analysis of the genome sequence of the flowering plant, Arabidopsis thaliana. Nature, 408, pp. 796-815, (2000); Auchincloss L.C., Laursen S.L., Branchaw J.L., Eagan K., Graham M., Hanauer D.I., Dolan E.L., Assessment of course-based undergraduate research experiences: A meeting report, CBE-Life Sciences Education, 13, pp. 29-40, (2014); Bandura A., Self-efficacy: Toward a unifying theory of behavioral change, Psychological Review, 84, pp. 191-215, (1977); Bangera G., Brownell S.E., Course-based undergraduate research experiences can make scientific research more inclusive, CBE-Life Sciences Education, 13, pp. 602-606, (2014); Bialek W., Botstein D., Introductory science and mathematics education for 21st-century biologists, Science, 303, pp. 788-790, (2004); Bloom B.S., Krathwohl D.R., Masia B.B., Taxonomy of educational objectives: The classification of educational goals, (1956); Buonaccorsi V., Peterson M., Lamendella G., Newman J., Trun N., Tobin T., Roberts W., Vision and Change through the Genome Consortium for Active Teaching Using Next-Generation Sequencing (GCAT-SEEK), CBE-Life Sciences Education, 13, pp. 1-2, (2014); Bustin S., Absolute quantification of mRNA using real-time reverse transcription polymerase chain reaction assays, Journal of Molecular Endocrinology, 25, pp. 169-193, (2000); Campbell C.E., Nehm R.H., A critical analysis of assessment quality in genomics and bioinformatics education research, CBE-Life Sciences Education, 12, pp. 530-541, (2013); Couch B.A., Wood W.B., Knight J.K., The Molecular Biology Capstone Assessment: A concept assessment for upper-division molecular biology students, CBE-Life Sciences Education, 14, (2015); Crowe A., Dirks C., Wenderoth M.P., Biology in Bloom: Implementing Bloom’s taxonomy to enhance student learning in biology, CBE-Life Sciences Education, 7, pp. 368-381, (2008); Ebert-May D., Holt E., Seeing the forest and the trees: Research on plant science teaching and learning, CBE-Life Sciences Education, 13, pp. 361-362, (2014); Ellington R., Wachira J., Nkwanta A., RNA secondary structure prediction by using discrete mathematics: An interdisciplinary research experience for undergraduate students, CBE-Life Sciences Education, 9, pp. 348-356, (2010); Freese N.H., Norris D.C., Loraine A.E., Integrated genome browser: Visual analytics platform for genomics, Bioinformatics, 32, pp. 2089-2095, (2016); Goff S.A., Vaughn M., McKay S., Lyons E., Stapleton A.E., Gessler D., Stanzione D., The iPlant Collaborative: Cyberinfrastructure for plant biology, Frontiers in Plant Science, 2, (2011); Green R.E., Krause J., Briggs A.W., Maricic T., Stenzel U., Kircher M., Paabo S., A draft sequence of the Neandertal genome, Science, 328, pp. 710-722, (2010); Gross L.J., Education for a biocomplex future, Science, 288, pp. 807-807, (2000); Hanauer D.I., Frederick J., Fotinakes B., Strobel S.A., Linguistic analysis of project ownership for undergraduate research experiences, CBE-Life Sciences Education, 11, pp. 378-385, (2012); Hancock D., Funnell A., Jack B., Johnston J., Introducing undergraduate students to real-time PCR, Biochemistry and Molecular Biology Education, 38, pp. 309-316, (2010); Jones M.T., Barlow A.E.L., Villarejo M., Importance of undergraduate research for minority persistence and achievement in biology, Journal of Higher Education, 81, pp. 82-115, (2010); Kardash C.M., Evaluation of undergraduate research experience: Perceptions of undergraduate interns and their faculty mentors, Journal of Educational Psychology, 92, pp. 191-201, (2000); Kodama Y., Shumway M., Leinonen R., The sequence read archive: Explosive growth of sequencing data, Nucleic Acids Research, 40, pp. D54-D56, (2011); Lau S., Roeser R.W., Cognitive abilities and motivational processes in high school students’ situational engagement and achievement in science, Educational Assessment, 8, pp. 139-162, (2002); Lent R.W., Brown S.D., Brenner B., Chopra S.B., Davis T., Talleyrand R., Suthakaran V., The role of contextual supports and barriers in the choice of math/science educational options: A test of social cognitive hypotheses, Journal of Counseling Psychology, 48, pp. 474-483, (2001); Lent R.W., Brown S.D., Larkin K.C., Self-efficacy in the prediction of academic performance and perceived career options, Journal of Counseling Psychology, 33, pp. 265-269, (1986); Lewis J., Kattmann U., Traits, genes, particles and information: Re-visiting students’ understandings of genetics, International Journal of Science Education, 26, pp. 195-206, (2004); Loike J.D., Rush B.S., Schweber A., Fischbach R.L., Lessons learned from undergraduate students in designing a science-based course in bioethics, CBE-Life Sciences Education, 12, pp. 701-710, (2013); Lopatto D., Undergraduate research experiences support science career decisions and active learning, CBE-Life Sciences Education, 6, pp. 297-306, (2007); Lopatto D., Alvarez C., Barnard D., Chandrasekaran C., Chung H.M., Du C., Elgin S.C., Undergraduate research. Genomics Education Partnership, Science, 322, pp. 684-685, (2008); Magana A.J., Taleyarkhan M., Alvarado D.R., Kane M., Springer J., Clase K., A survey of scholarly literature describing the field of bioinformatics education and bioinformatics educational research, CBE-Life Sciences Education, 13, pp. 607-623, (2014); Makarevitch I., Frechette C., Wiatros N., Authentic research experience and “big data” analysis in the classroom: Maize response to abiotic stress, CBE-Life Sciences Education, 14, 3, (2015); Makarevitch I., Martinez-Vaz B., Killing two birds with one stone: Model plant systems as a tool to teach the fundamental concepts of gene expression while analyzing biological data, Biochimica et Biophysica Acta, 1860, pp. 166-173, (2017); Marbach-Ad G., Expectations and difficulties of first-year biology students, Journal of College Science Teaching, 33, pp. 18-23, (2004); Martinez M.E., Cognition and the question of test item format, Educational Psychologist, 34, pp. 207-218, (1999); Mashiguchi K., Tanaka K., Sakai T., Sugawara S., Kawaide H., Natsume M., Kasahara H., The main auxin biosynthesis pathway in Arabidopsis, Proceedings of the National Academy of Sciences USA, 108, pp. 18512-18517, (2011); Mi H., Muruganujan A., Casagrande J.T., Thomas P.D., Large-scale gene function analysis with the PANTHER classification system, Nature Protocols, 8, pp. 1551-1566, (2013); Nagalakshmi U., Wang Z., Waern K., Shou C., Raha D., Gerstein M., Snyder M., The transcriptional landscape of the yeast genome defined by RNA sequencing, Science, 320, pp. 1344-1349, (2008); A new biology for the 21st century, (2009); Peterson M.P., Malloy J.T., Marden J.H., Buonaccorsi V.P., Teaching RNAseq at undergraduate institutions: A tutorial and R package from the Genome Consortium for Active Teaching, CourseSource, (2015); Procko C., Burko Y., Jaillais Y., Ljung K., Long J.A., Chory J., The epidermis coordinates auxin-induced stem growth in response to shade, Genes & Development, 30, pp. 1529-1541, (2016); R: A language and environment for statistical computing, (2015); Reinagel A., Bray Speth E., Beyond the central dogma: Model-based learning of how genes determine phenotypes, CBE-Life Sciences Education, 15, (2016); Robinson J.T., Thorvaldsdottir H., Winckler W., Guttman M., Lander E.S., Getz G., Mesirov J.P., Integrative genomics viewer, Nature Biotechnology, 29, pp. 24-26, (2011); Robinson M.D., McCarthy D.J., Smyth G.K., edgeR: A Bioconductor package for differential expression analysis of digital gene expression data, Bioinformatics, 26, pp. 139-140, (2010); Russell S.H., Hancock M.P., McCullough J., The pipeline. Benefits of undergraduate research experiences, Science, 316, pp. 548-549, (2007); Scouller K., The influence of assessment method on students’ learning approaches: Multiple choice question examination versus assignment essay, Higher Education, 35, pp. 453-472, (1998); Shaffer C.D., Alvarez C.J., Bednarski A.E., Dunbar D., Goodman A.L., Reinke C., Elgin S.C.R., A course-based research experience: How benefits change with increased investment in instructional time, CBE-Life Sciences Education, 13, pp. 111-130, (2014); Song L., Huang S.C., Wise A., Castanon R., Nery J.R., Chen H., Ecker J.R., A transcription factor hierarchy defines an environmental stress response network, Science, 354, (2016); Stanger-Hall K.F., Multiple-choice exams: An obstacle for higher-level thinking in introductory science classes, CBE-Life Sciences Education, 11, pp. 294-306, (2012); Tao Y., Ferrer J.L., Ljung K., Pojer F., Hong F., Long J.A., Chory J., Rapid synthesis of auxin via a new tryptophan-dependent pathway is required for shade avoidance in plants, Cell, 133, pp. 164-176, (2008); Trapnell C., Pachter L., Salzberg S.L., TopHat: Discovering splice junctions with RNA-Seq, Bioinformatics, 25, pp. 1105-1111, (2009); Trapnell C., Roberts A., Goff L., Pertea G., Kim D., Kelley D.R., Pachter L., Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks, Nature Protocols, 7, pp. 562-578, (2012); Usher E.L., Pajares F., Sources of self-efficacy in school: Critical review of the literature and future directions, Review of Educational Research, 78, pp. 751-796, (2008); Wandersee J.H., Plants or animals-Which do junior high school students prefer to study?, Journal of Research in Science Teaching, 23, pp. 415-426, (1986); Wandersee J.H., Schussler E.E., Preventing plant blindness, American Biology Teacher, 61, pp. 82-86, (1999); Wilhelm B.T., Marguerat S., Watt S., Schubert F., Wood V., Goodhead I., Bahler J., Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution, Nature, 453, pp. 1239-1243, (2008); Xue Z., Huang K., Cai C., Cai L., Jiang C.Y., Feng Y., Fan G., Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing, Nature, 500, pp. 593-597, (2013)
American Society for Cell Biology
Article
All Open Access; Gold Open Access; Green Open Access
Scopus