Expanding undergraduate exposure to computer science subfields: Resources and lessons from a hands-on computational biologyworkshop
Oesper L.; Vostinar A.
2020
SIGCSE 2020 - Proceedings of the 51st ACM Technical Symposium on Computer Science Education
2
10.1145/3328778.3366909
Computational biology is an exciting and ever-widening interdisciplinary field. Expanding the participation of undergraduate students in this field will help to inspire and train the next generation of scientists necessary to support this growing area. However, students at smaller institutions, such as those focused on undergraduate education, may not have access to courses related to or even faculty interested in computational biology. Providing more opportunities for such undergraduate students to be exposed to computational biology, or other subfields within computer science, will be important for ensuring these students are included in the pipeline of scientists contributing to these diverse fields. To this end, we hosted a computational biology workshop that brought together undergraduate students from three different liberal arts colleges. The goal of the workshop was to provide an introduction to how computer science can be used to help answer important problems in biology. A diverse set of six faculty members from different institutions each created and taught a hands-on module as an introduction to a different area of computational biology at the workshop. We describe how we went about organizing this undergraduate workshop, summarize the workshop materials that are freely available, and discuss the outcomes and lessons learned from the workshop. We further propose that the workshop structure used is adaptable to other subfields of computer science. Workshop materials available at the workshop website: https: //sites.google.com/carleton.edu/compbioworkshop2018/home. © 2020 Copyright held by the owner/author(s). Publication rights licensed to ACM.
Computational biology; Undergraduate education; Workshop
(2019); (2019); (2019); Bedau M.A., Artificial life: Organization, adaptation and complexity from the bottom up, Trends in Cognitive Sciences, 7, 11, pp. 505-512, (2003); Berger-Wolf T., Igic B., Taylor C., Sloan R., Poretsky R., A biology-themed introductory cs course at a large, diverse public university, Proceedings of the 49th ACM Technical Symposium on Computer Science Education. ACM, pp. 233-238, (2018); Bharadwaj A., Singh D.P., Ritz A., Tegge A.N., Poirel C.L., Kraikivski P., Adames N., Luther K., Kale S.D., Peccoud J., Tyson J.J., Murali T.M., Graphspace: Stimulating interdisciplinary collaborations in network biology, Bioinformatics, 33, 19, pp. 3134-3136, (2017); Carey M.A., Papin J.A., Ten simple rules for biologists learning to program, PLoS Comput Biol, 14, 1, (2018); Doom T., Raymer M., Krane D., Garcia O., A proposed undergraduate bioinformatics curriculum for computer scientists, ACM SIGCSE Bulletin, 34, pp. 78-81, (2002); Murat Eren A., Esen O.C., Quince C., Vineis J.H., Morrison H.G., Sogin M.L., Delmont T.O., Anvi'o: An advanced analysis and visualization platform for 'omics data, PeerJ, 3, (2015); Hagberg A., Swart P., Chult D.S., Exploring Network Structure, Dynamics, and Function Using NetworkX, (2008); Klawe M., Increasing female participation in computing: The harvey mudd college story, Computer, 46, 3, pp. 56-58, (2013); Koboldt D.C., Zhang Q., Larson D.E., Shen D., McLellan M.D., Lin L., Miller C.A., Mardis E.R., Ding L., Wilson R.K., Varscan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencing, Genome Res, 22, 3, pp. 568-576, (2012); Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., The sequence alignment/map format and samtools, Bioinformatics, 25, 16, pp. 2078-2079, (2009); Madlung A., Assessing an effective undergraduate module teaching applied bioinformatics to biology students, PLoS Comput Biol, 14, 1, (2018); Markowetz F., All biology is computational biology, PLoS Biol, 15, 3, (2017); Marx V., Biology: The big challenges of big data, Nature, 498, 7453, pp. 255-260, (2013); McGrath A., Champ K., Shang C.A., Van Dam E., Brooksbank C., Morgan S.L., From trainees to trainers to instructors: Sustainably building a national capacity in bioinformatics training, PLoS Comput Biol, 15, 6, (2019); Merkel D., Docker: Lightweight linux containers for consistent development and deployment, Linux Journal, 2014, 239, (2014); Mulder N., Schwartz R., Brazas M.D., Brooksbank C., Gaeta B., Morgan S.L., Pauley M.A., Rosenwald A., Rustici G., Sierk M., Warnow T., Welch L., The development and application of bioinformatics core competencies to improve bioinformatics training and education, PLoS Comput Biol, 14, 2, (2018); Oliphant T.E., A Guide to NumPy 1, (2006); Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., Et al., Scikit-learn: Machine learning in python, Journal of Machine Learning Research, 12, pp. 2825-2830, (2011); Sheynkman G.M., Johnson J.E., Jagtap P.D., Shortreed M.R., Onsongo G., Frey B.L., Griffin T.J., Smith L.M., Using galaxy-p to leverage RNA-seq for the discovery of novel protein variations, BMC Genomics, 15, (2014); Smith D.R., Bringing bioinformatics to the scientific masses: As the demand for high-level bioinformatics is growing, training students in the field becomes ever more important, EMBO Rep, 19, 6, (2018); Vaudel M., Burkhart J.M., Zahedi R.P., Oveland E., Berven F.S., Sickmann A., Martens L., Barsnes H., Peptideshaker enables reanalysis of ms-derived proteomics data sets, Nat Biotechnol, 33, 1, pp. 22-24, (2015); Welch L., Lewitter F., Schwartz R., Brooksbank C., Radivojac P., Gaeta B., Victoria Schneider M., Bioinformatics curriculum guidelines: Toward a definition of core competencies, PLOS Computational Biology, 10, 3, (2014); Sayres Wilson M.A., Hauser C., Sierk M., Robic S., Rosenwald A.G., Smith T.M., Triplett E.W., Williams J.J., Dinsdale E., Morgan W.R., Burnette J.M., Donovan S.S., Drew J.C., Elgin R.S.C., Fowlks E.R., Galindo-Gonzalez S., Goodman A.L., Grandgenett N.F., Goller C.C., Jungck J.R., Newman J.D., Pearson W., Ryder E.F., Tosado-Acevedo R., Tapprich W., Tobin T.C., Toro-Martinez A., Welch L.R., Wright R., Barone L., Ebenbach D., McWilliams M., Olney K.C., Pauley M.A., Bioinformatics core competencies for undergraduate life sciences education, PLoS One, 13, 6, (2018); Yanai I., Chmielnicki E., Computational biologists: Moving to the driver's seat, Genome Biol, 18, 1, (2017); Ada Zhan Y., Gregory Wray C., Namburi S., Glantz S.T., Laubenbacher R., Chuang J.H., Fostering bioinformatics education through skill development of professors: Big genomic data skills training for professors, PLoS Comput Biol, 15, 6, (2019); Zweben S., Bizot B., 2017 cra taulbee survey, Computing Research News, 30, 5, pp. 1-47, (2018)
Conference paper
All Open Access; Bronze Open Access
Scopus