Integrating Data Science Ethics Into an Undergraduate Major: A Case Study
Baumer B.S.; Garcia R.L.; Kim A.Y.; Kinnaird K.M.; Ott M.Q.
2022
Journal of Statistics and Data Science Education
8
10.1080/26939169.2022.2038041
We present a programmatic approach to incorporating ethics into an undergraduate major in statistical and data sciences. We discuss departmental-level initiatives designed to meet the National Academy of Sciences recommendation for integrating ethics into the curriculum from top-to-bottom as our majors progress from our introductory courses to our senior capstone course, as well as from side-to-side through co-curricular programming. We also provide six examples of data science ethics modules used in five different courses at our liberal arts college, each focusing on a different ethical consideration. The modules are designed to be portable such that they can be flexibly incorporated into existing courses at different levels of instruction with minimal disruption to syllabi. We connect our efforts to a growing body of literature on the teaching of data science ethics, present assessments of our effectiveness, and conclude with next steps and final thoughts. © 2022 The Author(s). Published with license by Taylor & Francis Group, LLC.
Case studies; Data ethics; Education; Undergraduate curriculum
Aust F., Barth M., (2021); Baumer B.S., “A Data Science Course for Undergraduates: Thinking with Data, The American Statistician, 69, pp. 334-342, (2015); Baumer B.S., Kaplan D.T., Horton N.J., Modern Data Science with R, (2021); Bender E.M., Gebru T., McMillan-Major A., Shmitchell S., “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?, FAccT ’21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pp. 610-623, (2021); Benjamin R., Race After Technology: Abolitionist Tools for the New Jim Code, (2019); Blair J.R., Jones L., Leidig P., Murray S., Raj R.K., Romanowski C.J., “Establishing ABET A accreditation Criteria for Data Science, Proceedings of the 52nd ACM Technical Symposium on Computer Science Education, pp. 535-540, (2021); Bloom B.S., Engelhart M.D., Furst E.J., Hill W.H., Krathwohl D.R., Taxonomy of Educational Objectives, Handbook 1: Cognitive Domain), (1956); Bruce K.B., “Five Big Open Questions in Computing Education, ACM Inroads, 9, pp. 77-80, (2018); Burton E., Goldsmith J., Mattei N., How to Teach Computer Ethics Through Science Fiction, Communications of the ACM, 61, pp. 54-64, (2018); Cai F., (2020); Canney N., Bielefeldt A., “A Framework for the Development of Social Responsibility in Engineers,”, International Journal of Engineering Education, 31, pp. 414-424, (2015); Carter L., Crockett C., An Ethics Curriculum for CS with Flexibility and Continuity, 2019 IEEE Frontiers in Education Conference (FIE), pp. 1-9, (2019); Chivukula S.S., Li Z., Pivonka A.C., Chen J., Gray C.M., “Surveying the Landscape of Ethics-Focused Design Methods,”, arXiv preprint arXiv:2102.08909, (2021); (2018); (2018); On Being a Scientist: A Guide to Responsible Conduct in Research, (2009); (2019); Davies H., (2015); D'Ignazio C., Klein L.F., “Data Feminism, (2020); Donoho D., “50 Years of Data Science,”, Journal of Computational and Graphical Statistics, 26, pp. 745-766, (2017); Dwork C., McSherry F., Nissim K., Smith A., “Calibrating Noise to Sensitivity in Private Data Analysis,”, Theory of Cryptography, pp. 265-284, (2006); Edmondson A., “Psychological Safety and Learning Behavior in Work Teams, Administrative Science Quarterly, 44, pp. 350-383, (1999); Elisa Raffaghelli J., “Is Data Literacy a Catalyst of Social Justice? A Response from Nine Data Literacy Initiatives in Higher Education, Education Sciences, 10, (2020); Elliott A.C., Stokes S.L., Cao J., “Teaching Ethics in a Statistics Curriculum with a Cross-Cultural Emphasis, The American Statistician, 72, pp. 359-367, (2018); Eubanks V., Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor, (2018); (2018); Fiesler C., Garrett N., Beard N., “What Do We Teach When We Teach Tech Ethics? A Syllabi Analysis, Proceedings of the 51st ACM Technical Symposium on Computer Science Education, pp. 289-295, (2020); Fitzpatrick J., (2010); Floridi L., Taddeo M., “What is Data Ethics?, Philosophical Transactions of the Royal Society A, 374, (2016); Fry H., Hello World: Being Human in the Age of Algorithms, (2018); Gebru T., “Race and Gender,”, The Oxford Handbook of Ethics of AI, pp. 251-269, (2020); Gershkoff A., Therriault A., Satyanarayan A., Jones B., Burg B., Hurt B., Granger B., Jacob B., Doig C., Fryar C., Ramanan D., Bhargava D., Perez F., Greenleigh I., Feng J., Loyens J., Morgan J., Ram K., Green L., Barba L., Colaco M., Rocklin M., Jamei M., Horn M., Harris N.E., Elprin N., Kaldero N., Chopra N., McGarry P., Todkar R., Jurney R., Brener S., Couture T., Thibodeaux T., McKinney W., (2019); Giroux M.E., Coburn P.I., Connolly D.A., Bernstein D.M., Perspective-Taking Abilities Across the Lifespan: A Review of Hindsight Bias and Theory of Mind, Individual Differences in Judgement and Decision-Making, pp. 157-175, (2016); Gotterbarn D., Wolf M.J., Flick C., Miller K., “Thinking Professionally: The Continual Evolution of Interest in Computing Ethics,”, ACM Inroads, 9, pp. 10-12, (2018); Grosz B.J., Grant D.G., Vredenburgh K., Behrends J., Hu L., Simmons A., Waldo J., “Embedded EthiCS: Integrating Ethics Across CS Education, Communications of the ACM, 62, pp. 54-61, (2019); Gunaratna N.S., Tractenberg R.E., “Ethical Reasoning with the 2016 Revised ASA Ethical Guidelines for Statistical Practice, in American Statistical Association, Proceedings of the Joint Statistical Meetings, (2016); Hand D.J., “Aspects of Data Ethics in a Changing World: Where are We Now?, Big Data, 6, pp. 176-190, (2018); Hardin J., Hoerl R., Horton N.J., Nolan D., Baumer B.S., Hall-Holt O., Murrell P., Peng R., Roback P., Temple Lang D., Ward M.D., “Data Science in Statistics Curricula: Preparing Students to ’Think with Data, The American Statistician, 69, pp. 343-353, (2015); Heggeseth B., “Intertwining Data Ethics in Intro Stats,”, Symposium on Data Science and Statistics, (2019); Hicks S.C., Irizarry R.A., “A Guide to Teaching Data Science, The American Statistician, 72, pp. 382-391, (2018); Hoffmann A.L., Cross K.A., (2021); Huff D., How to Lie with Statistics, (1954); James G., Witten D., Hastie T., Tibshirani R., An Introduction to Statistical Learning: With Applications in R, (2013); James G., Witten D., Hastie T., Tibshirani R., (2021); James G., Witten D., Hastie T., Tibshirani R., ISLR2: Introduction to Statistical Learning, (2022); Kantayya S., Buolamwini J., (2020); Kaplan D., “Teaching Stats for Data Science, The American Statistician, 72, pp. 89-96, (2018); Kerr N.L., “Harking: Hypothesizing After the Results are Known, Personality and Social Psychology Review, 2, pp. 196-217, (1998); Kim A.Y., Escobedo-Land A., “OKCupid Data for Introductory Statistics and Data Science Courses, Journal of Statistics Education, 23, (2015); Kim A.Y., Escobedo-Land A., “Correction to OkCupid Data for Introductory Statistics and Data Science Courses,”, Journal of Statistics and Data Science Education, 29, (2021); Kirkegaard E.O., Bjerrekaer J.D., “The OKCupid Dataset: A Very Large Public Dataset of Dating Site Users, Open Differential Psychology, 46, (2016); Kolaczyk E.D., Csardi G., Statistical Analysis of Network Data with R, 65, (2014); Kramer A.D.I., Guillory J.E., Hancock J.T., “Experimental Evidence of Massive-Scale Emotional Contagion Through Social Networks, Proceedings of the National Academy of Sciences, 111, pp. 8788-8790, (2014); Langkjaer-Bain R., “Trials of a Statistician, Significance, 14, pp. 14-19, (2017); Levin S., (2017); Loukides M., Mason H., Patil D., Ethics and Data Science, (2018); Loukides M., Mason H., Patil D., Of Oaths and Checklists, (2018); Meyer R., (2014); Milner Y., (2019); Muradova L., “Seeing the Other Side? Perspective-Taking and Reflective Political Judgements in Interpersonal Deliberation, Political Studies, 69, pp. 644-664, (2021); Data Science for Undergraduates: Opportunities and Options, (2018); (1978); Neff G., Tanweer A., Fiore-Gartland B., Osburn L., “Critique and Contribute: A Practice-Based Framework for Improving Critical Data Studies and Data Science, Big Data, 5, pp. 85-97, (2017); Noble S.U., Algorithms of Oppression: How Search Engines Reinforce Racism, (2018); O'Neil C., Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy, (2016); Pierson S., Wilkinson L., (2021); Poulsen K., (2014); R: A Language and Environment for Statistical Computing, (2021); Rosenberg M., Confessore N., Cadwalladr C., (2018); Saltz J., Skirpan M., Fiesler C., Gorelick M., Yeh T., Heckman R., Dewar N., Beard N., “Integrating Ethics Within Machine Learning Courses, ACM Transactions on Computing Education (TOCE), 19, pp. 1-26, (2019); Schlenker L., (2019); Shapiro B.R., Meng A., O'Donnell C., Lou C., Zhao E., Dankwa B., Hostetler A., Re-shape: A Method to Teach Data Ethics for Data Science Education, Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems,”, pp. 1-13, (2020); Shook N.J., Fazio R.H., “Interracial Roommate Relationships: An Experimental Field Test of the Contact Hypothesis, Psychological Science, 19, pp. 717-723, (2008); Skirpan M., Beard N., Bhaduri S., Fiesler C., Yeh T., “Ethics Education in Context: A Case Study of Novel Ethics Activities for the CS Classroom,”, Proceedings of the 49th ACM Technical Symposium on Computer Science Education, pp. 940-945, (2018); Solow B., (2021); Sweeney L., “k-Anonymity: A Model for Protecting Privacy, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 10, pp. 557-570, (2002); Tarran B., “German Commission Calls for Risk-Based Regulation of Algorithmic Systems, Significance, 16, pp. 4-5, (2019); Tractenberg R.E., (2019); Tractenberg R.E., (2019); Utts J., “Enhancing Data Science Ethics Through Statistical Education and Practice, International Statistical Review, 89, pp. 1-17, (2021); Vakil S., “Ethics, Identity, and Political Vision: Toward a Justice-Centered Approach to Equity in Computer Science Education, Harvard Educational Review, 88, pp. 26-52, (2018); Wang M.Q., Yan A.F., Katz R.V., “Researcher Requests for Inappropriate Analysis and Reporting: A US Survey of Consulting Biostatisticians, Annals of Internal Medicine, 169, pp. 554-558, (2018); Washington A.L., Kuo R., “Whose Side are Ethics Codes on? Power, Responsibility and the Social Good,”, Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pp. 230-240, (2020); Wasserstein R.L., Lazar N.A., Et al., “The ASA’s Statement on p-Values: Context, Process, and Purpose, The American Statistician, 70, pp. 129-133, (2016); Wasserstein R.L., Schirm A.L., Lazar N.A., “Moving to a World Beyond ‘p < 0.05’,”, The American Statistician, 73, pp. 1-19, (2019); Wender B., Kloefkorn T., (2017); Wickham H., Chang W., Henry L., Pedersen T.L., Takahashi K., Wilke C., Woo K., Yutani H., Dunnington D., (2021); Xiao T., Ma Y., A Letter to the Journal of Statistics and Data Science Education—A Call for Review of “OkCupid Data for Introductory Statistics and Data Science Courses, Journal of Statistics and Data Science Education, 29, pp. 214-215, (2021); Zimmer M., ‘But the Data is Already Public’: On the Ethics of Research in Facebook, Ethics and Information Technology, 12, pp. 313-325, (2010); Zuboff S., The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power, (2018)
Taylor and Francis Ltd.
Article
All Open Access; Gold Open Access; Green Open Access
Scopus