Barriers to integration of bioinformatics into undergraduate life sciences education: A national study of US life sciences faculty uncover significant barriers to integrating bioinformatics into undergraduate instruction
Authors:
Jason J. Williams aff001; Jennifer C. Drew aff002; Sebastian Galindo-Gonzalez aff003; Srebrenka Robic aff004; Elizabeth Dinsdale aff005; William R. Morgan aff006; Eric W. Triplett aff002; James M. Burnette, III aff007; Samuel S. Donovan aff008; Edison R. Fowlks aff009; Anya L. Goodman aff010; Nealy F. Grandgenett aff011; Carlos C. Goller aff012; Charles Hauser aff013; John R. Jungck aff014; Jeffrey D. Newman aff015; William R. Pearson aff016; Elizabeth F. Ryder aff017; Michael Sierk aff018; Todd M. Smith aff019; Rafael Tosado-Acevedo aff020; William Tapprich aff021; Tammy C. Tobin aff022; Arlín Toro-Martínez aff023; Lonnie R. Welch aff024; Melissa A. Wilson aff025; David Ebenbach aff026; Mindy McWilliams aff026; Anne G. Rosenwald aff027; Mark A. Pauley aff028
Authors place of work:
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, United States of America
aff001; Microbiology and Cell Science Department, University of Florida, Gainesville, FL, United States of America
aff002; Department of Agricultural Education and Communication, University of Florida, Gainesville, FL, United States of America
aff003; Department of Biology, Agnes Scott College, Decatur, GA, United States of America
aff004; Department of Biology, San Diego State University, San Diego, CA, United States of America
aff005; Department of Biology, College of Wooster, Wooster, OH, United States of America
aff006; University of California, Riverside, Riverside, CA, United States of America
aff007; Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, United States of America
aff008; Department of Biological Sciences, Hampton University, Hampton, VA, United States of America
aff009; Department of Chemistry and Biochemistry, California Polytechnic State University, San Luis Obispo, CA, United States of America
aff010; Department of Teacher Education, University of Nebraska at Omaha, Omaha, NE, United States of America
aff011; Department of Biological Sciences, North Carolina State University, Raleigh, NC, United States of America
aff012; Department of Biological Sciences, Bioinformatics Program, St. Edward’s University, Austin, TX, United States of America
aff013; Departments of Biological Sciences and Mathematical Sciences, University of Delaware, Newark, DE, United States of America
aff014; Department of Biology, Lycoming College, Williamsport, PA, United States of America
aff015; Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, United States of America
aff016; Biology and Biotechnology Department, Worcester Polytechnic Institute, Worcester, MA, United States of America
aff017; Bioinformatics Program, Saint Vincent College, Latrobe, PA, United States of America
aff018; Digital World Biology, PMB, Seattle, WA, United States of America
aff019; Department of Natural Sciences, Inter American University of Puerto Rico, Metropolitan Campus, San Juan, PR, United States of America
aff020; Department of Biology, University of Nebraska at Omaha, Omaha, NE, United States of America
aff021; Department of Biology, Susquehanna University, Selinsgrove, PA, United States of America
aff022; Department of Biology, Chemistry, and Environmental Sciences, Inter American University of Puerto Rico, San Germán Campus, San Germán, PR, United States of America
aff023; Department of Computer Science, Ohio University, Athens, OH, United States of America
aff024; School of Life Sciences, Arizona State University, Tempe, AZ, United States of America
aff025; Center for New Designs in Learning and Scholarship, Georgetown University, Washington, DC, United States of America
aff026; Department of Biology, Georgetown University, Washington, DC, United States of America
aff027; School of Interdisciplinary Informatics, University of Nebraska at Omaha, Omaha, NE, United States of America
aff028
Published in the journal:
PLoS ONE 14(11)
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pone.0224288
Summary
Bioinformatics, a discipline that combines aspects of biology, statistics, mathematics, and computer science, is becoming increasingly important for biological research. However, bioinformatics instruction is not yet generally integrated into undergraduate life sciences curricula. To understand why we studied how bioinformatics is being included in biology education in the US by conducting a nationwide survey of faculty at two- and four-year institutions. The survey asked several open-ended questions that probed barriers to integration, the answers to which were analyzed using a mixed-methods approach. The barrier most frequently reported by the 1,260 respondents was lack of faculty expertise/training, but other deterrents—lack of student interest, overly-full curricula, and lack of student preparation—were also common. Interestingly, the barriers faculty face depended strongly on whether they are members of an underrepresented group and on the Carnegie Classification of their home institution. We were surprised to discover that the cohort of faculty who were awarded their terminal degree most recently reported the most preparation in bioinformatics but teach it at the lowest rate.
Keywords:
Biology and life sciences – Bioinformatics – Computer and information sciences – Surveys – Graduates – Undergraduates – Workshops – Colleges
Introduction
Bioinformatics, an interdisciplinary field that combines aspects of biology, statistics, mathematics, and computer science, is becoming increasingly important for research efforts in all areas of biology [1,2]. Biology students graduating with bioinformatics experience have more employment opportunities available to them [3] and are better prepared for graduate studies in life sciences fields. It has also been suggested that students graduating with degrees in molecular biology and biochemistry should have some familiarity with bioinformatics [4]. With the growing emphasis on “big data” in biology, there is more demand for researchers in the life sciences with training in bioinformatics. However, many life sciences students earn their degrees with little exposure to it [5–7].
The Network for Integrating Bioinformatics into Life Sciences Education (NIBLSE, “nibbles”; https://niblse.org), a National Science Foundation Research Coordination Network, is a group of US education and private sector professionals in biology, bioinformatics, and computer science dedicated to making bioinformatics an integral component of instruction in the life sciences nationwide. Our approach involves developing instructional strategies for undergraduates to gain experience in bioinformatics, working to address barriers to the implementation of those strategies, and designing assessment instruments to evaluate the impact on student preparation [8].
In the US, bioinformatics instruction has predominately been provided at the graduate level [9–11]. Although we are aware that undergraduate bioinformatics courses are becoming more common, there has been little effort to integrate this interdisciplinary field broadly into undergraduate biology curricula. To further this integration, a better understanding of the barriers preventing its inclusion is necessary. We thus surveyed life sciences faculty at two- and four-year institutions across the US. Part of the survey consisted of open-ended, free-response questions that probed barriers to the integration of bioinformatics. Individual answers to these questions were qualitatively analyzed for specific barriers that deductively arose from the overall set of responses. (Example responses are provided in S1 Responses) The number of answers that were judged to refer to these key concepts were counted, and the counts were analyzed with respect to other data collected in the survey (see Materials and Methods). Given the number of valid responses to the survey—1,231; 1% to 2% of all US biological sciences faculty [12]—our findings provide a national consensus view. Below we discuss the major barriers uncovered and then describe efforts we and others are taking to address them.
Results
NIBLSE was founded on the premise that bioinformatics is and will continue to be essential for undergraduate biology education. One of the first questions in the survey asked whether respondents shared this view. Approximately 95% of survey respondents (Fig 1) agreed with the statement “Bioinformatics should be integrated into undergraduate life sciences education.” At the same time, however, only a third, 32%, said that they currently teach courses with at least some bioinformatics content.
The survey included four open-ended, free-response questions that asked faculty about the barriers they face in including bioinformatics in their teaching (Table 1). As described in Materials and Methods, the responses to these questions were analyzed qualitatively for specific barriers (e.g., “Lack of expertise/training” and “Lack of time”) that arose deductively from the overall set of responses. The categories Question 1 generated are given in Table 2. The categories were then combined into super-categories. Responses generated eight super-categories: “Faculty Issues,” “Student Issues,” “Curriculum Issues,” “Facilities Issues,” “Resource Issues,” “Institutional Issues,” “State Issues,” and “Accreditation Issues.” The number of responses that mentioned a given category of barrier was then counted. Although not every respondent answered all the open-ended questions and some didn’t answer any, there were almost 2,000 responses to the four questions (Table 3). Here, we describe our findings with respect to the two sets of barriers, “Faculty Issues” and “Student Issues,” that came up the most frequently, then describe others that were also commonly reported.
As shown in Figs 2 and 3, items in the super-category faculty issues were the most commonly reported barriers faculty face. This was true whether the respondent data were stratified by sex, race, ethnicity, institutional Carnegie Classification (institution type), minority-serving institution status, size of the undergraduate population, or geographic region (Fig 1). Under faculty issues, “Lack of expertise/training” was by far the most common barrier at all institution types except for doctoral-granting institutions; at doctoral institutions, one of the student issues, “Lack of skills/knowledge” was the most frequently reported (Fig 4).
We hypothesized that faculty who had earned their terminal degree most recently would report the highest amount of formal training in bioinformatics. Nearly 50% of faculty who earned their highest degree in 2010–2016 reported some kind of formal training (undergraduate or graduate courses and/or certificates), compared to 35% of the 2000–2009 cohort and decreasing thereafter (Table 4) (n = 968). Despite this level of formal training, faculty who earned their degrees most recently were the least likely (P = 0.003) (n = 908) to report teaching dedicated bioinformatics courses or teaching courses with some bioinformatics content (Fig 5). This is the case even though faculty from the 2010–2016 cohort teach at all types of institutions at about the same percentages (Table 5).
When we looked closely at who is integrating bioinformatics into their teaching—either teaching a dedicated course or incorporation into other courses—those who described themselves as self-taught are the most likely group to integrate at just over 18%. Thirteen percent of those with workshop or bootcamp training reported integration, and only 11% of respondents with formal training integrate bioinformatics into their teaching. Only a single individual with no training reported any form of integration (n = 877).
With respect to sex, females and males (n = 842) reported integrating bioinformatics at similar rates (20% female, 23% male). Females are more likely to be teaching at associate’s institutions (12% female vs. 7% male) and less likely to be teaching at doctoral-granting institutions (15% female vs. 22% male) (n = 929). The number of females obtaining terminal degrees has increased—7% of respondents who reported earning their terminal degree in the 1980s were female compared to 20% who graduated in the 2000s—with the latest cohort (2010–2016) having nearly equal numbers of males (7%) and females (9%) (n = 929). Females did not report training as a barrier significantly more than males did (30% vs. 26%) (n = 1013) but reported lack of access to computer labs at double the percentage of males (Question 4, Table 1; Fig 6). Slightly fewer females than males reported being self-taught in bioinformatics (20% female vs. 25% male), but both sexes are nearly evenly split in the other forms for training (workshops—12% female, 10% male; formal training—11% female, 12% male) or no training (5% female, 4% male) (n = 1013).
To determine if the barriers faculty face depend on whether they are members of an underrepresented minority (URM) in science, technology, engineering, and mathematics (STEM), we compared the responses of URM to non-URM faculty. (For this study, we considered the following groups to be underrepresented in STEM: Blacks, Hispanics, American Indians and Alaska Natives, and Native Hawaiians and other Pacific Islanders [13–15].) Because the number of respondents identifying as URMs was small—less than 7% of the total, a result that mirrors the lack of diversity in US life sciences faculty reported elsewhere [16]—we combined these respondents into a single group for analysis. We found that URM faculty reported training as a barrier much more frequently than non-URMs—42% vs. 28% (n = 961), respectively. Comparing faculty at minority-serving institutions (MSIs) with those at non-MSIs, MSI faculty report faculty issues as a barrier at a slightly lower rate than faculty at non-MSIs.
Faculty described several ways in which time was a barrier, including lack of instructional time to teach more material, lack of time for additional training, and lack of time for course development or restructuring. These responses were captured in the category “Lack of time,” a subcategory of faculty issues (Fig 2 and Table 2).
The student issues super-category was the second most frequently mentioned set of barriers after faculty issues (Fig 2). Two particular issues were commonly reported: students’ lack of background skills and knowledge, mentioned most frequently by faculty at doctoral-granting institutions, and students’ lack of interest, mentioned most frequently by faculty at master’s institutions (Fig 4). When we delved more deeply into the individual responses, we found that faculty at different institution types had different concerns, likely reflecting different expectations of their students. For example, faculty at doctoral-granting institutions were most concerned about their students’ lack of statistics knowledge and programming skills, whereas those at associate’s colleges mentioned their students’ lack of basic mathematics skills most often. In addition, we found that faculty teaching a dedicated bioinformatics course reported that their students lack the appropriate background at a much higher rate than those not teaching a dedicated course (Fig 7).
Many respondents reported barriers we grouped under the super-category curriculum issues (Fig 2). The two most frequently mentioned issues were “Communication difficulties,” specifically differences in the way biologists and computer scientists approach problems and communicate, and “Too much content,” referring to the difficulties inherent in including additional material in existing courses. Many respondents also mentioned “Quickly changing technologies,” alluding to the difficulties in keeping up with this rapidly changing field both in terms of training and access to software. This barrier was especially problematic at baccalaureate colleges (Fig 4), where faculty often have higher teaching loads across a wider range of subjects and fewer resources than those at research institutions. Interestingly, this barrier seemed to be less of a problem at associate’s-granting colleges, possibly reflecting the prescribed curriculum found at many two-year schools. Finally, respondents also mentioned “Institutional support issues,” including fellow faculty who do not feel that bioinformatics has a place in life sciences curricula and lack of support from administrators for resources such as training for faculty or hiring faculty with the appropriate training.
A multiple correspondence analysis (MCA) of responses was stratified by the Carnegie Classification of the respondent’s home institution (Fig 8). As can be seen, faculty at associate’s-granting colleges are markedly different from those at the other three institution types in a number of ways. These faculty are the least likely to be including bioinformatics in their teaching and more likely to report little to no training in bioinformatics, even though bioinformatics skills would contribute to the workforce readiness of their students. In contrast, faculty at doctoral-granting institutions are more likely to have formal training in bioinformatics and to teach dedicated courses in this discipline. They are also the most likely to mention higher-level student issues, such as poor computer science and statistics preparation. Finally, faculty at baccalaureate colleges and master’s institutions are more likely to have obtained training via informal modes, such as workshops and boot camps. When a multiple correspondence analysis of responses is stratified by the extent of bioinformatics integration, the three groups are almost completely separated from one another indicating that they are distinctly different (Fig 9).
Discussion
To the best of our knowledge, this is the first study to examine barriers US life sciences faculty face in integrating bioinformatics into undergraduate biology education, and as noted above, it provides a national consensus view on this issue. In our analysis, surveyed faculty overwhelmingly agreed that bioinformatics should be integrated into biology instruction, but only about a third did so. Our work thus provides direct evidence to support the commonly held tenet that a significant majority of life science students earn their degrees without exposure to bioinformatics. Training was reported as the most significant barrier, a finding that held whether the respondent data were stratified by sex, race and ethnicity, Carnegie Classification, MSI- status, the size of the undergraduate population, or geographic region.
We identified several other important trends in our data. First, faculty also often mentioned time as a barrier, although it was clear from the comments in the survey that this meant different things to different people—time for training, time for instruction (i.e., because there was a great deal of content to cover, it was difficult to find time for instruction on bioinformatics), as well as time for restructuring the curriculum. We plan to explore these issues further in a future study.
Second, faculty with the most training, the youngest cohort, teach bioinformatics the least. Although faculty at associate’s-granting institutions are less likely to integrate bioinformatics in general, we cannot conclude from this that faculty placement is sufficient to explain why the 2010–2016 cohort is the least-likely group to report integrating bioinformatics into their teaching despite better training (Table 5). A potential explanation is that as new faculty they are unable to shape the overall curriculum and/or are not yet tasked with teaching courses that best match their skills. We predict this discrepancy will lessen as this cohort becomes more senior in status and as additional cohorts of PhD trainees become faculty. However, we also note that as long ago as 1998, there were calls for the development of graduate programs in bioinformatics and computational biology [17]. While many such programs at the graduate level have been developed since then [18,19], graduates from these programs appear to have made little impact on biology education at the undergraduate level thus far. It is possible academia is less attractive to individuals fully trained in bioinformatics, who perhaps find better opportunities elsewhere. Preparing faculty that are equally well-trained in the biology, mathematics, computer science, and statistics necessary to teach the breadth of bioinformatics is a long-standing dilemma, although initiatives such as QUBES (Quantitative Undergraduate Biology Education and Synthesis) are making efforts to address this gap [20,21]. However, our findings illustrate more broadly the difficulties inherent in teaching interdisciplinary topics like bioinformatics.
Third, many faculty indicated that students were underprepared to engage in bioinformatics instruction. While faculty at doctoral institutions most often mentioned lack of high-level training in computer science and statistics, faculty at other institutions, especially community colleges, instead cited lack of preparation in basic mathematics skills. Lack of preparedness for college-level mathematics is a longstanding issue for students aspiring to college. In a recent review of the topic, McCormick and Lucas [22] cite a number of studies that describe the scope of the problem. For example, a study from 2001 by Morgan and Michaelides [23] determined that approximately 50% of first-year students were engaged in a remedial mathematics course. These findings suggest that creative ways to include basic mathematics skills in the context of a bioinformatics course are necessary.
Fourth, consistent with percentages of such faculty at institutions around the country [16], our study gathered relatively few respondents (81) who identified as members of groups underrepresented in STEM. Although we are aware that members of individual groups likely have different needs, responses from underrepresented groups were binned together for analysis. Previous reports have noted that at many historically black colleges and universities, bioinformatics courses have not been widely implemented due to a number of factors similar to those outlined here for the wider range of faculty, including lack of faculty training and lack of resources [24]. These trends with regard to faculty at MSIs and URM faculty suggest that serious attention to equity in training opportunities is necessary.
We found a few other trends based on demographics in our data that we need more information to interpret. Faculty at master’s institutions were more likely to cite lack of student interest as a barrier (Fig 4). Faculty teaching dedicated courses in bioinformatics more frequently reported that students lack needed background skills and knowledge and are intimidated by the topic. On the other hand, faculty attempting to integrate bioinformatics reported a lack of access to software at higher rates (Fig 7). Some barriers are experienced at higher rates by females than males (Fig 6). We plan to investigate some of these trends in a second study, including the finding that faculty at MSIs experience barriers at a slightly lower rate than non-MSI faculty. In this instance, the difference may be explained by the lower number of faculty at MSIs who are integrating bioinformatics: only 15% of the faculty at MSIs are integrating bioinformatics into their teaching in some way compared to 27% of faculty at non-MSIs (n = 638), but we intend to explore this point further.
Other studies have also investigated faculty, student, and institutional barriers to the integration of bioinformatics into life sciences education. Barone, Williams, and Micklos [25], surveying 704 National Science Foundation investigators from the Directorate for Biological Sciences, also found that training was the top unmet need within the research community. Cummings and Temple [19] describe three general categories of challenges for broader incorporation of bioinformatics in education: 1) required infrastructure and logistics; 2) instructor knowledge of bioinformatics and continuing education; and 3) the breadth of bioinformatics and the diversity of students and educational objectives. Barriers we uncovered here with faculty in the United States are also felt by faculty in the United Kingdom [9], as well as in emerging areas more globally [26], specifically in some African countries [10] and in India [11].
What can be done to alleviate barriers? Although a few institutions, such as the University of Wisconsin-La Crosse [27], Kalamazoo College [28], Muhlenberg College [29], and Drake University [30], have reported successful integration of bioinformatics into their life sciences programs [31], the majority of institutions appear not to have done so. Clearly, given that we and others [19,32] have found that lack of faculty training is a major problem, providing faculty with opportunities for training is important, as is giving faculty time to take advantage of these opportunities.
At present, there are many opportunities for faculty training available in the United States and elsewhere. Some of the opportunities include workshops provided by groups such as BioQUEST (http://bioquest.org); Data Carpentry (http://datacarpentry.org) [33]; DNA Subway (http://dnasubway.cyverse.org); Genome Consortium for Active Teaching (GCAT)-Seek (http://gcat-seek.weebly.com) [34]; Genomics Education Partnership (http://gep.wustl.edu) [35,36]; Genome Solver (http://genomesolver.qubeshub.org) [37]; Integrated Microbial Genomes Annotation Collaboration Toolkit [38,39]; SEA-PHAGES (http://seaphages.org) [40]; Software Carpentry (http://software-carpentry.org); QUBES (http://qubeshub.org); the National Center for Biotechnology Information at the National Institutes of Health (http://ncbi.nlm.nih.gov); the European Bioinformatics Institute (http://www.ebi.ac.uk); the Global Organisation for Bioinformatics Learning, Education, and Training (GOBLET) [9]; and ELIXIR [26]. Such groups are important not only for conveying information and knowledge but for building community. In addition, many schools offer bioinformatics graduate courses and certificates, either in person or online. There are also numerous courses offered in bioinformatics and computer science through Coursera (https://coursera.org) and EdX (https://edx.org). However, finding these training opportunities is left to individual faculty. NIBLSE plans to serve as a clearinghouse for such opportunities. One of our key findings is that faculty who have participated in informal training like workshops or boot camps report the need for training more than faculty with no training or faculty with formal training. This result is similar to that reported by Feldon et al., who suggest that boot camps and short workshops are not very effective for PhD students in the life sciences [41]. It thus may be useful to conduct a follow-up survey to address the deficits expressed by faculty with informal training.
Cummings and Temple [19] recommend “using transformative computer-requiring learning activities, assisting faculty in collecting assessment data on mastery of student learning outcomes, as well as creating more faculty development opportunities that span diverse skill levels, with an emphasis placed on providing resource materials that are kept up-to-date as the field and tools change.” NIBLSE is developing a set of teaching tools in its Learning Resource Collection that will help contextualize bioinformatics in light of the fundamentals of biology (http://niblse.org). We also point to the increasing number of resources in the Bioinformatics course on the CourseSource website (https://coursesource.org). These two centers of collected resources will also address the concern exhibited by respondents about the difficulty of finding tested curricula to use in their classrooms. We also note that important fundamental concepts in biology, including evolution and the central dogma, could be taught in the context of bioinformatics, helping to alleviate the “too-full curriculum” barrier expressed by some respondents.
To conclude, our results indicate that life sciences faculty overwhelmingly agree that bioinformatics should be integrated into the undergraduate life sciences curriculum, but many barriers exist that prevent them from doing so, a lack of training being the most significant. In addition, our study reveals that the barriers faculty face depend on demographic and other factors. Needs are especially great for members of underrepresented groups in STEM and for faculty at associate’s-granting institutions. While many questions about the landscape of bioinformatics education remain, moving forward, NIBLSE seeks to address the challenges uncovered in the present analysis in order to achieve integration of bioinformatics into the life sciences curriculum. The goals articulated by NIBLSE resonate with the recommendations stated in A New Biology for the 21st Century to create a community of researchers dedicated to solving a broad range of scientific and societal issues with interdisciplinary approaches and training students to be able to converse across disciplinary boundaries [42].
Materials and methods
The survey of life sciences faculty was collaboratively developed by a subgroup of NIBLSE members, the Core Competencies Working Group (CCWG). Faculty from a range of educational institutions were represented in the CCWG, including faculty at baccalaureate-, master’s-, and doctoral-granting institutions with various levels of research activity. One of the members of the CCWG was from industry. All members of the working group have extensive experience teaching bioinformatics to undergraduate biology students. Development and deployment of the survey is discussed in more detail by Sayres et al. [12]; the survey in its entirety is provided there as a supplementary document. Approval for the study was obtained from the University of Nebraska at Omaha Institutional Review Board (IRB # 161-16-EX) before the survey was distributed.
The survey was administered in April 2016 using Qualtrics with assistance from the Center for New Designs in Learning and Scholarship at Georgetown University; 1,264 responses were collected. The branched survey design included five-point Likert and free-response questions. As described by Sayres et al. [12], the survey was e-mailed to the more than 11,000 addresses in a mailing list of US biology faculty purchased from MDR (http://schooldata.com) and to members of networks of faculty with interests in life sciences education. Given 75,000 to 100,000 biological sciences faculty in the United States [12] and the total number of responses (1% to 2%), we estimate that the mean margin of error for the survey questions described in this paper is ± 3% at the 95% confidence interval [43]. For the results described here, we analyzed barriers to teaching bioinformatics through four free-response questions (Table 1). The responses were subjected to qualitative analysis by two groups, one at Georgetown University (AGR, using the classic content analysis method outlined in Leech and Onweugbuzie [44]) and one at the University of Florida (JCD, SG, and EWT, using a modification of the coding and thematic analysis process described by Harding [45]). In both analyses, categories of barriers—e.g., “No expertise/training,” “Time,” “Not enough faculty”—were deductively identified and then combined into super-categories (e.g., “Faculty Issues,” “Student Issues,” and “Resource Issues”) as shown in Table 2 for Question 1. The number of responses that described a given barrier was then counted. Although similar results were obtained from the two analyses, the authors decided to use the data from the University of Florida quantification for detailed analyses because the way in which it was formatted made subsequent analyses easier.
Survey data were exported to CSV-formatted files for analysis in R. Data were cleaned to eliminate multiple column headers and to transform Qualtrics numerical coding of responses into decoded values. During this step, responses from outside the US were eliminated, leaving n = 1,231 valid responses. Unless otherwise indicated, we used this number in all calculations. Values smaller than 1,231 occur in two cases: 1) For the four free-response questions, values of n are always the largest number of respondents who could have answered that question (some questions were only asked in particular branches of the survey). Blank responses were conservatively assumed to be intentionally unanswered as it was not possible to tell if a question was simply skipped or if the individual experienced no barriers. 2) Where a statistic involved a multiple-choice question, null responses (i.e., blank, unsure, or “rather not say” responses) were removed from the analysis. In some cases (e.g., respondent race/ethnicity, level of bioinformatics training, and degree year), responses were binned to achieve sufficient numbers for analysis. For example, the responses from respondents who identified as being from a race/ethnic background underrepresented in STEM were analyzed together.
Analysis methods
The reported barriers were analyzed with respect to a number of demographic criteria—sex, race/ethnicity, highest degree earned, year of highest degree, level of bioinformatics training, extent of current bioinformatics teaching, institutional Carnegie Classification, MSI vs. non-MSI status, size of school by undergraduate enrollment, and geographic region—to determine differences within these demographics and association of demographics and barriers. For a given demographic, respondents who did not answer, or indicated they did not know or were unsure, were dropped from analysis of that demographic category.
The MCA packages in R were used to visualize the correspondence of several categorical demographic factors [46,47]. Similar to a principle component analysis, MCA allows associations between categorical variables (e.g., our demographic categories) to be visualized. In our analysis, individuals for which we had complete demographic data were used to display relationships in two-dimensional space.
Proportion tests within demographics
A proportion test was used to calculate the χ2 statistic for differences between sub-demographics (H0 assuming faculty within all the sub-demographics report barriers equally). The margin of error (as the interval estimate of population proportion) was calculated at the 95% confidence level and is represented on Figs 4, 6 and 7 as error bars. Expected effect sizes detectable were calculated assuming 80% power. Selected findings are described in Results. Additional findings as well as the full data set and R scripts used for analyses and plotting can be found on the NIBLSE GitHub repository available at https://github.com/niblse.
Supporting information
S1 Responses [pdf]
This file contains example responses to the survey questions that probed the barriers life sciences faculty face in integrating bioinformatics.
Zdroje
1. Greengard S. How computers are changing biology. Commun. ACM. 2014;57: 21–23. doi: 10.1145/2591230
2. Marx V. Biology: the big challenges of big data. Nature. 2013;498: 255–260. doi: 10.1038/498255a 23765498
3. Levine A. An explosion of bioinformatics careers. Science. 2014 Jun 13. doi: 10.1126/science.opms.r1400143
4. White HB, Benore MA, Sumter TF, Caldwell BD, Bell E. What skills should students of undergraduate biochemistry and molecular biology programs have upon graduation? Biochem. Mol. Biol. Educ. 2013;41: 297–301. doi: 10.1002/bmb.20729 24019246
5. Wingreen N, Botstein D. Back to the future: education for systems-level biologists. Nat. Rev. Mol. Cell Biol. 2006;7: 829–832. doi: 10.1038/nrm2023 16990789
6. Pevzner P, Shamir R. Computing has changed biology—biology education must catch up. Science. 2009;325: 541–542. doi: 10.1126/science.1173876 19644094
7. Stefan MI, Gutlerner JL, Born RT, Springer M. The quantitative methods boot camp: teaching quantitative thinking and computing skills to graduate students in the life sciences. PLoS Comput. Biol. 2015;11(4): e1004208. doi: 10.1371/journal.pcbi.1004208 25880064
8. Dinsdale E, Elgin SCR, Grandgenett N, Morgan W, Rosenwald A, Tapprich W, et al. NIBLSE: A Network for Integrating Bioinformatics into Life Sciences Education. CBE Life Sci. Educ. 2015;14: 1–4. doi: 10.1187/cbe.15-06-0123 26466989
9. Hack C, Kendall G. Bioinformatics: current practice and future challenges for life science education. Biochem. Mol. Biol. Educ. 2005;33: 82–85. doi: 10.1002/bmb.2005.494033022424 21638550
10. Karikari TK, Quansah E, Mohamed WMY. Developing expertise in bioinformatics for biomedical research in Africa. Appl. Transl. Genom. 2015;6: 31–34. doi: 10.1016/j.atg.2015.10.002 26767162
11. Kulkarni-Kale U, Sawant S, Chavan V. Bioinformatics education in India. Brief Bioinform. 2010;11: 616–625. doi: 10.1093/bib/bbq027 20705754
12. Wilson Sayres MA, Hauser C, Sierk M, Robic S, Rosenwald AG, Smith TM, et al. Bioinformatics core competencies for undergraduate life sciences education. PLoS ONE. 2018;13(6): e0196878. doi: 10.1371/journal.pone.0196878 29870542
13. National Science Board (NSB). Science and Engineering Indicators 2018. Publication NSB-2018-1, National Center for Science and Engineering Statistics. Available from https://www.nsf.gov/statistics/indicators/
14. Estrada E, Burnett M, Campbell AG, Campbell PB, Denetclaw WF, Gutiérrez CG, et al. Improving underrepresented minority student persistence in STEM. CBE Life Sci. Educ. 2016;15: es5, 1–10. doi: 10.1187/cbe.16-01-0038 27543633
15. Kerr JQ, Hess DJ, Smith CM, Hadfield MG. Recognizing and reducing barriers to science and math education and STEM careers for Native Hawaiians and Pacific Islanders. CBE Life Sci. Educ. 2018;17: mr1, 1–10. doi: 10.1187/cbe.18-06-0091 30496031
16. Snyder TD, Dillow SA. Digest of Education Statistics 2012, Table 291, p. 419. Publication NCES 2014–015, National Center for Education Statistics. Available from: https://nces.ed.gov/pubs2014/2014015.pdf
17. Altman RB. A curriculum for bioinformatics: the time is ripe. Bioinformatics 1998;14: 549–550. doi: 10.1093/bioinformatics/14.7.549 9841111
18. Zauhar RJ. University bioinformatics programs on the rise. Nat. Biotechnol. 2001;19: 285–286. doi: 10.1038/85758 11231569
19. Cummings MP, Temple GG. Broader incorporation of bioinformatics in education: opportunities and challenges. Brief Bioinform. 2010;11: 537–543. doi: 10.1093/bib/bbq058 20798182
20. Jungck JR, Donovan SS, Weisstein AE, Khiripet N, Everse SJ. Bioinformatics education dissemination with an evolutionary problem solving perspective. Brief Bioinform. 2010;11: 570–581. doi: 10.1093/bib/bbq028 21036947
21. Jungck JR, Weisstein AE. Mathematics and evolutionary biology make bioinformatics education comprehensible. Brief Bioinform. 2013;14: 599–609. doi: 10.1093/bib/bbt046 23821621
22. McCormick N, Lucas M. Exploring mathematics college readiness in the United States. Curr. Issues Educ. 2011;14(1). Available from: http://cie.asu.edu/ojs/index.php/cieatasu/article/view/680
23. Morgan DL, Michaelides MP. Setting cut scores for college placement. New York: College Board. Research Report No. 2005–9; 2005. Available from: https://eric.ed.gov/?id=ED562865
24. Holtzclaw JD, Eisen A, Whitney EM, Penumetcha M, Hoey JJ, Kimbro KS. Incorporating a new bioinformatics component into genetics at a historically black college: outcomes and lessons. CBE Life Sci. Educ. 2006;5: 52–64. doi: 10.1187/cbe.05-04-0071 17012191
25. Barone L, Williams J, Micklos D. Unmet needs for analyzing biological big data: a survey of 704 NSF principal investigators. PLoS Comput. Biol. 2017;13(11): e1005755. doi: 10.1371/journal.pcbi.1005755 29049281
26. Crosswell LC, Thornton JM. ELIXIR: a distributed infrastructure for European biological data. Trends in Biotech. 2012;30: 241–241. doi: 10.1016/j.tibtech.2012.02.002 22417641
27. Howard DR, Miskowski JA, Grunwald SK, Abler ML. Assessment of a bioinformatics across life science curricula initiative. Biochem. Mol. Biol. Educ. 2005;35: 16–23. doi: 10.1002/bmb.13 21591051
28. Furge LL, Stevens-Truss R, Moore DB, Langeland JA. Vertical and horizontal integration of bioinformatics education. Biochem. Mol. Biol. Educ. 2009;37: 26–36. doi: 10.1002/bmb.20249 21567685
29. Wightman B, Hark AT. Integration of bioinformatics into an undergraduate biology curriculum and the impact on development of mathematical skills. Biochem. Mol. Biol. Educ. 2012;40: 310–319. doi: 10.1002/bmb.20637 22987552
30. Honts JE. Evolving strategies for the incorporation of bioinfomatics within the undergraduate cell biology curriculum. Cell Biol. Educ. 2003;2: 233–247. doi: 10.1187/cbe.03-06-0026 14673489
31. Magana AJ, Taleyarkhan M, Rivera Alvarado D, Kane M, Springer J, Clase K, et al. A survey of scholarly literature describing the field of bioinformatics education and bioinformatics educational research. CBE-Life Sci. Educ. 2014;13: 607–623. doi: 10.1187/cbe.13-10-0193 25452484
32. Ranganathan S. Bioinformatics education—perspectives and challenges. PLoS Comput. Biol. 2005;1(6): e52. doi: 10.1371/journal.pcbi.0010052 16322761
33. Teal T, Cranston K, Lapp H, White E, Wilson G, Ram K, et al. Data Carpentry: Workshops to increase data literacy for researchers. IJDC. 2015;10: 135–153. Available from: http://dx.doi.org/10.2218/ijdc.v10i1.351
34. Buonaccorsi V, Peterson M, Lamendella G, Newman J, Trun N, Tobin T, et al. Vision and change through the Genome Consortium for Active Teaching using Next-Generation Sequencing (GCAT-SEEK). CBE Life Sci. Educ. 2014;13: 1–2. doi: 10.1187/cbe.13-10-0195 24591495
35. Shaffer CD, Alvarez CJ, Bednarski AE, Dunbar D, Goodman AL, Reinke C, et al. A course-based research experience: how benefits change with increased investment in instructional time. CBE Life Sci. Educ. 2014;13: 111–130. doi: 10.1187/cbe-13-08-0152 24591510
36. Shaffer CD, Alvarez C, Bailey C, Barnard D, Bhalla S, Chandrasekaran C, et al. The Genomics Education Partnership: successful integration of research into laboratory classes at a diverse group of undergraduate institutions. CBE Life Sci. Educ. 2010;9: 55–69. doi: 10.1187/09-11-0087 20194808
37. Rosenwald AG, Russell J, Arora G. The Genome Solver website: a virtual space fostering high impact practices for undergraduate biology. J. Microbiol. Biol. Educ. 2012;13: 188–190. doi: 10.1128/jmbe.v13i2.444 23653812
38. Ditty JL, Williams KM, Keller MM, Chen GY, Liu X, Parales RE. Integrating grant-funded research into the undergraduate biology curriculum using IMG-ACT. Biochem. Mol. Biol. Educ. 2013;41: 16–23. doi: 10.1002/bmb.20662 23382122
39. Ditty JL, Kvaal CA, Goodner B, Freyermuth SK, Bailey C, Britton RA, et al. Incorporating genomics and bioinformatics across the life sciences curriculum. PLoS Biol. 2010;8(8): e1000448. doi: 10.1371/journal.pbio.1000448 20711478
40. Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, et al. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. MBio. 2014;5: e01051–13. doi: 10.1128/mBio.01051-13 24496795
41. Feldon DF, Jeong S, Peugh J, Roksa J, Maahs-Fladung C, Shenoy A, et al. Null effects of boot camps and short-format training for PhD students in life sciences. PNAS. 2017;114: 9854–9858; published ahead of print August 28, 2017. doi: 10.1073/pnas.1705783114 28847929
42. National Research Council. 2009. A New Biology for the 21st Century. Washington, DC: The National Academies Press. https://doi.org/10.17226/12764
43. Dillman DA. Mail and internet surveys: the tailored design method, 2nd ed. Hoboken (NJ): John Wiley & Sons; 2007
44. Leech NL, Onweugbuzie AJ. An array of qualitative data analysis tools: A call for data analysis triangulation. School Psych. Quart. 2007;22: 557–584. doi: 10.1037/1045-3830.22.4.557
45. Harding J. Qualitative data analysis from start to finish. Thousand Oak (CA): Sage; 2013
46. Greenacre M, Blasius J, Eds. Multiple correspondence analysis and related methods. New York: Chapman and Hall/CRC; 2006
47. Lê S, Josse J, Husson F. FactoMineR: an R package for multivariate analysis. J. Stat. Softw. 2008;25(1): 1–18. Available from: https://www.jstatsoft.org/article/view/v025i01
Článek vyšel v časopise
PLOS One
2019 Číslo 11
- S diagnostikou Parkinsonovy nemoci může nově pomoci AI nástroj pro hodnocení mrkacího reflexu
- Je libo čepici místo mozkového implantátu?
- Pomůže v budoucnu s triáží na pohotovostech umělá inteligence?
- AI může chirurgům poskytnout cenná data i zpětnou vazbu v reálném čase
- Nová metoda odlišení nádorové tkáně může zpřesnit resekci glioblastomů
Nejčtenější v tomto čísle
- A daily diary study on maladaptive daydreaming, mind wandering, and sleep disturbances: Examining within-person and between-persons relations
- A 3’ UTR SNP rs885863, a cis-eQTL for the circadian gene VIPR2 and lincRNA 689, is associated with opioid addiction
- A substitution mutation in a conserved domain of mammalian acetate-dependent acetyl CoA synthetase 2 results in destabilized protein and impaired HIF-2 signaling
- Molecular validation of clinical Pantoea isolates identified by MALDI-TOF