15 Epistemic Standards and the Next Generation of Scholars

Andreea Musulan, Université de Montréal, andreea.musulan@gmail.com
Jean-François Godbout, Université de Montréal, jean-francois.godbout@umontreal.ca

Abstract: Artificial Intelligence (AI) is decoupling research production from methodological capacity development, creating new challenges for the development of human expertise and the evaluation of knowledge production. As AI becomes increasingly integrated into research workflows, maintaining the integrity of research will hinge on establishing methodological standards and training practices for current and future generations of scholars.

AI usage statement: AI tools were used to assist with editing and manuscript preparation. The authors take full responsibility for the content of this chapter.

15.1 Introduction

Artificial Intelligence (AI) is fundamentally changing the practice of quantitative social science. While the discipline has long been defined by the technical demands of data collection, processing, and analysis, the tools now available to researchers and students have simultaneously raised expectations for knowledge production and lowered the barriers to producing it. AI is not just increasing productivity; it is decoupling the ability to produce analysis from the ability to understand and evaluate it, allowing for research contributions that are not directly built on a foundation of human ingenuity.¹

With the social science behavioural revolutions (Dahl 1961), many departments required integrated methodological training, including statistics and programming. This led to a relatively small number of experts, along with specialized journals and conferences. Similarly, AI appears to be following this pattern, with methods-oriented scholars most likely responsible for teaching and developing these tools. However, because AI is especially useful for working with language data and programming, the years of training that previously constrained the pool of quantitative scholars no longer limit the size of the community. It also opens up the discipline to other subfields, particularly through text-as-data, the automation of research assistant tasks, and accessible coding and statistical guidance.

With respect to knowledge output, AI has expanded our ability to collect and process data, perform analyses, and report results (Karjus 2025; Filimonovic et al. 2025).² In principle, this creates opportunities to contribute more efficiently to the advancement of the different social science disciplines. At the same time, it introduces new pressures on scholars to produce work that is meaningful and methodologically sound, while also increasing the number of scholars able to do this work. These pressures are related to the rate and quantity of research production, as well as technical sophistication and the amount of data being analyzed. While these may be manageable for experienced researchers, for students, they are reshaping the relationship between learning and skill acquisition.

15.2 Changing barriers

At present, the barrier to conducting quantitative social science research outside the domain of specialists is relatively low. Even undergraduate students can use Large Language Models (LLMs) to generate code, explain complex concepts, and produce analytical outputs.³ Tasks that previously required continued effort to understand and internalize — such as programming and statistical reasoning — can now be completed with limited direct engagement. Rather than developing these foundational skills over time, students are increasingly able to compress the learning process through reliance on LLMs. This compression operates through reduced engagement with failure at all stages of the research process, including debugging and testing assumptions. Emerging evidence suggests that this reliance may be associated with weaker performance in tasks that require sustained critical thinking (Jošt et al. 2024). The ability to recognize when an analysis is inappropriate, when assumptions are violated, or when results are misleading depends on the quality of a researcher’s training and practice. Without this, reliance on AI-generated outputs risks shifting the role of the researcher from active analyst to passive evaluator, with limited capacity to assess the quality of what is produced. Furthermore, given that the development of AI systems continues to depend on human expertise for data curation, annotation, and evaluation — which remain integral to model development, performance, and reliability (Gebru et al. 2021) — this decoupling may eventually affect not only knowledge production, but also the quality of the AI systems that increasingly support it.

Historically, methodological skill in quantitative social science has been defined by statistical and programming expertise. These skills were instrumental for processing observational and experimental data and understanding the assumptions and limitations underlying empirical analysis that enable interpretation and validation. For example, preprocessing text data required familiarity with both the structure of the source data and the implications of different formatting decisions. Today, AI tools can facilitate, and in some cases automate, many of these steps for researchers with minimal programming experience.

This shift has important implications for the development and evaluation of expertise. AI systems can assist with research implementation and output generation, but they do not substitute for the ability to formulate strong research questions, critically assess the appropriateness of the analysis and the validity of its results. Consequently, it becomes possible to produce work that appears methodologically sophisticated without a corresponding depth of understanding. This can lead to the creation of research outputs that involve misinterpretation of results, incorrect application of assumptions, or inappropriate tool selection. Although experienced researchers may be well positioned to preemptively identify and address these potential issues, this is less likely to be the case for students and emerging scholars, who are still in the process of acquiring foundational skills.

These dynamics will transform the scale of social science research. AI will increase both the number of scholars and their capacity to produce outputs, likely leading to a substantial expansion in research volume. While this may compromise quality, that outcome is not inevitable. However, excessive cognitive offloading introduces the risk that research capacity will expand more quickly than underlying competence. While statistical analysis and natural language processing also automated aspects of research, they were limited to structured, quantifiable data. In contrast, AI now functions as a highly capable research assistant, able to produce and analyze massive amounts of multimodal data at a scale unprecedented in social science.

As AI becomes increasingly integrated into subfields across the discipline, it will extend into qualitative research, making it increasingly difficult to distinguish from quantitative approaches. Textual analysis is an important part of the methodological toolkit for both quantitative and qualitative scholars, and as a result, will play a central role in their convergence. As with the behavioural revolution of the 1960s, disciplines and departments will need to redesign training and teaching approaches. New methods will emerge alongside or in place of traditional tools such as surveys and regression. This will also prompt foundational ontological and epistemological debates, shifting focus toward the role of AI itself rather than the traditional quantitative–qualitative divide.

Quantitative social science is thus well positioned to help define epistemic standards in this context and to coordinate such efforts across disciplines. This involves outlining expectations around the use of AI tools, such as prompt documentation, model selection, and reproducibility (Munafò et al. 2017; Gebru et al. 2021). Scholars can develop frameworks for assessing methodological quality in an environment where tools are widely accessible but unevenly understood. The quality of research is at risk of decline without quantitative social science assuming this role, as few other disciplines are equipped with the tools and experience to evaluate AI-assisted research with the same level of rigor.

15.3 Concluding remarks

Although considerable attention has been given to how AI can be incorporated into research workflows with appropriate human oversight, less attention has been paid to how these tools are shaping the development of future scholars. This raises important questions for educational institutions.⁴ In particular, there is a need to consider how foundational skills can continue to be cultivated in an environment where many of the tasks associated with learning can be readily automated. Encouraging sustained engagement with core concepts, and ensuring that students develop the capacity to critically evaluate analytical outputs, will be essential for maintaining the integrity of the different social science disciplines. This has implications not only for training, but for the reliability and interpretability of research outputs at scale.

As a cross-cutting field, computational social science should play a central role in outlining foundational skills in programming, model understanding, and advanced statistical analysis. These skills are necessary to ensure that scholars retain the ability to lead AI-assisted research outputs. Research integrity in the current technological context will depend on individual expertise and discipline’s ability to define the terms for the use of AI tools at scale, ensuring that scholars retain the capacity for knowledge production.

Brodeur, Abel, David Valenta, Alexandru Marcoci, et al. 2025. Comparing Human-Only, AI-Assisted, and AI-Led Teams on Assessing Research Reproducibility in Quantitative Social Science. I4R Discussion Paper Series. https://www.iza.org/publications/dp/17645/comparing-human-only-ai-assisted-and-ai-led-teams-on-assessing-research-reproducibility-in-quantitative-social-science.

Dahl, Robert A. 1961. “The behavioral approach in political science: Epitaph for a monument to a successful protest.” American Political Science Review 55 (4): 763–72. https://doi.org/10.2307/1952525.

Filimonovic, Dragan, Christian Rutzer, and Conny Wunsch. 2025. Can GenAI Improve Academic Performance? Evidence from the Social and Behavioral Sciences. https://arxiv.org/abs/2510.02408.

Gao, Jian, and Dashun Wang. 2024. “Quantifying the use and potential benefits of artificial intelligence in scientific research.” Nature Human Behaviour 8 (12): 2281–92. https://doi.org/10.1038/s41562-024-02020-5.

Gebru, Timnit, Jamie Morgenstern, Briana Vecchione, et al. 2021. “Datasheets for datasets.” Communications of the ACM 64 (12): 86–92. https://doi.org/10.1145/3458723.

Jošt, Gregor, Viktor Taneski, and Sašo Karakatič. 2024. “The impact of large language models on programming education and student learning outcomes.” Applied Sciences 14 (10): 4115. https://doi.org/10.3390/app14104115.

Karjus, Andres. 2025. “Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligence.” Humanities and Social Sciences Communications 12 (1): 1–18.

Lei, Ya-Wen, and Rachel Kim. 2024. “Automation and augmentation: artificial intelligence, robots, and work.” Annual Review of Sociology 50 (1): 251–72. https://doi.org/10.1146/annurev-soc-090523-050708.

Munafò, Marcus R, Brian A Nosek, Dorothy VM Bishop, et al. 2017. “A manifesto for reproducible science.” Nature Human Behaviour 1 (1): 0021. https://doi.org/10.1038/s41562-016-0021.

Xu, Haotian, and Wenqin Shen. 2026. “From tool to partner: generative AI usage patterns and research performance among doctoral students.” Studies in Higher Education, 1–16. https://doi.org/10.1080/03075079.2026.2625386.

Zhang, Jiaxing, and Shuaishuai Feng. 2021. “Machine learning modeling: A new way to do quantitative research in social sciences in the era of AI.” Journal of Web Engineering 20 (2): 281–302.

It is prudent to make the distinction between the augmentation of work using AI and automation for the central argument of this paper. While reliance on AI introduces “adverse consequences” (for example, through the lack of “domain expertise”) (Lei and Kim 2024, 251), the incorporation of AI as an augmentation tool has been shown to at least match the level of human performance in quantitative social science research (Brodeur et al. 2025, 3–4). Research incorporating AI in general has also been shown to be “more likely to be cited both within” and across scientific disciplines, but that there is a “misalignment between AI use and AI education” (i.e., there is insufficient training relative to the extent of AI application in research) (Gao and Wang 2024, 2286–87).↩︎
The application of AI in quantitative social science has helped the field confront one of its most widespread challenges, overfitting, through “cross validation and regularization” (Zhang and Feng 2021, 281).↩︎
Recent research on the use of generative AI use by doctoral students, particularly in more quantitative fields, such as economics, suggests that its incorporation results in higher quality and quantity of outputs (Xu and Shen 2026). However, the fundamental concepts and skills are developed prior to doctoral research.↩︎
While not the primary focus of our discussion, pedagogical approaches and methodological standards developed within social science may have implications that extend beyond higher education. As AI tools increasingly become accessible to younger students, similar concerns regarding foundational skill development and critical thinking may emerge earlier in the educational pipeline.↩︎