Social Science Theories in Software Engineering Research
Social Science Theories in Software Engineering Research
1994
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
This paper is structured as follows: Section 2 presents relevant include the technology acceptance model [17], cognitive fit
background and related work on theories in social sciences and SE, theory [81], and the theory of task-technology fit [25].
including an overview of the classification criteria employed by this (5) Theories for design and action provide detailed instructions on
paper. Sections 3 and 4 describe the research method and present how to do something, like recipes. These are less common in
the results. Section 5 elaborates on the theories used most often in the social sciences than in software engineering. Multiview,
SE research. Section 6 discusses the study’s implications, limitations the methodology and contingency framework, is a good
and avenues for future research. Finally, Section 7 concludes the example [3].
paper with a summary of its contributions.
Theories also exist on a kind of generalizeability spectrum from
2 BACKGROUND observations and empirical generalizations to middle-range theo-
This section explains our operationalization of social science and ries, general theories, grand theories, and theories of everything [61,
the classification schemes we later employ. 72]. SE researchers are mainly concerned with empirical generaliza-
We consider a theory a social science theory if it originates in one tions, middle-range theories and general theories [72]. An empirical
of the disciplines listed by The Social Science Encyclopedia [36]: an- generalization is “an isolated proposition summarizing observed
thropology, cognitive science, criminology and law, cultural studies, uniformities of relationships between two or more variables” [44];
demography, economics, education, evolution, gender, geography, for instance, Moore’s law. Middle-range theories (e.g. COCOMO [8];
health and medicine, history, industrial relations and management, the software engineering principles of Davis [16]) “lie between the
language, linguistics and semiotics, mental health, methods of so- minor but necessary working hypothesis that evolve in abundance
cial research, philosophy, political theory, politics and government, during day-to-day research and the all-inclusive systematic efforts
psychology, social problems and social welfare, and sociology. Some to develop a unified theory that will explain all the observed unifor-
notes on this list are warranted: mities” [44]. A general theory, meanwhile, “applies across [a] field
and unifies existing empirical and theoretical work” [33]. No gen-
• Some of these fields (e.g. geography) mix social, natural or
eral theory of software engineering is widely accepted, but a few
applied science.
have been proposed (e.g. the tarpit [31]; sensemaking-coevolution-
• Some fields intersect several broader social sciences (e.g.
implementation theory [55]).
communication studies intersects cultural studies, linguistics,
Researchers also distinguish between variance theories, process
management, and sociology).
theories, and taxonomies. Variance theories explain and predict
• Each field has many subfields (e.g. information systems,
the variance in one or more dependent variables using indepen-
which shares many similarities with SE, is a subfield of man-
dent, moderating and mediating variables [14]. Process theories
agement).
explain how an entity changes and develops [78]. Taxonomies (also
• We are not sure that philosophy is a social science, per se,
called theories for understanding, descriptive theories, frameworks
but tentatively include it anyway.
and typologies) organize instances (individual things) into classes
Theories can be classified on numerous dimensions including (abstract descriptions) [58].
what they are for, their scope and their structure, as well as the way Variance theories typically posit causal relationships among con-
they are used and where in an article they are used. structs [63]; process theories and taxonomies typically do not [58].
Gregor [26] proposed a taxonomy for classifying information Process theories usually focus on non-causal relationships between
systems theories according to their purpose. She argued that theo- entities (e.g. actors, objects) while taxonomies focus on similari-
ries can serve four purposes (analysis and description, explanation, ties between entities. Variance theories are often associated with
prediction, and prescription), leading to five types: positivist, quantitative, statistical research while process theories
(1) Theories for analyzing, also called frameworks or taxonomies, and taxonomies are often associated with interpretivist, qualitative,
are relatively simple, descriptive theories. A good example non-statistical research [45]. In many scientific disciplines, includ-
is Iivari et al.’s dynamic four-tiered framework for classify ing information systems, variance theories are more common than
information systems development methodologies and ap- process theories. However, process theories better explain the re-
proaches [28]. lationships between inputs and outcomes [14]. Variance theories
(2) Theories for explaining, also called theories for understanding, and process theories can sometimes be combined [48].
attempt to explain what, how, when, why or where phenom- Meanhile, theories can serve different purposes within a research
ena occur. They do not include causal hypotheses or make article [27]:
numerical predictions. Darwinian evolution is a theory for
explaining, as is dual coding theory [49]. • Design: The theory influences a study’s research method
(3) Theories for predicting predict future events without speci- including research questions, hypotheses, tasks, materials,
fying causal relationships (e.g. weather forecasting models). simulation parameters, or research models and frameworks.
These theories have testable propositions but no prescrip- • Explanation: The theory explains a study’s results. There
tions. may be a fine line between a theory that explains results and
(4) Theories for explaining and predicting posit causal explana- a theory that analyzes results.
tions and testable propositions (e.g. quantitative hypotheses). • Applied: The theory is used for its intended purpose (e.g.
Prescriptions may be included but are not the focus. This predicting technology adoption) within a study, or to analyze
type is common in information systems, where examples study results.
1995
Social Science Theories in Software Engineering Research ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA
• Motivation: The theory inspires the article’s topic (but its con- Methodology (TOSEM), Empirical Software Engineering (EMSE), the
cepts are not used explicitly to formulate the study design; Journal of Systems and Software (JSS), and Information and Soft-
that would be Design). ware Technology (IST). We chose these journals because they are
• Tested: The article evaluates the theory using primary data widely considered to be among the most prestigious, influential
(e.g. an experiment) or secondary data (e.g. a systematic and academically rigorous journals that sit squarely in SE.
literature review). To keep the review recent, tractable and reproducible, we limit
• Modified: The article incrementally modifies, adapts or ex- our search to articles published between 1 January 2007 and 31
tends the theory. December 2019. Within this thirteen-year period, the five selected
• Basis: The article proposes a new theory, with its own char- journals published more than 5,900 articles. Of these, we manually
acter and concepts, partly based on an existing theory. excluded editorials, introductions, book reviews, and other non-
Hannay et al. [27] include an eighth role—proposed—however, research articles.
theories proposed by SE articles are beyond the scope of this review We scanned the remaining research articles for theory use ac-
because they do not originate in a social science. We added the role cording to the following process:
applied to accommodate theories that are used for their intended (1) We performed an electronic search for the term “theory”
purpose. Furthermore, we follow Hannay et al. [27] in excluding within each entire article, which can serve as an initial indi-
tangential references to theories and descriptions of theories, as in cator for theory use.
a related work section, and focus on substantial uses of theories. (2) An electronic search for the names of the most commonly
Finally, we can classify theories according to where in an article used theories in information systems according to the Theo-
they are used. The IMRaD pattern—“Introduction, Method, Result, ries Used in IS Research Wiki1 or Dwivedi et al. [19]. This
and Discussion”—is the de-facto standard for scientific writing step was included to avoid missing theories that do not have
in many fields [6]. (In IMRaD, introduction includes related work the word “theory” in the title (e.g. Task Technology Fit) and
while discussion includes implications, limitations, future work and potentially explore similarities between SE and IS.
conclusions.) While each article is unique, most can be mapped to
the IMRaD structure. We can ignore front and back matter such as Each paper returned by these searches was then manually checked
the title, abstract, keywords, references, acknowledgements, and against the following inclusion criteria:
appendices as these sections rarely contain substantive uses of (1) the “theory” meets our definition of a collection of ideas for
theories. understanding, analyzing, explaining or predicting;
(2) the theory takes on one of Hannay et al.’s roles as adapted
3 METHODOLOGY for our study (Section 2);
We conducted a critical review. Critical reviews [e.g. 4, 73, 83] are (3) the theory originates in one of the social science subfields
similar in execution to systematic reviews [cf. 34], except that criti- listed in the Social Science Encyclopedia [36]. (If not obvious,
cal reviews analyze a sample of papers to make a point (often about we examined the reference list or searched for the theory in
theory or methods), whereas systematic reviews aim to synthesize relevant scientific databases).
all relevant evidence. We exclude theories that originated in software engineering,
Briefly, we identified clear and upfront analysis questions, se- mere observations, grand theories, theories of everything, and the-
lected a search strategy including explicit inclusion and exclusion ories that were mentioned but not substantively used (Fig. 1).
criteria, retrieved a sample of articles from prominent SE journals,
extracting instances of theory use from the articles, and analyzing
the theories and their uses by applying a prior coding scheme.
3.3 Data Extraction and Analysis
Next, we extracted relevant data from the articles. We extracted
3.1 Objective and Questions metadata including title, authors, journal, volume, issue (if applica-
ble) and publication. In addition to metadata, we extracted data on
This study aims to provide a comprehensive summary of the state
the theories and articles themselves. For theories, we extracted the
of social science theory use in software engineering research. We
originating discipline, theory role, theory usage, theory nature, and
refine this goal into five specific analysis questions (AQs) all of
Gregor’s theory type. For articles, we extracted the corresponding
which end with an implied ‘in SE research’:
SWEBOK classification.
(AQ1) To what extent are social science theories used? To gather this data, we analyzed each theory to determine its orig-
(AQ2) What social science theories are used? inating discipline, type (according to Gregor [26]), scope (middle-
(AQ3) Which social sciences provide the theories? range, etc.) and nature (process, variance or taxonomy). Then we
(AQ4) Which types of theories are used? analyzed each theory usage to determine the theory’s role and where
(AQ5) For which purposes are theories used? it was used (introduction, methods, results or discussion).
(AQ6) Where in the papers are theories used? Finally, we classified each article according to the fourteen knowl-
(AQ7) In what knowledge areas are theories used? edge areas defined by the software engineering body of knowledge
(SWEBOK) [10]—design, testing, etc.
3.2 Journal, Article and Theory Selection
We selected five SE journals to study: IEEE Transactions on Software
Engineering (TSE), ACM Transactions on Software Engineering and 1 https://is.theorizeit.org/
1996
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
3.4 Validation
We validated our analysis in two ways. First, the second and third
TSE TOSEM EMSE JSS IST authors performed a detailed audit of the first author’s classifica-
(839) (299) (710) (2,567) (1,515) tions. This review produced the edge case decisions described in
Section 3.3.1 and many improvements to the classification of theo-
ries. Next, we extracted the authors’ contact information from each
article that applied one or more theories and sent the authors a ques-
Total retrieved: 5,930 tionnaire regarding their article, the theory (or theories) they used
and their experience with social science theories. Questionnaire
results are reported in Section 4.8
1997
Social Science Theories in Software Engineering Research ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA
1998
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
1999
Social Science Theories in Software Engineering Research ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA
context of construction (4), SE process (4), testing (3), design (3), • Design/method (29%) and design/introduction (23%) are the
maintenance (3), and quality (1). No articles used theories in the most common combinations. That is, the theory informs
context of other knowledge areas. the design of the study, and this is explained in either the
However, 19 articles did not map well into the SWEBOK knowl- introduction or the method section. For example, Babar et
edge areas. Of these, twelve can be best categorized as technology al. [P68] formulate their hypothesis based on task-media fit
adoption (e.g. investigating technology adoption and acceptance in the methodology section. Marsan et al. extend concepts
in Botswana [P16]). Seven are about research methods (e.g. us- of institutional theory and rhetorical theory to propose a
ing the garbage can model to find a research method for studying research question in their introduction [100].
open-source software [P2]). • Explanation/discussion is also common (11%). For example,
Naturally, social science theories are used more in the primarily Hyrynsalmi et al. [102] use cultural lag theory to explain
social knowledge areas than in the primarily technical knowledge value creation mechanisms for mobile applications in their
areas. The more technical knowledge areas may use more mathe- discussion section.
matical and natural-science foundations, or may be less theoretical • Motivation/introduction, applied/method, and basis/method
than the more social knowledge areas. That social theories are used each make up another approximately 3%. The first is self
so rarely in software design is puzzling. explanatory. As an example of the second, Yu and Petter
Perhaps more concerning is that the SWEBOK knowledge area [P69] used shared-mental models theory to analyze different
taxonomy appears deficient for classifying either SE research or SE kinds of agile practices in their methods section. Moody [P1]
practice. Much of the research we reviewed does not fit in any of the introduces a design theory on the basis of various existing
categories, and many common practices resist classification. How theories, as an example for the third.
would one classify, for example, research on continuous integration The number of articles contributed by each journal varies greatly.
practices? TOSEM and ESE contributed only 274 and 642 papers respectively,
while IST and JSS contributed 1419 and 2453. TSE was in the middle
4.8 Member checking results with 792. This basically reflects publication numbers—JSS and IST
Approximately 20% of the authors completed our questionnaire, publish a lot more articles.
leading to minor changes to the classification of some theories, However, even after accounting for their relative contribution
mainly in how theories are used within the articles. A sample issue to the sample, journals vary significantly in their use of theories.
that was discussed upon analyzing the questionnaire was the origin Less than 1% of the TOSEM papers used theories, followed by TSE
of Adaptive Structuration Theory (MIS) compared to Structuration with 1.51%, JSS with 1.59%, ESE with 1.87%, and 2.60% IST papers.
Theory (sociology). Benefits of using social science theories in SE This suggests that either IST is most sympathetic to research using
research mentioned by respondents include: social science theories, or that researchers who use social science
• improving understanding of phenomena occurring in SE; theories prefer submitting to IST.
• theoretical grounding of their own research; and We might expect more articles using social science theories in
• connecting technical and human aspects such as the way of recent years, as SE has become more concerned with the human
thinking, teamwork, and failure. and social aspects of development. However, no particular trends
are evident in the data overall, or within any specific journal.
Challenges mentioned include:
• understanding the theoretical background as social science 5 MOST USED THEORIES
theories are often more abstract; This section describes in more detail some of the most used theories
• finding corresponding theories and mapping them to SE; in our sample. While space limitations prevent comprehensive
• theories that seem useful but lack necessary precision. descriptions of all 87 theories, elaborating a few prevalent ones
Some authors are sympathetic to theories, but find difficulty should give the reader a better sense of the sort of social science
in publishing due to reviewers not understanding social science theories being used in SE research.
theories. One response mentioned the psychological uncertainty
when using qualitative constructs and the challenge of accepting 5.1 The Technology Acceptance Model
that “everyone does it differently”. The most used theory in this review is the technology acceptance
model (TAM). TAM is a middle-range variance theory for explaining
4.9 Exploratory findings and predicting organizational adoption of new technologies [17].
Several findings beyond the original analysis questions emerged. TAM posits that an organization’s intention to adopt a technology
This section briefly reports the more interesting incidental findings. depends on perceived usefulness and perceived ease of use. It is
Software engineering is closely related to information systems, based on Fishbein and Ajzen’s theory of reasoned action [23].
so we might expect a large overlap in theory use. However, only 28 TAM is arguably the most widely used theory in information
(32%) of the theories used in our sample are listed in the Theories system research. It has been extensively tested, refined and ex-
Used in IS Research Wiki or Dwivedi et al.’s [19] survey of theories tended, most notably by the Unified Theory of Acceptance and Use
commonly used in IS research. of Technology (UTAUT; Fig. 3) [80].
Meanwhile, cross-referencing where theories are used with the However, TAM has also been widely criticized [12]. For example,
role the theory plays highlights a few combinations: Benbasat and Barki [5] argue that TAM “has created an illusion
2000
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
Performance
Expectancy
Eort Behavioural
Expectancy Intention
Use
Behavior
Social
Inuence
Facilitating
Conditions
Voluntariness
Gender Age Experience
of Use
2001
Social Science Theories in Software Engineering Research ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA
June 2021 issue of MIS Quarterly appear to use one or more social
Internal Mental
Problem Solving theories. At least three possible explanations are evident:
Representation of Representation
Performance
Problem Domain for Task Solution (1) SE research primarily uses homegrown theories;
(2) SE research primarily uses natural science theories;
(3) SE research is primarily atheoretical.
External Problem Problem-Solving Distinguishing between these three explanations would require a
Representation Task different kind of study, with a different sampling strategy. However,
based on our experience with initiatives aimed at encouraging
theory development and use, we suspect SE research is largely less
Figure 5: Cognitive fit theory (adapted from [67]) concerned with theory (more atheoretical) than other similar fields.
Compared to management, for example, SE researchers appear
more concerned with making tools than theories.
Additionally, some of the theories that are being used are prob-
5.4 Coordination Theory lematic. TAM, for example, is outdated [80], does not explain a core
Coordination theory is a general theory for explaining “how the SE phenomenon, and has been widely criticized [12].
activities of separate actors can be coordinated” [41]. It includes a Broadly considered, the results of this study suggest that SE
framework for analyzing coordination in complex processes, as well research is not making good use of social science theory. Failing to
as a typology of dependencies and coordination mechanisms [42]. incorporate social theories is problematic in several ways.
It posits that in an organization, some tasks depend on other tasks, Theories underpin almost all natural, social and applied sciences.
which creates coordination problems. Actors solve these problems Sound theoretical foundations are essential for maturing into a
using coordination mechanisms. Coordination theory posits different fully developed research discipline [70]. Without a collection of
kinds of coordination problems and mechanisms. core theories, a discipline struggles to generate, accumulate and
Coordination theory is used to design not only human organiza- preserve knowledge [62, 68, 70].
tions but also technologies (e.g. tools for facilitating coordination, Second, SE is intrinsically interdisciplinary—it intersects com-
distributed computer systems) [41]. puter science, economics, management, mathematics, philosophy,
psychology, and sociology. The social and technical aspects of cre-
5.5 Cognitive Fit Theory ating software are inextricably entwined; they cannot be cleaved
Cognitive fit is a middle range, variance theory for explaining and apart and understood independently. More than two percent of SE
predicting task performance. It “describes the relationships between research needs to draw on social science theories to have any hope
graphical and tabular representations and the types of tasks they of understanding the full scope of software development.
support” [81]. It posits that matching information representations In our experience, this inattention to social theory already harms
to problem solving tasks will increase task performance (Fig. 5). SE research, practice, and education. Some SE research tends to
For example, to navigate, we might want a map, but to calculate oversimplify and over-rationalize social phenomena. For exam-
distance traveled, we might want a table showing distances between ple, where an SE researcher asks “what is the problem the system
destinations. should solve?” a sociologist sees multiple stakeholders, who do
Cognitive fit theory has evolved over time. For example, Samuel not agree on the nature of the problem or the aims of the system
et al. [66] extend cognitive fit theory to introduce the dual-domain [11], jockeying for control of a project’s agenda—there is no “the
problem solving framework. While uncontroversial, cognitive fit is problem”. Where a software professional sees “requirements elicita-
only one of multitudinous antecedents of task performance, and tion”, a psychologist sees a prospective user and an analyst analyst
may be overwhelmed by other, more important factors. co-constructing preferences specific to that moment and context
[37]—the outcomes are neither “requirements” nor “ellicited” [54].
6 DISCUSSION Moreover, while students of other applied sciences learn to design
innovative systems based on ill-defined opportunities, SE students
6.1 Implications primarily learn to construct routine, fully-specified systems, as if
Our analysis produced several surprising findings: figuring out what to build is someone else’s job [cf. 53].
(1) Less than 2% of the articles use a social science theory in a Furthermore, infrequent use of social theory is surprising given
substantive way. two trends in SE research:
(2) SE research rarely tests a social science theory, for example, (1) The SE academic community is midway through a shift from
to see if it actually applies to SE. rationalism (it works because it intuitively makes sense) to
(3) Many relevant social sciences are not referenced at all (e.g. empiricism (it works because the results of this empirical
criminology). study demonstrate that it works) [57].
(4) Process theories are used more often than variance theories, (2) The SE community appears increasingly receptive to soci-
which shows that process theories are much more common ological research methods including grounded theory [73]
in SE than generally assumed [cf. 58]. and ethnography [83].
Intuitively, less than two percent of papers using social theories As the community demands more rigorous research, demands
seems low. For comparison, at least two thirds of the papers in the more empirical studies, and attempts more qualitative and social
2002
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
research, we expect researchers increasingly to draw on reference Additionally, SE researchers lean toward theories from psychol-
disciplines for both supporting foundations, evidence and method- ogy (to understand professionals) and management (to understand
ological guidance. teams and organizations). We rarely draw on education (to un-
derstand professional development), criminology (to understand
deviant behavior), health (to understand professionals’ wellbeing),
6.2 Limitations history (to understand the evolution of SE), or philosophy (to un-
The results of this study should be weighed against several limita- derstand a great many things). Exploiting the vast array of relevant
tions. It is very likely that some articles in the sample were missed knowledge from myriad disciplines may help launch many fruitful
because they used a theory, but not the word “theory.” We mitigated research programs.
this threat by searching for a list of specific theories by name, but More specifically, this study investigates use of social science
that list too is incomplete. theories. Similar reviews of homegrown SE theories and natural
Similarly, our search process may not have been sensitive to em- science theories used in SE are needed.
pirical generalizations. These generalizations would not be listed in
the IS Theory Wiki or or Dwivedi et al.’s [19] review, and would not 6.4 Comparison to previous reviews
normally have “theory” in their names, if they even have names. Several studies have examined the status of theory use in SE re-
Therefore, the fact that we did not uncover any empirical general- search. Hannay et al. [27] present a systematic literature review of
izations from social science does not mean none were used. theories used in software engineering experiments. Stol & Fitzger-
The five journals and 13-year period we studied are not rep- ald [70] argue that SE researchers often use “theory fragments” for
resentative of all SE outlets across all time. We used purposive, analysis. Lim et al. [38] examine the theoretical basis of information
search-based sampling [4] to select good, influential journals rather systems research. Ralph et al. [59] describe seven theories that, they
than random journals and focused on more recent years because argue, ought to be core to SE research.
we want to understand the trajectory of the field, which is mostly Like Hannay et al., we find that theories are mostly used to moti-
set by recent papers in top outlets. vate and design studies; fewer articles test or modify theories. Like
Meanwhile, researchers do not agree on the definition of theory. Lim et al., we find that TAM is widely-used, and that psychology
Different researchers might have included or excluded different and economics are popular reference disciplines. Like Ralph et al.,
theories. We mitigate this threat by providing a complete list of the we argue that SE researchers should use more social theory.
included theories (Section 4.2) and discussing edge cases (Section However, this review extends previous work in several ways.
3.3.1). We also tried searching for “model”, but this produced too Stol & Fitzgerald [70] is more of a position paper with a limited
many false positives. review. Hannay et al. [27] only consider experiments, and their
We validated our classifications through audits (i.e. the second review is getting dated. Lim et al. [38] looked at a different field.
and third authors extensively reviewing the classifications) and Ralph et al. [59] recommend social theories rather than studying
member checking (i.e., sending a questionnaire to primary study which are currently used. In summary, we extend these reviews by
authors). This led to significant recategorization of Gregor’s theory studying a larger, broader, more recent sample of primary studies,
types and a handful of minor changes. However, different analysts leading to somewhat different recommendations.
may classify theories, usages and topic areas differently.
7 CONCLUSION
6.3 Areas for future research The presented study examined substantive uses of social science
SE researchers should consider using more social theories from theories in more than 5500 articles published in five leading soft-
reference disciplines. Using social theories more often, and more ef- ware engineering journals over a thirteen-year period. Despite
fectively, will help SE researchers produce more rigorous, nuanced, several calls for more attention to theory in SE research, increasing
insightful research. Furthermore, SE researchers should move be- attention to the social aspects of development, and increasing use
yond using theories to motivate their papers or specify the research of behavioral and qualitative research, only about two percent of the
methodology. We need to test social theories in SE contexts and reviewed articles use social science theories in any substantial way.
theorize about SE phenomena using concepts from the social sci- In other words, SE research tends to ignore relevant theories from
ences. For example, iteratively evaluating and adapting the theory social sciences.
of boundary objects [69] could help us better understand product When social science theories are used, they are most commonly:
backlogs, user stories, and other common non-code artifacts. • middle-range theories;
Similarly, social theories are often used to examine project man- • process theories;
agement, requirements engineering, modeling, software develop- • theories for explaining (but not predicting);
ment methods and technology adoption. SE researchers tend not • drawn from psychology or management;
to use them to examine designing, constructing, testing, evaluating • used in some way related to designing the study;
and configuring software. However, the latter are socio-technical • used in the introduction or methods section of the paper;
processes in which technical phenomena are entangled with social • used in research concerning professional practice, technol-
phenomena. Again, social theories should help. For example, nu- ogy adoption or models and methods of software develop-
merous theories concerning power and influence could help explain ment; and
interpersonal conflict during peer programming. • published in Information and Software Technology.
2003
Social Science Theories in Software Engineering Research ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA
In contrast, SE research tends not to: [13] Ronald H Coase. 1937. The nature of the firm. Economica 4 (1937).
[14] Kevin Crowston. 2000. Process as theory in information systems research. In
• test theories being used; Organizational and Social Perspectives on Information Technology. Springer, 149–
• use theories to analyze data; 164.
• apply theories to software design, testing, evaluation or con- [15] Richard L Daft and Robert H Lengel. 1983. Information richness. A new approach
to managerial behavior and organization design. Technical Report. Texas A and
figuration; M Univ College Station Coll of Business Administration.
• draw theories from relevant reference disciplines including [16] Alan M Davis. 1995. 201 Principles of Software Development. McGraw-Hill, Inc.
[17] Fred D Davis. 1989. Perceived usefulness, perceived ease of use, and user accep-
health, criminology or history. tance of information technology. MIS Quarterly (1989), 319–340.
Applied sciences benefit from balancing more concrete tech- [18] Alan R Dennis, Robert M Fuller, and Joseph S Valacich. 2008. Media, tasks, and
communication processes: A theory of media synchronicity. MIS Quarterly 32, 3
niques and more abstract theories. Techniques encapsulate practical, (2008), 575–600.
specific, contextualized, often short-term knowledge that practi- [19] Yogesh K Dwivedi, Michael R Wade, and Scott L Schneberger. 2011. Information
tioners can readily apply. Theories encapsulate more abstract, gen- systems theory: Explaining and predicting our digital society. Vol. 1. Springer.
[20] Yrjö Engeström et al. 1999. Activity theory and individual and social transforma-
eralizable, long-term knowledge. They insulate a field against fads, tion. Perspectives on Activity Theory (1999), 19–38.
guide research programs and help us understand novel situations. [21] Iaakov Exman, Dewayne E. Perry, Balbir Barn, and Paul Ralph. 2016. Separability
For SE research to mature and improve, researchers therefore principles for a general theory of software engineering: Report on the GTSE
2015 workshop. SIGSOFT Software Engineering Notes 41, 1 (Feb. 2016), 25–27.
need to make better use of theory: social science theories to un- https://doi.org/10.1145/2853073.2853093
derstand social phenomena, natural science theories to understand [22] Fred E Fiedler. 1964. A contingency model of leadership effectiveness1. In
Advances in Experimental Social Psychology. Vol. 1. Elsevier, 149–190.
physical and mathematical phenomena, and more homegrown the- [23] M. Fishbein and I. Ajzen. 1975. Belief, attitude, intention, and behavior: an intro-
ories to understand SE-specific phenomena. duction to theory and research. Addison-Wesley Pub. Co., Reading, MA, USA.
No one is saying that SE should abandon developing and evalu- [24] Anthony Giddens. 1986. The Constitution of Society: Outline of the Theory of
Structuration. Vol. 349. Univ of California Press.
ating new technologies or techniques. Rather, SE has twin aims of [25] Dale L Goodhue and Ronald L Thompson. 1995. Task-technology fit and individ-
understanding software development and creating tools and tech- ual performance. MIS Quarterly 19, 2 (1995), 213–236.
niques for developing software better. Our abilities to build better [26] Shirley Gregor. 2006. The nature of theory in information systems. MIS Quarterly
30, 3 (2006), 611–642.
systems and to explain software development phenomena necessar- [27] Jo Hannay, Dag Sjøberg, and Tore Dyba. 2007. A systematic review of theory use
ily co-evolve. Many of our hardest problems lie at the intersection in software engineering experiments. IEEE Transactions on Software Engineering
33, 2 (Feb. 2007), 87–107. https://doi.org/10.1109/TSE.2007.12
of social phenomena, technical phenomena, understanding, and cre- [28] Juhani Iivari, Rudy Hirschheim, and Heinz K Klein. 2000. A dynamic framework
ating. Only by accepting the fundamental interconnections among for classifying information systems development methodologies and approaches.
these four dimensions can we address our greatest challenges. Journal of Management Information Systems 17, 3 (2000), 179–218.
[29] Michael C Jensen and William H Meckling. 1976. Theory of the firm: Managerial
behavior, agency costs and ownership structure. Journal of Financial Economics
DATA AVAILABILITY 3, 4 (1976), 305–360.
[30] Michael John, Frank Maurer, and Bjørnar Tessem. 2005. Human and social factors
The anonymized replication package includes the questionnaire of software engineering: workshop summary. ACM SIGSOFT Software Engineering
template, the list of primary studies, and a spreadsheet containing Notes 30, 4 (2005), 1–6.
all extracted data on articles and theories being used, as well as [31] Pontus Johnson and Mathias Ekstedt. 2016. The Tarpit–A general theory of
software engineering. Information and Software Technology 70 (2016), 181–203.
decision rules for inclusion and exclusion criteria. It is available at [32] Pontus Johnson, Mathias Ekstedt, and Ivar Jacobson. 2012. Where’s the theory
https://doi.org/10.5281/zenodo.6036076. for software engineering? IEEE Software 29, 5 (2012), 94–96. https://doi.org/10.
1109/MS.2012.127
[33] Pontus Johnson, Paul Ralph, Michael Goedicke, Pan-Wei Ng, Klaas-Jan Stol, K.
REFERENCES Smolander, Iaakov Exman, and Dewayne E. Perry. 2013. Report on the second SE-
[1] Gordon W Allport. 1954. The Nature of Prejudice. Addison-Wesley. MAT workshop on general theory of software engineering (GTSE 2013). SIGSOFT
[2] Chris Argyris. 1995. Action science and organizational learning. Journal of Software Engineering Notes 38, 5 (2013), 47–50.
Managerial Psychology 10, 6 (1995), 20–26. [34] Staffs Keele et al. 2007. Guidelines for performing systematic literature reviews in
[3] D E Avison and A T Wood-Harper. 1986. Multiview&Mdash;an Exploration in software engineering. Technical Report. Citeseer.
Information Systems Development. Aust. Comput. J. 18, 4 (Nov. 1986), 174–179. [35] Laurie J Kirsch. 1996. The management of complex tasks in organizations:
[4] Sebastian Baltes and Paul Ralph. in press. Sampling in software engineering Controlling the systems development process. Organization Science 7, 1 (1996),
research: A critical review and guidelines. Empirical Software Engineering (in 1–21.
press). [36] Adam Kuper and Jessica Kuper (Eds.). 2004. The Social Science Encyclopedia (3rd
[5] Izak Benbasat and Henri Barki. 2007. Quo vadis TAM? Journal of the Association ed.). Routledge, Chippenham, UK.
for Information Systems 8, 4 (2007), 7. [37] Sarah Lichtenstein and Paul Slovic. 2006. The Construction of Preference. Cam-
[6] Marc Bertin and Iana Atanassova. 2014. A study of lexical distribution in citation bridge University Press, Cambridge, UK.
contexts through the IMRaD standard. PloS Negl. Trop. Dis 1, 200,920 (2014), [38] Sanghee Lim, Terence JV Saldanha, Suresh Malladi, and Nigel P Melville. 2013.
83–402. Theories used in information systems research: Insights from complex network
[7] Elizabeth Bjarnason, Kari Smolander, Emelie Engström, and Per Runeson. 2016. A analysis. JITTA: Journal of Information Technology Theory and Application 14, 2
theory of distances in software engineering. Information and Software Technology (2013), 5.
70 (2016), 204–219. [39] Edwin A Locke and Gary P Latham. 1990. A Theory of Goal Setting & Task
[8] Barry W Boehm, Chris Abts, A Winsor Brown, Sunita Chulani, Bradford K Clark, Performance. Prentice-Hall, Inc.
Ellis Horowitz, Ray Madachy, Donald J Reifer, and Bert Steece. 2009. Software [40] Kalle Lyytinen and Jan Damsgaard. 2001. What’s wrong with the diffusion of
Cost Estimation with COCOMO II. Prentice Hall PTR. innovation theory?. In Working Conference on Diffusing Software Product and
[9] Barry W. Boehm and Rony Ross. 1989. Theory-W software project management Process Innovations. Springer, 173–190.
principles and examples. IEEE Transactions on Software Engineering 15, 7 (1989), [41] Thomas W Malone et al. 1988. What is coordination theory? (1988).
902–916. [42] Thomas W Malone and Kevin Crowston. 1994. The interdisciplinary study of
[10] Pierre Bourque and Richard E. Fairley. 2014. Guide to the Software Engineering coordination. ACM Computing Surveys (CSUR) 26, 1 (1994), 87–119.
Body of Knowledge (SWEBOK (R)): Version 3.0. IEEE Computer Society Press. [43] RE Mayer. 2001. Multimedia Learning. 2001.
[11] P. Checkland. 1999. Systems Thinking, Systems Practice. Wiley, Chichester. [44] Robert King Merton and Robert K Merton. 1968. Social Theory and Social Structure.
[12] Mohammad Y Chuttur. 2009. Overview of the technology acceptance model: Ori- Simon and Schuster.
gins, developments and future directions. Sprouts: Working Papers on Information [45] Lawrence B Mohr. 1982. Explaining Organizational Behavior. Jossey-Bass.
Systems 9, 37 (2009).
2004
ICSE 2022, May 21–29, 2022, Pittsburgh, PA, USA Lorey et al.
[46] Stewart C Myers. 1977. Determinants of corporate borrowing. Journal of Financial [65] Everett M Rogers. 1983. Diffusion of Innovations. The Free Press.
Economics 5, 2 (1977), 147–175. [66] Binny M Samuel, Linwood Watkins, Andrew Ehle, and Vijay Khatri. 2015. Cus-
[47] Ikujiro Nonaka. 1994. A dynamic theory of organizational knowledge creation. tomizing the representation capabilities of process models: understanding the
Organization Science 5, 1 (1994), 14–37. effects of perceived modeling impediments. IEEE Transactions on Software Engi-
[48] Ana Ortiz de Guinea and Jane Webster. 2014. Overcoming variance and process neering 41, 1 (2015), 19–39.
distinctions in information systems research. (2014). [67] Teresa M Shaft and Iris Vessey. 2006. The role of cognitive fit in the relationship
[49] Allan Paivio. 1990. Mental Representations: A Dual Coding Approach. Oxford between software comprehension and modification. MIS Quarterly 30, 1 (2006),
University Press. 29–55.
[50] Ananthanarayanan Parasuraman, Valarie A Zeithaml, and Leonard L Berry. 1988. [68] Dag IK Sjoberg, Tore Dyba, and Magne Jorgensen. 2007. The future of empirical
Servqual: A multiple-item scale for measuring consumer perc. Journal of Retailing methods in software engineering research. In 2007 Future of Software Engineering.
64, 1 (1988), 12. IEEE Computer Society, 358–378.
[51] Jeffrey Pfeffer and Gerald R Salancik. 2003. The External Control of Organizations: [69] Susan Leigh Star and James R. Griesemer. 1989. Institutional ecology, ’translations’
A Resource Dependence Perspective. Stanford University Press. and boundary objects: Amateurs and professionals in Berkeley’s museum of
[52] Peter Pirolli and Stuart Card. 1995. Information foraging in information access vertebrate zoology, 1907-39. Social Studies of Science 19, 3 (1989), 387–420.
environments. In Proceedings of the SIGCHI Conference on Human Factors in [70] Klaas-Jan Stol and Brian Fitzgerald. 2013. Uncovering theories in software
Computing Systems. ACM Press/Addison-Wesley Publishing Co., 51–58. engineering. In 2013 2nd SEMAT Workshop on a General Theory of Software
[53] Paul Ralph. 2012. Improving coverage of design in information systems education. Engineering (GTSE). IEEE, 5–14.
In Proceedings of the International Conference on Software Engineering. IEEE, [71] Klaas-Jan Stol and Brian Fitzgerald. 2015. Theory-oriented software engineering.
Orlando, Florida, USA. Science of Computer Programming 101 (2015), 79–98.
[54] Paul Ralph. 2013. The illusion of requirements in software development. Require- [72] Klaas-Jan Stol, Michael Goedicke, and Ivar Jacobson. 2016. Introduction to the
ments Engineering 18, 3 (2013), 293–296. special section—general theories of software engineering: New advances and
[55] Paul Ralph. 2015. The sensemaking-coevolution-implementation theory of implications for research. Information and Software Technology 70 (2016), 176–
software design. Science of Computer Programming 101 (2015), 21–41. https: 180.
//doi.org/10.1016/j.scico.2014.11.007 [73] Klaas-Jan Stol, Paul Ralph, and Brian Fitzgerald. 2016. Grounded theory in
[56] Paul Ralph. 2016. Software engineering process theory: A multi-method com- software engineering research: A critical review and guidelines. In Proceedings
parison of Sensemaking-Coevolution-Implementation Theory and Function- of the International Conference on Software Engineering. IEEE, Austin, TX, USA,
Behavior-Structure Theory . Information and Software Technology 70 (2016), 120–131.
232–250. https://doi.org/10.1016/j.infsof.2015.06.010 [74] John Sweller. 1988. Cognitive load during problem solving: Effects on learning.
[57] Paul Ralph. 2018. The two paradigms of software development research. Science Cognitive Science 12, 2 (1988), 257–285.
of Computer Programming 156 (2018), 68–89. [75] Mary Tate, Joerg Evermann, and Guy Gable. 2015. An integrated framework for
[58] Paul Ralph. 2018. Toward methodological guidelines for process theories and theories of individual attitudes toward technology. Information & Management
taxonomies in software engineering. IEEE Transactions on Software Engineering 52, 6 (2015), 710–727.
45, 7 (2018), 712–735. [76] Nigel Thomas. 2014. Dual coding and common coding theories of memory. In
[59] Paul Ralph, Mike Chiasson, and Helen Kelley. 2016. Social theory for software Stanford Encyclopedia of Philosophy. Stanford University Press. https://plato.
engineering research. In Proceedings of the International Conference on Evaluation stanford.edu/entries/mental-imagery/theories-memory.html
and Assessment in Software Engineering. ACM, Limerick, Ireland. https://doi.org/ [77] Stephen E Toulmin. 2003. The Uses of Argument. Cambridge University Press.
10.1145/2915970.2915998 [78] Andrew H. Van de Ven. 2007. Engaged Scholarship: A Guide for Organizational
[60] Paul Ralph, Gregor Engels, Ivar Jacobson, and Michael Goedicke. 2015. 4th and Social Research. Oxford University Press, Oxford, UK.
SEMAT workshop on general theory of software engineering (GTSE 2015). (May [79] Viswanath Venkatesh, Michael G Morris, Gordon B Davis, and Fred D Davis.
2015), 983–984. 2003. User acceptance of information technology: Toward a unified view. MIS
[61] Paul Ralph, Iaakov Exman, Pan-Wei Ng, Pontus Johnson, Michael Goedicke, Quarterly 27, 3 (2003), 425–478.
Alper Tolga Kocata, and Kate Liu Yan. 2014. How to develop a general theory of [80] Viswanath Venkatesh, Michael G. Morris, Gordon B. Davis, and Fred D. Davis.
software engineering: Report on the GTSE 2014 workshop. SIGSOFT Software 2003. User acceptance of information technology: Toward a unified view. MIS
Engineering Notes 39, 6 (Dec. 2014), 23–25. https://doi.org/10.1145/2674632. Quarterly 27, 3 (2003), 425–478.
2674647 [81] Iris Vessey. 1991. Cognitive fit: A theory-based analysis of the graphs versus
[62] Paul Ralph, Pontus Johnson, and Howell Jordan. 2013. Report on the first SEMAT tables literature. Decision Sciences 22, 2 (1991), 219–240.
workshop on a general theory of software engineering (GTSE 2012). SIGSOFT [82] Claes Wohlin, Per Runeson, Martin Höst, Magnus C Ohlsson, Björn Regnell, and
Software Engineering Notes 38, 2 (2013), 26–28. Anders Wesslén. 2012. Experimentation in Software Engineering. Springer Science
[63] Paul Ralph and Ewan Tempero. 2018. Construct validity in software engineering & Business Media.
research and software metrics. In Proceedings of the 22nd International Conference [83] He Zhang, Xin Huang, Xin Zhou, Huang Huang, and Muhammad Ali Babar. 2019.
on Evaluation and Assessment in Software Engineering. ACM Press, New York, Ethnographic research in software engineering: a critical review and checklist. In
New York, USA, 13–23. https://doi.org/10.1145/3210459.3210461 Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering
[64] James Reason. 1990. Human Error. Cambridge University Press. Conference and Symposium on the Foundations of Software Engineering. 659–670.
2005