Using Havel Hakimi To Graph Classroom Networks
Using Havel Hakimi To Graph Classroom Networks
Bachelor of Science in Applied Mathematics from Middle Georgia State College. Currently a graduate
student in the Department of Mathematical Sciences at Georgia Southern University with a concentration
in Applied Mathematics and research in Graph Theory.
Page 26.1666.1
Gathered Information
The data was gathered from engineering students attending a research university
in the southeast. Students from two classrooms participated in the survey: a smaller
upper level course (N = 12) and a large lower level course (N = 52). Over 80 percent of
the students in each course completed the survey. Course titles and numbers are held
back to protect the identities of the instructors. Part of the survey asked students to
respond to questions about the type of instruction the professor uses and their levels
of collaboration and cognition (defined below).
Students in the large lower level course reported hearing lectures over 70 percent
of the time. Students in the smaller upper level course reported hearing lectures
roughly 40 percent of the time. After answering questions about their classroom, the
students were asked to report the number of students they typically work with during
Page 26.1666.2
1
from 0 to 5 (n = 55). We gathered up the numbers for all student respondents and
treated them as a degree sequence.
In graph theory, an area of discrete mathematics focused on the study of graphs,
a degree sequence is defined as follows:
Definition. For a graph G with vertices {v1 , · · · , vn } with degrees di = deg(vi ). De-
fine the degree sequence of G to be the non-increasing sequence {di1 , · · · , din }.
For example, Figure 1 shows a graph whose degrees sequence is [2,2,2,1,1]. The
graph contains 5 vertices. The number of edges, |E|, can be calculated,
X
deg(vi ) = 2|E|.
Classroom Networks
A degree sequence for a classroom is a list individual students and the number of
peers with which a student studied or collaborated to determine the degree. Consider
the degree sequence {n11 , n12 , · · · , n1i , n21 , n22 , · · · , n2j , nk , · · · , nkl }, where npx and
npy represents students who have the same ingroup number. Theoretically, all class-
room networks are graphical if a perfect representation of the network is attained,
but incomplete classroom network data may not be graphical by Havel-Hakimi re-
quirements. This can be caused by students who have reported the wrong number
of ingroup peers or by not having all students in the class take the survey, or some
Page 26.1666.3
2
Thus, although we had large samples from each of the test courses (i.e. greater
than 80 percent), we needed to examine the trustworthiness of the data. We began
by examining the degrees sequence data to determine if any outliers existed in the
data (i.e. students who reported questionable numbers of partners). No outliers were
found in the data. Next, we used the Havel-Hakimi Theorem to determine if the
degree sequences were graphical. Raw data from both classrooms proved graphical
with no modification. The Havel-Hakimi algorithm we used to determine if a simple
graph existed for the degree sequence data was based on the following theorem:
Theorem (Havel (1955) Hakimi (1962)). Let S = (d1 , · · · , dn ) be a finite list of
nonnegative integers that is nonincreasing. List S is graphic if and only if the finite
list S 0 = (d2 − 1, d3 − 1, · · · , dd1 +1 − 1, dd1 +2 , · · · dn ) has nonnegative integers and is
graphic.
The algorithm then follows a recursive method where the theorem is used with
S = S 0 until S 0 = {0}.
3
Additionally, Gephi has the capability of analyzing network structures and produc-
ing statistics that describe individual nodes and network characteristics. We choose
to examine the number of modularity classes in the networks, and measures of close-
ness centrality and betweenness centrality for the individual nodes. The modularity
algorithm identifies the number of likely ”communities” in a graph, in this case the
number of working groups. Betweeness centrality measures how often a node appears
on shortest paths between nodes in the network. Closeness Centrality measures the
avergage distance from a given starting node to all other nodes in the network. In
this case, both are measures of information sharing among classmates.
Results
Our inspection of the graph layout provided face validity evidence for the network
structure of the classroom. The small, upper division course formed a network with
four modularity classes, or communities of students. The large lower division course,
on the other hand, formed a network with 36 modularity classes, 32 of which were
comprised of single students (students with zero degrees were treated as single mod-
ularity classes). See figure 2 for the graphs that were generated from the data, with
modularity classes represented in gray scale. Thickness of edge weight represents edge
frequency from random simulation.
Student closeness scores were significantly and positively correlated with self re-
ported collaboration (r = .33; p < .01, n = 55) and cognition (r = .20; p <
.05; n = 55). Student betweenness measures were significantly and positively corre-
lated with self reported collaboration (r = .27; p < .01, n = 55) but not cognition
(r = .15; p = .08; n = 55). Collaboration was assessed with a self reported Likert
response (7 point agree or disagree scale) to the item, ”My classmates and I actively
worked together to complete assignments.” Cognition was assessed using a self re-
port Likert response (7 point agree or disagree scale) to the item, ”I thought about
different approaches or strategies I could use for studying the assignments.”
Figure 2: Generated graphs for small course and large course respectively
Page 26.1666.5
Discussion
4
Classroom network analysis may be a useful way of examining collaboration in
engineering education classrooms. Network layouts and measures such as closeness
centrality and betweenness centrality statistics have the potential to provide impor-
tant information about impact of instruction at the group level of analysis. Our pilot
comparison suggested that in the large lecture course we surveyed, a tight network
– with the same number of modulartity classes as the small upper division course –
emerged in the classroom. The face validity of these networks was supported with
our statistical evidence that students embedded in these networks were more likely
to collaborate (evidence of concurrent validity) and strategically learn (evidence of
predictive validity).
These findings are impactful because of the large number of students who were
not embedded in the classroom network. Our preliminary evidence suggests these
students may be less likely to consider different approaches or strategies for learning,
though some of these students may be high performing and prefer not to collaborate.
Perhaps encouraging networks with multiple diads or triads, or collections of distinct
modularity classes, in larger courses will engage a larger percentage of the students
who might benefit from collaboration. It is not clear what types of instructional
strategies might be related to different network structures, or how much variation
in student learning classroom networks might predict, but these small scale findings
suggest it might be worth examining further using survey research.
Our findings also suggest that expanding the Havel Hakimi approach to include
an edge swapping technique is an effective method for generating random graphs from
student degree sequences. One potential limitation here, that we continue to explore,
is variation in isomorphic class. Our edge swapping technique creates a random graph,
choosen by statistical probability. Thus, the isomorphic class of the best fitting graph
is chosen at random, and it is not clear if small differences in isomorphic classes have
theoretical implications for classroom instruction. Variation in isomorphic classes for
large graphs continues to be a challenge for mathematicians, and we plan to continue
to run monte carlo studies to examine the problem from a educational, theoretical
perspective.
Network analysis has provided new insights into a wide variety of social phenom-
ena, particularly with regard to understanding the underlying structures of complex
systems that produce emergent outcomes. Classrooms have been described in the
extant literature as complex adaptive systems (Sawyer, 2004), and the emergent na-
ture of educational environs has been established (Davis and Sumara, 2006). The
results of our pilot study, presented here, suggest there may be value in systemati-
cally studying classrooms using network analysis to examine the relationship between
1) instructional categories and network structure, and 2) network and structure and
student cognition for mediational relationships.
References
Borrego, M., Karlin, J., McNair, L. D., and Beddoes, K. (2013). Team effec-
Page 26.1666.6
5
student project teams: A research review. Journal of Engineering Education. 102(4),
472-512.
Davis, B. and Sumara, D. (2006). Complexity and Education: Inquires Into
Learning, Teaching, and Research. Lawrence Erlbaum and Associates, Mahwah, NJ.
Gross, J. and Yellen, J. (2006). Graph Theory and its Applications. Taylor and
Francis. Boca Raton, Fl.
Hakimi, S. L. (1962). On realizability of a set of integers as degrees of the vertices
of a linear graph. Journal of the Society for Industrial and Applied Mathematics 10:
496506, MR 0148049.
Havel, V. (1955) A remark on the existence of finite graphs. asopis pro pstovn
matematiky (in Czech) 80: 477480.
Sawyer, R. K. (2001). Emergence in Sociology: Contemporary Philosophy of Mind
and Some Implications for Sociological Theory American Journal of Sociology, Vol.
107, No. 3, pp. 551-585.
Sawyer, R. K. (2004). Improvised Lessons: Collaborative Discussion in the Con-
structivist Classroom. Teaching Education. 15(2), 189-201.
Watts, D. J. (1999). Small Worlds: The Dynamics of Networks Between Order
and Randomness. Princeton University Press.
Page 26.1666.7