0% found this document useful (0 votes)

40 views244 pages

FULLTEXT01

This document summarizes Pablo Puñal Pereira's doctoral thesis which proposes an efficient Internet of Things (IoT) framework for industrial applications. The framework includes features for secure communication, authentication, fine-grained access control, zero-configuration networking, and run-time reconfiguration. Pereira tests the framework on two industrial case studies - mobile machinery monitoring and smart rock bolts. Experimental results validate the feasibility and energy efficiency of the proposed battery-operated IoT concept for industrial systems of systems. The results also identify the most critical areas for performance improvement.

Uploaded by

Renaldiansyah Gumay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views244 pages

FULLTEXT01

Uploaded by

Renaldiansyah Gumay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 244

DOC TOR A L T H E S I S

Department of Computer Science, Electrical and Space Engineering

Division of EISLAB

Efficient IoT Framework for

Pablo Puñal Pereira Efficient IoT Framework for Industrial Applications

ISSN 1402-1544

Industrial Applications
ISBN 978-91-7583-665-2 (print)
ISBN 978-91-7583-666-9 (pdf)

Luleå University of Technology 2016

Pablo Puñal Pereira

Industrial Electronics
Eﬃcient IoT Framework for
Industrial Applications

Pablo Puñal Pereira

EISLAB
Luleå University of Technology
Luleå, Sweden

Supervisors:
Jens Eliasson and Jerker Delsing
Printed by Luleå University of Technology, Graphic Production 2016

ISSN 1402-1544
ISBN 978-91-7583-665-2 (print)
ISBN 978-91-7583-666-9 (pdf)
Luleå 2016
www.ltu.se
To my family

iii
Abstract

The use of low-power wireless sensors and actuators with networking support in in-
dustry has increased over the past decade. New generations of microcontrollers, new
hardware for communication, and the use of standardized protocols such as the Internet
Protocol have resulted in more possibilities for interoperability than ever before. This in-
creasing interoperability allows sensors and actuator nodes to exchange information with
large numbers of peers, which is beneficial for creating advanced, flexible and reusable
systems.
The increase in interoperability has resulted in an increase in the number of possible
attacks from malicious devices or users. For this reason, the use of encryption techniques
to protect client and server communications has become mandatory. However, even with
state-of-the-art encryption mechanisms, there is no protection that can control access
to each particular service with fine-grained precision. The nodes within an industrial
network of wireless sensors and actuators are resource-constrained embedded devices,
and increasing interoperability therefore requires a higher level of computation capabil-
ities. The nodes’ intrinsic limitations of memory and processing exert an adverse effect
on power consumption and communication delays, resulting in a shorter battery life-
time. Therefore, the standard computing solutions for Internet communications are not
directly applicable, and new mechanisms to achieve security, scalability, dependability,
interoperability and energy efficiency are needed.
Sensor and actuator networks can transmit sensed data, but they also offer access
to the actuators. Such accesses, presumably provided via services, require an access
protection scheme. For this reason, the use of access control mechanisms is mandatory.
Access control assists in the creation of customized services and access policies. These
access policies can isolate access permissions to devices with different roles, such as
production and maintenance.
The main contribution of this thesis is a novel, efficient IoT framework for industrial
applications, including design, implementation, and experimental validation. The frame-
work includes features for communication protection, authentication, fine-grained access
control, zero-configuration networking, and run-time reconfiguration. These technologies
and their corresponding energy consumption data clearly demonstrate the feasibility of
integrating a battery-operated IoT concept into a functional System of Systems. The
provided data also pinpoint the most critical areas for improvement.

v
Contents
Part I 1
Chapter 1 – Introduction 3
1.1 Problem formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Thesis scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Thesis outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Chapter 2 – Internet of Things 9
2.1 Historical (r)evolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.1.1 Software . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.1.2 Hardware . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2 Wireless Sensor and Actuator Networks . . . . . . . . . . . . . . . . . . 14
2.3 Constrained Application Protocol . . . . . . . . . . . . . . . . . . . . . . 15
2.4 Service Oriented Architecture (SOA) . . . . . . . . . . . . . . . . . . . . 17
Chapter 3 – Security 19
3.1 Secure communications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
3.1.1 Standard end-to-end security mechanisms . . . . . . . . . . . . . 21
3.1.2 Access control analysis . . . . . . . . . . . . . . . . . . . . . . . . 21
3.2 Access control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2.1 Standard solutions . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2.2 Ticket-based access control . . . . . . . . . . . . . . . . . . . . . . 24
3.2.3 Alternatives under development . . . . . . . . . . . . . . . . . . . 37
Chapter 4 – Efficient Industrial IoT Framework 39
4.1 Network architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
4.2 Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4.2.1 Bootstrapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4.2.2 Conﬁguration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.2.3 Device management . . . . . . . . . . . . . . . . . . . . . . . . . . 45
4.2.4 Authentication and authorization . . . . . . . . . . . . . . . . . . 45
4.3 Case studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
4.3.1 Mobile machinery monitoring . . . . . . . . . . . . . . . . . . . . 45
4.3.2 Smart rock bolts . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
4.4 Experiments and results . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.4.1 Test setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.4.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

vii
4.4.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Chapter 5 – Contributions 55
Chapter 6 – Discussion 59
6.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
6.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
References 65

Part II 71
Paper A 73
1 Background and Related work . . . . . . . . . . . . . . . . . . . . . . . . 75
2 Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
3 Performed experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
5 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
7 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Paper B 93
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
3 EXI Processor Design and Implementation . . . . . . . . . . . . . . . . . 106
4 EXI data binding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
5 CoAP/EXI/XHTML Web page engine . . . . . . . . . . . . . . . . . . . 118
6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
2A Acknowledges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Paper C 131
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
2 Background and Related work . . . . . . . . . . . . . . . . . . . . . . . . 135
3 Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
4 Authentication Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
5 Security Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
6 Experiments and results . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
7 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
9 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Paper D 149
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
2 Background and Related Work . . . . . . . . . . . . . . . . . . . . . . . . 152
3 Problem Deﬁnition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

viii
4 Proposed Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
5 Application Scenario . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159
6 Implementation and Results . . . . . . . . . . . . . . . . . . . . . . . . . 161
7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
8 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
9 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Paper E 167
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
2 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
3 Proposed approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
4 Use cases and evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
7 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
8 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
Paper F 187
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
2 Background and Related work . . . . . . . . . . . . . . . . . . . . . . . . 190
3 Network infrastructure . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
4 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195
5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
6 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
7 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202
8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202
9 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
Paper G 207
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209
2 Background and Related work . . . . . . . . . . . . . . . . . . . . . . . . 210
3 Proposed Industrial IoT framework . . . . . . . . . . . . . . . . . . . . . 212
4 Test and Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
6 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
8 Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226

ix
Acknowledgments

This thesis is the result of more than four years of continuous research, development, and
learning. During this time I met many people who help me to be the researcher I am
now; because without their advice this thesis would not be possible. I need to extend
my gratitude to my supervisors, Associate Professor Jens Eliasson and Professor Jerker
Delsing, who had trusted me for this Ph.D. position and had invested time and eﬀort
guiding me to be on the right track. I also need to manifest my gratitude to my mentor,
Dr. Rumen Kyusakov, for his unconditional help, collaborations, and discussions during
these long way.
I would like to thank all my colleagues at LTU, because in one way or another, they
contributed to this with a comfortable working environment. Special thanks go to Dr.
Miguel Castaño for his valuable guidance during my ﬁrst year, to Dr. Blerim Emruli for
all the time that we spend teaching together and coding discussions, to Miguel Gómez,
Lara Lorna, and Emilio Rodrı́guez for their support and points of view, and to Dr. Arash
Mousavi because he shared his experiences and perspectives of the life.
This thesis is the last step of a long education road, which started in 2002. It was
a tough way that Today has come to an end. In this fourteen years, I had received
support from many people, but especially from Professor Julio Martos, Martı́n Piazzon,
Iván Leiva, Dr. Georgy Kornakov and Marisol Robles.
I would also like to thank all the Arrowhead’s partners that supported this thesis,
with particular mention to Per-Erik Larsson.
A challenge is always a fruitful source of knowledge and motivation, for this reason, I
need to mention the Smart Rock Bolt team (Jens, Henrik, Mikael, Claudia, Joakim, and
Hasan). Thanks for all the time and support, especially during the stressful moments.
The last part of this thesis has been done remotely. Therefore, I need to say thanks
to Ulf Bodin, Jerker Delsing, and Jens Eliasson for trusting me. Also, I would like to
thank Artemis, Arrowhead, EMC2 , and CASTT for funding, and thereby making my
Ph.D. possible.
Last but not least, I would like to say thanks to my family for all the time together,
even being thousands of miles away. With especial mention to my wife Dr. Paloma
Dı́az Fernández, who kept me alive after spend weekends, nights, and vacations working.
Thanks for all your time, discussions, suggestions, guidance, patience, and help.

Luleå, September 2016

Pablo Puñal Pereira

xi
Part I

1
Chapter 1

Introduction

“Never send a human to do a machine’s job.”

- Agent Smith

Analyzing the physical environment is something that humanity has been doing for
thousands of years, including measuring distance, time, temperature, etc. At first, rudi-
mentary methods based on references such as the sizes of body parts or the positions of
the Sun were used. However, with the standardization of measurement units, the first me-
chanical systems capable of measuring certain physical variables began to appear; these
were the first sensors. At present, humanity is in what is known as the Silicon Age, and
thanks to the electronic revolution, we can measure any physical variable using electronic
sensors. In 1950, the United States Army introduced the capability of communication
with a group of sensors as part of the Sound Surveillance System (SOSUS) project, as de-
scribed by Silverstein [1], which was a network of submerged microphones (hydrophones)
for detecting Soviet submarines in the Atlantic and Pacific Oceans. Thirty years later,
in 1980, the United States Defense Advanced Research Projects Agency (DARPA), also
under the umbrella of the United States Army, developed the Distributed Sensor Network
(DSN), as described by Chong et al. in [2]. The DSN project explored the implemen-
tation of distributed wireless sensor networks, as the predecessor to the Wireless Sensor
Network (WSN).
An actuator is an artifact that is able to modify a physical variable, such as an LED,
motor, heater, or valve. The incorporation of actuators and sophisticated mechanisms
into a WSN turns it into a Wireless Sensor and Actuator Network (WSAN). Today, with
the use of the Internet Protocol (IP), each node in a WSAN can be transformed into
an Internet of Things (IoT) device. IoT technology maximizes interoperability, which
enables the possibility of connecting any device at any time to another device somewhere
in the world. IoT technology for computers and big data centers is not new, but when the
area of application is a Wireless Sensor and Actuator Network (WSAN), the scope of the
problem changes. The definition of an IoT device used in this thesis is as follows: “An
IoT device is a resource-constrained embedded system with the capability to perform
a number of well-defined tasks, such as sensing, signal processing, and networking. It

3
4 Introduction

usually has wireless communication capabilities and is powered by batteries.” Therefore,

according to this definition, an IoT device must be energy efficient.
Currently, the IoT concept is applied not only for industrial usage but also in many
examples of domestic applications, such as smartwatches and GPS-based pet trackers.
This thesis focuses on the growing area of the Industrial IoT (IIoT), which has many
potential applications; however, the complexity of and the requirements for industrial
applications are greater than in the case of domestic applications. Therefore, research in
this field requires deeper knowledge and the development of more sophisticated technol-
ogy to achieve these requirements. The IIoT requires communication among hundreds
of devices on the same wireless network, creating issues of scalability, and the trans-
ferred data require a higher level of security to prevent data leaks and data injection.
Interoperability requires appropriate control of the access to IoT devices. Therefore, a
fine-grained access control mechanism is needed. Other requirements include robustness
and stability.
The use of the IoT concept for industrial applications increases the complexity of the
problem, and at first glance, the efficiency of this approach may be questionable. The
goal of this thesis is to design and analyze an efficient framework for the Industrial IoT,
providing a state-of-the-art approach for industrial applications.

1.1 Problem formulation

The use of IoT technologies in industrial Wireless Sensor and Actuator Networks en-
ables a high level of interoperability, maybe the highest possible level if the network is
connected to the Internet. According to statistics from Cisco [3] and Gartner [4], there
are approximately 6 billion connected IoT devices in the world today (2016). Therefore,
this level of interoperability enhances the ultimate utility of a WSAN; the possibility to
consume data produced by a sensor using a mobile phone on the other side of the world
was unthinkable only a few years ago. The IoT for industrial applications oﬀers great
potential for research over the coming decades, as shown by Khan et al. [5].
However, IoT devices are subject to resource constraints regarding their memory and
processing capabilities. Hence, the use of standard protocols leads to increased overheads
for delay and energy consumption, and in some cases, these overheads are a technological
barrier. Delays can break communications connectivity, and energy overheads can dras-
tically reduce battery life, making a given application impossible in a realistic scenario.
This problematic situation gives rise to the ﬁrst research question addressed in this thesis:

1. Is it feasible to use IoT-SOA technology in WSANs for industrial applications?

The answer to this question is not simple, and it encompasses two further questions:

1.1. What are the beneﬁts of adding IoT technology to industrial WSANs?
1.2. Is it possible to increase interoperability while mitigating performance impact?
1.2. Methodology 5

Interoperability is an obvious beneﬁt, but an increase in the number of possible

connections also increases the number of possible malicious users. Thus, the second
research question arises:

2. How can access to exposed IoT nodes be protected and controlled while maintaining
performance?
The goal is to analyze the feasibility of using IoT technology in industrial applica-
tions, in which the need to manually conﬁgure each device should be avoided. This
requirement leads to the third research question:

3. How can zero-conﬁguration operation be achieved for an IoT node?

To answer all these questions requires a complete analysis of time and energy con-
sumption in industrial WSANs as well as a study of the communication overheads and
memory footprints. The answers to questions two and three can be obtained through
two independent analyses, but answering question one requires a more detailed analysis
of a complete IoT framework. Such an analysis should highlight all factors that create
larger overheads in the framework, either to solve them directly or to recommend them
as topics for future work.

1.2 Methodology

Syn
th s
es is e
m
e
is

Pr
H yp othe

a tio n
a lu
sis

Exp
e ri m e n t

Figure 1.1: Research methodology

Experimental science is “a science that requires the use of tests or prototypes under
controlled conditions to demonstrate a known truth, examine the validity of a hypothesis,
or determine the eﬃcacy of something previously untried”. In 1991, at the Workshop
6 Introduction

on Research in Experimental Computer Science (Palo Alto, California), Bob Taylor pre-
sented the following principle for a good experimental study: “you should build what
you design and use what you build, as only through the extensive use of an artifact do
you truly understand the implications of your work” [6].
The methodology used in the work described in this thesis is based on the iterative
process illustrated in Figure 1.1. This iterative process begins with a real-world problem
to be solved, for which a set of premises is provided that enables the formulation of the
initial research questions (research question 1 in this thesis). After a few iterations, with
deeper knowledge of the problem, additional research questions can emerge (research
questions 1.1, 1.2, 2 and 3). Iteration continues until the evaluation step is successful.

1.3 Thesis scope

The Internet of Things is a research area that has undergone two decades of constant
evolution. It is, therefore, a broad area of study that involves a combination of many disci-
plines, such as wireless communications, networking protocols, machine learning, sensors,
actuators, hardware design, information security, cloud computing, and big data. The
multi-disciplinary nature of IoT requires collaborative efforts from people with different
backgrounds; such collaboration also contributed to the work described in this thesis,
which is the fruitful result of a collaboration with many other researchers and industry
partners.
This thesis focuses on a feasibility study of IoT technologies for industrial applica-
tions. In greater detail, the thesis investigates, proposes, and analyzes an efficient IoT
framework that enables the use of cutting-edge IoT technology for industrial applica-
tions, thereby updating the previous prevailing design for industrial Wireless Sensor and
Actuator Networks. The aspects of the research that are focused on the application layer
also overlap with other research areas that are not considered in this thesis, such as
encryption and link-layer protocols.
This thesis represents improvements in the IoT field in aspects such as scalability,
dependability, security, interoperability, and energy efficiency. This thesis offers two
relevant contributions. The first involves research on novel mechanisms for the control of
access to IoT resource-constrained devices, resulting in the proposal of an energy-efficient
access control scheme that enables fine-grained access control for CoAP-based networks.
The second relevant contribution involves research and analysis on all of the mechanisms
involved in the efficiency of an IoT network with regard to energy and delays.

1.4 Thesis outline

This is a compilation thesis that consists of two parts. Part I serves as an introduction
to the research area, research methodology, and research questions. It also includes
descriptions of the solutions proposed to answer the research questions, the experimental
evaluation of the proposed solutions, and a discussion of and conclusions obtained from
1.4. Thesis outline 7

the experimental results. Part II consists of four peer-reviewed papers that have been
published in the proceedings of various conferences, one published journal paper, and two
submitted journal papers. All articles fall under the umbrella of the research performed
as part of this thesis work; they have been reformatted to follow the thesis layout, but
their contents have not been modified.
The remaining chapters in Part I are organized as follows. Chapter 2 offers a brief
introduction to the Internet of Things and its historical evolution up through its integra-
tion with Wireless Sensor and Actuator Networks. It discusses the benefits and provides
examples of the application of this technology; it also describes the new issues that must
be addressed for IoT technology to be applicable in industry. Chapters 3 and 4 attempt
to solve these problems to make IoT technology feasible for industrial application, fo-
cusing on issues of security and efficiency, respectively. Chapter 5 describes the research
contributions of this thesis and explains the evolution of how the research questions were
addressed during the thesis work. Chapter 6 summarizes the results presented in this
thesis, answers the research questions, and describe newly identified issues and directions
for future studies.
Chapter 2
Internet of Things

Defining the concept of the ‘Internet of Things’ is a difficult task, considering that
this concept varies from one research area to other. The IEEE Internet of Things group
compiled definitions from various Internet associations and research groups in the publi-
cation “Towards a definition of the Internet of Things.” [7]. The following are the most
relevant definitions for this thesis:

“The basic idea is that IoT will connect objects around us (electronic, elec-
trical, non-electrical) to provide seamless communication and contextual ser-
vices provided by them. Development of RFID tags, sensors, actuators, mo-
bile phones make it possible to materialize IoT which interact and co-operate
each other to make the service better and accessible anytime, from anywhere.”

– Internet Engineering Task Force (IETF), 2010

“A network of items—each embedded with sensors—which are connected to

the Internet.”

– Institute of Electrical and Electronics Engineers (IEEE), 2014

“The Internet of Things refers to the unique identiﬁcation and ‘Internetiza-

tion’ of everyday objects. This allows for human interaction and control of
these ‘things’ from anywhere in the world, as well as device-to-device inter-
action without the need for human involvement.”

– HP, 2014

In this thesis, the following deﬁnition of the IoT concept is adopted: “An IoT device
is a resource-constrained embedded system with the capability to perform a number of
well-deﬁned tasks, such as sensing, signal processing, and networking. It usually has
wireless communication capabilities and is powered by batteries.”

9
10 Internet of Things

The IoT concept, by deﬁnition, changes with the evolution of hardware and software
(as do many other concepts related to electronics and/or computation). For this reason,
this chapter provides a brief introduction to the historical development of IoT devices.
It also presents a description of Wireless Sensor Networks and an overview of possible
application areas.

2.1 Historical (r)evolution

With the creation of the Internet Protocol regarded as the beginning of the evolution of
the IoT concept, this section analyzes the evolution of all technologies involved in IoT
hardware and software up to the present time, 2016.
Internet of Things has been a hot research topic and a number of projects like
SOCRADES, IoT-A, SODA, and IMC-AESOP were all addressing a more industrial
usage of IoT.

2.1.1 Software
The IoT concept involves several software components, but some of the most important
progress has been made in application and link-layer protocols and in operating systems
(OSs). This section presents an overview of a few of the most relevant advances.

Application protocols
In the OSI model, the application layer is the abstraction layer responsible for interfacing
between communications and the application running on the host. The following is a
chronological list of the most representative application protocols for IoT technology:

1996 RESTful HTTP The first acknowledged IoT protocol was the Hypertext Transfer
Protocol [8], which is a Request/Response protocol for a client-server model and
is mainly used to deploy web-based services. The transport layer used is TCP.
The general usage of XML makes it overly complex and inefficient for low-power
purposes. The recent changes made in HTTP/2 [9] enable header compression to
improve the performance of the HTTP protocol, but it is still not suitably efficient
for a resource-constrained device.

1999 MQTT IBM created the MQ Telemetry Transport protocol based on a client-
broker-server architecture, with two types of communication procedures: Request/Re-
sponse, as in HTTP, and Publish/Subscribe. This protocol is more eﬃcient than
HTTP but still uses TCP as the transport layer.

1999 Jabber An open-source community developed this protocol for real-time instant
messaging (IM). Communication is based on XML, and similarly to MQTT, it
supports both Publish/Subscribe and Request/Response communications over a
client-server model. This protocol also uses TCP as the transport layer.
2.1. Historical (r)evolution 11

2004 XMPP The IETF decided to modify the Jabber project by adding TLS for com-
munication encryption and SASL for authentication, renaming the protocol to the
Extensible Messaging and Presence Protocol (XMPP) [10].
2007 MQTT-SN IBM created a new, more eﬃcient UDP-based version of MQTT
named MQTT for Sensor Networks (MQTT-SN) [11].
2011 WebSockets This protocol was designed to improve communications between web
browsers and web servers, but it can also be used as an independent client-server
application protocol. It also uses TCP as the transport layer [12].
2014 CoAP The Constrained Application Protocol [13] was created to optimize the
eﬃciency of communications in Wireless Sensor Networks. This RESTful-based
protocol is allowed to deploy services (resources) directly on the network nodes.
Based on a client-server model, it allows the use of Request/Response and Observe
methods. In contrast with previous protocols, CoAP was designed to use a UDP
transport layer.

Link-layer protocols
The use of wireless technologies such as Bluetooth and WiFi is quite common for the
creation of Wireless Local Area Networks and may be the best solution for mobile sensing
platforms because any smartphone can serve as an Internet gateway for both technologies.
The barrier hindering the use of both Bluetooth and WiFi is their power consumption. At
present, hardware implementations are available that can reduce this power consumption,
such as the CC3000 from Texas Instruments [14], which has a transmission consumption
of 936 mW and a reception consumption of 331 mW, or the ESP8266 from Adafruit [15],
which consumes 561 mW for transmission and 185 mW for reception. Wireless commu-
nications represent a large percentage of the total power consumption of an IoT device,
meaning that its battery life will directly depend on the selected wireless technology.
In 2007, the IETF created 6LoWPAN, a link-layer protocol with encapsulation and
header compression, to allow the use of IPv6 networks over IEEE 802.15.4 [16]. This
improvement was probably the greatest step forward for IoT implementation in WSNs,
enabling the creation of IP wireless networks for low-power devices (Section 4.4 presents
an empirical analysis of the power consumption for this protocol).

Operating systems
An IoT device can function without an operating system; it requires only a functional
IP stack. However, to create smart and sophisticated IoT devices, the usage of operating
systems is highly recommended. The following is a chronological list of the most common
open-source operating systems for resource-constrained devices:
2000 TinyOS [17] Not considered an OS in the traditional sense, it used to be deﬁned as
a framework for embedded systems with a set of components to enable the deploy-
ment of IoT applications, as shown by Levis et al. [18]. It is programmed in NesC
12 Internet of Things

and is widely used for scheduled applications for extremely resource-constrained

devices. It requires less than 1 kB of RAM and less than 15 kB of ROM.

2002 FreeRTOS [19] Its name stands for Free Real-Time Operating System; this OS,
with only 3 C ﬁles, is extremely easy to port, read and maintain. FreeRTOS
provides a full set of tools for the creation of complex applications with multiple
threads, semaphores, and timers. It is supported for more than forty diﬀerent
microcontrollers. A simple application requires less than 2 kB of RAM and less
than 12 kB of ROM.

2003 Contiki [20] This OS allows multitasking with the use of protothreads, and it is
officially supported for more than fifty different microcontrollers. Contiki includes
a built-in full IP stack with UDP and TCP support, and it is able to use wireless
low-power communications by means of ContikiMAC and 6TiSCH. The OS includes
several applications for easily creating servers and clients. Contiki can run in Cooja,
a simulation environment, for the easy testing and debugging of applications and
communications. A simple application requires less than 2 kB of RAM and less
than 30 kB of ROM. The programming language is C. The feasibility of Contiki
was presented by Dunkels et. al [21].

2006 Embedded Linux [22] This OS is a lightweight version of the Linux kernel that
is intended for use on hardware with clear limitations; however, it is not suitable
for use on resource-constrained devices because it requires approximately 1 MB
of RAM and 1 MB of ROM, speciﬁcations that a low-power resource-constrained
device cannot meet. It can run any programming language, including Java and
Python. Moreover, it can use most of the available programs for desktop versions
of Linux.

2011 OpenWSN [23] It is not an OS by itself, but it must be included on this list.
OpenWSN is an open-source implementation that provides a complete protocol
stack based on IoT standards, supporting both UDP and TCP connections. It
runs on top of OpenOS, FreeRTOS, and RIOT. A simple application requires ap-
proximately 14 kB of RAM and 50 kB of ROM. The programming language is
C.

2013 RIOT [24] The newest OS on this list, it was designed to improve real-time oper-
ation, modularity, and multithreading. It focuses on the use of CoAP and CBOR,
thereby reducing memory usage and allowing simple applications to require less
than 2 kB of RAM and less than 6 kB of ROM. RIOT supports the C and C++
programming languages.
2.1. Historical (r)evolution 13

2.1.2 Hardware
Over the past decade, the explosion in the use of embedded devices for industrial pur-
poses and in many commercial products, such as mobile phones, smartwatches, and
miniaturized computers, has motivated the development of many diﬀerent types of mi-
crocontrollers, sensors, radio modules, systems-on-a-chip, etc. This section describes the
evolution of some of these hardware technologies.

Microcontrollers and microprocessors

The technological development of both microcontrollers and microprocessors is the same;
their evolution involves improving their computational power while expending less energy
and reducing their size. The diﬀerences between microprocessors and microcontrollers
are not clear-cut. Usually, a microprocessor is an integrated circuit that includes only a
processing unit, whereas a microcontroller also incorporates RAM and ROM memories
and many input/output interfaces.
IoT devices have historically used microcontrollers because their power consumption
is lower than that of microprocessors and their computing capabilities have been suﬃcient
for the intended applications. Companies such as Atmel, Microchip Technology, Texas
Instruments, ARM and Intel produce some of the most widely used microcontrollers for
IoT applications, namely, the AVR32, PIC32, MSP430, Cortex-M7, and Quark, respec-
tively. Figure 2.1 shows the historical and forecasted market for microcontrollers, which
shows a continuous increase over the coming years.

Figure 2.1: Evolution of demand for MCUs based on IC insights

Recent improvements in energy consumption reduction have led to the introduction

of microprocessors with low power consumption and high performance, such as the Intel
Atom and the ARM Cortex-M73. More powerful devices are suitable for use in applica-
tions in which the nodes require a high level of processing power, mitigating the overhead
14 Internet of Things

for communications.

Wireless technologies
Communication in wireless environments requires more energy than in wired environ-
ments; in other words, the use of wireless communications increases the power consump-
tion of a system. This limitation has motivated extensive research in this area, leading to
the creation of several diﬀerent wireless communication technologies. The selection of one
of these technologies usually depends on the bandwidth, range and power consumption
requirements, of which power consumption is typically the most critical factor.

Sensors
Sensor technologies have been under constant development in recent years, and at present,
sensors are available for almost every conceivable purpose, such as temperature, proxim-
ity, acoustic, chemical, position, and optical measurements, among many others. How-
ever, the feasibility of these sensors for IoT applications depends on their power con-
sumption, and currently, some of the greatest improvements are primarily motivated by
smartphones. Smartphones are embedded platforms that require energy eﬃciency, and
they include many sensors, such as GPS units, microphones, accelerometers, gyroscopes,
and magnetometers. Moreover, with the emergence of smartwatches, this application
area is witnessing even greater expansion.

2.2 Wireless Sensor and Actuator Networks

Since the first Wireless Sensor Network was developed in 1950, a WSN has been under-
stood to consist of a group of embedded nodes with connected sensors that are able to
measure physical variables, perform data analysis, and communicate with a centralized
data collector, or server, for data transmission (see Figure 2.2). The benefit of this archi-
tecture is that the nodes do not require a high level of complexity to function; generally,
data are communicated from the nodes to the data collector. Implicit acknowledgment
mechanisms are not suitably energy efficient for battery-powered devices; the use of an
explicit acknowledgment mechanism instead can solve this problem, as shown by Blago-
jevic et al. [25] and Gonzalez et al. [26]. To simplify the tasks performed by the nodes,
WSNs are pulling-based systems, which means that the measurement and transmission
processes run periodically, usually with static timeouts.
The term Wireless Sensor and Actuator Network (WSAN) was born with the incor-
poration of actuators into industrial and domestic WSNs. However, the introduction
of actuators requires significant structural changes to a WSN. An actuator requires in-
formation about the action to be performed. Therefore, an actuator node needs to be
able to receive that information, and to implement this feature, the architecture must
be able to communicate in both directions (node to server and server to node). To in-
crease the benefits of using actuators, a WSAN node can use sensor information from
2.3. Constrained Application Protocol 15

Server Server

(a) WSN architecture (b) WSAN architecture

Figure 2.2: Comparison between the WSN and WSAN architectures

one or multiple nodes to specify actions for its actuator; thus, a WSAN also requires the
implementation of a Machine-to-Machine (M2M) communication capability.
The incorporation of the Internet Protocol into a WSAN turns each node into an IoT
device; however, according to some researchers, even without the Internet Protocol, a
WSAN node can be regarded as an IoT device.

2.3 Constrained Application Protocol

The IETF Constrained Application Protocol (CoAP) [13] is an application-layer proto-
col designed to provide web services that work with resource-constrained devices. It is
eﬃcient for devices with microcontrollers with small amounts of ROM and RAM and can
run over 6LoWPAN network stacks (see Figure 2.3) with high packet error rates. The
protocol is designed for low-power networking, allowing nodes to switch into sleep mode
to extend their battery life.
CoAP provides a Request/Response interaction model between application endpoints.
It supports the built-in discovery of services (resources) and includes key Web concepts
such as URIs [27], RESTful interactions [28], and extensible header options. CoAP can
easily interface with HTTP for integration with the Web while meeting specialized re-
quirements for constrained environments, such as multicast support, very low overhead
and simplicity. CoAP runs over UDP (see Figure 2.3), unlike HTTP, which uses TCP.
Currently, there are several groups of researchers porting CoAP to run over TCP, includ-
ing Bormann et al. [29].
Several relevant features of CoAP are as follows:

• Two types of request messages. A Conﬁrmable Message (CON) is retransmitted

16 Internet of Things

Application

CoAP

UDP

6LoWPAN

IEEE 802.15.4

Figure 2.3: OSI model for a CoAP-based application

(a maximum of four times) with an exponential timeout while waiting for an Ac-
knowledged Message (ACK) or the correct response from the server. By contrast,
a Non-conﬁrmable Message (NON) is sent without any expected response.

• The URI format allows the use of both standard and specialized service endpoints.
One example is the resource discovery scheme deﬁned in RFC 5785 [30], which uses
the /.well-known/core path and the CoRE Link Format.

• CoAP also allows the sending of large messages with a stop-and-wait mechanism
called “blockwise transfers”.

0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
V T TKL Code Message ID
Token (if any, TKL bytes) ...
Options (if any) ...
1 1 1 1 1 1 1 1 Payload (if any) ...

Figure 2.4: CoAP packet format

The packet format of CoAP (see Figure 2.4) includes several parameters that are
relevant for the understanding of Chapter 3. These parameters are as follows:
• Version: Indicates the CoAP version number.

• Type: Indicates whether the message is of the Conﬁrmable, Non-conﬁrmable, Ac-

knowledgment or Reset type.
2.4. Service Oriented Architecture (SOA) 17

Option Number Policy [31]

0...255 IETF Review or IESG Approval
256...2047 Speciﬁcation Required
2048...64999 Expert Review
65000...65535 Experimental use (no operational use)

Table 2.1: CoAP option policy

• Token length (TKL): Indicates the length of the variable-length Token ﬁeld.

• Code: Indicates the Request Method or the Response Code.

• Message-ID: Unique ID to prevent duplication.

• Option: Indicates the options declared in the message.

The option field allows more information to be included in each CoAP communication,
such as a Max-Age for Observe, Uri-Host, or Content-Type. The value of the option
field can be empty, an opaque group of bytes, unsigned integers or strings. There are
many possible option numbers, as specified in the CoAP option policy (see Table 2.1).
Currently, only fewer than twenty option numbers are standardized.

Resources
CoAP allows each device to consume and provide resources. A resource is deﬁned as a
simple service that requires the resource provider to perform simple tasks, e.g., transmit
a sensor value or turn on an LED. This method of using CoAP resources is supplemented
by the IPSO Alliance and its Smart Object Guidelines [32], in which a resource’s URI is
used as a tag to identify the type, instance, and value of a sensor or actuator.
In this thesis, the terms resource and service have the same meaning given above. A
CoAP resource that requires processing or other external resources to provide a response
can be considered a service.

2.4 Service Oriented Architecture (SOA)

A Service Oriented Architecture (SOA) is a design architecture for the creation and
development of a system based on distributed subsystems. Each system can offer and
consume one or more services to perform its tasks. The main benefits of this design are
scalability, reusability, and flexibility. A complex system can be divided into many simple
subsystems, thus making development and testing faster and easier because each sub-
system can be developed individually. Expanding or updating a system simply requires
increasing the number of subsystems.
A Service Oriented Architecture for the Industrial IoT makes the best possible use of
interoperability and scalability. Industrial monitoring and control is a complex task that
18 Internet of Things

can be divided into simple subsystems (IoT devices), with each IoT device performing a
simple task, such as measuring a variable or oﬀering a service to an actuator. Critical
components for an industrial application can be duplicated or, in case of failure, replaced.
A Service Oriented Architecture requires many complex mechanisms to function, in-
cluding Quality of Service (QoS), orchestration, and conﬁguration. The work reported
in this thesis was conducted under the auspices of the Arrowhead project, a European
project responsible for researching, developing and improving interoperability in complex
industrial environments.
Chapter 3
Security

In industrial applications, interoperability is an advantage. Interoperability reduces

the costs of operation and maintenance because upgrading a framework with a high level
of interoperability requires less investment and effort than upgrading a non-interoperable
framework. Two different yet interoperable platforms can be integrated; they can share
resources, data, and services without the need for duplication. An interoperable frame-
work that supports various device types can exploit the best features of each device
for each situation, e.g., collecting data from a resource-constrained device and process-
ing it on a high-performance server. All these beneficial characteristics are useful in
industry, but in environments in which the devices are transmitting sensitive data or
offering access to actuators, interoperability poses increased risk. Security is therefore a
key concern when deploying an IoT framework in industry. It is especially critical for
resource-constrained devices, particularly battery-powered devices. Implementing a se-
curity mechanism will inevitably increase power consumption; therefore, for applications
in which the battery life is a concern, the design must strike a suitable balance between
security and power consumption. In an SOA architecture, each node acts as a service
provider, and services are accessible to anyone on the network. To protect these services
against malicious access, certain mechanisms are needed. For these reasons, security
issues can be divided into two different areas: secure communication and access control.
This chapter presents an overview of current methods of securing communications
between IoT devices and proposes a new mechanism to enable efficient, fine-grained
access control.

3.1 Secure communications

All computer communications require protection against many diﬀerent types of attacks,
such as packet injection, eavesdropping, replay attacks, and DoS attacks. The framework
presented in this thesis is based on CoAP as the application protocol and is especially
suitable for 6LoWPAN networks (see Figure 4.2). To maintain interoperability, the se-
curity mechanisms must be standardized; otherwise, interoperability will be reduced.

19
20 Security

An analysis of the possible standard technologies that can be used over a 6LoWPAN-
CoAP stack is therefore needed; such an analysis performed by Hennebert et al. [33] is
summarized in Table 3.1.

Security mecha- Header

Layer Requirement achieved Attack
nism overhead
Jamming / collision /
Physical CSMA-CA None Availability
flooding
Secure firmware None Node tampering
Secure element None Cloning
Authentication and in-
Link MIC 6-26 bytes Packet injection
tegrity
AES encryption
7-15 bytes Confidentiality Eavesdropping
only
Authentication, in-
AES-CCM
11-29 bytes tegrity, confidentiality Replay attack
Nonce
and freshness
DoS / battery exhaus-
Address filtering None Energy efficient
tion
Adaptation Hash chain 8 bytes Integrity Fragmentation attack
DoS / buffer satura-
Split buffer None Availability
tion
Authentication of the
emitter and network
Packet injection, re-
Network IPsec AH 16 bytes integrity, resiliency, ro-
play attacks
bustness, and resis-
tance
Confidentiality be- Eavesdropping, replay
IPsec ESP 28 bytes
tween two peers attacks
Secure routing - Availability Routing attacks
Secure neighbor Protection of network
- Intrusion
discovery services
Authorization through
a token and authenti-
cation of the emitter
Compressed and integrity and con- Aggregation, data
Application DTLS ciphered 16 bytes fidentiality between peeking, packet injec-
layer two peers using a given tion
application network,
resiliency, robustness,
and resistance
IDS - Network services Every intrusion

Table 3.1: Security mechanisms for 6LoWPAN networks

3.1. Secure communications 21

3.1.1 Standard end-to-end security mechanisms

Interoperability enables communication among many devices. Usually, such communica-
tion requires the use of other intermediary devices, such as routers, switches, and servers.
Therefore, the priority is to ensure end-to-end communications, and according to Table
3.1, the main families of mechanisms that are able to provide end-to-end protection are
IPsec and DTLS.

Internet Protocol security (IPsec)

Internet Protocol security (IPsec) [34] is the secure evolution of the Internet Protocol
(IP). It consists of a collaboration among several different protocols and supports various
types of encryption [35]. IPsec includes two mechanisms, Authentication Header (AH)
and Encapsulating Security Payloads (ESP). AH provides data origin authentication,
protection against replay attacks and connectionless data integrity, whereas ESP provides
confidentiality. If guaranteeing confidentiality is a priority, then IPsec-ESP is a reasonable
choice. ESP encrypts the original IP packet into the payload of a new IPsec packet,
which can be decrypted only using the correct previously deployed or negotiated keys.
To negotiate these keys, IPsec supports the Internet Key Exchange protocol, version 2
(IKEv2) [36]. This protocol is useful for avoiding the use of static and long-term keys,
thereby increasing security.

Datagram Transport Layer Security (DTLS)

Datagram Transport Layer Security (DTLS) is primarily a UDP evolution of Transport
Layer Security (TLS), which runs over TCP. It provides data origin authentication, au-
thorization, data integrity and confidentiality. This protocol initially consisted of two
phases, the first being a handshake between the two communicating machines, during
which both must authenticate themselves and validate the other using certificates, and
the second phase being the transmission of the encrypted information. However, the use
of the default version of DTLS is not efficient for resource-constrained devices because
the overhead and the use of certificates degrade low-power performance. These problems
motivated the development of compressed DTLS, as reported by Raza et al. [37], and
the replacement of the certificates with keys, as proposed by Fossati et al. [38], to create
a standard and efficient version of DTLS.

3.1.2 Access control analysis

The standard mechanisms described above for securing (end-to-end) communication pro-
vide several features for controlling access to each device. The problem is the lack of
granularity of these mechanisms. Access can be achieved at several diﬀerent levels: ad-
dress (IP address), ID, service, and method. Table 3.2 summarizes which types of access
control are provided by IP, IPsec, Black Lists, IKEv2, DTLS, and CoAP. All of these
technologies enable control at the service and method levels.
22 Security

Access control
Technology Address ID Service Method
coarse-grained ﬁne-grained
IP
Black Lists
IPsec
IPsec+IKEv2
DTLS
CoAP

Table 3.2: Access control comparison for diﬀerent security technologies

The ability to control access by address and ID also provides control over who can
communicate with the service provider. Access control at the service level also provides
the ability to create custom services for each user and user type. Finally, control over
access based on method enables the provision of services with diﬀerent functionalities
depending on user type, e.g., a time service that a regular user can access to obtain the
time, whereas the same service can be accessed to update the time only by administrators.
Therefore, the use of ﬁne-grained access control mechanisms is required.

3.2 Access control

Access control is a mechanism for monitoring service requests issued to a service provider
and managing when a communication must or must not be approved. Access control can
also enable the identification of a consumer of a service and the provision of relevant
information about that consumer to the service, enabling the possibility of providing
customized services.
As in the previous section, the use of standard mechanisms in recommended to main-
tain interoperability. However, an exception arises in this case because the standard
solutions have shortcomings that require the implementation of a new, more efficient
access control mechanism for IoT applications. This section describes the most common
access control standards and their shortcomings and proposes such an efficient access
control mechanism.

3.2.1 Standard solutions

The two most popular standard protocols that provide access control functionality are
Kerberos and RADIUS. The working principles of the two are different, each offering
certain benefits and disadvantages, which will be used as the basis for the creation of an
efficient access control method in the following sections.
3.2. Access control 23

Kerberos
Kerberos is an access control mechanism that runs over UDP; it uses ticket granting to
validate a service consumer to a service provider. The access control process requires a
service provider (SP), a service consumer (SC), and a Key Distribution Center (KDC).
Each entity has its own key, with the exception of the KDC, which possesses all keys. The
process begins with the SC requesting a ticket. To do so, it sends a partially encrypted
message with accessible information regarding ID, time and other parameters. The KDC
can use the SC’s key to decrypt the message; if decryption is successful, the KDC trusts
the SC and sends back a packet with timeouts and other parameters that is completely
encrypted using the SC’s key. The KDC must store that packet and use it again to
request a valid ticket to contact the SP. In such a request, the SC generates the ticket
that will be utilized by the KDC to access the SP; the content of this ticket includes
information about the SC, timeouts, etc., and it is completely encrypted using the SP’s
key. Then, the SC uses this ticket to request access, and the SP can use its key to decrypt
it and extract all information about the SC and the access control policies.
Kerberos provides password protection. There is no password communication, which
protects each particular SC and the SPs accessible by that SC. In other access control
mechanisms, each SC must have a database of credentials or passwords for its accessible
SPs. Kerberos requires the use of a centralized KDC, which provides the convenience of
database maintenance. For an IoT device, however, Kerberos is not optimal. Kerberos
does not require communication between an SP and the KDC, which is a clear advantage
for reducing power consumption, but Kerberos tickets contain all access control infor-
mation in an encrypted form, which poses in a limitation because of the ticket size and
processing complexity.

Remote Authentication Dial-In User Service (RADIUS)

The Remote Authentication Dial-In User Service (RADIUS) is a UDP-based centralized
Authentication, Authorization and Accounting (AAA) protocol for the management of
users that connect to and use a particular service. The access control process requires
a service provider (SP), a service consumer (SC), and a RADIUS server (RS). The SC
requests a service from the SP, the SP claims an exchange of authentication information,
and this information is then used by the SP to check the access status with the RS
through an access request. There are three possible responses from the RS: access-
accept, access-reject, and access-challenge. In the case of the third response, a challenge
request response from the SC is required to provide more information to the RS via the
SP.
RADIUS authentication and authorization processes do not require complex crypto-
graphic operations, and the communications require the transfer of a negligible amount
of data; the RADIUS packet size is 20 bytes plus the attributes (see Figure 3.1). For an
IoT device, this mechanism is suitably eﬃcient with regard to processing and complex-
ity, but it requires the exchange of several communications, especially for the SP, which
consequently compromises the low-power criterion.
24 Security

0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
Code Identiﬁer Length

Authenticator

Attributes (if any) ...

Figure 3.1: RADIUS packet format

Lack of eﬃcient standard solutions for IoT devices

IoT devices are resource-constrained embedded devices, and their processing capability
is limited. An increase in processing results in an increase in power consumption, and for
battery-powered devices, this is a critical limitation. Moreover, the use of wireless com-
munications further increases the power consumption, which means that if a mechanism
requires additional communications, it will represent additional power consumption.
Kerberos requires the use of large encrypted tickets (containing information about
the client, time, services, etc.). The transmission and processing of such a ticket requires
a large amount of energy, and in a framework with a low rate of data transmission, this
type of solution is not efficient. On the other hand, RADIUS requires the exchange
of many messages, especially on the service provider’s side, also increasing the power
consumption of the device.
Thus, neither standard solution is suitably efficient for deployment in an IoT frame-
work. Therefore, a new, more efficient access control mechanism is required to provide
energy-efficient and fine-grained access control.

3.2.2 Ticket-based access control

The granularity of an access control mechanism depends on the level(s) at which it is
capable of controlling access. In this thesis, the application protocol used is CoAP, which
oﬀers services (resources) that can be accessed using a variety of possible methods, such
as GET, POST, PUT, and DELETE (see Section 2.3). Therefore, in this context, a
ﬁne-grained access control scheme must allow accesses to be controlled by user, service
and method. As shown in Table 3.2, this level of control is not possible with current
technology. To address this issue, I started working on a new solution [39], which is also
described in the Arrowhead book [40] and summarized in this section.
The goal of this access control scheme is to limit the additional communication over-
heads of the original CoAP protocol, which degrade low-power performance or increase
communication delays. To this end, the ticket design of Kerberos and the authentica-
tion/authorization mechanisms of RADIUS together are the keys to designing a new
mechanism for access control over CoAP. CoAP supports many packet options; the idea
is to use one of these options to send the ticket information. The ticket, unlike in Ker-
3.2. Access control 25

beros, does not include any additional information; it consists of a group of bytes to
identify each identity in the network (clients and servers). The size of the ticket depends
on the final application and represents a compromise between the level of security and
the power consumption performance.
The use of tickets over CoAP allows the framework to centralize authentication and
ticket verification for distributed services. This implementation allows either multiple
access control mechanisms for individual systems or a centralized mechanism to prevent
inconsistencies, thereby improving its scalability. The authentication and authorization
processes are implemented specifically as CoAP services to reduce the overheads on the
IoT devices.

Requirements
The proposed access control method assumes the existence of the following information
for the IoT devices and the AAA server:

• Secret key (SK): a group of 16 bytes that both an IoT device and the AAA server
know.

• ID: the ID of the IoT device.

• Password: the password must be a shared property of the IoT device and the AAA
server, and it is never sent during either the Authentication or the Authorization
process.

Description
The proposed access control mechanism consists of two diﬀerent steps, namely, Authen-
tication and Authorization, and each step is implemented via a diﬀerent service on the
AAA server. The Accounting function can characterize access to a service in terms of
time duration or number of accesses; this Accounting mechanism is described in this
section as an attractive feature for business models, but it has not been implemented
or evaluated. All data are presented in JSON [41] format for human readability during
development, but for enhanced performance, they can also be encoded in CBOR [42].

Authentication The Authentication process must be performed for every IoT device
that must be managed by the access control mechanism, including both service
(resource) providers and consumers. This step must guarantee the identification
of each IoT device by the AAA server. The server must provide a single ticket
to the IoT device, which will be used as an identification tag in all subsequent
communications between other entities and the AAA server.
As Figure 3.2 shows, the IoT device acting as a client starts the process with a GET
request to the AAA server; then, the server possesses the necessary IP and MAC
addresses, and the Challenge Request/Response process begins. The AAA server
sends back an authenticator generated specifically for those parameters (a group of
26 Security

Client AAA Server

Authentication Process

Authentication Request
Authenticator

Challenge Request-Response
Encrypt Password

Ticket

Figure 3.2: Authentication process

16 bytes), expecting a response in the next 15 seconds; otherwise, the authenticator

is not valid (see Code 3.1). When the client receives the authenticator, it must
encrypt the password using the same Challenge Request/Response process used in
RADIUS (described below).

Code 3.1: GET response to initiate the Challenge Request/Response process

1 {
2 " version " : 1 ,
3 " timeout " : 1 5 0 0 0 ,
4 " authenticator " : " 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 AABBCCEEDDFF "
5 }

JSON: 93 bytes - CBOR: 69 bytes

The secret key (SK) and the authenticator (A) must each have 16 bytes; if the length
of the SK is smaller, then the remaining variable values must be ﬁlled with zeros. The
password must be split into 16-byte chunks, p1 , p2 , etc., with the last one ﬁlled with
zeros to maintain the chunk size.
3.2. Access control 27

b1 = MD5(SK + A) c 1 = p1 ⊕ b 1
b2 = MD5(SK + c1 ) c 2 = p2 ⊕ b 2
· ·
· ·
· ·
bn = MD5(SK + cn−1 ) c n = pn ⊕ b n

The encrypted password can then be expressed as c1 +c2 +...+cn , where + denotes con-
catenation. It is sent back to the AAA server together with the entity ID (see Code 3.2);
then, the AAA server repeats the process and compares the two results.

Code 3.2: Response with the encrypted password

1 {
2 " name " : " example name " ,
3 " password " : " 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 AABBCCEEDDFF "
4 }

JSON: 78 bytes - CBOR: 62 bytes

If the encrypted passwords are the same, then the AAA server creates a ticket with a
timeout and sends it back to the entity, completing the Authentication process (see Code
3.3).

Code 3.3: AAA server response with the ticket in a successful authentication
1 {
2 " name " : " example name " ,
3 " password " : " 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 AABBCCEEDDFF "
4 }

JSON: 56 bytes - CBOR: 38 bytes

Authorization Authorization is a process that must be implemented for a service provider

to recognize a service consumer as a valid entity or to perform double authentication, in
which a consumer also uses this process to verify that the provider is valid and trustwor-
thy.
Before beginning to process a request or response, entity A must ask the AAA server
about the validity of the ticket from entity B. This request includes the IP address and
the ticket from B in the payload and the ticket from A in the CoAP options (see Figure
2.4). An Authorization request is illustrated in Code 3.4.
28 Security

Client AAA Server

Authorization Process

Authorization Request
Check Ticket and Policies

Figure 3.3: Authorization process

Code 3.4: Authorization request

1 {
2 " remote_address " : " fdfd : : AB " ,
3 " remote_ticket " : " 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7 "
4 }

JSON: 73 bytes - CBOR: 56 bytes

If the request succeeds, then the AAA server will send back a validity confirmation along
with B’s name, last login, timeout, protocol and ticket expiration time, as shown in Code
3.5. At this point, there are two possible means of addressing the permissions: different
types of users with different privileges may be defined, or the relevant policies may be
included in the Authentication request.

Code 3.5: Authorization response

1 {
2 " valid " : true ,
3 " name " : " example_name " ,
4 " login " : 1 4 6 8 5 2 1 2 9 2 ,
5 " expire " : 1 4 6 8 5 2 2 2 9 2 ,
6 " protocols " : " CoAP " ,
7 " timeout " : 6 0 0 0 0
8 }

JSON: 135 bytes - CBOR: 75 bytes

An implementation based on the definition of users with different privileges has been
tested, as discussed in the results section (Section 3.2.2 - Figure 3.8). This solution is
more efficient but less flexible than a policy-based implementation. An example of the
use of policies is provided in Code 3.6.
3.2. Access control 29

Code 3.6: Authorization response with policies

1 {
2 " valid " : true ,
3 " name " : " example_name " ,
4 " login " : 1 4 6 8 5 2 1 2 9 2 ,
5 " expire " : 1 4 6 8 5 2 2 2 9 2 ,
6 " protocols " : " CoAP " ,
7 " timeout " : 6 0 0 0 0 ,
8 " policies " : [
9 {
10 " service " : " service_name 1 " ,
11 " allow " : [
12 " GET " ,
13 " POST " ,
14 " PUT " ,
15 " DELETE "
16 ]
17 },
18 {
19 " service " : " service_name 2 " ,
20 " allow " : [
21 " GET "
22 ]
23 },
24 {
25 " service " : " service_name 3 " ,
26 " allow " : [
27 " GET " ,
28 " POST " ,
29 " PUT "
30 ]
31 }
32 ]
33 }

JSON: 480 bytes - CBOR: 212 bytes

Accounting The Accounting mechanism operates in two diﬀerent modes: accounting by time
and accounting by access instances. Accounting by time allows the service provider to
provide services to a consumer for a certain amount of time, after which the access autho-
rization expires and the service provider must notify the AAA server. An example of the
transferral of such accounting information from the AAA server during the authorization
process is shown in Code 3.7.
30 Security

Code 3.7: Accounting by time

1 {
2 " accounting " : {
3 " type " : " time " ,
4 " timeout " : 6 0 0 0 0 }
5 }

JSON: 57 bytes - CBOR: 34 bytes

Accounting by access instances limits the number of accesses to a service that can be
made during a particular time window, e.g., access to a service may be allowed forty
times in one hour. This accounting method requires a report to the AAA server either
when the number of accesses reaches the limit or when the timeout window expires. An
example of the transferral of such accounting information from the AAA server during
the authorization process is shown in Code 3.8.

Code 3.8: Accounting by access instances

1 {
2 " accounting " : {
3 " type " : " access " ,
4 " timeout " : 6 0 0 0 0 0 0 ,
5 " accesses " : 4 0 }
6 }

JSON: 79 bytes - CBOR: 49 bytes

Ticket information
The purpose of using tickets is to reduce the communication overhead and power con-
sumption as much as possible by using non-complex processing methods. For this reason,
the current implementation of a ticket is essentially a randomized 64-bit number provided
by the AAA server. Each ticket must be unique; a duplication (two clients with the same
ticket) on the network could compromise the authorization process. Each entity in the
network can be identiﬁed by the information on its ticket. In this thesis, the ticket con-
tent is represented as a hexadecimal number to make it human readable. Every time that
an entity requests a new ticket from the AAA server, if the Challenge Request/Response
process succeeds, the server responds with the ticket and a timeout. This timeout rep-
resents the valid lifetime of the ticket; in other words, it describes for how long all other
devices will provide services to the ticketed device or regard it as a trusted entity in the
network.
The Authentication mechanism requires the use of an encrypted channel (IPsec,
DTLS, TLS, etc.). Thus, the use of a static ticket is not a problem; moreover, the
reduced number of communication messages and the timeout related to the ticket help
to protect it. However, if the ﬁnal application were to require increased security, then
the ticket could be dynamic. This means that in each communication between a ser-
vice consumer and the provider, the ticket would need to consist of the hashed data of
3.2. Access control 31

the original ticket and another variable parameter, such as the CoAP message ID; see
Figure 2.4. All applications considered for the work reported in this thesis assume trust
of the conﬁdentiality at the IPsec level; for this reason, dynamic tickets have not been
implemented.

Distributed access control

The simplest scenario for the access control mechanism is a network with a service
provider (CoAP Server), a service consumer (CoAP Client) and the AAA Server.

CoAP Client CoAP Server AAA Server

Standard Request without Access Control

CoAP Request

Access Control - Access Allowed - First Request

CoAP Request
Get Ticket

Check Ticket
Valid Ticket

Access Control - Access Allowed - Non-First Request

CoAP Request
Get Ticket and check

Figure 3.4: All possible access control scenarios

32 Security

Figure 3.4 shows the three possible scenarios for a successfully requested service. The
ﬁrst case is a service request for a service without access control, in which the service
provider provides the service without any other additional process.
The second case represents a situation in which the service consumer has never at-
tempted to consume the service before or its ticket has expired. Then, the service provider
must validate the consumer’s ticket with the AAA Server before providing the service.
The third case is similar to the ﬁrst one and corresponds to a service request in which
the provider already holds the consumer’s ticket and that ticket is still valid. Then, the
provider needs only to check the ticket’s timeout and provide the service.

Compatibility with existing standardized access control solutions

0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
V T TKL Code Message ID
Token (if any, TKL bytes) ...
Options (if any) ... 1 1 1 1 1 1 1 1
Code Identiﬁer Length

Authenticator

Attributes (if any) ...

(a) Without compression

0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
V T TKL Code Message ID
Token (if any, TKL bytes) ...
Options (if any) ... 1 1 1 1 1 1 1 1

Authenticator

Attributes (if any) ...

(b) With compression

Figure 3.5: Proposed packet solutions for the integration of the RADIUS protocol with
CoAP

The possibility of integrating the RADIUS protocol with the CoAP protocol gives the
proposed framework a ﬂexible authentication method that can be used with a standard
RADIUS server. This approach requires no support for the RADIUS protocol on the
client. The overhead and the required resources are smaller compared with the use of
3.2. Access control 33

both protocols at the same time. This is especially important for resource-constrained
sensor nodes. In fact, conversion between a RADIUS packet and a CoAP-RADIUS packet
is possible. In this thesis, two solutions are proposed: a CoAP packet with a RADIUS
payload (see Figure 3.5(a)) and a compressed CoAP-RADIUS packet. The compression
omits redundant information such as the Code, Identifier and Length fields, which can
be integrated directly into the CoAP ID and Code fields (see Figure 3.5(b)).

Multi-protocol support

The research presented in this thesis was motivated primarily by the Arrowhead project
[43], a project focused on maximizing interoperability for industrial applications. The
goal is to offer a smart approach to exchanging services between devices with differ-
ent characteristics, communication protocols, semantics, etc. in a transparent way. For
example, in the case of a high-performance machine consuming services from a resource-
constrained device, one may communicate with MQTT, whereas the other may commu-
nicate with CoAP. Using current standards, communication between these two devices is
not possible, and the available access control mechanisms are not effective for both tech-
nologies. The problem of translation between protocols or semantics is not addressed in
this thesis, and it is not relevant to the results presented herein. Therefore, a translator
is treated as a black box that is able to translate from one protocol to another. See the
appended paper D [44] for more information.

Ticket Ticket
CoAP Validation Generation

MQTT

RADIUS RADIUS
XMPP
Client Server

HTTP

... Per Per

Access Time

Accepted Protocols Accounting

AAA Server

Figure 3.6: Authentication, Authorization, and Accounting server architecture for multi-
protocol communications

The access control mechanism presented in this thesis is suitable for application in
34 Security

multi-protocol communication with the use of a translator, where the translator is a

trusted third party. The proposed AAA server is designed to be able to handle requests
from diﬀerent protocols, as shown in Figure 3.6. In an instance of communication be-
tween two entities, one using protocol A (in blue color) and the other using protocol B
(in red), the access control mechanism functions as in the case of single-protocol commu-
nication, as explained in the previous sections, but with the additional use of a translator
in direct communications between the two entities, as shown in Figure 3.7. The imple-
mentation and development of multi-protocol communications will be addressed in future
work, along with research on mechanisms for preventing man-in-the-middle attacks when
translators are used.
3.2. Access control 35

Service Consumer Translator Service Provider AAA Server

Standard Request without Access Control

Request
Request

Access Control - Access Allowed - First Request

Request
Request
Get Ticket

Check Ticket
Valid Ticket

Access Control - Access Allowed - Non-First Request

Request
Request
Get Ticket and check

Figure 3.7: All possible access control scenarios for multi-protocol communication
36 Security

Results

Implementations of this access control mechanism have been proven and tested in many
different scenarios and for various purposes, such as mobile machine monitoring (Arrow-
head), smart rock bolts (IPSO Challenge), and mining conveyor belts.
The code developed for this access control mechanism includes versions for libcoap
4.1.1 [45], Copper 1.0.0 [46], Erbium [47] and Californium 1.0.4 [48]. All of these are
implementations of CoAP RFC 7252 [13] for servers and clients except for Copper, which
is only a client implementation.
To demonstrate this fine-grained access control mechanism, the test setup included a
resource-constrained device and a laptop. The selected IoT platform for the experiment
was a Mulle mk4 from Eistec AB, which is equipped with a 100 Hz ARM Cortex-M4
microcontroller and an IEEE 802.15.4 module capable of communicating via 6LoWPAN.
It has 2 MB of flash memory onboard and runs Contiki OS. The Mulle was config-
ured to offer various services subject to access control, including public services (“Pub-
lic Service”), services for non-authorized users (“NoAuth Service”), services by user type
(“Group Name”), services for specific users (“User Name”) and services for administra-
tors (“Admin”). The laptop used a Copper client to display the results via an intuitive
GUI. During the tests, various user types were tested, as shown in Figure 3.8. In all
cases, a “.well-known/core” request was made, which returned different lists of services
for different types of users, namely, for (a) a user requesting access without authorization,
(b) a normal user, and (c) an administrator.

(a) Non-authorized (b) Authorized user (c) Administrator

user

Figure 3.8: Screenshots of the services available to diﬀerent types of users, taken during
an experiment when performing a CoAP Discovery process

The use of the proposed access control method produces only a small overhead on the
message size because of the use of tickets (T). This size can be modiﬁed, but during the
3.2. Access control 37

work conducted for this thesis, it was ﬁxed at 64 bits. Table 3.3 shows the overheads for
each CoAP message type in two scenarios: simple access control and access control with
dual authentication. For example, in a GET request process with a conﬁrmable response,
three messages are exchanged, namely, the request, the response and the acknowledgment.
The overhead size for all messages in the proposed access control mechanism is 64 bits
(a single ticket), whereas that for dual authentication is 192 bits (three tickets).

RFC 7252 Access Control Dual Auth

request response request response request response
GET N N+T N N+T
POST N N+T N N+T
PUT N N+T N N+T
DELETE N N+T N N+T
OBSERVE N N+T N N+T
ACK N N N+T
RST N N+T N N+T
.well-known/core N N+T N N+T
N: Normal size
T: Ticket size (64-bits)

Table 3.3: Message sizes for normal CoAP vs. CoAP with access control vs. CoAP with
access control and dual authentication

3.2.3 Alternatives under development

The IETF Authentication and Authorization for Constrained Environments (ACE) group
is an active group that is developing access control solutions for resource-constrained de-
vices. This group is responsible for the development of OAuth 2.0, an access control
solution, and OSCOAP, an end-to-end security mechanism. To reduce the size of mes-
sages, ACE uses CBOR as a semantic protocol. Both solutions are based on CBOR
Object Signing and Encryption (COSE) [49], a speciﬁcation for creating and process-
ing signatures, message authentication codes, and encryption. COSE also speciﬁes how
cryptographic keys must be represented in CBOR format.

OAuth 2.0
OAuth 2.0 [50] is an Authentication and Authorization framework that allows a client
to obtain restricted access to a particular resource oﬀered by a resource provider. The
framework requires a trusted third-party server that provides two resources, “token” and
“intraspect”. These have many similarities to the “authentication” and “authorization”
resources for ticket-based access control. One notable diﬀerence is the ticket concept
38 Security

itself; OAuth 2.0 uses access tokens instead of tickets. An access token contains encrypted
data that are readable only by the resource provider and the AA server; for this reason,
is also called a Proof-of-Possession (PoP) token. Another signiﬁcant diﬀerence is that
no communication between the resource provider and the AA server is needed because
the permissions are encoded in the token. The client requests an access token for the
particular resource to be accessed and the access type; if the access is valid, the AA
server generates the access token, which is encrypted using a key known by the resource
provider. Code 3.9 shows an example of the information encoded in such an access token
request.
Code 3.9: Example request for an access token bound to an asymmetric key
1 Header : POST ( Code = 0 . 0 2 )
2 Uri - Host : " server . example . com "
3 Uri - Path : " token "
4 Content - Type : " application / cbor "
5 Payload :
6 {
7 " grant_type " : " token " ,
8 " aud " : " lockOfDoor 0 8 1 5 " ,
9 " client_id " : " myclient " ,
10 " token_type " : " pop " ,
11 " alg " : " ES 2 5 6 " ,
12 " profile " : " coap_oscoap "
13 " cnf " : {
14 " COSE_Key " : {
15 " kty " : " EC " ,
16 " kid " : h ‘ 1 1 ’,
17 " crv " : "P - 2 5 6 " ,
18 " x " : b 6 4 ‘ usWxHK 2 PmfnHKwXPS 5 4 m 0 kTcGJ 9 0 UiglWiGahtagnv 8 ’,
19 " y " : b 6 4 ‘ IBOL + C 3 BttVivg + lSreASjpkttcsz + 1 rb 7 btKLv 8 EX 4 ’
20 }
21 }
22 }

Object Security of CoAP (OSCOAP)

Object Security of CoAP (OSCOAP) [51] is a mechanism that adds and an additional
layer of protection to CoAP communications. It provides end-to-end security, encryption,
and replay protection and also checks the integrity of messages.
The idea of OSCOAP is to encapsulate the CoAP payload, header, and various options
into a COSE object. This COSE object corresponds to the payload of a new CoAP packet,
all content of which is encrypted.
Chapter 4
Eﬃcient Industrial IoT Framework

Industrial applications of IoT technology usually require long battery lifetime, in

many cases as long as years. These low-power requirements are especially stringent for
wireless devices. Battery replacement is usually not easy in the industrial environment.
To reduce the number of replacements or to avoid them entirely, the IoT devices must be
eﬃcient. Domestic IoT applications do not have the same high requirements as industrial
IoT applications; the key diﬀerences are the following:

• Scalability - Industrial applications can include tens of thousands of entities.

• Security - A security breach in a factory can result in damage to the environment

and/or human personnel as well as enormous costs.

• Interoperability - Industrial applications most often use multiple diﬀerent systems

and technologies, which complicates information exchange and necessitates the use
of mediators or translators.

Today, there is no common, widely used standardized solution for networks of this type
that require low-power mechanisms. The Arrowhead project, which provided funding for
this thesis, is focused on improving the use of IoT for industrial applications. Other
organizations, including OMA, IPSO, the IETF (T2TRG, ACE, etc), and the ZigBee
Alliance, are also investigating some of these issues. Some companies also focused on
the creation of IoT cloud-based platforms like Cumulocity, Xively, ThingWorx, Microsoft
Azure Cloud or VxWorks (Intel). Other companies chose a peer-to-peer approach like
IPSO, Thingsquare, IzoT, AllJoyn, or IoTivity. A good comparison of these platforms is
presented by Derhamy et al. [52]. This thesis addresses this technological gap to make IoT
SOA-based technologies feasible for industrial applications. This chapter presents new
results in the following areas: a) device life-cycle management, including bootstrapping
and conﬁguration; b) eﬃciency of security mechanisms; and c) a feasibility of the use of
standard IoT technologies in industrial applications.

39
40 Efficient Industrial IoT Framework

4.1 Network architecture

The proposed framework was designed for resource-constrained IoT Wireless Sensor and
Actuator Networks (see Figure 4.1). The communication between each gateway and its
node is described by a tree with the minimal number of hops. This limitation is imposed
because sometimes the wireless range between a gateway and a node is insufficient and
it is better to have an additional hop between them. Each hop-node increases the power
consumption because it must handle its own traffic and the traffic of other dependent
nodes.

Clients External Servers

Internal Servers

N1 N4

G1 G2

N2 N5
G3

N3 N8 N7 N6

Area to cover
Industrial Network

Figure 4.1: Network architecture

The communication between gateways and nodes is achieved through a wireless net-
work based on the network stack shown in Figure 4.2. It consists of a 6LoWPAN layer
over IEEE 802.15.4 to enable the IPv6 protocol; it supports IPsec to protect commu-
nications, but it may also include other security mechanisms such as TLS, DTLS, or
MAC encryption. The application protocol is CoAP, which provides the means for ser-
vice (resource) deployment on each node. The communication between the gateways
and the servers is not addressed in this research; in the proposed network, it can be
achieved through either wireless or cable technology because the power consumption is
not relevant at the gateway level.
4.1. Network architecture 41

Application

JSON/CBOR

CoAP NTP

UDP

IP / IPsec

6LoWPAN

IEEE 802.15.4

Figure 4.2: Network stack

Each type of element in the topology has a different role; thus, an individual descrip-
tion of each is needed.
Nodes are resource-constrained embedded devices with wireless connectivity and are
usually powered by batteries. The wireless connectivity provided by the gateway
allows them to communicate with other nodes, servers or clients. The essential
task of a node is to sense a physical variable and use an actuator to produce a
change in the physical environment. With advancements in microcontrollers, such
devices have become capable of performing complex tasks, such as analyzing and
evaluating data using filters, finding relevant profiles, and applying adaptive trig-
gers. Manually configuring each node is not feasible in an industrial environment.
Therefore, each node can ask for its configuration during boot time; even if the
configuration changes during run time, the node must be able to reconfigure itself.
Nodes are not merely clients. In fact, with the use of CoAP, each node can dynam-
ically create services and provide customized services. To do so, they must use an
access control mechanism.
In the proposed platform, a single node can provide services based on aggregate
data, such as the average temperature of nearby nodes, or take actions based on
data from other nodes. In other words, the nodes can create systems of systems.
To this end, direct machine-to-machine (M2M) communication is mandatory.
Gateways are embedded computers with two main tasks: providing wireless connectiv-
ity to the nodes (thus acting as a standard gateway) and running a Supervisory
Control and Data Acquisition (SCADA) system. The SCADA system is respon-
sible for registering each connected node with the Device Manager (DM) and for
providing bootstrapping and configuration services as well as the authentication
and authorization services for access control. All these services are replicates of the
services hosted on the internal server, to decentralize the system and ensure that it
continues to function even if the connection to the internal server goes down. Each
service is described in detail in the following sections.
42 Efficient Industrial IoT Framework

Gateways create 6LoWPAN connections to the nodes and Ethernet, WiFi, 4G or

5G connections to the internal server. Therefore, the communication between a
gateway and the internal server is conducted using the HTTP/HTTPS protocol.
Gateways also extend the IPv6 network of the nodes to the internal network, al-
lowing the internal server to also communicate directly with each node.

Servers are not addressed in the research reported in this thesis, but a basic definition
of their functionality is needed for a detailed understanding of the platform. There
are no differences between internal and external servers (see Figure 4.1); even an
external server can act as an internal server with the use of a VPN connection to
the internal network. The names are simply labels to distinguish between servers
that are exposed or not exposed to external links such as the Internet. Servers act
as gateways with greater computational power and memory and more connections.
The greatest difference between a server and a gateway is that a server can provide
communication between nodes corresponding to different gateways.

4.2 Services
This section describes all of the mandatory services that the proposed framework must
provide to cover all requirements for an IIoT platform: bootstrapping, conﬁguration,
device management and access control.

4.2.1 Bootstrapping
The bootstrapping service is compliant with LWM2M OMA Bootstrapping [53], which
provides information to IoT devices regarding instances of essential services such as access
control, configuration, and the LWM2M server.
The bootstrapping service must run on the gateway and should use a predefined port
because this port is information that a node is required to have during its first boot. In
other words, the bootstrapping service must be readily available to any IoT device that
joins the network.
When a device initiates a bootstrapping request, it can include additional information
in the request, such as its serial number, MAC address, and internal software name or
version. Using this information, the framework can distribute the IoT devices among
different access control, configuration or LWM2M servers. This can help to balance the
loads among different servers, ensure a variety of devices with different versions, and
improve availability.
Bootstrapping increases the stability and robustness of the framework. If a service is
down, it can be replaced by another simply by changing the IP address or port in the
bootstrapping response. It supports multiple endpoints for the same service, and when
one is busy, the device can use the next. Moreover, in the case that a node is connected
to another wireless network with different services or different network routes, it ensures
4.2. Services 43

that everything will function as usual. Code 4.1 shows an example of a bootstrapping
response.
Code 4.1: Bootstrapping example
1 {
2 " auth " : {
3 " ip " : " fdfd : : 0 A " ,
4 " port " : 5 6 8 3 ,
5 "v": 1,
6 " res " : " / Authentication " ,
7 " resAlt " : " / Authorization "
8 },
9 " conf " : {
10 " ip " : " fdfd : : 0 B " ,
11 " port " : 5 6 8 2 ,
12 "v": 1,
13 " res " : " / Conf "
14 },
15 " dev " : {
16 " ip " : " fdfd : : 0 C " ,
17 " port " : 5 6 8 1 ,
18 "v": 1,
19 " res " : " / rd "
20 }
21 }

JSON: 305 bytes - CBOR: 147 bytes

This service can also be used for simple time synchronization or to deploy other relevant
information.

4.2.2 Configuration
The role of a sensor is to take measurements of a physical variable, and the role of an actuator
is to take actions on physical variables. The configuration service sets the parameters for
how those measurements must be performed, how the data should be analyzed, and which
actions must be taken under certain conditions. For example, this service sets sampling rates
and triggers, sets filters dynamically, sends alerts to other devices, and manages collaborative
analysis, among other tasks.
The configuration service also sets the services that must be active on each device depending
on the task to be performed. This on-demand creation of services improves the performance
and reusability of each node. In addition, it can be used as a security countermeasure and as
a way to reduce power consumption.
The configuration service is a CoAP observable resource, which means that a device can
observe the configuration service and, during run time, receive a new configuration to optimize
its performance or battery life.
The configuration service has several benefits, most of which are focused on optimizing each
device and creating collaborative applications. The power consumption of each device can be
44 Efficient Industrial IoT Framework

reduced, but at the cost of increasing the complexity of how the services are programmed, which
has a direct negative impact on the program’s size.

Code 4.2: Conﬁguration example for an IoT temperature measurement device

1 {
2 " Services " : [
3 {
4 " name " : " TempService " ,
5 " type " : " temperature " ,
6 " source " : " sens 1 " ,
7 " interface " : {
8 " GET " : {
9 " active " : true ,
10 " return " : " sens 1 "
11 },
12 " POST " : {
13 " active " : false
14 },
15 " PUT " : {
16 " active " : true ,
17 " receive " : " trigger " ,
18 " return " : " trigger "
19 },
20 " DELETE " : {
21 " active " : false
22 },
23 " OBSERVABLE " : {
24 " active " : true ,
25 " period " : 1 2 0 ,
26 " return " : " sens 1 "
27 }
28 }
29 }
30 ],
31 " Actuators " : [ ] ,
32 " Sensors " : [
33 {
34 " name " : " sens 1 " ,
35 " period " : 6 0 0 0 0 ,
36 " triggered " : " yes "
37 }
38 ]
39 }

JSON: 692 bytes - CBOR: 268 bytes

4.3. Case studies 45

4.2.3 Device management

In a large network, having all entities registered in a server is useful for management and
administration. The idea of having a Device Manager is that during the boot process, each
device registers itself with the Device Manager service, which will notify the SCADA system
and start the acquisition process. The Open Mobile Alliance (OMA) has proposed the OMA
Lightweight Machine to Machine (LWM2M) protocol [54] for standardized device management.
At present, there are two widely used solutions: Leshan [55] and Wakaama [56]. Both support a
broad range of standard LWM2M features, and both were tested during the research described
in this thesis; however, the presented framework uses a customized version based on Leshan.

4.2.4 Authentication and authorization

The authentication and authorization services are part of the access control mechanism pre-
sented in the previous chapter (see Chapter 3).

4.3 Case studies

The work presented in this thesis was tested during the development of various projects directly
related to industry. For this reason, a description of each of these study cases is needed.

4.3.1 Mobile machinery monitoring

High-load working vehicles are expensive to maintain. A common problem is the maintenance
of the ball bearings: changing them too soon results in a useless expenditure of money, whereas
changing them too late can cause damage to the wheels, engine, and other parts. Therefore,
ﬁnd the perfect moment at which to change them before they break saves time and money.
This project was conducted as part of the Arrowhead project, in collaboration with SKF.
To address this problem, each wheel is assigned a node, with an accelerometer on the axis
and a temperature sensor in the lubricant oil of the ball bearing. Using these sensors, we can
measure the number of rotations per wheel, the direction, impacts, changes in the temperature
of the ball bearing, etc. The nodes are connected to a gateway on the vehicle, with 4G Internet
access for communication with the Arrowhead servers. This gateway can collect and analyze
data and can transmit the results to Arrowhead when it has connectivity.
Figure 4.3 shows accelerometer data collected for one wheel in a wheel loader ﬁeld test.
Data analysis can be performed to extract the speed of the vehicle, any impacts or vibrations,
and, in combination with the Z-axis data, even the angle of the wheel.

4.3.2 Smart rock bolts

Historically, there have been many dramatic accidents related to the mining Industry. For
example, 29 workers died in a coal mine in New Zealand in 2010, 33 men were trapped for over
two months in a coal mine in Chile in 2013, and 301 workers died in a coal mine in Turkey
in 2014. In all these situations, the mine partially collapsed. Rock bolts are metal bars of 3
meters in length that are placed in the wall of a mine to reinforce its structure. However, in
some situations, the rock bolts can bend or even break because of the work in the mine or
46 Efficient Industrial IoT Framework

( &&,(1
(&$$&%*=?*&$&%*>@

)"#(1(5( " ())&%0
per-erik.larsson@skf.com

Figure 4.3: Acceleration of one wheel on the X and Y axes during a test

seismic vibrations; under these conditions, the rock bolts can lose their reinforcing properties.
Detecting such situations is critical because they can easily turn into collapse scenarios, which
pose a high risk to workers and cause economic problems for mining companies.

(a) Smart Rock Bolt prototype (b) Position and sensor distribution in a tunnel
installation

Figure 4.4: Smart Rock Bolt

The objective of the Smart Rock Bolt project is to add sensors and electronics to a standard
rock bolt (see Figure 4.4), endowing it with sensing and communication capabilities. The sensors
can measure the pressure on the bar and detect a breakdown situation; moreover, they can also
detect small vibrations in the walls. The data are analyzed locally, and if a dangerous situation
4.4. Experiments and results 47

is detected, the data are transmitted to a gateway, which collects the data and analyzes all
possible alarms in the mine. The system can alert the corresponding authority to take actions
such as evacuating or reinforcing the tunnel.
This project was presented as part of the IPSO Challenge 2015 [57] and won ﬁrst place in
the contest.

4.4 Experiments and results

This section provides a description of the experimental setup for all performed experiments and
an overview of the obtained results.

4.4.1 Test setup

The experimental scenario was a realistic environment outside of laboratory, with a gateway
running the SCADA software and multiple nodes connected to it. The experimental results
could therefore be affected by radio transmissions from other nodes and by network traffic
effects. The goal of the experiments was to determine the energy consumption and delays
generated by the use of the various elements of the proposed framework, including secure com-
munications, the access control mechanism, and the bootstrapping and configuration services.
The experimental configuration relied on measurements of battery current and voltage that
were performed externally to the device by using a 16-bit ADC operating at 1840 Hz to capture
rapid events such as radio signals, wakeups, etc. All these measurements were combined into
8 digital inputs that could be used to gain detailed information on the power consumption of
each software component. The selected IoT platform was a Mulle (as previously described)
with the Contiki OS; all recorded measures were affected by running the OS at the same time,
thereby yielding data that were as realistic as possible. The effect of running an OS generates
peaks in internal processing queues, communications, internal timeouts, events, etc. which can
increase the error levels of the measurements.

4.4.2 Results
IPsec
Security is a crucial feature for IoT communication. To analyze the energy consumption and
delays of IPsec-ESP, various conﬁgurations were tested. The ESP settings were AES128-CTR
and AES-XCBC. To obtain relevant data, the analysis was performed at two CPU speeds (96
MHz and 48 MHz), with and without hardware acceleration for AES128, and for diﬀerent pay-
loads. Between 20 and 40 measurements were obtained for each payload. Figure 4.5 compares
the energy consumption (a) and delays (b) with and without security.
A detailed analysis of the data revealed the overheads in terms of energy and delays for
transmission (Figure 4.6) and for reception (Figure 4.7), which demonstrate the feasibility of
using this technology for communication protection.
48 Efficient Industrial IoT Framework

Other services
The conﬁguration of each service is as follows:

Internet Key Exchange v2 (IKEv2) The key exchange process has two steps: initializa-
tion and authentication. Therefore, both were analyzed separately. The IKEv2 conﬁgu-
ration is AES128-CTR + AES-XCBC + SHA1 + ECP192.

Bootstrapping The bootstrapping analysis included the overheads for the bootstrapping re-
quest and the parsing of the response, yielding information about the Device Manager,
access control, and conﬁguration services (see Code 4.1).

Configuration The configuration analysis included the configuration request, the parsing of
the response, the configuration of a sensor and the creation and deployment of a new
service (see Code 4.2).

Authentication The authentication analysis included the authentication request, the Chal-
lenge Request/Response process, and the parsing of the ticket and attributes.

Authorization The authorization analysis included the ﬁrst authorization request and the
parsing of timeouts and permissions.

Device Manager The Device Manager analysis included the registration of a node in the
OMA LWM2M Server.

Figure 4.8 shows the energy consumption and delays for each service at two CPU speeds
(96 MHz and 48 MHz), and Table 4.1 presents the values obtained in the experiments in more
detail.

Service Power (mW) Delay (ms) Energy (mJ)

Speed (MHz) 96 48 96 48 96 48
IKE INIT 195.7 158.6 2145.6 3789.2 419.9 601.1
IKE AUTH 209.8 168.1 3916.5 10650.1 821.8 1791.2
Bootstrapping 68.0 65.2 56.4 55.8 3.8 3.6
Conﬁguration 170.9 134.7 81.9 84.2 14.0 11.3
Authentication 197.9 158.6 188.4 232.8 37.3 36.9
Authorization 74.8 71.7 56.9 56.4 4.8 4.0
Dev. Manager 113.7 90.8 72.8 71.8 8.2 6.5

Table 4.1: Analysis of power consumption, delays and energy overheads for each service

4.4.3 Summary
For smaller payloads, communication incurs lower power consumption and shorter delays. The
eﬀect of having the Contiki OS running on the device during the measurements has a larger
impact for smaller payloads. For this reason, the data in Figure 4.6 and Figure 4.7 appear
4.4. Experiments and results 49

inconsistent, with high overheads and large error bars on small payloads. However, the overhead
values for payloads over 200 bytes seems to be consistent and stable. Therefore, to compare
the overheads between diﬀerent conﬁgurations, those are the values that must be used.
50 Efficient Industrial IoT Framework

12 Transmission @96MHz None

Transmission @96MHz Hardware
Transmission @96MHz Software
Reception @96MHz None
10 Reception @96MHz Hardware
Reception @96MHz Software
Transmission @48MHz None
Transmission @48MHz Hardware
Energy consumption (mJ)

8 Transmission @48MHz Software

Reception @48MHz None
Reception @48MHz Hardware
Reception @48MHz Software
6

0
0 100 200 300 400 500
Payload (bytes)
(a) Energy consumption for transmission and reception at 96 MHz and 48
MHz

50 Transmission @96MHz None

Transmission @96MHz Hardware
45 Transmission @96MHz Software
Reception @96MHz None
Reception @96MHz Hardware
40 Reception @96MHz Software
Transmission @48MHz None
Transmission @48MHz Hardware
35
Transmission @48MHz Software
Reception @48MHz None
30 Reception @48MHz Hardware
Reception @48MHz Software
Delay (ms)

0
0 100 200 300 400 500
Payload (bytes)
(b) Delays for transmission and reception at 96 MHz and 48 MHz

Figure 4.5: Analysis of IPsec-ESP communication in terms of energy consumption and

delays
4.4. Experiments and results 51

100
Transmission @96MHz Hardware
90 Transmission @96MHz Software
Transmission @48MHz Hardware
80 Transmission @48MHz Software

70
Energy overhead (%)

0
0 100 200 300 400 500
Payload (bytes)
(a) Energy overheads for transmission at 96 MHz and 48 MHz

100 Transmission @96MHz Hardware

Transmission @96MHz Software
90 Transmission @48MHz Hardware
Transmission @48MHz Software
80

70
Delay overhead (%)

0
0 100 200 300 400 500
Payload (bytes)
(b) Delay overheads for transmission at 96 MHz and 48 MHz

Figure 4.6: Analysis of IPsec-ESP communication in terms of overheads for transmission

52 Efficient Industrial IoT Framework

300 Reception @96MHz Hardware

275
Reception @96MHz Software
Reception @48MHz Hardware
250 Reception @48MHz Software

225

200
Energy overhead (%)

175

150

125

100

0
0 100 200 300 400 500
Payload (bytes)
(a) Energy overheads for reception at 96 MHz and 48 MHz

300
Reception @96MHz Hardware
275 Reception @96MHz Software
Reception @48MHz Hardware
250 Reception @48MHz Software
225

200
Delay overhead (%)

175

150

125

100

0
0 100 200 300 400 500
Payload (bytes)
(b) Delay overheads for reception at 96 MHz and 48 MHz

Figure 4.7: Analysis of IPsec-ESP communication in terms of overheads for reception

4.4. Experiments and results 53

2000
@ 96 MHz
@ 48 MHz
1500

1000
Energy consumption (mJ)

500

0
IKE_INIT IKE_AUTH Bootstrapping Configuration Authentication Authorization Manager

Services
(a) Energy consumption for each service at 96 MHz and 48 MHz

12000 @ 96 MHz
@ 48 MHz
10000

8000

6000

4000

2000
Delay (ms)

200

150

100

0
IKE_INIT IKE_AUTH Bootstrapping Configuration Authentication Authorization Manager

Services
(b) Delays for each service at MCU frequencies of 96 MHz and 48 MHz

Figure 4.8: Analysis of each service in terms of energy consumption and delays
Chapter 5

Contributions

This work was performed within the framework of the Arrowhead project, a European
project to improve interoperability in industrial environments. Some of the focuses of this
project include service orchestration, system orchestration, multi-protocol communications,
zero-conﬁguration operation, and Quality of Service.

Security

Energy
Eﬃciency Scalability

Interoperability Dependability
Paper A Paper B Paper C
Paper D Paper E Paper F Paper G

Figure 5.1: Contribution of each paper to each IoT research area

The author has contributed to various areas of research related to IoT-WSNs, all of which are
interconnected, in developing this thesis: interoperability, scalability, dependability, eﬃciency

55
56 Contributions

and security (see Figure 5.1). These areas have been described and discussed in previous
chapters (see Chapter 2, Chapter 3 and Chapter 4).
This chapter presents a summary of the appended papers and an outline of the author’s
main contributions to each paper.

Paper A: A Feasibility Study of SOA-enabled Networked Rock Bolts

Authors: Jens Eliasson, Pablo Puñal Pereira, Henrik Mäkitaavola, Jerker Delsing, Joakim
Nilsson and Joakim Gebart
Published in: Proceedings of 2014 IEEE 19th International Conference on Emerging Tech-
nologies & Factory Automation (ETFA 2014): Barcelona, Spain
In this paper, research concerning the use of IoT rock bolts in mines is presented. Each
rock bolt can measure strain and seismic activity; each node provides its data as a service over
a wireless mesh network. Using the collected data it, the ability to detect falling rocks and the
presence of mobile machinery is demonstrated.
The author’s main research contribution to this paper was a study of the possible systems that
could be used to protect the network against potential attacks.
This paper was presented at the IEEE 19th International Conference on Emerging Technologies
& Factory Automation (ETFA) in Barcelona, Spain, in September 2014.

Paper B: EXIP: A Framework for Embedded Web Development

Authors: Rumen Kyusakov, Pablo Puñal Pereira, Jens Eliasson and Jerker Delsing
Published in: Proceedings of ACM Transactions on the Web, 2014
This paper presents the design and implementation techniques of the EXIP framework for
embedded Web development. The framework consists of a highly efficient EXI processor, a
tool for EXI data binding based on templates, and a CoAP/EXI/XHTML Web page engine.
A prototype implementation of the EXI processor is presented and evaluated herein. It can be
applied to Web browsers or thin server platforms using XHTML and Web services for supporting
human-machine interactions in the Internet of Things.
This paper presents four major results: (1) theoretical and practical evaluations of the use of
binary protocols for embedded Web programming; (2) a novel method for the generation of EXI
grammars based on XML Schema definitions; (3) an algorithm for grammar concatenation that
produces normalized EXI grammars directly, consequently reducing the number of iterations
during grammar generation; and (4) an algorithm for the efficient representation of possible
deviations from the XML schema.
The author’s main research contributions to this paper were the integration of the EXIP library
with CoAP in an embedded system and the demonstration of direct interaction between a mobile
IoT device and a web browser, with the direct provision of EXIP data from the resource-
constrained device.
This paper was published in ACM Transactions on the Web in October of 2014.
57

Paper C: An Authentication and Access Control Framework for CoAP-based

Internet of Things
Authors: Pablo Puñal Pereira, Jens Eliasson and Jerker Delsing
Published in: Proceedings of the 40th Annual Conference of the IEEE Industrial Electron-
ics Society (IECON 2014), Dallas, USA
The necessity of a fine-grained access control method for CoAP networks is described and
justified in this paper. It presents an analysis of other security mechanisms that can be useful
in combination with CoAP in constrained embedded systems, identifying the shortcomings of
these mechanisms and the reasons to create a new access control mechanism for CoAP systems.
The author’s main research contribution to this paper was the design of a fine-grained access
control mechanism consistent with the power-efficient CoAP concept for small devices to reduce
overhead and the implementation of a small network to demonstrate its performance.
This paper has been accepted for publication in the Proceedings of the 42nd Annual Conference
of the IEEE Industrial Electronics Society (IECON) in Dallas, USA, November 2014.

Paper D: Translation Error Handling for Multi-Protocol SOA Systems

Authors: Hasan Derhamy, Jens Eliasson, Jerker Delsing, Pablo Puñal Pereira and Pal
Varga
Published in: Proceedings of the IEEE 20th International Conference on Emerging Tech-
nologies & Factory Automation (ETFA 2015): Luxembourg, Luxembourg
The problem of networks using multiple protocols is addressed in this paper. In an attempt
to increase interoperability in networks of this type, this paper proposes a solution based on
protocol translation and a study of how to handle speciﬁc protocol error messages.
The author’s main research contribution to this paper was the analysis of the security aspects
involved in translating between protocols.
This paper was presented at the IEEE 20th International Conference on Emerging Technologies
& Factory Automation (ETFA) in Luxembourg, Luxembourg, in September 2015.

Paper E: The Arrowhead Framework Conﬁguration Approach

Authors: Oscar Carlsson, Pablo Puñal Pereira, Jens Eliasson, Jerker Delsing, Bilal Ah-
mad, Robert Harrison and Ove Jansson
Published in: Proceedings of the 42nd Annual Conference of the IEEE Industrial Elec-
tronics Society (IECON 2016), Florence, Italy
The purpose of the Arrowhead project is to provide services for configuration, bootstrapping,
and deployment to enhance dependability and zero-configuration capabilities. Several use cases
for these services are presented in this paper, including building automation, the manufacturing
industry, IoT devices and the process industries.
The author’s main research contributions to this paper were the research, development and
testing of low-power services that can be used by a resource-constrained IoT device to implement
bootstrapping and configuration for sensors, actuators, and services.
This paper has been accepted for presentation at the 42nd Annual Conference of the IEEE
Industrial Electronics Society (IECON) in Florence, Italy, in November 2016.
58 Contributions

Paper F: Using Internet of Things for Industrial Applications: A Feasibility

Check
Authors: Jens Eliasson, Pablo Puñal Pereira and Jerker Delsing
Submitted to IEEE Journal of Sensors
This paper presents a condition monitoring architecture for industrial applications based
on IoT devices using the OMA LWM2M protocol, IPSO Smart Objects, and the Arrowhead
Framework. The paper analyzes the feasibility of applying this technology to rock bolts in the
mines, with a focus on performance and lifetime.
The author’s main research contributions to this paper were the technical aspects, research, de-
velop and implementation of the architecture, with particular attention to reducing the power
consumption of the IoT devices.

Paper G: An Eﬃcient IoT Framework for Industrial Applications

Authors: Pablo Puñal Pereira, Jens Eliasson and Jerker Delsing
Submitted to IEEE Internet of Things Journal
The use of IoT technology for condition monitoring has gained relevance over the past five
years; this paper presents an Industrial IoT platform for condition monitoring with the capabil-
ity to adapt itself to the power consumption of each node, improving the nodes’ measurements
or lifetimes when needed. The paper also studies the impact of each applied technology to
analyze the power consumption and delays; these technologies include communication features
such as access control and encryption as well as functional features such as zero-configuration
networking, device management, and reconfiguration at run time.
The author’s main research contributions to this paper were the design and implementation of
the platform as well as the execution of the power consumption and delay analysis to optimize
the platform performance and lifetime.
Chapter 6
Discussion

To conclude this thesis, some reﬂection on the obtained results and the research questions
presented in the Introduction (Chapter 1) is required. As is illustrated in Figure 6.1, the thesis
research began with a real-world problem that motivated these Ph.D. studies, i.e., the current
lack of a mechanism for implementing eﬃcient Wireless Sensor and Actuator Networks with
high interoperability for industrial applications.

qu e r t h e q u e s ti o n
R a is e a e s ti o n Answ e hyp
ev e lop a hypothesis Test th othesis
D
Real-world Research
Method
problem question

Research
Hypothesis Results
motivation
Evalu
ate the results
D is em
cuss
io n s: robl
res ults’ e i n i t ial p
im pact on th

Figure 6.1: Research methodology for this thesis

This real-world problem of interoperability was addressed close to twenty years ago with
the adoption of Internet technology in embedded devices, as discussed by Delsing et al. [58]
and Sveda et al. [59]. However, the ineﬃcient use of the IP protocol caused power consumption
to increase, which prevented the widespread introduction of this technology into industrial
WSANs, especially for battery-powered nodes. In recent years, international alliances such
as the International Internet Consortium (IIC) [60], the IPSO Alliance [61], the Open Mobile
Alliance (OMA) [62], and the Internet Engineering Task Force (IETF) have been investigating
the standardization and promotion of the use of the Internet Protocol. The development of
6LoWPAN ﬁnally enabled the use of IP (IPv6) in wireless low-power networks, and the use of
CoAP as an application protocol allows an HTTP-like protocol to be used to deploy services

59
60 Discussion

on resource-constrained devices.
The initial hypothesis of this thesis was that CoAP could be used over 6LoWPAN to cre-
ate a Service Oriented Architecture for industrial WSANs. The subsequent research, based
primarily on experimental work, yielded the results reported in Paper A. Evaluation of this
research motivated the following question: “Is it feasible to use IoT-SOA technology in WSANs
for industrial applications?” considering all of the requirements of IoT devices, i.e., bootstrap-
ping, zero-configuration networking, access control, etc. The hypothesis-results-evaluation loop
required several iterations to evaluate the impact of the results on the initial problem. This
final evaluation is the reason for this chapter.
The accumulated results demonstrate that enabling IoT technology with services for boot-
strapping, configuration, access control, device management, etc. increases the computational
complexity on the nodes but allows their constrained resources to be optimized to reduce in-
efficient overheads as much as possible. As demonstrated in Chapter 4 (see Figure 4.5), the
highest power consumption occurs when a node is using wireless communication, especially
during transmission. Thus, at this point, a comparison between polling-based and event-based
WSANs, as considered in this thesis, is mandatory.

Communication efficiency
There are two perspectives from which communication efficiency can be evaluated: data effi-
ciency and energy efficiency.
• Data efficiency (see equation 6.3) is the capability of a system to acquire only valuable
data; in other words, is a measure of how well a system reacts when the measured value
of a source changes. If the data efficiency is 1, the system can obtain all valuable data.
If it is lower than 1, this means that the system is losing some relevant data.

Dataacquired = f · t
where f is the frequency at which the system acquires data.

Datarelevant = fsource · t
where fsource is the frequency of the measured source, or how rapidly the measured
physical variable can change. A change to a measured physical variable is considered a
relevant datum; for example, if the temperature in a room is stable, then only one of all
recorded measurements is a relevant datum, whereas if the temperature is not stable, all
varying values are relevant data.

f · t, if f < fsource
Dataacquired and relevant = (6.1)
fsource · t, if f ≥ fsource

(fsource − f ) · t, if f < fsource
Datanon-acquired and relevant = (6.2)
0, if f ≥ fsource

Dataacquired and relevant f /fsource , if f < fsource
μdata = = (6.3)
Datarelevant 1, if f ≥ fsource
61

• Energy eﬃciency (see equation 6.5) is the capability of a system to acquire only the
important data and discard all irrelevant values, i.e., a measure of how good the system
is at obtaining relevant values. If the energy eﬃciency is 1, the system acquires only
appropriate values. If it is lower than 1, this means that the system acquires some
irrelevant values.

0, if f < fsource
Dataacquired and non-relevant = (6.4)
(f − fsource ) · t, if f ≥ fsource

Dataacquired and non-relevant 1, if f < fsource
μenergy = 1 − = (6.5)
Dataacquired fsource /f, if f ≥ fsource

The total efficiency μ must therefore include both energy and data efficiency, as shown in
equation 6.6.

f /fsource , if f < fsource
μ = μdata · μenergy = (6.6)
fsource /f, if f ≥ fsource
An energy-efficient system is usually a data-inefficient system and vice versa, i.e., an energy-
efficient system acquires data at a low rate to save energy, meaning that the system will likely
lose some relevant data, whereas a system with a high data acquisition rate can measure all
relevant data but will also likely measure some data that are not relevant and is therefore an
energy-inefficient system. The source of the relevant data is usually a physical variable that the
system can sense, and with the exception of time, there is no physical magnitude that exhibits
a constant frequency of change. Thus, upon adding temporal variability to equation 6.6, the
following result is obtained:

f /fsource (t), if f < fsource (t)
μ(t) = (6.7)
fsource (t)/f, if f ≥ fsource (t)
In a polling-based system, the polling rate may be either constant or even adaptive1 , but
in either case, the polling frequency can be considered as a constant in time. Therefore, such a
system cannot be perfectly efficient.

μ(t)|polling = 1
As illustrated in the Figure 6.2, polling-based systems are not efficient even for small changes
in fsource over time. The efficiency is reduced by more than half when fsource ≤ (fpolling /2) or
fsource ≥ 2 · fpolling .
By contrast, an event-based system acquires data only when the source changes; thus, its
frequency of acquisition is variable over time, and an event-based system can potentially be
fully efficient.

μ(t)|event = 1
1
An adaptive polling-based system can modify its polling rate depending on external conditions. For
example, if the system is measuring wind and the wind is more stable during the night than during the
day, the polling rate may be diﬀerent between day and night.
62 Discussion

μ(t)|polling

0.5

energy ineﬃciency data ineﬃciency

fsource (t)
f /2 f 2f

Figure 6.2: Evolution of the eﬃciency between fsource (t) and f

The ineﬃciency of polling-based systems is a strong reason to conclude that the WSNs
must implement event-based systems, especially when the nodes use wireless technology and
are powered by batteries.
The crypto layer of communication has not yet been discussed. There are two standard
solutions for encrypting CoAP communications: IPsec and DTLS. IPsec was used for the work
presented in this thesis; the main reason for this choice was the performance of IPsec implemen-
tations compared with that of DTLS implementations four years ago (as shown by De Rubertis
et al.[63]). At present, the performances of both are similar, and thus, the work presented in
this thesis could be repeated using DTLS instead of IPsec. The diﬀerences between the two
have been discussed by many authors, such as Hennebert et al.[33] and Alghamdi et al.[64].

The presented methods of authentication and authorization have been successfully tested in
several IoT applications as part of the Arrowhead project as well as in the IPSO Challenge 2015.
The results presented in this thesis prove the efficient of the proposed method as a fine-grained
access control mechanism; it can be used to control access at the service and method levels and
can even provide control with regard to a parameter of the CoAP URI. Concerning overheads,
the method does not increase the power consumption by more than 13.3% in the worst case (a
CoAP message without a payload). Thus, it is an efficient access control method.

6.1 Conclusions
The work presented in this thesis aims to advance the state of the art in Industrial IoT-WSNs,
and it represents a step forward in the implementation and expansion of IoT technology in
the industrial world. The approach that is presented in this thesis can be used as a guide
for creating new conﬁguration, security, access control, and management mechanisms and for
improving industrial technology.
This thesis answers the main research questions presented in the Introduction (Chapter 1):

1. Is it feasible to use IoT-SOA technology in WSANs for industrial applications?

6.1. Conclusions 63
Energy consumption

time

Figure 6.3: Energy consumption proﬁle of a typical IoT device

The results of this thesis show that with SOA approach it is feasible, it requires more
sophisticated techniques than WSANs, which increases the computational overhead. But
this complexity can reduce the average power consumption and improve the data acqui-
sition, with the implementation of smart resources i.e. services. Therefore, an IoT-SOA
framework can be more eﬃcient than today’s WSAN.

1.1. Which are the beneﬁts of adding IoT technology to Industrial WSANs?

The use of IoT technology maximizes interoperability, enables machine-to-machine

(M2M) communications, makes systems easier to upgrade, allows zero-conﬁguration
networking, and increases the ease of maintaining and replacing system components.

2. How can access to exposed IoT nodes be protected and controlled while maintaining per-
formance?

The Authentication and Authorization solution presented in this thesis enables ﬁne-
grained access control with very low penalties in terms of performance and power con-
sumption. This solution, based on the use of tickets, can be adapted for use with standard
solutions such as RADIUS and DIAMETER.

3. How can zero-conﬁguration operation be achieved for an IoT node?

In this thesis, services such as bootstrapping, authentication, authorization, and conﬁg-

uration are used to demonstrate how a node can receive a customized configuration from
scratch. Hence, an optimized and specific configuration is realized for each node. This
thesis also provides a route toward the creation of dynamic configurations to adapt the
behavior of each node to its current context.
64 Discussion

6.2 Future work

The framework presented in this thesis demonstrates the feasibility of using IoT-SOA technol-
ogy in industrial applications. Therefore, the work presented herein can be used as a basis for
further research. There are three diﬀerent topics related to the Industrial IoT concept that
must be addressed in future work: eﬃciency, scalability and Quality of Service (QoS).

Efficiency has already been considered as part of this thesis, but there are many additional
aspects that could not be addressed because of time limitations, including issues that emerged
as a direct result of this research. From the charts presented in Chapter 4, is easy to recognize
that IKEv2 is the most inefficient component of the IoT system, reaching levels of consumption
one hundred times higher than that of 500-byte encrypted communication. Hence, a new, more
efficient Internet Key Exchange mechanism is needed for IoT applications.
Another area for improvement is the optimization of the wireless communications at the low-
level protocols, to synchronize the radio and sleep cycles of the microcontrollers and minimize
the time that each device is awake waiting for radio beacons.
The configuration process could be improved by allowing it to be performed completely
dynamically while adapting the behaviors of the sensors and actuators to their current contexts,
e.g., changing sampling rates, temporarily shutting off sensors or actuators, or increasing sleep
cycles to maximize battery life or enhance performance.
The most recent generation of microcontrollers includes specific hardware for processing
certificates; therefore, the use of micro-certificates instead of tickets could be an alternative to
enable better protection in access control.

Industrial process monitoring requires the use of a large number of nodes, for which scal-
ability is critical. During this thesis work, all tests and implementations were performed with
only a limited number of devices; therefore, the presented framework still needs to be tested
and proved for application in massive networks.

This thesis work is based on a Service Oriented Architecture, and one parameter that is
widely used in modern SOA frameworks is the Quality of Service (QoS). For commercial
applications or critical services, this feature is particularly important, e.g., to guarantee that
a ﬁrmware updating service has suﬃcient bandwidth or priority to be completed as soon as
possible or to prioritize alert services ahead of data services.
References
[1] H. Silverstein, “Ceasar, sosus, and submarines: Economic and institutional implications of
asw technologies,” in OCEANS ’78, Sept 1978, pp. 406–410.

[2] C. Y. Chong, S. Mori, E. Tse, and R. P. Wishner, “Distributed estimation in distributed

sensor networks,” in American Control Conference, 1982, June 1982, pp. 820–826.

[3] Cisco, “Cisco visual networking index (vni) complete forecast for 2015 to 2020.” [On-
line]. Available: https://newsroom.cisco.com/press-release-content?type=press-release&
articleId=1771211

[4] Gartner, “Gartner says 6.4 billion connected ”things” will be in use in 2016, up 30
percent from 2015.” [Online]. Available: http://www.gartner.com/newsroom/id/3165317

[5] R. Khan, S. U. Khan, R. Zaheer, and S. Khan, “Future internet: The internet of things
architecture, possible applications and key challenges,” in Frontiers of Information Tech-
nology (FIT), 2012 10th International Conference on, Dec 2012, pp. 257–260.

[6] B. Liskov, “Report on workshop on research in experimental computer science,”

p. 49, 06/1992 1992. [Online]. Available: http://oai.dtic.mil/oai/oai?verb=getRecord&
metadataPreﬁx=html&identiﬁer=ADA256874

[7] IEEE Internet of Things, “Towards a definition of the internet of things (iot),”
IEEE, Tech. Rep., May 2015. [Online]. Available: http://iot.ieee.org/images/files/pdf/
IEEE IoT Towards Definition Internet of Things Revision1 27MAY15.pdf

[8] R. Fielding and J. Reschke, “Hypertext Transfer Protocol (HTTP/1.1): Message Syntax
and Routing,” RFC 7230 (Proposed Standard), Internet Engineering Task Force, Jun.
2014. [Online]. Available: http://www.ietf.org/rfc/rfc7230.txt

[9] M. Belshe, R. Peon, and M. Thomson, “Hypertext Transfer Protocol Version 2

(HTTP/2),” RFC 7540 (Proposed Standard), Internet Engineering Task Force, May 2015.
[Online]. Available: http://www.ietf.org/rfc/rfc7540.txt

[10] P. Saint-Andre, “Extensible Messaging and Presence Protocol (XMPP): Core,” RFC 3920
(Proposed Standard), Internet Engineering Task Force, Oct. 2004, obsoleted by RFC
6120, updated by RFC 6122. [Online]. Available: http://www.ietf.org/rfc/rfc3920.txt

[11] A. Stanford-Clark and H. L. Truong, “Mqtt for sensor networks (mqtt-sn)

protocol speciﬁcation,” IBM Corporation, Tech. Rep., Nov 2013. [Online]. Available:
http://mqtt.org/new/wp-content/uploads/2009/06/MQTT-SN spec v1.2.pdf

65
66 References

[12] I. Fette and A. Melnikov, “The WebSocket Protocol,” RFC 6455 (Proposed
Standard), Internet Engineering Task Force, Dec. 2011. [Online]. Available: http:
//www.ietf.org/rfc/rfc6455.txt

[13] Z. Shelby, K. Hartke, and C. Bormann, “The Constrained Application Protocol (CoAP),”
RFC 7252 (Proposed Standard), Internet Engineering Task Force, Jun. 2014. [Online].
Available: http://www.ietf.org/rfc/rfc7252.txt

[14] “CC3000 wiﬁ chip from texas instruments - datasheet.” [Online]. Available: http:
//www.ti.com/lit/ds/symlink/cc3000.pdf

[15] “ESP8266 wiﬁ chip from adafruit - datasheet.” [Online]. Available: https://cdn-shop.
adafruit.com/product-ﬁles/2471/0A-ESP8266 Datasheet EN v4.3.pdf

[16] G. Montenegro, N. Kushalnagar, J. Hui, and D. Culler, “Transmission of IPv6

Packets over IEEE 802.15.4 Networks,” RFC 4944 (Proposed Standard), Internet
Engineering Task Force, Sep. 2007, updated by RFCs 6282, 6775. [Online]. Available:
http://www.ietf.org/rfc/rfc4944.txt

[17] “TinyOS.” [Online]. Available: http://www.tinyos.net

[18] P. Levis, S. Madden, J.Polastre, R. Szewczyk, K. Whitehouse, A. Woo, D. Gay,

J. Hill, M. Welsh, E. Brewer, and D. Culler, “TinyOS: An Operating System for Sensor
Networks,” Tech. Rep. [Online]. Available: http://people.eecs.berkeley.edu/∼culler/
papers/ai-tinyos.pdf

[19] “FreeRTOS OS.” [Online]. Available: http://www.freertos.org

[20] “Contiki OS.” [Online]. Available: http://www.contiki-os.org

[21] A. Dunkels, B. Gronvall, and T. Voigt, “Contiki - a lightweight and ﬂexible operating
system for tiny networked sensors,” in Local Computer Networks, 2004. 29th Annual IEEE
International Conference on, Nov 2004, pp. 455–462.

[22] “eLinux - Embedded Linux OS.” [Online]. Available: http://www.elinux.org

[23] “OpenWSN OS.” [Online]. Available: https://openwsn.atlassian.net/wiki/display/OW/

Home

[24] “RIOT OS.” [Online]. Available: https://www.riot-os.org

[25] M. Blagojevic, M. Nabi, M. Geilen, T. Basten, T. Hendriks, and M. Steine, “A probabilistic

acknowledgment mechanism for wireless sensor networks,” in Networking, Architecture and
Storage (NAS), 2011 6th IEEE International Conference on, July 2011, pp. 63–72.

[26] R. Gonzalez and M. Acosta, “Evaluating the impact of acknowledgment strategies on mes-
sage delivery rate in wireless sensor networks,” in 2010 IEEE Latin-American Conference
on Communications, Sept 2010, pp. 1–6.

[27] S. Moonesamy, “The ‘about’ URI Scheme,” RFC 6694 (Informational), Internet
Engineering Task Force, Aug. 2012. [Online]. Available: http://www.ietf.org/rfc/rfc6694.
txt
References 67

[28] Z. Shelby, “Constrained RESTful Environments (CoRE) Link Format,” RFC 6690
(Proposed Standard), Internet Engineering Task Force, Aug. 2012. [Online]. Available:
http://www.ietf.org/rfc/rfc6690.txt

[29] D. C. Bormann, S. Lemay, Z. Technologies, and H. Tschofenig, “A TCP and TLS

Transport for the Constrained Application Protocol (CoAP),” Internet Engineering Task
Force, Internet-Draft draft-ietf-core-coap-tcp-tls-02, Jun. 2016, work in Progress. [Online].
Available: https://tools.ietf.org/html/draft-ietf-core-coap-tcp-tls-02

[30] M. Nottingham and E. Hammer-Lahav, “Deﬁning Well-Known Uniform Resource

Identiﬁers (URIs),” RFC 5785 (Proposed Standard), Internet Engineering Task Force,
Apr. 2010. [Online]. Available: http://www.ietf.org/rfc/rfc5785.txt

[31] T. Narten and H. Alvestrand, “Guidelines for Writing an IANA Considerations Section in
RFCs,” RFC 5226 (Best Current Practice), Internet Engineering Task Force, May 2008.
[Online]. Available: http://www.ietf.org/rfc/rfc5226.txt

[32] J. Jimenez, M. Kostert, and H. Tschofening, “IPSO Smart Objects,” IPSO Alliance,
Tech. Rep. [Online]. Available: http://www.ipso-alliance.org/wp-content/uploads/2016/
01/ipso-paper.pdf

[33] C. Hennebert and J. D. Santos, “Security protocols and privacy issues into 6lowpan stack:
A synthesis,” IEEE Internet of Things Journal, vol. 1, no. 5, pp. 384–398, Oct 2014.

[34] S. Kent and K. Seo, “Security Architecture for the Internet Protocol,” RFC 4301
(Proposed Standard), Internet Engineering Task Force, Dec. 2005, updated by RFCs
6040, 7619. [Online]. Available: http://www.ietf.org/rfc/rfc4301.txt

[35] P. Hoﬀman, “Cryptographic Suites for IPsec,” RFC 4308 (Proposed Standard), Internet
Engineering Task Force, Dec. 2005. [Online]. Available: http://www.ietf.org/rfc/rfc4308.
txt

[36] S. Frankel and S. Krishnan, “IP Security (IPsec) and Internet Key Exchange (IKE)
Document Roadmap,” RFC 6071 (Informational), Internet Engineering Task Force, Feb.
2011. [Online]. Available: http://www.ietf.org/rfc/rfc6071.txt

[37] S. Raza, D. Trabalza, and T. Voigt, “6lowpan compressed dtls for coap,” in 2012 IEEE
8th International Conference on Distributed Computing in Sensor Systems, May 2012, pp.
287–289.

[38] T. Fossati and H. Tschofenig, “Transport Layer Security (TLS) / Datagram Transport
Layer Security (DTLS) Proﬁles for the Internet of Things,” RFC 7925, Jul. 2016. [Online].
Available: https://rfc-editor.org/rfc/rfc7925.txt

[39] P. Puñal, J. Eliasson, and J. Delsing, “An authentication and access control framework
for coap-based internet of things,” in IECON 2014 - 40th Annual Conference of the IEEE
Industrial Electronics Society, Oct 2014, pp. 5293–5299.

[40] J. Delsing, Ed., Arrowhead Framework: IoT Automation, Devices, and Maintenance.
CRC Press, 12 2016. [Online]. Available: http://amazon.com/o/ASIN/1498756751/
68 References

[41] T. Bray, “The JavaScript Object Notation (JSON) Data Interchange Format,” RFC 7159
(Proposed Standard), Internet Engineering Task Force, Mar. 2014. [Online]. Available:
http://www.ietf.org/rfc/rfc7159.txt

[42] C. Bormann and P. Hoﬀman, “Concise Binary Object Representation (CBOR),” RFC
7049 (Proposed Standard), Internet Engineering Task Force, Oct. 2013. [Online].
Available: http://www.ietf.org/rfc/rfc7049.txt

[43] “Arrowhead project.” [Online]. Available: http://www.arrowhead.eu

[44] H. Derhamy, J. Eliasson, J. Delsing, P. Varga, and P. Puñal, Translation Error Handling
for Multi-Protocol SOA Systems, ser. I E E E International Conference on Emerging Tech-
nologies and Factory Automation. Proceedings. IEEE, 2015.

[45] “Libcoap 4.1.1.” [Online]. Available: https://github.com/obgm/libcoap

[46] “Copper 1.0.0.” [Online]. Available: https://github.com/mkovatsc/Copper

[47] “Erbium.” [Online]. Available: https://github.com/contiki-os/contiki/tree/master/apps/

er-coap

[48] “Californium.” [Online]. Available: https://github.com/eclipse/californium

[49] J. Schaad, “CBOR Object Signing and Encryption (COSE),” Internet Engineering
Task Force, Internet-Draft draft-ietf-cose-msg-17, Aug. 2016, work in Progress. [Online].
Available: https://tools.ietf.org/html/draft-ietf-cose-msg-17

[50] E. Wahlstroem, G. Selander, L. Seitz, H. Tschofenig, and S. Erdtman, “Authentication

and Authorization for Constrained Environments (ACE),” Internet Engineering Task
Force, Internet-Draft draft-ietf-ace-oauth-authz-02, Jun. 2016, work in Progress. [Online].
Available: https://tools.ietf.org/html/draft-ietf-ace-oauth-authz-02

[51] G. Selander, J. Mattsson, L. Seitz, and F. Palombini, “Object Security of

CoAP (OSCOAP),” Internet Engineering Task Force, Internet-Draft draft-selander-
ace-object-security-05, Jul. 2016, work in Progress. [Online]. Available: https:
//tools.ietf.org/html/draft-selander-ace-object-security-05

[52] H. Derhamy, J. Eliasson, J. Delsing, and P. Priller, A survey of commercial frameworks for
the Internet of Things, ser. I E E E International Conference on Emerging Technologies
and Factory Automation. Proceedings. IEEE, 2015.

[53] Open Mobile Alliance (OMA), “LWM2M OMA - Bootstrap Interface.” [Online].
Available: http://dev devtoolkit.openmobilealliance.org/IoT/LWM2M10/doc/TS/index.
html#!Documents/bootstrapinterface.htm

[54] Open Mobile Alliance (OMA), “Lightweight Machine to Machine (LWM2M)

v1.0.” [Online]. Available: http://technical.openmobilealliance.org/Technical/
technical-information/release-program/current-releases/oma-lightweightm2m-v1-0

[55] “Eclipse Foundation - Leshan LWM2M.” [Online]. Available: http://www.eclipse.org/

leshan/
References 69

[56] Intel, “Wakaama LWM2M.” [Online]. Available: https://projects.eclipse.org/projects/

technology.wakaama

[57] IPSO Alliance, “IPSO Challenge 2015.” [Online]. Available: http://challenge.ipso-alliance.

org/ipso-challenge-2015

[58] J. Delsing, K. Hyyppa, and T. Isaksson, “The ip-meter, design concept and example im-
plementation of an internet enabled power line quality meter,” in Instrumentation and
Measurement Technology Conference, 2000. IMTC 2000. Proceedings of the 17th IEEE,
vol. 2, 2000, pp. 657–660 vol.2.

[59] M. Sveda and R. Vrba, “An integrated framework for sensor-based embedded systems,” in
Engineering of Computer-Based Systems, 2002. Proceedings. Ninth Annual IEEE Interna-
tional Conference and Workshop on the, 2002, pp. 195–202.

[60] “Industrial Internet Consortium (IIC).” [Online]. Available: http://www.iiconsortium.org

[61] “IPSO Alliance.” [Online]. Available: http://www.ipso-alliance.org

[62] “Open Mobile Alliance (OMA).” [Online]. Available: http://openmobilealliance.org

[63] A. D. Rubertis, L. Mainetti, V. Mighali, L. Patrono, I. Sergi, M. L. Stefanizzi, and S. Pas-

cali, “Performance evaluation of end-to-end security protocols in an internet of things,”
in Software, Telecommunications and Computer Networks (SoftCOM), 2013 21st Interna-
tional Conference on, Sept 2013, pp. 1–6.

[64] T. A. Alghamdi, A. Lasebae, and M. Aiash, “Security analysis of the constrained appli-
cation protocol in the internet of things,” in Second International Conference on Future
Generation Communication Technologies (FGCT 2013), Nov 2013, pp. 163–168.
Part II

71
Paper A
A Feasibility Study of SOA-enabled
Networked Rock Bolts

Authors:
Jens Eliasson, Pablo Puñal Pereira, Henrik Mäkitaavola, Jerker Delsing and Joakim
Nilsson

Reformatted version of paper originally published in:

Conference paper, IEEE EFTA, 2014.

c 2014 IEEE. Reprinted, with permissions, from Jens Eliasson, Pablo Puñal Pereira,
Henrik Mäkitaavola, Jerker Delsing and Joakim Nilsson, A Feasibility Study of SOA-
enabled Networked Rock Bolts, IEEE EFTA, 2014.

73
A Feasibility Study of SOA-enabled Networked Rock
Bolts

Jens Eliasson, Pablo Puñal Pereira, Henrik Mäkitaavola, Jerker Delsing and Joakim
Nilsson

Abstract

The use of rock bolts in the mining industry is a widely used approach for increasing
mine stability. However, when compared to the automation industry, where the use of
sensors and real-time monitoring of processes have evolved rapidly, the use rocIPseck
bolts have not changed a lot during the last 100 years. What is missing are technologies
for keeping installed rock bolts under real-time and online monitoring. One problem is
that rock bolts can become damaged by seismic activities or movements within the rock,
and thus lose their load bearing capacity. If that happens, the outer shell of a tunnel’s
walls or ceiling can collapse, with disaster as a result. Therefore, there is a clear need for
online and real-time monitoring solutions for strain and thereby stress, as well as seismic
activity.
In this paper, the current state of art in research around intelligent rock bolts is
presented. An intelligent rock bolt is the combination of a traditional rock bolt with an
Internet of Things device, i.e. a rock bolt with embedded sensors, actuators, processing
capabilities and wireless communication. In the proposed architecture, every rock bolt
has its own IPv6 address and can establish a wireless mesh network in an ad-hoc manner.
By measuring strain and seismic activity and exposing the sensors in the form of services,
large gains in terms of safety and eﬃciently can be achieved. A number of mining
related activities such as stress on the rock bolt can be detected, falling rocks and the
presence of mobile machinery can be observed. Since the network is based on standard
communication protocols such as IPv6, it is vital to add security mechanisms to prevent
eavesdropping and tampering of data traﬃc.
By utilizing the real-time monitoring capabilities of a network of Internet-connected
intelligent rock bolt, it is possible to drastically improve monitoring of mining activities
and thereby providing workers with a safer working environment.

1 Background and Related work

Mine activity monitoring is today mostly made with geo-phones, still the most sensitive
devices to detect earth movement [1]. In mines geo-phones are now interconnected and
used to gather micro seismic data which is further analyzed to provide safety predictions
[2, 3].

75
76

The mining industry have over time initiated a number of smaller projects to test the
function of rock bolts. This has lead to some functional rock-bolt monitoring speciﬁca-
tions [4, 5]. Some of the most important are:

• measure static and dynamic rock bolt load of <300kN.

• Dynamics to be captured are <100 Hz, thus a sampling rate of 1kHz will be suﬃ-
cient.

• true load measured with an accuracy 2 %.

• not sensitive to uneven loading on the bolt plate.

• a cable free system.

• continuous load sampling over time (with the possibility to set sampling intervals).

• life time without changing power supply >12 month (using a battery).

There are several approaches to make one shot testing of rock bolts. Ultrasound is
one common approach to measured bolt load through speed of sound measurements. We
do find several scientific papers and several patents in this field. One example is [6].
Some suppliers of ultrasound measurement technology for bolt load measurements are:

• USM-3 by Norbar [7]

• Hevii - US bolt load technology [8].

• Boltscope-II by Hydratight [9]

This ultrasound technology has the potential to provide the most information on the
changes in the rock bolt. The technology is still rather young and much development can
be expected in the future. The major drawback is the price tag. An attractive approach
for strain gauges sensing applied to rock bolt load measurements is the MMT prototype
found with Hitec corporation [10]. They exhibit and custom device drilled into to the
head of the bolt. The major draw back is the sensitivity to non-axial loads. To our
understanding the development has been halted.
The process automation industry, where the use of sensors, actuators, distributed
control systems and other technologies are widely used, have responded well to the new
possibilities that networked embedded devices, e.g. Internet of Things (IoT) and Cyber-
physical systems, (CPS) can oﬀer [11]. The use of IP-based networked sensor and actua-
tor devices with vertical integration into traditional industry systems is currently being
investigated in some of Europe’s largest automation projects such as the R&D projects
FP7 IMC-AESOP [12] and Artemis Arrowhead [13].
The COBS project [14] at Luleå University of Technology aims at developing smart
conveyor belt rollers for the mining industry and logistics. By equipping a conveyor belt
roller with a wireless sensor node and additional sensors, the roller is able to monitor
2. Architecture 77

itself and thereby sending alarms when for example a ball bearing is getting too warm
which is an indication of a ball bearing damage. The higher level system is used to alert
operators of any anomalies or alarms and assist in scheduling maintenance and reduces
cost from less unexpected downtime.

2 Architecture
This section outlines the core architecture of the intelligent rock bolt with its sensing and
networking capabilities, the support for communication and as well as security.

Intelligent rock bolt

The current proposed design of the intelligent rock bolt is composed of several individual
components. The base is a standard rock bolt, which is equipped with measurement
electronics. The core of the electronic system is the Mulle platform from Eistec AB [15].
The Mulle is a low-power sensor node designed for Internet of Things applications. The
current Mulle features a 16-bit microcontroller, analog and digital inputs and outputs, an
868 MHz IEEE 802.15.4 transceiver, several memories and power management circuits.
To the Mulle is an interface board for the strain and vibration sensors connected, which
is described in more detail in the next section. The Mulle runs the Contiki operating
system from Dunkels et al. [16].

Electronics and sensors

The measurement system consists of a strain sensor and an accelerometer. The accelerom-
eter is mounted on a printed circuit board (PCB) while the strain sensor is external to
the PCB, i.e. mounted inside the rock bolt’s head. Both these sensors produce a voltage
which is sampled by two 24-bit analog-to-digital converters (ADCs). These ADCs are
mounted on the PCB which also hosts a connector that allows the ADCs to communicate
with the Mulle using a high-speed SPI port.
The measurement board hosts a high-density connector for interfacing the Mulle. It
also features some LEDs for development use, power supply, etc. Figure 1 shows the
circuit of the vibration sensor system with Mulle platform.
The two sensors, accelerometer and strain, have been chosen in order for the rock
bolt to be able the two most important factors for mine stability. Seismic activity will
cause vibrations in the rock, and forces lead to tensions in the rock which when released
can result in small earthquakes. These quakes can in worst case result in the collapse of
tunnels, or even portions of the mine.

Internet of Things networking stack

The current communication stack is based on previous work from several research projects.
Other research projects that have been developing the Mulle architecture are EU FP7
78

Figure 1: Sensor node electronics

IMC-AESOP and I2Mine and Artemis Arrowhead. The current version of the Mulle’s
communication stack is based on the IEEE 802.15.4 standard, and uses IPv6 and RPL
over 6LoWPAN. Data is normally transmitted using SenML encoded using XML (with
optional EXI-compression by the EXIP parser [17]) over CoAP. Figure 2 shows the Mulle’s
communication stack.
The software side of the strain and acceleration measurements were implemented as
CoAP [18] services. A CoAP service is easily accessible through a web browser that
supports it. This provides simplicity in monitoring and conﬁguring the rock bolts as it
can be done through a standard web interface over the Internet. CoAP is a protocol
designed to be used on resource-constrained, low power electronic devices.

Figure 2: Rock bolt communication stack

Since Contiki, which is used on the Mulle platform and hence rock bolts, runs RPL
[19] it is possible to create mesh networks, i.e. with multi-hop support. The mesh net-
working support has been experimentally veriﬁed on the rock bolts and the performance
of RPL has been investigated by Potsch et al. in [20]. Time synchronization is per-
formed using the NTP protocol. Wireless re-programming of Mulle devices is handled
by a custom written CoAP service. The Mulles are connected to existing networks (i.e.
Ethernet) using a BeagleBone based gateway, also equipped with an IEEE 802.15.4 radio
2. Architecture 79

transceiver. The gateway host several services, such as RPL, NTP, and a number of
CoAP services. The gateway is connected to its back-end system using an encrypted
VPN solution. This ensures that sensor data is transmitted from a Mulle to database
servers over encrypted channels only.
The use of a Ultra-wide Band (UWB) chip from Decawave has also been investigated.
Preliminary results indicates that UWB is a viable solution for environments with severe
multi-path problems. This will be studied further as well. The use of UWB in com-
bination with distributed event detection and pattern recognition, as proposed in [21],
could provide one solution for performing detection and classiﬁcation of mining related
activities.

Measurement software
For the strain measurements, a CoAP service was created that can retrieve a strain
sample at any time. Also, a threshold value can be set that allows the user of the service
to be notified when a measurement is collected that has changed a specified amount
from when the threshold was set. This is realized through CoAP’s Observe-mechanism.
Moreover, the sampling interval of the notifying service can be set through another CoAP
service.
For the acceleration measurements, a CoAP service was created that controls the
sensor to store a given amount of acceleration samples to the internal flash memory of
the Mulle. When the logging is complete, the samples can be fetched through another
service. Acceleration measurements are done in this way as acceleration data must be
sampled at a much higher data rate than the available bandwidth of the wireless network.

Communication security
A high level of security usually means complex methods and algorithms, therefore more
CPU time and more energy consumption. For this reason on low power systems (net-
works) the security design is a critical task. Nowadays one of the most extended systems
over 6LoWPAN is IPsec that is an extension of the IP protocol that adds security to IP
and higher layers. It was developed for the ”new” IPv6 standard and was later adopted
to include IPv4 as well.
IPsec has two different protocols, AH and ESP, to secure the authentication, integrity
and confidentiality on communication [22]. IPsec can protect completely the IP datagram
(Tunneling Mode [23]) or only the protocols on higher layers (Transport Mode). In
Tunneling mode the IP datagram is encapsulated completely inside a new IP datagram
that uses IPsec (the final IP of the datagram could even be different). In Transport mode,
IPsec only manages the content of the IP datagram, adding the IPsec header between
the original IP header and the header of higher layers, shown in Figure 3.
To protect the integrity of IP datagrams, the IPsec protocol uses authentication mes-
sage codes based on hash, HMAC (Hash Message Authentication Codes). To protect the
confidentiality of IP datagrams, IPsec uses standard algorithms of symmetric cipher (in
80

Figure 3: IPsec encryption and authentication

our case using AES-128, but could work with any other cipher such as AES-256). In or-
der to protect against DoS (Denial of Service) attacks, IPsec uses sliding windows. Each
packet receives a sequence number and only is accepted by the receiver if the number of
packet is inside this window or next. Any previous packets are immediately discarded.
This is an efficient protection mechanism against attacks with message repetition, espe-
cially when the attacker is using sniffed original packets to resend.
The current IPsec version, which is based based on the compressed IPsec design
developed by Raza [24], is under development and does not support directional keys, this
means that IPsec must use a different secret key for each direction of the communication
with the same client/server, but this implementation uses the same (reducing the security
level). One big step forward is the implementation of IKE - Internet Key Exchange - that
is now work in progress. With IKE IPsec could change and choose the correct secret key
for each communication. The use of DTLS encryption for CoAP would further increase
the communication security [25].

3 Performed experiments
This section presents the tests and experiments that have been performed, and gives an
overview of all tests’ setup in terms of hardware and software.

Test overview
In order to investigate the performance and feasibility of the rock bolt design, several tests
were performed. The ﬁrst set of tests was performed indoors in a controlled laboratory
environment. When it was conﬁrmed that the sensing electronics were functioning as
planned as well the integration between the electronics and the rock bolt the next step
3. Performed experiments 81

Figure 4: Intelligent rock bolt installed in mine

was taken by performing tests using four rock bolts in an active mine. The mine test
system was comprised of a total of four intelligent rock bolts, two Linux-based BeagleBone
devices, interface cables, and power supplies.

Laboratory test setup

In the initial laboratory test, the strain sensor was mounted on a device constructed to
simulate strain. This device was fastened to a desk and diﬀerent torques were applied
at the nut of the device to simulate the strain of a rock bolt. The accelerometer was
also tested in a lab. setup where vibrations were measured as well. All measurement
data were transmitted wirelessly using a CoAP service over a 6LoWPAN network and
stored to ﬁle for later processing and visualization. A Java implementation of CoAP,
Californium [26], was used to retrieve all measurements.

Mine installation
Figure 4 shows how an intelligent rock bolt is installed in a mine tunnel. The rock bolt
itself is around three meters long, and the head with the Mulle and sensor interface board
inside the grey plastic box. The strain sensor is located inside the stainless steel head.
The two cables, one for power and one for data, are connected to the data logger and
power supply, respectively. This installation is a prototype device, and not of production
quality. In practice, the electronics must be protected in a better manner in order to
withstand the harsh environment inside an active mine but for prototyping and testing
this approach was suﬃcient.

Performed tests
When all four rock bolts were installed and equipped with the electronics for measuring
strain and vibration, the two BeagleBone-based data loggers were time synchronized
82

using NTP over an 100 Mbit/s Ethernet cable. Each logger stored data from one pair
of rock bolts installed on the same tunnel wall. This procedure was performed during a
total of three days in order to collect as much data as possible.
Several diﬀerent experiments were conducted in the mine in order to collect as much
relevant data as possible. The performed experiments were:

Strain
The strain was recorded on all four rock bolts.

Tunnel wall vibration

A metal object was used as a hammer to hit the tunnel wall and the vibrations were
recorded.

Top hammer drill rig

The vibrations generated by a production top hammer drill rig some 30 meters away for
the rock bolts were recorded.

Falling rocks
A rock was dropped in order to simulate the event of rocks falling from a tunnel’s ceiling.

Vehicle detection
A car was driven by the rock bolts and the generated vibrations were recorded.

4 Results
This section presents results from the collected data from the laboratory experiments
performed in August 2013 as well as from the Kittilä mine experiments performed in
October 2013. All data processing and plots were performed using Matlab. For the
accelerometer, the Z-axis has been used which corresponds to vibrations along the length
of the rock bolt.
Note that a 24-bit ADC has been used, together with an accelerometer that can
measure static acceleration (i.e. the gravity components is visible in the signal). A DC-
blocking filter could be used to remove all offset. The accelerometer will see a different
offset depending on the angle the rock bolt is installed with.

Laboratory strain measurements

To test the linearity of the strain sensor, diﬀerent torques were applied to the strain
simulation device. Four diﬀerent boards and sensors were tested, labeled 2, 3, 4 and 5
4. Results 83

and the strain output as a function of applied torque were recorded. The measurements
were taken at torques of 0, 40, 50, 70 and 80 Nm. 10 measurements were taken for each
value of torque and the mean and standard deviation, respectively, of the measurements
were then plotted. The resulting plots are shown in Figure 5. It can also be seen that
the strain measurement sensors have good linearity properties.

x 10
6 Strain measurments for warying torque.
8.52

8.5

8.48

8.46
Strain

8.44

8.42

Board #2
8.4 Board #3
Board #4
Board #5

8.38
0 10 20 30 40 50 60 70 80
Torque [Nm]

Figure 5: Strain measurements for varying torque

Steel rod
In this ﬁrst mine-based test, a rock bolt rod was used as a hammer to hit the wall near
one of the installed rock bolts. This was repeated eight times in order to get a better
understanding of which type of signal amplitudes that could be expected from a very
strong source of vibration in close proximity of a rock bolt. The vibration data collected
is shown in Fig. 6.
84

Figure 6: Steel rod hit on wall

It is clearly shown in the wave form when the rod hits the tunnel wall and generates
a vibration pattern. This type of amplitudes, or even higher, would probably also be
generated if a mobile machine would drive too close to a wall and brush against it. The
rock bolts can therefore be used for anomaly detection around vehicles.

Drill test
The second mine-based test was performed in order to investigate if a rock bolt’s vibration
sensor can be used to detect mining-related activities such as drilling. A mobile top
hammer production drill rig, located approximately 25-30 meters from the installed rock
bolts, was used as a vibration source.

Figure 7: Drilling detection

It is clearly seen in the signal at 140 and 360 seconds in Figure 7 when the drilling
machine drills, takes a short pause to insert a new rock tool, and starts drilling again.
4. Results 85

Figure 8: Vehicle detection

This indicates that a rock bolt can be used to detect drilling activity in close proximity,
and even count how many drill holes that have been drilled.

Vehicle detection
One important feature that can be used to localizing vehicles is the ability for a rock bolt
to monitor the presence of close by vehicles. This can be used for fine grain localization
of mobile machinery such as cars, trucks, etc. Figure 8 shows the raw and unfiltered
vibration signal from one rock bolt when a car was used in the vicinity. At 265 seconds
into the signal, the car’s engine was turned off which is clearly visible as a sharp drop
of signal amplitude. The plotted signal is the raw output from the sensor, without any
applied signal processing, such as filtering. By applying filtering techniques, the presence
of a nearby vehicle could be detected [27].
How larger vehicles, such as loaders and trucks, will be observed is currently unknown.
However, previous work performed within the iRoad project indicates that heavier vehi-
cles generate higher amplitude levels, as shown by Hostettler et al. [28].

Falling rock detection

Rocks falling from a tunnel’s ceiling are a clear indication of pending danger. When this
occurs, a collapse of the tunnel could happen, or lead to larger and heavier rocks falling
which could result in damage to vehicles and machinery as well as injuries on workers.
In order to see if the rock bolts could detect falling rocks, a simple experiment was
86

Figure 9: Falling rocks detection

performed by dropping a loose rock weighing approximately 3-4 kg from around two
meters height down on the tunnel’s ﬂoor around 1.5 meters from the rock bolt. At 314,
323 and 332 seconds in the signal shown in Fig. 9, three spikes are clearly visible. This
indicates that an intelligent rock bolt can be used to detect falling rocks. When this
feature is combined with the wireless communication capabilities, this could be used for
a near real-time alarm system.

Strain test
A strain gauge sensor can be used to monitor stress in pillars, tunnels walls and ceilings.
Strain can be a good indication of how strong forces that are aﬀecting a volume of rock.
The strain sensor is currently mounted at the rockbolt’s head, however this will severely
limit the amount of strain that can be detected by the sensor due to the fact that the
shotcrete will limit the forces to propagate along the rockbolt.
The output from the strain sensor, shown in Figure 10, also concludes this. For
better strain gauge sensor performance, the strain sensor must be re-designed. This is
considered as future work.

5 Future work
Some of the more prominent features that need more work are: eﬃcient signal processing
of captured data, suﬃcient low-power operation on sensing, processing and communica-
tion, and integration with back-end mine monitoring systems. Performance of the used
6. Conclusion 87

Figure 10: Strain gauge sensor output

sensors also needs more testing, especially the strain gauge which is challenging to ob-
serve in a mine due to the very high time constants and slow change rates. The mounting
of the strain gauge also needs more investigation. Another key issue that needs more
research is how the use of traditional Internet of Things protocols and technologies, which
were originally designed for very low data-rate transmission, with no or low real-time re-
quirement will behave when larger amounts of data must be streamed, i.e. from vibration
sensors, with high requirements on low-latency transmission. The impact of scalability
and security issues must also be investigated further. An interesting approach for self-
learning methods for signal processing proposed in [29] would be interesting to evaluate
for rock bolt usage. The fourth issue to explore is how strain and/or stress information
and vibration data can be successfully integrated in today’s monitoring systems.
In order to secure the communication and SOA model, the IPsec protocol must be
enhanced with a key exchange mechanism like IKEv2 [30]. A system for ﬁne-grain access
control like Radius is also needed to be able to allow or deny speciﬁc clients to access
services.

6 Conclusion
The use of rock bolts in the mining industry is a well known approach for increasing
stability in for example tunnels, and thereby increasing safety for workers. However, what
has been missing is a method of keeping installed rock bolts under constant monitoring.
When compared to the process automation industry, where the use of sensors and SCADA
system is a commonly used, rock bolt monitoring has not been especially improved.
This paper has presented a novel method for rock bolt monitoring, and the design of an
intelligent rock bolt architecture with on-board sensing, processing and communication
capabilities. The intelligent rock bolt, which comprise of a standard rock bolt, sensors
and actuators, signal processing, data storage and wireless communication, can monitor
itself and send alarms when seismic activities are detected, or when diﬀerent mining
88

activities are observed. Since security is highly important, the rock bolts have also
been equipped with a security framework designed to provide tamper-free and secure
communication. The rock bolt can detect, at least but not limited to, the following
mining related activities:
• Deviation of strain on rock bolts
• Drilling
• Usage of mining machinery
• Falling rocks
This paper has also presented concrete test results from a mine-based ﬁeld test using a
low-cost intelligent rock bolt as the measurement device. Results from the tests indicates
that a traditional rock bolt can be equipped with sensors, and that the sensors are capable
of detecting mining-related activities.
Test results also show that successful integration between low-power electronics and a
standard rock bolt is feasible. When all results presented in this paper are summarized,
it is clear that intelligent rock bolts can be used within the mining industry to produce
a better and safer working environment.

7 Acknowledgment
The authors would like to express their gratitude toward Agnico Eagle’s Kittilä mine for
allowing us to perform our experiments there. We would especially like to thank Antti
Pyy and André van Wageningen for their assistance with the preparations and support
during the ﬁeld tests. The authors would also like to thank Mikael Larsmark for his
contributions to the development of hardware and software for the intelligent rock bolt.
The authors would also like to thank Fredrik Sandin for fruitful discussions regarding
signal processing and data analysis.
We would also like to express our gratitude towards our partners within the I2Mine
and Arrowhead projects, and the European commission and Artemis for funding. We
would also like to thank Gluetech AB for assisting us with the strain gauge sensors and
Eistec AB for their support with the Mulle platform.

References
[1] C. E. Krohn, “Geophone ground coupling,” Journal of GeoPhysics, vol. 49, pp.
722–731, 1984.
[2] A. T. Kunnath and M. V. Ramesh, “Integrating geophone network to real-time
wireless sensor network system for landslide detection,” in Proc. First International
Conference on Sensor Device Technologies and Applications, IEEE. IEEE, 2010,
pp. 167–171.
References 89

[3] B. L. F. Daku and J. Salt, “Directional performance of an algorithm used to lo-

cate microseismic events in underground mines,” in IECON 2011 - 37th Annual
Conference on IEEE Industrial Electronics Society, Nov 2011, pp. 2198–2201.
[4] G. Bäckblom, “Project plan migs wp3 monitoring of rock bolt load in underground
openings,” RTC, Tech. Rep., Dec. 2008.
[5] J. Delsing, “Migs wp3 monitoring of bolt load - review of sensor technology for bolt
load measurements,” EISLAB, Luleå University of Technology,, SE-971 87 Luleå,
Sweden, Tech. Rep., 2009.
[6] O. G. et.al., “Us patent 4,402,222, bolt load determining apparatus,” Tech. Rep.,
Sept. 1983.
[7] [Online]. Available: http://www.norbar.com/
[8] [Online]. Available: http://www.heviitech.com/Hevii\ UT.html
[9] [Online]. Available: http://www.hydratight.com/en/products/ultrasonics/
boltscope\-ii
[10] [Online]. Available: http://www.globalspec.com/Supplier/CustomProductDetail/
HITEC?Comp=10\&QID=13910091\&ExhibitId=42329
[11] S. Karnouskos, O. Baecker, L. de Souza, and P. Spiess, “Integration of soa-ready
networked embedded devices in enterprise systems via a cross-layered web service
infrastructure,” in Emerging Technologies and Factory Automation, 2007. ETFA.
IEEE Conference on, Sept 2007, pp. 293–300.
[12] “IMC-AESOP - Architecture for Service-Oriented Process Monitoring and Control,”
Feb. 2013. [Online]. Available: http://www.imc-aesop.eu
[13] “Arrowhead - Enable collaborative automation by networked embedded devices.”
Feb. 2013. [Online]. Available: http://www.arrowhead.eu/
[14] J. Eliasson, R. Kyusakov, and P.-E. Martinsson, “An Internet of Things approach
for intelligent monitoring of conveyor belt rollers,” in International Conference on
Condition Monitoring and Machinery Failure Prevention Technologies - CM2013,
June 2013.
[15] “Eistec AB,” Feb. 2013. [Online]. Available: http://www.eistec.se/
[16] A. Dunkels, B. Gronvall, and T. Voigt, “Contiki - a lightweight and ﬂexible operating
system for tiny networked sensors,” in Local Computer Networks, 2004. 29th Annual
IEEE International Conference on, Nov 2004, pp. 455–462.
[17] R. Kyusakov, H. Makitaavola, J. Delsing, and J. Eliasson, “Eﬃcient xml interchange
in factory automation systems,” in IECON 2011 - 37th Annual Conference on IEEE
Industrial Electronics Society, Nov 2011, pp. 4478–4483.
90

[18] C. Bormann, A. Castellani, and Z. Shelby, “Coap: An application protocol for

billions of tiny internet nodes,” Internet Computing, IEEE, vol. 16, no. 2, pp. 62–
67, March 2012.

[19] J. Tripathi, J. De Oliveira, and J. P. Vasseur, “A performance evaluation study of

rpl: Routing protocol for low power and lossy networks,” in Information Sciences
and Systems (CISS), 2010 44th Annual Conference on, March 2010, pp. 1–6.

[20] T. Potsch, K. Kuladinithi, M. Becker, P. Trenkamp, and C. Goerg, “Performance

evaluation of coap using rpl and lpl in tinyos,” in New Technologies, Mobility and
Security (NTMS), 2012 5th International Conference on, May 2012, pp. 1–5.

[21] M. Baqer and A. Khan, “Energy-eﬃcient pattern recognition approach for wireless
sensor networks,” in Intelligent Sensors, Sensor Networks and Information, 2007.
ISSNIP 2007. 3rd International Conference on, Dec 2007, pp. 509–514.

[22] V. Manral, “Cryptographic Algorithm Implementation Requirements for Encap-

sulating Security Payload (ESP) and Authentication Header (AH),” RFC 4835
(Proposed Standard), Internet Engineering Task Force, Apr. 2007. [Online].
Available: http://www.ietf.org/rfc/rfc4835.txt

[23] S. Kent, “IP Encapsulating Security Payload (ESP),” RFC 4303 (Proposed
Standard), Internet Engineering Task Force, dec 2005. [Online]. Available:
http://www.ietf.org/rfc/rfc4303.txt

[24] S. Raza, S. Duquennoy, T. Chung, D. Yazar, T. Voigt, and U. Roedig, “Securing

communication in 6lowpan with compressed ipsec,” in Distributed Computing in
Sensor Systems and Workshops (DCOSS), 2011 International Conference on, June
2011, pp. 1–8.

[25] S. Raza, D. Trabalza, and T. Voigt, “6lowpan compressed dtls for coap,” in Dis-
tributed Computing in Sensor Systems (DCOSS), 2012 IEEE 8th International Con-
ference on, May 2012, pp. 287–289.

[26] M. Kovatsch, S. Mayer, and B. Ostermaier, “Moving application logic from the
ﬁrmware to the cloud: Towards the thin server architecture for the internet of
things,” in Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS),
2012 Sixth International Conference on, July 2012, pp. 751–756.

[27] R. Hostettler, W. Birk, and M. Nordenvaad, “Feasibility of road vibrations-based

vehicle property sensing,” Intelligent Transport Systems, IET, vol. 4, no. 4, pp.
356–364, December 2010.

[28] ——, “Extended kalman ﬁlter for vehicle tracking using road surface vibration mea-
surements,” in Decision and Control (CDC), 2012 IEEE 51st Annual Conference
on, Dec 2012, pp. 5643–5648.
91

[29] S. del Campo, K. Albertsson, J. Nilsson, J. Eliasson, and F. Sandin, “Fpga

prototype of machine learning analog-to-feature converter for event-based succinct
representation of signals,” in Machine Learning for Signal Processing (MLSP),
2013 IEEE International Workshop on, Sept 2013, pp. 1–6. [Online]. Available:
http://pure.ltu.se/portal/ﬁles/43648751/mlsp2013.pdf

[30] C. Kaufman, P. Hoﬀman, Y. Nir, and P. Eronen, “Internet Key Exchange

Protocol Version 2 (IKEv2),” RFC 5996 (Proposed Standard), Internet Engineering
Task Force, Sep. 2010, updated by RFCs 5998, 6989. [Online]. Available:
http://www.ietf.org/rfc/rfc5996.txt
Paper B
EXIP: A Framework for Embedded
Web Development

Authors:
Rumen Kyusakov, Pablo Puñal Pereira, Jens Eliasson and Jerker Delsing

Reformatted version of paper accepted for publication in:

Journal paper, ACM Transactions on the Web, 2014.

c 2014 IEEE. Reprinted, with permissions, from Rumen Kyusakov, Pablo Puñal Pereira,
Jens Eliasson and Jerker Delsing, EXIP: A Framework for Embedded Web Development,
ACM Transactions on the Web, 2014.

93
EXIP: A Framework for Embedded Web
Development

Rumen Kyusakov, Pablo Puñal Pereira, Jens Eliasson and Jerker Delsing

Abstract

Developing and deploying Web applications on networked embedded devices is often

seen as a way to reduce the development cost and time to market for new target plat-
forms. However, the size of the messages and the processing requirements of today’s Web
protocols, such as HTTP and XML, are challenging for the most resource-constrained
class of devices that could also benefit from Web connectivity.
New Web protocols using binary representations have been proposed for addressing
this issue. Constrained Application Protocol (CoAP) reduces the bandwidth and pro-
cessing requirements compared to HTTP while preserving the core concepts of the Web
architecture. Similarly, Efficient XML Interchange (EXI) format has been standardized
for reducing the size and processing time for XML structured information. Nevertheless,
the adoption of these technologies is lagging behind due to lack of support from web
browsers and current Web development toolkits.
Motivated by these problems, this article presents the design and implementation
techniques for the EXIP framework for embedded Web development. The framework
consists of a highly efficient EXI processor, a tool for EXI data binding based on tem-
plates, and a CoAP/EXI/XHTML Web page engine. A prototype implementation of the
EXI processor is herein presented and evaluated. It can be applied to Web browsers or
thin server platforms using XHTML and Web services for supporting human-machine
interactions in the Internet of Things.
This article contains four major results: (1) theoretical and practical evaluation of
the use of binary protocols for embedded Web programming; (2) a novel method for
generation of EXI grammars based on XML Schema definitions; (3) an algorithm for
grammar concatenation that produces normalized EXI grammars directly, and hence
reduces the number of iterations during grammar generation; (4) an algorithm for efficient
representation of possible deviations from the XML schema.
Categories and Subject Descriptors: E.4 [Coding and information theory]: Data
compaction and compression; H.3.5[Online Information Services]: Web-based ser-
vices

This work is supported by the EU FP7 Project IMC-AESOP and ARTEMIS Innovation Pilot Project
Arrowhead.
Author’s addresses: R. Kyusakov, P. Puñal, J. Eliasson, and J. Delsing are with the Department of
Computer Science, Electrical and Space Engineering, Luleå University of Technology, Luleå;

95
96

General Terms: Performance, Design, Algorithms, Standardization

Additional Key Words and Phrases: Information Exchange, EXI, XML, Data For-
mats, CoAP, Data Processing, XHTML, Embedded Systems, Internet of Things, Web of
Things

1 Introduction
Web technologies are rapidly expanding to networked embedded devices with studies
showing that in 2013 there were more Web-connected gadgets than people in the U.S.1
This process is expected to accelerate due to the increased IPv6 adoption rate and the
availability of small-sized, cheap, off-the-shelf hardware that is powerful enough to exe-
cute full-featured network stacks. Already now, the number of TCP/IP connected sensor
and actuator devices using low-power wireless technologies or even power-line communi-
cation is huge. The application areas cover home automation [1], energy management,
and industrial process monitoring and control [2].
With the increase in the number of devices, the requirements on their interfaces are
also higher. Consumers are demanding “smart” gadgets that are easy and intuitive
to deploy, configure, interact with, and integrate with other devices and systems. An
example from the home automation domain is a smart thermostat that can communicate
with the user’s smart phone to display the current temperature in the house along with
energy costs as well as control settings. It is becoming more common to equip the
traditionally simple sensor and actuator devices with additional diagnostics, logging,
and security capabilities. This phenomenon leads to developing more complex embedded
applications, which are often required to support Web connectivity for human-machine
interfacing. As the code base increases, so are the product cost and time-to-market for
new devices. The development and support for different hardware platforms becomes
especially challenging, and thus the need for a common development platform based on
established and globally adopted standards. The Web development has proved successful
in leveraging a set of global standards for unification of the development for front-end
tools and applications over a large number of desktop and mobile platforms. In addition,
ICT research as argued by [3] suggests that embedded computing will also benefit from
Web development platforms.
The trade-off between Web and native applications has been a turning point for
development strategies in the mobile market. As discussed by [4], Web applications are
cheaper to build, deploy, and maintain, but are often lagging behind in performance and
user experience when compared to the native apps. This gap is narrowing, thanks to
HTML5 and new Web toolkits such as Argos [5] which provides direct access to devices’
capabilities from JavaScript code. While the app stores made the management of native
applications much easier and user-friendly, their main drawback remains - supporting
1
According to data from research firm NPD Group
1. Introduction 97

different platforms often requires substantial rebuild of the code base that needs to be
kept up-to-date with new versions of the different operating systems. As Charland et
al. conclude, one size does not fit all, and there are use cases when it is better to use
one or the other approach. While there are a number of differences between the smart
phone and the embedded systems segments, it is possible to draw some similarities and
list a number of applications where building Web applications is more beneficial even
for resource-constrained hosts when compared to developing proprietary solutions. The
simplified use case presented in Section 5, which demonstrates a human-machine interface
with a sensor platform, provides an example of such application. In this scenario, the
user interface is implemented as dynamic Web application based on CoAP/EXI/XHTML
and using the EXIP framework.
The approach of using standard binary protocols for enabling Web connectivity for
constrained hosts differs from the most common methods described in the literature. The
state-of-the-art solutions to the problems of embedded Web development (e.g., memory,
network, and processing constraints) can be classified into two groups. The methods
in the first group rely on powerful gateway devices that translate the standard Web
protocols to some lightweight messaging framework, and vice versa. An example of this
approach is the work by [6], which describes a gateway architecture for providing Web
connectivity to highly resource-constrained nodes. The methods in the second group
focus on implementing efficient and stripped-down version of the standard text-based
Web protocols. High-impact research results based on this method are the techniques for
implementing an efficient HTTP server for embedded devices presented by [7] and [8], as
well as the small-footprint XML Web service implementation by [9].
Using text-based protocols that rely on simple character encoding such as ASCII, was
important requirement in the early days of distributed computing systems. During that
time, the ability to debug the interactions between the systems with one’s bare hands
was crucial to the acceleration of the adoption of the protocols. Nowadays, practically
all text editors and development tools support UTF-8 character encoding. The tools
also parse the XML documents before printing them to the screen to support syntax
highlighting. Proper tool support opens up new possibilities for efficient representation
of the information on the wire. The new binary encoding schemes are transparent for
the user - if, in any case, the XML documents are parsed before printing them, then
it is better to use faster, binary encoding which is easier to process than text-based
representation. However, implementing highly optimized binary coding schemes is much
more challenging than processing text-based streams. Even more challenging, is the use
of such binary processors on resource-constrained embedded devices where the memory
footprint and CPU usage are crucial. As an example, a common way to compress the
size of an XML document is by indexing frequently used tags and value items. Instead of
encoding each occurrence in the stream, the repeated information items are represented
by their index. Using more extensive indexing increases the compression, but also makes
the memory footprint required to store the indexed information larger. Providing efficient
methods to build and store the indexes is just one example of optimization that is needed
for running binary encoding schemes on embedded hosts.
98

In this work, we present design and implementation strategies for running an Efficient
XML Interchange processor on embedded devices for enabling Web connectivity through
RESTful interface that is based on Constrained Application Protocol. The RESTful
interface can be used for human-machine interactions with Internet of Things hosts as
well as for implementing embedded distributed systems based on the Service Oriented
Architecture, as discussed by [10].
Unlike XML, the EXI specification mandates the use of schema-specific parsing [11]
when the EXI document is encoded with schema knowledge i.e. using schema mode. In
order to address all possible use cases, the presented EXI processor supports both the
schema and schema-less modes of operation. This is achieved by using dynamic state
machine abstraction that can evolve through addition of new states and state transitions.
The main benefit of using static state machines, as in the EIGEN [12] and libEXI [13]
libraries, is the small footprint and hence the ability to implement highly optimized,
dedicated EXI processors. In order to efficiently support a static mode of operation -
in other words, strict schema processing with no deviations, the EXIP library needs to
be configured to strip the code responsible for evolving the state machines. This can be
done easily during compile time due to EXIP modular architecture.
One important component of EXI implementations supporting schema-enabled pro-
cessing is the automatic generation of the state machines based on XML schema language
definitions. These definitions are used to construct a set of formal grammars that describe
a particular XML language which is then recognized by the generated state machines.
EXIP includes an optimized and lightweight grammar generation utility that can be exe-
cuted efficiently at run time. This allows it to support dynamic XML schema negotiations
even on embedded hosts. The main contributions of this article are the grammar gener-
ation algorithms that are the core of the high performance of this utility. To the best of
our knowledge, all other EXI implementations use an external library for processing the
XML Schema definitions that are used for the grammar extraction. A commonly used
external XML Schema library is Apache Xerces. However, its usage for embedded Web
development is limited to static compile-time generation of the EXI state machines.
A prominent research work that is based on the approach of compile-time generation
of the state machines is presented by [14]. The authors show that the use of EXI for
embedded Web service development brings substantial benefits in hardware utilization
(network, CPU, RAM and programming memory). Moreover, their work includes the
design of a Web service code generator based on Simple Object Access Protocol (SOAP)
and the HTTP/EXI/SOAP protocol stack. Promising future research work, as stated by
the authors of that study, is to add support for CoAP RESTful Web service interface to
the proposed generator. As such, the EXIP framework described herein is extending and
further specifying the suggested CoAP RESTful Web service generator.
EXI is not the only possible data format that can meet the requirements of the
embedded Web programming, but it has been shown to provide the highest efficiency
compared to rival binary XML solutions [15]. Lightweight text formats such as JSON
and Comma-separated values (CSV) or binary encoding schemes (ASN.1, BSON, Pro-
tocol Buffers, Thrift etc.) are also capable of representing very efficiently structured
2. Background 99

information. However, the lack of formally defined mapping between these technolo-
gies and the XML Information Set [16] makes them unable to guarantee interoperability
with existing Web technologies and protocols such as XHTML, Scalable Vector Graphics
(SVG), Extensible Messaging and Presence Protocol (XMPP), and RSS feeds to name a
few.
The initial goal of the EXIP library was only to provide efficient implementation of
an EXI processor for embedded systems. Since the initial version of the prototype EXI
processor, the EXIP library was used in a number of research projects and prototypes
as in [17] and [18]. Based on the recurring need of higher processing efficiency and
Web integration, the scope of the EXIP project has now extended, and new processing
algorithms are employed. In addition to the grammar generation algorithms that are part
of the EXI processor prototype implementation, this work defines the overall architecture
of the EXIP Web development toolkit. The architecture consists of three main modules:
the EXI processor library, EXI data binding, and the CoAP/EXI/XHTML Web page
engine. Their functionality, required properties, and overall design in the context of
embedded Web development are discussed in Section 2. Detailed descriptions of each of
these modules and the associated research questions that are investigated are presented
in Sections 3, 4, and 5, respectively.

2 Background
Optimizing the hardware utilization by the Web protocols is a key requirement for their
application on embedded platforms. Very often the connected devices have limited mem-
ory (both RAM and programming memory), and use low-cost CPUs. If the device is
battery powered, the communication overhead is a main contributor to the power con-
sumption that needs to be carefully modeled in order to guarantee the intended up-time
periods. Simulation tools such as PowerTOSSIM [19] can be employed to highlight areas
of the protocol implementations that are mostly responsible for draining the battery.
Among the use of radio duty cycling and CPU sleep modes, reducing the number of
packets sent and received is another way of cutting the power consumption, especially in
wireless applications.
W3C performed an extensive evaluation of the EXI format [20],[15] that shows sub-
stantial improvements in compactness compared to text encoding as well as other XML
binary formats. Additionally, EXI has superior processing performance compared to plain
XML. Both the compactness and processing efficiency depend heavily on the structure of
the encoded documents and the options used for processing. For example, the use of XML
schema information during encoding and decoding can cut the size of small documents
more than 50 %, as the element and attribute qualified names are encoded as indexes
instead of strings. This allows for substantial reduction of the number of packets required
for communication of structured information over the network, and thereby minimizes
the power consumption. Existing Web technologies that are formally described using
XML schema language such as XHTML, for example, can then be efficiently represented
for use in embedded applications.
100

Client side Network layer Server side

Browsers/HMIs

Wired

Userland Web services

Wireless

HTTP/XML to
Thin Web servers
CoAP/EXI proxy

Applica-
CoAP, MQTT-S
EXI/XHTML & CSS tion
EXI/XHTML
rendering engine
Presen- EXI, BSON, page engine
tation ASN.1, Protobuf
Lightweight
Techno- client-side scripting
Transport UDP EXI serializer
logies EXI parser
IPv6, RPL,
Network
6LoWPAN
CoAP server
CoAP client IEEE 802.15.4,
PHY/MAC ZigBee, ISA100.11a,
G3-PLC

EXIP parser EXI data

binder
Network profilers
(Wireshark)
Develop- EXI/XHTML to CoAP/EXI/XHTML
ment tools DOM translator Web page engine
CoAP prototyping
(Copper plugin)
libcoap, Erbium. libcoap, Erbium.
Californium Californium

Figure 1: Overview of the tools and technologies for embedded Web development that
are based on standard binary protocols. The tools that are the main focus of this work
are marked with red ellipses.
2. Background 101

Compaction and processing improvements of CoAP compared to HTTP are also sig-
nificant, as reported by [21]. Moreover, the asynchronous design of the CoAP protocol
makes it much more suitable for event-driven interactions. Publish/subscribe protocols
are often preferred in embedded systems, as they provide better hardware and network
utilization compared to polling schemes that are used by HTTP, for example.
Figure 1 provides an overview of the state of the art of embedded Web development
along with a high level architectural view of the use of binary protocols for network com-
munication and information exchange. The suggested components of the architecture are
grouped depending on their role - client-side, networking layer, and server-side execution;
and their application domain - user tools and applications, technologies/protocols/spec-
ifications, and development tools. The goal of this categorization is to show how the
work presented in this article relates to the current technologies and applications, and to
further motivate the need for this research.
As shown in Figure 1, client-side user applications of the embedded Web include
browsers, graphical Web clients (HMI devices), embedded Web services, and proxy de-
vices translating the binary Web protocols to their text-based counterparts. The tech-
nologies to implement these client-side user applications are CoAP client, EXI parser,
lightweight client-side scripting engine, and EXI/XHTML/CSS rendering engine. The
concrete development tools that can be used for implementing these technologies are the
EXIP parser library, which is the primary objective of this work, EXI/XHTML to DOM
translator, and CoAP libraries such as libcoap [21], Erbium [22] and Californium [23].
Similarly, the networking layer shows different wired and wireless network stacks
and protocols grouped according to the OSI model along with developing tools used for
debugging.
The server-side is represented by resource-constrained embedded devices that are
conforming to the thin server architecture suggested by [23]. The server technologies
include CoAP server, EXI serializer, and EXI/XHTML page engine. The proposed de-
velopment server-side tools are the EXI data binder and CoAP/EXI/XHTML Web page
engine that are described in detail in Sections 4 and 5 of this work. Other server-side
tools for embedded Web development are again, the CoAP libraries libcoap, Erbium, and
Californium.

EXI
EXI data format signiﬁcantly reduces the size of XML when stored on disk or trans-
ferred over the network and also speeds up the parsing and serialization. According to
[15] the compression level varies between 1 % of the original size for large and sparse doc-
uments with compression and schema options enabled to 95 % for schema-less encoding
of very small and dense documents. Nevertheless, EXI format has few drawbacks that
are inherited from XML and must be taken into account in the discussions that follow
in this article. XML notation and semantics are perceived as complex both for humans
to understand but also for machines to process which stems from the design goal of the
format to be ﬂexible and easily extendable for application in variety of use cases. This
102

flexibility creates a lot of special cases and exceptions that must be specifically handled
with if-then-else statements during serialization and parsing. While EXI is very efficient
in removing the redundancy in the XML syntax, it does not simplify the processing -
it merely speeds it up. Besides, EXI adds another level of flexibility by introducing en-
coding options that can be used to influence the level of compression, processing speed
and RAM usage during parsing and serialization. Providing support for all possible EXI
options requires large and complex code base that can hardly fit into the programming
memory of a highly resource constrained embedded device. Therefore, the application of
EXI on such platforms often requires defining a profile of the EXI specification which re-
stricts the supported EXI options to particular values and predefines the XML Schema.
Different EXI profiles and how they are supported by EXIP are further discussed in
Section 3.
Selecting the values for the EXI options is often a trade-off between memory usage,
processing speed and level of compression (for example when setting the values of val-
uePartitionCapacity, valueMaxLength and compression options). Furthermore, as these
parameters heavily depend on the structure of the documents and even on the schema
design (as shown by [20],[15]) it is difficult to predict the level of efficiency when applying
EXI on a particular set of XML documents without performing an extensive empirical
study.

EXI theoretical foundations

The goal of this section is to provide the necessary background information for supporting
the discussions on the EXI processor architecture and algorithms for embedded processing
that follow without going into details of the inner workings of the EXI specifications.
For in-depth overview of the EXI format, the reader is advised to refer to the W3C
specification [24] and white paper [25].
An EXI stream is a sequence of events that describe the content of the XML docu-
ment. These events are analogous to the streaming XML events and denote the start of
an element or attribute, value items, closing tags and so on. For achieving higher com-
pactness, the events are represented by a simplified Huffman coding [26] scheme. The
occurrence of each event in the EXI stream is controlled/described by a set of formal
grammars. The EXI specification very broadly identifies the formal grammars used as
being in restricted Greibach normal form [27]. Support for the theoretical fitness of the
discussed grammar generation algorithms is given in the next paragraph. It provides
more concrete classification of the EXI grammars.
Unlike Greibach grammars, the EXI grammars have at most one non-terminal symbol
on the right-hand side of the grammar productions. Therefore, all EXI grammar rules
are in one of the following two forms: 1) Z → aY or 2) Z → a ,where Z and Y are
intermediate (non-terminal) symbols and a is a terminal symbol. As all grammar rules
are in one of these two forms, the EXI grammars are also regular and in particular right
linear grammars as they require exactly one terminal on the right-hand side and at most
one non-terminal which is at the end of the grammar rule. The regular grammars are
strict subset of the context-free grammars according to the Chomsky hierarchy, and as
2. Background 103

every context-free grammar can be represented in Greibach normal form [27], they are
also a subset of the Greibach grammars.
Identifying the EXI grammars as regular grammars provides much more insight into
their properties. For example, context-free grammars define very broad class of languages
and are equivalent to pushdown automaton (PDA), while regular grammars are equivalent
to nondeterministic finite automaton (NFA). Moreover, the EXI grammars are simple
a.k.a. s-grammars [28] as each pair Z → a... appears only once in each EXI grammar.
Based on this constraint, the EXI grammars are also unambiguous and support linear
parsing time by deterministic finite automaton (DFA).
The process of converting a set of XML Schema definitions to EXI grammars includes
four steps:
1. Create a set of proto-grammars that describe the content model according to the
schema. The EXI proto-grammars are strictly context-free grammars that are
neither regular nor in Greibach normal form as they allow unit productions: Z →
Y where both Z and Y are intermediate (non-terminal) symbols.
2. Normalize the proto-grammars to EXI grammars. The normalization includes
simplification of the proto-grammars by removal of the unit productions. This
creates regular grammar that can be ambiguous, in other words, lacking unique
leftmost derivation tree for every input. In this case a second simplification is per-
formed in which the ambiguous regular grammars are transformed to unambiguous
s-grammars.
3. Assign event codes to grammar productions
4. Extend the EXI grammar with additional productions that describe the possible
deviations from the XML Schema
Section 3 describes an extension to the algorithm for creating proto-grammars from
schema definitions [step (1)] that guarantees that the resulting grammars are regular
s-grammars. This allows for avoiding the normalization of the proto-grammars as a
separate second-step process.
Section 3 describes a modified version of the algorithm for augmenting the EXI gram-
mars for handling schema deviations [step (4)]. The new version of the algorithm allows
the removal of redundant grammar productions that are otherwise required by the ap-
proach described in the EXI specification.

Related work for XML grammars

The formal grammars used in the EXI specification express the constraints defined in
the XML Information Set [16] and are not specific to EXI format itself. As such, the
formal models and theoretical results developed for XML are also valid for EXI. There
are two main theoretical models for studying the properties of XML languages and XML
schema languages. The first model treats XML instances as strings and schema languages
as formal languages that define particular sets of strings representing the possible XML
104

instances that are valid according to a certain schema. This model is based on context-
free (word) grammars and their more restricted forms such as parenthesis and balanced
grammars as presented by [29].
In the second model, the XML instances are treated as trees and the schema languages
as formal languages defining sets of trees representing the valid instances according to a
certain schema [30], [31]. The nested structure of the XML forms ordered unranked trees
i.e. trees with nodes allowed to have any number of ordered child nodes. The theoretical
foundation of this model are regular tree grammars which can be seen as a generalization
of regular word grammars. The tree model is appropriate when studying the expressive
power of different XML schema languages as shown by [32]. In this work, Murata et al.
present a formal classification and comparison between DTD, W3C XML Schema, and
RELAX NG based on the regular tree grammar theory.
Context-free word languages and regular tree languages are closely related. For exam-
ple, it is proven that the set of derivation trees for a language defined by a context-free
word grammar forms a regular tree language [33]. In addition, Brüggemann-Klein et
al. show that tree grammars, and even more generally hedge grammars, are effecively
identical to balanced grammars and that balanced languages are identical to regular
tree languages, modulo encoding [34]. These results demonstrate that the two models
are equally expressive and can be used interchangeably when studying or characterizing
languages based on XML Information Set.
The discussions in this paper are following the first model, because the EXI specifica-
tion defines the XML content with a set of regular word grammars as already presented in
Section 2. For that reason, all grammars in this work are assumed to be word grammars
even if not explicitly stated.
Instead of defining the terminal alphabet in terms of ASCII or UTF-8 characters,
which is commonly used in word grammars, the EXI grammars use XML events (start
element, attribute definition, end element etc.) as terminal symbols. This provides
high level description of the XML content model without affecting the theoretical results
developed for regular grammars. As XML Information Set defines context-free language
parsed by pushdown automaton, a single regular grammar (a single DFA) is, in general,
unable to represent (parse) the content of a whole XML document. Using a single
regular grammar (or a single DFA) for describing (parsing) the whole content of an
XML is possible when certain restrictions on the document structure are met by the
XML/EXI instances. For example, this approach is used for efficient processing of SOAP
Web services that are ordered XML documents with predefined schema [35]. A less
restrictive form of schema-specific XML parsing that uses an extended version of PDA is
presented by [11]. Unlike these approaches, the EXI specification defines the parsing and
serialization of XML Information Set documents based on a stack of regular grammars.
Each regular grammar in the stack describes the content of particular XML element.
The stack of grammars is used to model the nesting of elements (e.g. parsing a nested
element equals adding its regular grammar on the stack) similarly to the role of the stack
in the PDA.
For illustrating how the grammar stack is used during processing in EXI it is conve-
2. Background 105

nient to represent the XML Information Set in terms of extended context-free grammars
(ECFG) which describe exactly the context-free languages and are the basis for DTD
schema language [36]. In an extended context-free grammar each right-hand side of a
production consists of a regular expression which is in turn equivalent to regular grammar
or ﬁnite automaton. Consider the example XML instance and its corresponding ECFG
shown in Table 1:

Table 1: Extended context-free grammar for a sample XML instance where element
<notebook> can have zero or more <note> elements with optional <subject> and
mandatory <body>. The following operators are used in the regular expressions in
ECFG: . - denotes concatenation, * - Kleene star operator (zero or more occurrences), ?
- zero or 1 occurrence and [ ] - matches a single character from the speciﬁed set within
the brackets. The non-terminal symbols are in uppercase letters.

Sample XML Corresponding ECFG

<notebook>
<note> NOTEBOOK → <notebook>.(NOTE)*.</notebook>
<subject>Sample</subject> NOTE → <note>.(SUBJECT)?.BODY.</note>
<body>XML Instance</body> SUBJECT → <subject>.[UTF-8 characters]*.</subject>
</note> BODY → <body>.[UTF-8 characters]*.</body>
</notebook>

The set of regular grammars, used during processing of EXI documents, corresponds
to the set of regular expressions in ECFG which describe the content of all possible
elements. At every step the EXI processor uses the regular grammars on top of the
grammar stack to process the content of the current element. Starting of a nested element
involves pushing its grammar to the stack and closing an element pops its grammar from
the stack. In this way, parsing the XML document shown in Table 1 involves: (1)
parse the content of <notebook> element according to the regular grammar for that
element which is initially the only grammar in the stack; (2) the start of the nested
<note> element requires pushing its regular grammar on the stack and parsing its content
according to that grammar; (3) on start of the nested <subject> element its grammar is
pushed to the stack and used for parsing; (4) When all the content of <subject> element
is parsed and there are no more nested elements at this level pop its grammar from the
stack and continue processing according to the <note> grammar that is currently on
the top of the stack; (...) the same procedure is repeated for the rest of the elements in
this example.
Unlike DTD which defines a local language, the language defined by the set of regular
grammars in EXI is a single-type language that corresponds to the expressive power of
W3C XML Schema [32]. This essentially means that two or more elements sharing the
same name but having different types are evaluated using different regular grammars that
match their type. This differs from DTD where the name of an element uniquely identifies
its content model (or, equivalently, the regular expression or the regular grammar of its
106

content).

CoAP
The Constrained Application Protocol [37] is specially designed for use with resource-
constrained hosts over low-bandwidth network links. CoAP functionality resembles the
HTTP request/response interaction model, and is based on the Representational State
Transfer (REST) architecture of the Web [38]. CoAP also supports well established
concepts of the Web such as URIs and Internet media types. This allows for transpar-
ent translation between CoAP and HTTP traffic while enabling Web interactions with
embedded systems.
CoAP fulfills the requirements of the embedded domain such as providing support for
asynchronous message exchange, multicast capabilities, lightweight discovery mechanism,
very low overhead, and implementation simplicity. This is possible by using UDP as a
transport protocol with optional reliable unicast support and Datagram Transport Layer
Security (DTLS) instead of TCP and TLS. The use of UDP enables the implementation
of CoAP lightweight publish-subscribe mechanism [39] supporting dynamic content ex-
change between embedded servers and Web clients. The built-in asynchronous exchange
of events encoded with EXI provides features similar to the AJAX framework, but with
much lower cost in terms of network bandwidth and hardware requirements for the hosts.
Application areas that would greatly benefit from an open and standard way to con-
nect embedded hosts to the Web include various Internet of Things and machine-to-
machine (M2M) applications such as home automation and energy management.

3 EXI Processor Design and Implementation

Deploying EXI-based RESTful Web services on resource-constrained hosts requires a
modular implementation of the EXI processor library that can support different compile-
time configurations depending on the application scenario. For example, some target
platforms can make use of hash tables for fast lookups in the string tables, while others
have too little RAM for that. In other cases, certain EXI options (e.g., compression,
random access, etc.) are not allowed, and hence the code for processing them can be
pruned from the library.
In this section, we present the modular design of the EXIP library [40] that enables
compile-time profiling of the code base. As shown in Figure 2, by using fine-grained
components that have low interdependencies, it is possible to define different profiles of
the library that support a variety of use cases. Such profiles can be application-specific
(e.g., full-featured, most-restricted, etc.), or defined as part of different communication
standards - EXI Profile for limiting usage of dynamic memory [41], Vehicle to grid com-
munication interface (ISO 15118), or other energy management standards [42] such as
Smart Energy Profile 2.0 [43], and OpenADR, for example.
The encapsulation of the components’ source code is done with the standard mecha-
nisms available in the C programming language - splitting the code into different header
3. EXI Processor Design and Implementation 107

Figure 2: EXIP modular architecture and application proﬁles

and source files, and hiding the implementation in static functions, strictly avoiding the
use of global variables and, where needed, using conditional C preprocessor macros. This
enables the implementation of a simple and easy-to-maintain Makefile build system which
can track the dependencies between the components. With this build system in place
the developers can cherry-pick only the components that are needed during compile time
which allows using the EXI Processor library for different application profiles or contexts.

Problem Formulation
The first step in supporting the requirements of the EXI-based embedded Web program-
ming is to provide efficient Application Programming Interface (API) to encode and
decode EXI streams. Already established XML APIs such SAX, DOM, and StAX are
widely used in Java processors, but are shown to provide less than optimal efficiency for
resource-constrained devices [44]. Other requirements of the EXI processor implementa-
tion include a small footprint and an easy-to-use code base that executes quickly, and
consumes as little RAM as possible while being portable across a wide range of embedded
platforms. Although the main goal of the EXIP library directly follows from these require-
ments, detailed description and evaluation of the degree to which these requirements are
met is out of the scope of this paper. The reason for excluding these discussions is the low
research value of the implementation technicalities that are involved in writing efficient
and portable C code, a subject which is better presented by the EXIP library developers’
documentation2 . Instead, this section is focused solely on the grammar generation func-
tionality that is an essential part of a number of use cases connected to dynamic/runtime
exchange of schema information. The need for such runtime negotiation of the document
structure is evident in supporting versioning of the schema documents and implementing
generic Web services such as information logging and archiving, data visualization of
2
Available at http://exip.sourceforge.net/
108

uncategorized information, dynamic Web service composition, and peer-to-peer services.

A concrete example where the XML Schema documents are processed during runtime to
generate EXI grammars is the speciﬁcation draft for using EXI over Extensible Messaging
and Presence Protocol (XMPP) [45].
The dynamic processing of XML schema information can also be employed in cases
where no schema information is available to describe a particular set of XML documents.
In such cases the XML schema can be inferred from the set of available XML examples,
and used to enable more compact EXI encoding. Both the schema inference and the
generation of EXI grammars can be done at runtime. Example approaches for schema
inference include learning of deterministic regular expressions [46], as well as learning
chain regular expressions, in the case of the Trang open source software library [47].

Eﬃcient EXI Grammar Generation

The standard way of generating EXI grammars from XML Schema is to rely on a generic
XML Schema parser/validator such as the Apache Xerces library. The role of the XML
Schema parser is to load the schema definitions into appropriate structures in the memory.
These structures are then converted to EXI grammars based on the algorithms specified
in the EXI specification. The EXIP library takes a different approach by including a
dedicated EXI grammar generator without external dependencies on schema parsers,
which uses a modified version of the algorithms described in the EXI specification.
Many embedded targets use EXI because XML processing is too heavy to support. In
such cases, the dynamic generation of the EXI grammars cannot be achieved in a standard
way, as it requires processing text-based XML schema definitions. One possible solution
is to use proprietary encoding for the EXI grammars, which is against the principles of
the Web, and will still require some loading code that expands the programming memory
footprint.
The dedicated EXI grammar generator solves this problem by using two simple ideas.
First, the XML Schema document is itself an XML document that can be represented in
binary using EXI, thus reducing its size and improving the loading time. Second, once
represented in EXI, the XML Schema document can be parsed by the EXI parser itself
without the need of an external library for that; in other words, the EXI decoder code is
reused to extract the XML schema definitions.

EXI Grammar Concatenation and Normalization

The EXI specification defines an algorithm for building a set of context-free grammars
that directly correspond to the definitions in the W3C XML Schema specification. These
grammars are called proto-grammars as they are intermediate representation which is
only used during EXI grammar generation. The process of building proto-grammars is
roughly as follow:

1. a set of simple proto-grammars are deﬁned that describe the content model for each
atomic XML schema deﬁnition (attributes, simple types, element terms, wildcard
3. EXI Processor Design and Implementation 109

terms)

2. the proto-grammars for composite schema deﬁnitions are built by using the proto-
grammars of their sub-components. For example, the <sequence> compositor
equals to concatenation of the proto-grammars of its child elements and <choice> com-
positor equals to the union of the proto-grammars of its children.

The next step in the process of building EXI grammars is to normalize the proto-
grammars such that all unit productions (Z → Y where both Z and Y are intermediate
symbols) are removed and there are no ambiguities in the grammars. This essentially
converts the proto-grammars to EXI grammars that are then used for processing EXI
documents conforming to a schema.
The review of the algorithm for creating EXI proto-grammars from XML Schema
definitions in section 8.5.4.1 EXI Proto-Grammars of the EXI specification leads to the
conclusion that the only way for creating proto-grammars that contain unit productions,
and hence are not regular, is as an output of the grammar concatenation operator (see
8.5.4.1.1 Grammar Concatenation Operator of the specification). However, all atomic
grammars used as an input to the concatenation operator are regular and from the closure
property of the regular languages under concatenation [48] we know that the resulting
output grammar can also be presented in a regular form.
This section defines an extended grammar concatenation operator that produces reg-
ular EXI grammars, thereby removing the need for additional normalization of the gram-
mars by removal of unit productions. The extended operator depends on the following
recursive definition:

DEFINITION: Weak equality of grammar productions The grammar produc-

tion A : Z1 → a1 Y1 and the grammar production B : Z2 → a2 Y2 are weakly equivalent
if:

1. a1 ≡ a2 and Y1 ≡ Y2
OR

2. a1 ≡ a2 . Let the sets of productions in the EXI grammar that have Y1 and Y2
as a left-hand side be denoted as {Y1 } and {Y2 } respectively. The two sets have
the same cardinality, and each production P ∈ {Y1 } is weakly equivalent to a
production in {Y2 }.

The grammar concatenation operator defined below is very similar to the one in the
EXI specification in the sense that it creates a new grammar given two input grammars.
The new grammar accepts any set of symbols accepted by the left operand followed by
any set of symbols accepted by the right operand of the concatenation operator. The
main difference is that the operator defined here produces regular EXI grammars, given
its operators are also regular grammars.
110

DEFINITION: Extended grammar concatenation operator Given two EXI

Grammars L(Nl , T, Sl , Pl ) and R(Nr , T, Sr , Pr ) where Nl and Nr are finite sets of non-
terminals, T is the set of terminal symbols representing the EXI events, Sl ∈ Nl and
Sr ∈ Nr are both designated initial symbols, and Pl and Pr are the sets of grammar
productions in L and R respectively. All grammar productions in Pl and Pr are in one of
the following two forms: Z → aY where a ∈ T and a = EE or Z → EE where EE ∈ T
is the terminating end element EXI event.
The result of applying the grammar concatenation operator to L and R, L R, is
a new grammar C(Nl ∪ Nr , T, Sl , Pc ) where the set of productions Pc is defined as fol-
lows: each production l ∈ Pl , where l = Z → EE for every Z ∈ Nl , is part of Pc ; each
production r ∈ Pr , where r = Sr → aY for every a ∈ T , and Y ∈ Nr is part of Pc .
For each production el ∈ Pl , where el ≡ Z → EE for every Z ∈ Nl , the following set
of productions is also part of Pc : the set {Z → aY } where a production sr of the form
Sr → aY exists in Pr , and sr is not weakly equivalent to any production in Pl that
has Z as a left-hand side non-terminal symbol. There are no other productions in Pc
besides those defined with these rules.

When the extended concatenation operator is used for XML Schema sequence deﬁ-
nitions, the resulting regular grammar might contain productions with duplicate terminal
symbols i.e. the result can be an ambiguous regular grammar. In this case the algorithm
in section 8.5.4.2.2 Eliminating Duplicate Terminal Symbols of the EXI speciﬁcation
should be further applied to the resulting concatenated EXI grammar. It is worth noting
that these cases are extremely rare and can only occur when optional element particles
are allowed to repeat more than once. Example content model that contains duplicate
terminal symbols and leads to the creation of ambiguous regular grammar is the following:
<s e q u e n c e maxOccurs=”2”>
<element name=”a” maxOccurs=”3”/>
<element name=”b” minOccurs=”0”/>
</sequence >

Eﬃcient Representation of Schema Deviations

The EXI speciﬁcation deﬁnes an algorithm that augments the EXI Grammars with addi-
tional grammar productions which are used to handle possible deviations from the XML
schema. Such deviations are often used to add extensions to a particular protocol or
handle cases that require additional information in the XML documents. Furthermore,
certain XML events that are not explicitly declared in the schema may also occur in the in-
stance documents without making them invalid (e.g. comments, processing-instructions,
type casts using type attribute from http://www.w3.org/2001/XMLSchema-instance
namespace).
One constraint that must be followed when adding productions to the normalized EXI
grammars is that addition of productions allowing attribute deviations must only occur
before the element content - otherwise the grammars describe a document which is not
3. EXI Processor Design and Implementation 111

well formed. The algorithm as described in the EXI specification (see 8.5.4.4.1 Adding
Productions when Strict is False [24]) depends on a set of redundant productions in the
normalized EXI grammars in order to fulfill this requirement. The redundant productions
are a copy of the productions describing the possible states for starting the content of
an XML element that has wildcard attributes or a mixed-content model. An example of
such redundant productions is the EXI grammar describing element fragments (see 8.5.3
Schema-informed Element Fragment Grammar [24]).
The algorithm described in this section augments the EXI grammars for accepting
schema deviations without having a dependency on redundant productions in the input
EXI grammar. The algorithm is presented by highlighting only the modifications and
differences with comparison to the algorithm in the EXI specification. An example of
applying the modified algorithm is given in Appendix A.
The algorithm depends on the definition of a content non-terminal symbol, and an
index called content index for each input EXI grammar. The assignment of content
index and content to a non-terminal symbol is identical to the process defined in the
EXI specification, and a prose description of it is given below:

DEFINITION: content non-terminal symbol The content non-terminal sym-

bol is the symbol that indicates that all attributes (AT terminal symbols) are already
encoded. The content non-terminal symbol represents all the states for starting the
encoding of the content of a particular XML element.

DEFINITION: content index Assign index numbers to all non-terminal symbols

such that the designated initial symbol of the EXI grammar has index 0 and all other
indexes are larger than 0. The index of the content non-terminal symbol, in other
words, the content index, is then the smallest index that is larger than the indexes of
all non-terminal symbols that are used as a left-hand side in grammar productions with
AT terminals.

DEFINITION: Grammar augmentation for schema deviations Create a copy

of all grammar productions that have the content non-terminal on the left-hand side
if and only if there are AT productions that have the content non-terminal symbol on
their right-hand side or the content index is 0. The copy of the content non-terminal
symbol - content2 if available, is inserted just before the content i.e. it has index of
(content index - 1). In the case when the content index is 0, that would mean that
the content2 is now the entry non-terminal symbol of the grammar. After the copying,
there should be no productions with content2 non-terminal on the left-hand side that
have content2 on their right-hand side - instead they should have only content. All
AT productions that have a content non-terminal symbol on their right-hand side are
changed to point towards content2 instead.
Apply the procedure in 8.5.4.4.1 Adding Productions when Strict is False
[24] while applying the following modiﬁcations to the algorithm:
112

• The designated initial symbol of the EXI grammar is changed to content2 when
content index is 0.

• Change each occurrence of content with content2 and vice versa, that is, each
occurrence of content2 with content.

• If there is no content2 non-terminal, then do not perform the procedure for it and
assume the content2 index is smaller than the content index, but larger than
the indexes of all non-terminals that are used in AT productions.

Performance Evaluation
The goal of this section is to evaluate the performance of the dedicated EXI grammar
generator implemented as part of the EXIP library. The grammar generator accepts EXI
encoded XML Schema deﬁnitions as an input, and uses the extended grammar concate-
nation operator and the algorithm for eﬃcient representation of schema deviations. The
measurements in this section are indicative and aim to compare the execution time and
memory usage of grammar generation on real-world data. As the core contribution of
this work is in the grammar generation utility, this section does not evaluated the overall
EXI processing performance. Measurements of the EXI parsing speed are included only
to the extent needed to put the grammar generation evaluation in context.

Description of the test setup A set of 5 XML schema documents were used for
decoding 15 instances (XML examples that conform to the schema; 3 instances per each
schema document) by 3 different EXI processors. Decoding in this experiment refers to
converting a binary EXI file to its text-based XML representation. The EXI processors
are EXIficient v0.9.1 Java [49], OpenEXI v1.0238.0 Java [50], and EXIP v0.5.3 C [51].
At the time of writing this article - June 2014, there is one more open source EXI parser
- WS4D-uEXI3 . WS4D-uEXI is written in C and is designed for constrained embedded
devices. It is not included in this comparison as it uses EXIficient library for building
the EXI grammars at compile time and therefore does not support runtime grammar
generation [52]. Moreover, WS4D-uEXI implements a subset of the EXI specification
and its current version (SVN r2) is unable to decode some of the EXI instances in this
evaluation due to missing features.
The evaluation uses the following XML schema documents: netconf.xsd4 , SenML.xsd5 ,
sep.xsd6 , OPC-UA-Types.xsd7 , and XMLSchema.xsd8 . All of them were accessed from
the local hard-drive, including the imported XML schema files, so there were no depen-
dencies on the network performance.
3
http://code.google.com/p/ws4d-uexi/
4
Network Configuration Protocol: https://www.iana.org/assignments/xml-
registry/schema/netconf.xsd
5
Sensor Markup Language: http://tools.ietf.org/html/draft-jennings-senml-10
6
SEP2: http://www.zigbee.org/Standards/ZigBeeSmartEnergy/SmartEnergyProfile2.aspx
7
OPC-UA: http://opcfoundation.org/UA/2008/02/Types.xsd
8
Schema for XML Schema: http://www.w3.org/2001/XMLSchema
3. EXI Processor Design and Implementation 113

Figure 3: Grammar generation execution times for each XML Schema test case. The
averaged times per XML schema are given on the logarithmic Y axis for each of the
tested EXI processors - EXIﬁcient (leftmost column, forward slash hatching), OpenEXI
(middle column, backslash hatching) and EXIP (rightmost column, grid hatching). Each
bar in the chart represents the execution times when explicit optimizations are applied
(lighter colored part of the bar) and when no optimizations are applied.

The tests were executed on a desktop PC (Intel(R) Core(TM)2 Duo CPU E8400 @
3.00GHz, 4GB RAM @ 1067 MHz) running 32-bit Linux Ubuntu 13.10. The version
of the Java Virtual Machine (JVM) used for running EXIficient and OpenEXI is Java
HotSpot(TM) Server VM 1.7, and the C compiler used for EXIP is GCC 4.8.1.
Two distinct measurements of the execution time were performed for each EXI pro-
cessor: (1) the time it takes for loading an XML Schema and converting it to EXI
grammars, and (2) the time it takes to generate the EXI grammars as well as decode
a sample XML instance. The time was measured using System.nanoT ime() in Java
and clock gettime() in C, in other words, we measured wall-clock time which can vary
depending on the external load of the system. In order to get comparable results, the
tests were executed ensuring similar conditions on the system load, and taking the mean
value of 300 measurements. Moreover, the mean value is calculated for two distinct runs
of the test framework - one with optimizations and one without applying optimizations.
In the unoptimized case the Java processors run on a ”cold” JVM i.e. the code is exe-
cuted for the first time on the VM and hence the classes for grammar generation and
instance decoding are loaded at runtime. Also the ”cold” JVM has smaller chance for
applying run-time optimizations such as Just-In-Time (JIT) compilation. Conversely,
the optimized case uses ”warmed-up” JVM where the tests are run 5 times on the JVM
before the measurement are taken. The EXIP processor is compiled with −O0 flag for
114

Figure 4: Grammar generation and instance decoding execution times for each XML
Schema test case. The averaged times per XML schema are given on the logarithmic Y
axis for each of the tested EXI processors - EXIﬁcient (leftmost column, forward slash
hatching), OpenEXI (middle column, backslash hatching) and EXIP (rightmost column,
grid hatching). Each bar in the chart represents the execution times when explicit op-
timizations are applied (lighter colored part of the bar) and when no optimizations are
applied.

unoptimized case and with −O3 for the optimized run.9

Figure 3 and Figure 4 show the averaged execution times per each XML schema test
case with enabled and disabled optimizations. In Figure 3 the times are for grammar
generation only while Figure 4 shows the execution times for both grammar generation
and instance decoding. In both charts, the execution times on the Y axis are represented
in logarithmic scale for enhancing the visual representation.

Table 2: Averaged execution times (ms) for all XML Schema test cases

Optimized Unoptimized
EXI Processor
Grammar Grammar+Instance Grammar Grammar+Instance
EXIﬁcient 150.3 168.7 586.5 651.4
OpenEXI 98.8 106.9 639.3 676.2
EXIP 10.5 11.3 14.7 15.8

On average, among all test cases, the execution times for grammar generation and
9
The automated test framework for conﬁguring and executing the evaluation is available open source
at http://github.com/kjussakov/exip-eval
3. EXI Processor Design and Implementation 115

instance decoding are given in Table 2. As shown in the table, EXIP generates the
grammars about 9 times faster than OpenEXI and 14 times faster than EXIficient when
compile time optimizations for the C code and run-time JVM optimizations for the Java
code are enabled. This cannot be attributed solely to the performance difference in native
code versus Java byte code execution where on average Java programs are somewhere
between 50 % faster to 4 times slower than their C counterparts10 .
The superior performance of EXIP grammar generation is mainly due to the use of
EXI-specific XML Schema parser that accepts EXI encoded XML Schema definitions as
opposed to the use of general purpose XML Schema parser. By using the extended gram-
mar concatenation operator (see Section 3), EXIP has to perform one less iteration over
the set of all grammar rules which has noticeable benefits mainly in large XML Schemas
such as SEP2 (sep.xsd). The grammar augmentation algorithm presented in Section 3
has no effect on processing efficiency, but slightly improves memory usage. Code opti-
mizations, in terms of avoiding unnecessary loops and selecting appropriate searching and
sorting algorithms (for example the use of a hash table for mapping element definitions
to their globally defined types instead of iteration), have impact on the performance as
well but are harder to quantify.

Memory usage
This section provides some insight into the memory consumption of EXIP, and EXI in
general, as memory is often a bottleneck in embedded system applications. Section 2
already discussed that the dynamic memory usage for EXI processing can be controlled
by some of the parameters defined in the EXI header. This is done by adjusting the
extent of the content indexing used to detect and reduce redundancy in the data which
also affects the compactness and processing speed. However, the mechanisms provided in
the EXI specification cannot guarantee bounded run-time memory usage when deviations
from the XML schema are present. For that purpose, an extension to these mechanisms
are developed in a complementary specification called EXI Profile for limiting usage of
dynamic memory [41]. A subset of this profile is supported by EXIP but its impact on
the memory consumption is not evaluated in this section as the tests presented here are
restricted to a schema valid instance of the SenML standard. Table 3 shows the size and
memory usage during encoding and decoding for a sample instance document borrowed
from the SenML specification11 .
The size and memory consumption are given for different encoding options. The
platform used for testing is Raspberry Pi embedded computer with ARM-based system
on chip including 700 MHz processor with 512 MB of RAM. The memory usage presented
in Table 3 shows only the amount of dynamic memory (heap) usage for statically compiled
EXI grammars and is measured using DHAT (dynamic heap analysis tool) that is part
of the code profiling library Valgrind.
An interesting observation is that although the document is relatively small, turning
10
Source: http://benchmarksgame.alioth.debian.org/u32/java.php
11
Available at: http://tools.ietf.org/html/draft-jennings-senml-10#section-7
116

Table 3: Size of a SenML instance for diﬀerent encoding modes and memory usage for
EXIP and the light-weight XML parser library MiniXML on a Raspberry Pi system. The
rows are ordered by document size.

Size RAM/heap usage (kB)

Encoding mode
(bytes) EXIP MiniXML
Encoding Decoding Encoding Decoding
Plain XML 387 - - 1.36 1.55
EXI Schema-less byte aligned 248 7.95 8.26 - -
EXI Schema-less no value indexing 237 6.93 6.79 - -
EXI Schema-less default options 200 7.90 8.26 - -
EXI Schema mode no value indexing 137 1.93 2.27 - -
EXI Schema mode default options 100 2.87 2.21 - -
EXI Schema mode strict 98 2.89 2.23 - -

oﬀ the indexing of repeating values (i.e. setting valuePartitionCapacity parameter to 0)

substantially inflates the size of the resulting EXI representation. This is due to the
high redundancy in the attribute values which has profound affect even in schema mode
encoding. This simple example shows the high variation of compression and dynamic
memory usage depending on the content of the documents and the encoding options in
use.
The compile-time allocated RAM used by the EXIP library (calculated as the sum of
.rodata, .data and .bss sections in the Executable and Linking Format (ELF)) is 23 kB
(of which 8 kB EXI grammar definitions used for schema mode cases) while the light-
weight XML parser MiniXML v2.8 requires only 3 kB. EXIP SenML parser uses 79 kB
programming memory while MiniXML uses only 16 kB. Additionally, as shown in Table
3, MiniXML is more efficient in the use of dynamic memory compared to EXIP. These
results indicate that EXI processing, and EXIP library in particular, require more RAM
compared to highly optimized XML processing. The main reason for this is the use of
content indexing and grammar information during EXI processing. Further optimizations
of the RAM usage in EXIP are possible both for the size of the content index as well
as the in-memory grammar representation. It should also be noted that schema-based
EXI processing implicitly performs partial schema validation while MiniXML is a non-
validating parser.
Enabling run-time EXI grammar generation from the SenML schema additionally
requires 57 kB of dynamic memory and 37 kB of programming memory. These memory
requirements show that the run-time grammar generation module fits easily in embedded
devices such as Raspberry Pi but is too heavy for the most constrained platforms. As an
example, the popular Stellaris LM4F120H5QR 32-bit ARM Cortex-M4F microcontroller
(80 MHz CPU frequency, 256 KB flash and 32 KB SRAM) does not have enough RAM for
supporting run-time EXI grammar generation. Nevertheless, by using static grammars
the EXIP library is capable of running on such platforms with averaged total RAM usage
of about 20 kB12 and 60kB of programming memory for the SenML sample instance.
12
The RAM usage in schema mode is 20 kB (1 kB stack size + 2.5 kB heap + 16.5 kB .data and .bss)
4. EXI data binding 117

4 EXI data binding

The information contained in an XML/EXI document is often loaded into the memory
for further processing and mapped to a hierarchy of data structures or objects that are
maintained by the applications. For example, a status report by a device can include
various hierarchal information such as network status (which in turn contains parameters
like RTT, signal strength, connected peers etc.) or resource utilization (storage space,
battery level etc.) that is mapped to a corresponding hierarchy of programming objects.
The process of generating an XML/EXI document from a hierarchy of objects and vice
versa is known as XML/EXI data binding. The process of building objects from an
XML/EXI input document is called unmarshalling and the reverse - the generation of
XML/EXI output document from objects, is called marshalling. The unmarshalling
is implemented as a software module that connects to the parser API, and generates
memory structures that correspond to the structure and content of the XML document.
The marshalling is implemented as a module that transforms a set of objects in the
memory to a sequence of calls to the serialization API.
The XML/EXI data binding code can be complex to write and maintain manually.
For that reason, it is often automatically generated. There are two main approaches
when generating the code and keeping it in sync with the XML/EXI documents - direct,
and indirect mapping. In direct mapping, the source code is generated based on XML
schema definitions or vice versa - the XML schema can be built based on the existing
source code definitions. When no schema information is available or needed, the XML
tree can be directly mapped to a memory representation, as in the case of the Document
Object Model (DOM). The data binding frameworks that are based on direct mapping of
the XML Information Set and the memory representation, are widely adopted in desktop
and enterprise applications - examples include DOM, JAXB, XMLBeans, and others [53].
Their main advantage is that it is very easy to build and maintain the XML-to-source
code mapping. An example of a pure XML direct mapping framework for embedded
systems development is the gSOAP toolkit [54]. A similar approach, but applied to
EXI and targeted at highly resource-constrained embedded devices is the automatic EXI
Processor generation reported by [55].
The indirect mapping is a more flexible approach that allows discarding the unnec-
essary XML structures or reusing existing objects in the memory by defining a layer of
indirection between the XML Information Set and the memory representation. Example
libraries in this category include Castor and JiBX [56] - both only available in Java, and
targeted at server/desktop applications. A comparison between the two approaches i.e.
direct and indirect mapping, along with performance measurements, are presented by
Sosnoski in IBM developerWorks article on data binding tools for Java/XML [57].
The EXI binding presented in this section falls into the category of indirect mapping,
and it is targeted at embedded systems development. Its design is based on the following
requirements:

while
the RAM usage in schema-less mode is 19 kB (1 kB stack size + 7.5 kB heap + 10.5 kB .data and .bss)
118

• The mapping rules should have intuitive syntax and semantics.

• The binding deﬁnitions should be independent from the programming language in

use - the same binding deﬁnition should work for programs written in C, Java,
Python, and so on.

• The EXI binding should be eﬃcient to use on embedded platforms.

• The mapping layer should allow for loading the binding deﬁnitions and building
the objects in memory dynamically at run-time.

To optimally fulﬁll these requirements, we propose template-based binding deﬁnitions

that are written in XML and converted to EXI before being used for code generation
or loading at runtime. The binding templates are very similar to other frameworks for
dynamic content delivery based on templates such as JavaServer Pages (JSP) technology.
An in-depth overview of template-based code generation is presented by [58] where the
authors describe the theoretical foundations of template systems and include comparison
with other code generation techniques. The proposed EXI template framework is a
heterogeneous code generator that follows the model-view-controller design pattern as
suggested by [58].
Figure 5 shows a comparison of this approach to what is a commonly used method
for defining such binding definitions. As depicted, the mapping between dynamic EXI
content and programming constructs is done using a special character @ and semicolon
notation. As such, the definitions are intuitive to define as well as simple to process by the
loading code. As with other such approaches based on templates, these special characters
must be escaped when used in a static content. As an example, the value for a static
attribute email within an EXI binding definition should be defined as example@@com
to escape the special character that indicates the beginning of dynamic content mapping.

5 CoAP/EXI/XHTML Web page engine

This section presents a prototype implementation of a dynamic Web interface for an
embedded sensor platform based on CoAP/EXI/XHTML technologies. The implemen-
tation is developed using the EXIP framework, and consists of an experimental Java
browser running on a laptop PC that connects to a wireless sensor device (Mulle version
3.2 [59]) over Bluetooth. The laptop user can navigate to the device Web interface using
mDNS/DNS-SD or CoAP built-in discovery capabilities - multicast service discovery [37],
or CoRE Resource Directory [60]. In our simpliﬁed test setup, the network address of
the sensor device is predeﬁned so the discovery process was not implemented.
The EXI encoded XHTML page is dynamically generated on the sensor platform on
a CoAP GET request, and it contains an iframe tag with a link to an external observable
CoAP resource:
...
<p>Current t e m p e r a t u r e i s :</p>
5. CoAP/EXI/XHTML Web page engine 119

Figure 5: Comparison between typical binding deﬁnitions and the EXIP templates

Debian Linux laptop PC Wireless sensor platform

Bluetooth v2.0
16-bit MCU, 47 kB RAM
CoAP/EXI prototype Java browser:
flyingsaucer v8 UDP CoAP/EXI/XHTML engine:
OpenEXI v1.0238.0 libcoap v4.0.1
CoAP v13 EXIP v0.5.1
Californium v0.13.1

CoAP GET /exip

T
i
GET response + ACK m
EXI/XHTML payload e

CoAP GET Observe /temp 18º C

T
e
GET response + ACK
18º C m
Plain text: 18 sensor p
readings e
dynamically updated content
21º C r
GET response notication a
Plain text: 21 t
21º C u
r
ACK notication
e

Figure 6: CoAP/EXI/XHTML dynamic Web interface demonstration

This prototype demonstrates how the newly emerging binary Web protocols can be
employed to enable a dynamic Web interface for highly resource-constrained embedded
devices. The Web interface can be used in a wide range of mobile applications, as sug-
gested by [61]. The approach of using the iframe tag with CoAP Observe enables very
lightweight event-based content delivery that is suitable for low-power radio communi-
cations such as IEEE 802.15.4 (6LoWPAN, ZigBee), Z-Wave, or Bluetooth low energy.
Example application domains for the EXIP framework include, but are not limited to:
industrial process monitoring and control, eHealth and elderly care, wearable electronics,
home automation, and energy management.
Data visualization technologies based on XML encoding such as SVG14 and X3D15 can
be readily included in the CoAP/EXI/XHTML engine to efficiently represent graphical
indicators (e.g., battery level, signal strength) and visualize measurements and configu-
ration parameters. An evaluation of EXI encoding for SVG in rich media applications for
embedded systems presented by [62] shows that EXI significantly increases the efficiency
of the SVG format. Also shown in this work is an approach using the EXI header option
datatypeRepresentationMap to further optimize the compression of graphics formats for
embedded web applications.
14
Scalable Vector Graphics (SVG): http://www.w3.org/TR/SVG11/
15
X3D Specification for 3D Graphics: www.web3d.org/x3d
6. Conclusions 121

The presented CoAP/EXI/XHTML Java browser always tries to subscribe to the

iframe CoAP links - if the resource is not observable, the subscription is not established.
When the resource is observable but should be treated statically for display in the browser
(for example representing a snapshot of dynamic data), the embedded server should reject
the subscription request by the browser. This approach can be too limited in certain
scenarios, in which case different ways to indicate whether the browser should subscribe
to changes on the iframe resource can be employed - adding an extra boolean argument
observe to the iframe tag as an XHTML schema deviation, or requesting the resource
description in CoRE Link Format before sending the subscription request. Similarly, in
more complex scenarios the use of plain text encoding for the iframe resource might be
too limiting. In such cases a structured format such as EXI can be used instead of plain
text. The definition of the data format (including parameters and schema if available)
for particular iframe can be defined as XHTML schema deviation or read from the CoRE
Link Format as suggested for the observe use case.

Implementation details The information provided hereafter gives more insight into
the actual implementation, and is useful for reproducing the test setup. The Mulle
sensor platform has a 16-bit Renesas M16C/62P microcontroller and Mitsumi Bluetooth
2.0 wireless module. The application runs on bare metal, in other words, without an OS,
on top of a port of lwIP TCP/IP stack and a libcoap v4.0.1 library. The EXI/XHTML
generation is done in schema-less mode using EXIP v0.5.1.
The laptop PC is running Debian Linux, and is equipped with a USB Bluetooth
2.0 adapter. Debian packages bluez-compat v4.99-2, bridge-utils v1.5-6, and isc-dhcp-
server were installed and conﬁgured on the system to enable TCP/IP communication
over Bluetooth.
The size of the EXI/XHTML Web page is 239 bytes, and is generated directly in
binary (EXI) form without transition to plain XML. If converted to text XHTML, the
size is 427 bytes. The temperature notiﬁcations are in plain text, and account for 14
bytes of CoAP packet size (UDP payload) in total, assuming 2 bytes for the plain text
temperature value.

6 Conclusions
The newly emerging transport and data representation protocols based on binary encod-
ing - CoAP and EXI - provide an eﬃcient way to connect embedded systems to the Web
across scenarios as diverse as mobile computing, home automation, and smart grid. As
the translations between CoAP ⇔ HTTP and EXI ⇔ XML are well deﬁned, the inte-
gration of these binary protocols to the existing Web infrastructure is standardized and
conforms to the well-established programming interfaces. For example, EXI processors
often provide the same API as XML processors, and CoAP/HTTP proxies are simple to
deploy and are transparent for the Web users.
The work presented in this paper shows that the use of CoAP/EXI stack and the
EXIP Web development toolkit enables reuse of the existing pool of Web technologies
122

and developers’ skills, even on very resource-constrained embedded platforms. The de-
velopment process and especially the integration with existing systems is much faster
and easier to maintain as compared to the use of handcrafted communication protocols.
Moreover, the presented EXI processor design, and the EXI grammar generation algo-
rithms in particular, provide superior processing performance compared to the methods
described in the EXI specification with order-of-magnitude speed-up in some of the test
cases. This could enable exchange patterns supporting dynamic XML schema negotia-
tions even for embedded hosts. The use cases for such an approach include support for
schema versioning, generic Web services, and runtime service composition.
Finally, the presented prototype of dynamic Web interface for sensor platforms demon-
strates the possibility to use event-based Web content delivery with a very low overhead
in terms of network bandwidth and processing power. The development of the Web
interface or Web service exchange can be automated by using the template-based EXI
data binding. As the data binding creates indirect mapping between the EXI document
and the programming constructs, the memory structures and programming objects can
be reused when generating or decoding the EXI streams.
Possible extensions of this work include in-depth memory consumption evaluation and
trade-off analysis as well as developing a formal specification of the EXIP data binding,
and implementing prototypes in C and Java to evaluate the proposed approach against
existing XML data binding frameworks. Providing support for light-weight client-side
scripting as part of the CoAP/EXI/XHTML embedded Web programming is also an
interesting and important topic for future investigation. It is also worth analyzing the
application of CoAP/EXI, and the EXIP framework in particular, for mobile platforms
and even for desktop applications that are not resource-constrained. Lowering the net-
work traffic and CPU cycles for Web content delivery on mobile phones and tablet PCs
could potentially increase the battery life for these devices, lower the networking cost for
both operators and users, and even lead to energy savings if applied on a global scale.
6. Conclusions 123

*
APPENDIX A: Grammar Augmentation Algorithm Example This appendix gives an
example of how the augmentation procedure is applied to the wildcard XML schema
type anyType which is the base type deﬁnition for all other XML schema types. A
minimal (without redundant productions) EXI grammar that describes the content model
according to the process of creating proto-grammars is:

anyType −0:
AT( ∗ ) anyType−0
SE ( ∗ ) anyType−1
EE
CH anyType−1