0% found this document useful (0 votes)
70 views11 pages

Necessity of Big Data and Analytics For Good E-Governance: September 2017

Uploaded by

Arun
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
70 views11 pages

Necessity of Big Data and Analytics For Good E-Governance: September 2017

Uploaded by

Arun
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/319477823

Necessity of Big Data and Analytics for Good e-governance

Article · September 2017


DOI: 10.14257/ijgdc.2017.10.8.02

CITATIONS READS

8 817

4 authors, including:

Bhavya Patel Sudipta Roy

1 PUBLICATION   8 CITATIONS   
Washington University in St. Louis
67 PUBLICATIONS   416 CITATIONS   
SEE PROFILE
SEE PROFILE

Debnath Bhattacharyya
Vignan's Institute of Information Technology
243 PUBLICATIONS   1,324 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

A review on bio medical image processing View project

Bigdata analytics View project

All content following this page was uploaded by Sudipta Roy on 17 September 2017.

The user has requested enhancement of the downloaded file.


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017), pp.11-20
http://dx.doi.org/10.14257/ijgdc.2017.10.8.02

Necessity of Big Data and Analytics for Good e-governance

1 2 3 4
Bhavya Patel , Sudipta Roy , Debnath Bhattacharyya and Tai-Hoon Kim
1,2
Department of Computer Science and Engineering, U.V.Patel College of
Engineering, Ganpat University, Kherva, Mehsana-3840112, Gujarat, India
3
Department of Computer Science and Engineering, Vignan’s Institute of
Information Technology, Visakhapatnam 530049, AP, India
4
Department of Convergence Security, Sungshin Women's University, 249-1,
Dongseon-dong 3-ga, Seoul, 136-742, Korea
bhavya.bda1407@ict.gnu.ac.in, sudiptaroy01@yahoo.com, debnathb@gmail.com,
taihoonn@daum.net

Abstract
Big Data is the most trending topic in the computer science field. This big data
analysis is booming field in present days and used to get the insight in the particular
domain. Big data analytics is used to solve many problems and get the predations of many
upcoming problems. Big Data refers to the huge dataset which is not managed and
handle by old traditional methods. Big Data plays a very important role in effective
management of e-governance. In this paper we collect the kishan call center (KCC) data
as the data is in semi structured format i.e in JavaScript Object Notation (JSON) format
so we used mongoDB to store. For analysis purpose before using the data we had
converted it into the structured format. After converting into the structured format we
created charts for better understanding. This paper focuses on how the big data and
analysis is helpful for good e-governance. This big data is useful to get the insight and
predictive analysis that gives good e-governance.

Keywords: Big Data, Data Analytics, E-Governance, Public Sector, Structured Data, Unstructured
Data

1. Introduction
Congratulations! New technologies contribute a lot in increasing the data day by day.
Social media and the audio and video data and other unstructured data contributed a lot in
the data growth. When generated data crosses the limit of GBs it falls into the big data
category. As per definition of Big Data, data must at least follow the 3 V‟s Volume,
Velocity, and Variety. Approximately of 2.5 quintillions of data are generated by users per
day [1] and is huge data to manage from the management point of view. So Big Data is the
technology to manage, store and handle this huge amount of data having various variety
generating very rapidly daily. Analyzing this huge data is also challenging and can be solved
by various tools like Hadoop, MapReduce, and Pig. Importance of e-governance is very
important for the developing countries like India which lead to the creation of large data set
within country. Indian government is also planning to implement the concept of smart cities
which in future will contribute large chunk of data. Analyzing these huge data in turns give
the result which is helpful for prediction of future projects. The source to collect the data are
social media, device or sensor generated data, internal and external data, surveys done by
government and other private company and many other methods. For e.g.: from social media
like twitter, survey data done by government and private companies like during census
counting or data collected by private company for exit poll. Following Figure 1 shows the
basic overview of data sources analysis of big data.

ISSN: 2005-4262 IJGDC


Copyright ⓒ 2017 SERSC Australia
International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Data sources

Social media Device/sensor Survey data

Figure 1. Different Data Sources

Any government entity or organization that uses the information and communication
technologies to operate their business and services is known as e-government [2]. E-
governance is the collection of services of the government provided to the people [2]. In
India national e-governance plan NeGP is implemented by Department of Electronics and
Information Technology (DeitY) and Department of Administrative Reforms and Public
Grievances (DARPG)[2]. The vision of NeGP is “Make all Public Services accessible to
the common man in his locality, through common service delivery outlets and ensure
efficiency, transparency and reliability of such services at affordable costs to realize the
basic needs of the common man” [3]. For better governance it is very important know the
trend, pattern and the sentiment of the public for the policies or schemes or rules they
have formed. Generally, the government follows the objectives for e-governance [4]: i)
To build an informed security, ii) Increase government and citizen interaction, iii)
Encourage citizen participation, iv) Bring transparency in governing process, v) Make
government accountable, vi) Reduce cost of governance and vii) Reduce the reaction time
of government.
There are typical four types of model for e-governance and they are: i)
G2G(Government to Government), ii) G2C(Government to Citizens), iii)
G2B(Government to Business), and iv) G2E(Government to Employee).
In this paper we have discussed the benefits of the big data and analytics in the e-
governance. Using the analytics generates the reports and this report is analyzed to take
action on the new reforms or scheme or plans. There are methods like data mining to get
the insight from the data which is helpful for betterment of e-governance.

2. Big Data Analytics for E-Governance


Government of India (GoI) uses the information technology to deliver the information
in the field of health care in rural areas. Also government is delivering the information
related to the government scheme to the public of rural area. Indian government has
started this project with an aim and objective of removing of duplicate entries from the
government records and to provide high quality low error database, generates the reports
automatically or regular basis. Use of data electronically for the future purpose and also
some data is electronically available to the people. The traditional e-governance model is
based on the customer satisfaction, process the information and e-governance as tool for
government. Figure2 is the e-governance working framework using traditional model [5].

12 Copyright ⓒ 2017 SERSC Australia


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Figure 2. e-governance Framework using Traditional Model [5]

Matthias Finger in his paper[6] suggest the new model for e-governance and compare
the traditional method customer satisfaction , process the information and e-governance as
tool for government with the newly proposed method. Following Figure 4 is the suggested
new model by National e-Governance Plan (NeGP) in network for information and
computer technology (NICT).

Figure 3. NICT Model in e-governance

As per this NICT model there is need of policy making, regulation and operations. Before
taking the huge decision one should analyzed the previous data and upon the result one should
take the decision. To get better understanding of behavior of data one should use big data
analytics. If the decision making is done big data analytics will surely result into better
decision. Indian government uses big data and analytics for implementing various policies

Copyright ⓒ 2017 SERSC Australia 13


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

and to make change and reforms in existing system. The biggest contributor of big data in
India is the „Aam ADHmi ka AdhikAR‟ as „AADHAR‟ scheme Unique Identification
Authority of India (UIDAI) which give the unique. 100 crore Unique ID number has been
generated as on 4thApril 2016 [7]. This AADHAR number are used and linked with bank
account, Liquefied Petroleum Gas (LPG) connection, „Mahatma Gandhi National Rural
Employment Gurantee Act (MANREGA)‟ job cards, „Indian railway catering and tourism
corporation limited and passport (IRCTC)‟, and passport. Now this data is huge and is link
with many other schemes to maintain we need big data and to get the insight we need the big
data and analytics technology. Similarly, the digital India initiative of an India government
aims at ensuring the government services are made available to citizens electronically by
reducing paper work which in turns make the process fast. There are many other sources from
which data is generated in huge amount and which follows 3‟Vs of Big data. Following
diagram shown in Figure 4 suggest the main sources of data in India.

Figure 4. Main Sources of Data in India

We get the benefits from analytics like predictive policing, evolving new market,
increase operational frequency, and better consumer services.

3. Storing of e-governance Data


Government and its organizations are the biggest producers of data in any country.
This is the big question that how to store the different type of data. In e-governance there
are many different types of format of data. It is difficult to store it in the RDBMS as it
need predefined database schema. As the data structure of government data is always
changing. To solve this problem one should create different database structure of every
format of the data coming from the public. This is very hard to handle and maintain the
database, also it proves very expensive. When the data increase in huge amount to
petabytes it is very difficult to retrieve the data and also it takes very long time to give the
result. So it is not feasible to store the data in the RDBMS. But the transaction data must
be stored in RDBMS as it provides ACID properties. Another way to solve the storage
problem of data is to store the data in Not Only Structure Query Language (NOSQL)
Database. In NOSQL there are main 4 types of data model, they are (i) document-
oriented, (ii) columnar database, (iii) key-value database, and (iv) graph database. The
reason to select the NOSQL database is that NOSQL provides flexible schemas.
Document-oriented database is best for storing the semi structured data like JavaScript
Object Notation (JSON) format file. It uses the whole document structure to store the data.
It doesn‟t need any predefined schema and one can add new attributes when required. We
have given an example of format that can be store in NOSQL databases is shown in
Figure 5 below.

14 Copyright ⓒ 2017 SERSC Australia


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Figure 5. Document Database (couchbase)

NOSQL documents database have no need of joins as it stores the data in single document
and is shown in Figure 5. As most the data in government are in semi structured data so
document-oriented database is the best suited for the e-governance. For e.g.: census data will
be stored and maintained well in document oriented model of NOSQL. Also the health record
of public in hospital is also well maintained in NoSQL document oriented model. One
example of document database has been shown in Figure 6 below within box.

{ _id: "abc_123",
aadhar: "1234 45678 9090
9090", data: {
Health: {
[ {
Date: "10/08/2011",
Physically_handicapped:
"NO" Any_Allergy: "Tonsils"
},
Medicine: {Date: "09/28/2011",
Amount: "12"
Quantity: "5"
frequency_per_day:"1" },
]
},
}
formdef: “fs_forms_medication_1”,
metadata: {
changed-by: “abc_123”,

Figure 6. Example of document database

The other one which is also very helpful is graph database. It is used to find the
relations between the entities and use those relations for getting more insight in the data.
For example, if father or mother is having some allergy than there may be the chances of
having the same allergy to his child. This we can plot the using the graph database by
relating child and father/mother entity by parent child relation. We can take early
precaution in this case and may help to cure that allergy and it is shown in Figure 7.

Copyright ⓒ 2017 SERSC Australia 15


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Figure 7. Graph Database showing Parents Child Relation along with


the Allergy Parents have

4. Different Methods for Big Data Analytics


This section of paper discussed about the different approaches and methods for big
data analytics. There several methods out of that most commonly highly used are: i) data
mining, ii) text analytics, and iii) machine learning. Data mining is the technique or the
process of analyzing and managing the data from the different perspective and to get the
useful insight by summarizing. In short data mining is the process to correlates the fields
in large datasets. Text analysis is a technique to analyzing the specific text or finding
some pattern from huge data sets of file. Text analysis can be done on social network
data, web data, and documents. Machine Learning-is a technique which guide the
machine to learn to do analysis by using various algorithm. It focuses on the dynamic
programs that change as per the new data. It helps in performing the analysis by learning
the data analysis algorithm. In machine learning it learns more and more and become
more accurate by experiences. The machine learning algorithm applied in the mind
reading wizard [8, 9, 10, 11, and 12] which neither take any input nor read. It will
communicate by the users thought. It is one of good example of machine learning. There
are many more methods like sentiment analysis, regression analysis, classification tree
analysis, Association rule learning, genetic algorithms, and Social network analysis.

5. Result and Discussion


Here one use case related to e-governance has been shown in Figure below. This use
case is related to the farmers asking query to the kishan call center (KCC). This data is
taken from the website „data.gov.in‟. The data is in semi structure format that is in JSON
format. So we stored this data in NOSQL database using MongoDB. Here is the
screenshot of our MongoDB data has been shown in Figure 8 below.

16 Copyright ⓒ 2017 SERSC Australia


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Figure 8. KCC Data in JSON Format Stored in MongoDB

This data contain only information queries from Puri and Shambalpur districts of
Odisha, India. Now we had done the analysis of this data and get the insight. We get the
insight that from which region the numbers of queries are more, also in which season they
the more queries. Using the insight get from this data or from the analysis the government
can plan which type of campaign they have to start. Also we analyze for which crop the
queries are more and from this analysis government start program which create awareness
among the farmers related to the method of cultivation of that crop and what other
measures should be taken. Here are some screenshot that we get from the analysis of data.

Figure 9. Number of Queries Related to the Season from KCC Data

Kharif crop are basically a summer crops which are cultivated during rainy season. From
above Figure 9 we get to know that the farmers are having problems or queries or lack of
knowledge related to kharif crop so government can setup their camp especially for kharif
crops. Rabi crops are crops sown in winter and harvested in spring. Zaid or Jayad crops are
crops which are irrigated and harvested in between of kahrif and rabi crop season.

Copyright ⓒ 2017 SERSC Australia 17


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

Figure 10. Analysis of KCC Data

From the Figure 10 we acquire the knowledge that the farmers from Puri and
Shabmhalpur district are more related to vegetables farming. Also we get to know that the
farmers are having the less knowledge or more queries related to vegetables crops. So
government can plan some public programs to teach new methods for the good cultivation
of vegetables crops. Know we know that the farmers from Puri and Shambhalpur district
are facing the problems in vegetables crop cultivation, we can dig deeper in the data and
get more precise insight by getting the knowledge that in which particular vegetable crop
the farmers are facing problems.

Figure 11. Percentage of Queries of Particular Vegetables Crop

Figure 11 shows a dig deeper in the data and to get the insight that in the vegetable crop the
highest of the queries are related to the “Brinjal” crop and second place is taken by
“Bhindi(Lady’s Finger) and Onion”. Using this insight government can plan to setup the

18 Copyright ⓒ 2017 SERSC Australia


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

training camp for the cultivation of vegetables crops having more focus on the Brijanl,
Onion and Bhindi(Lady‟s Finger). From this government came to know that in particular
season in particular district which type of crop cultivation is in higher trend among the
farmers.

6. Conclusion
If big data can be used in right way with some good analytics approach for e-
governance domain it will surely improve the methods of work to be done. It will give
rise to the new ideas and speed up the procedure. New reforms or scheme can be analyzed
before implementing which make more chances of successful implementation of the
scheme. Big data analytics in e-governance can give rise for effective services to the
citizens. Implementing big data and analytics in e-governance can provide more effective
idea. Although there are some challenges but research work are going in this field. The
main advantage to big data and analytics in e-governance is to implement the schemes or
services which really the public wants. Also government can efficiently setup their camps
for different purpose we in turn beneficial to the public and create the awareness in public
related to schemes or programs

References
[1] C. Dobre and F. Xhafa, “Intelligent services for big data Science”, Future generation computer systems,
Elsevier, Volume 37, (2014), pp 267-281.
[2] Fatima-Zahra Benjelloun, Ayoub Ait Lahcen, Samir Belfkih, “An Overview of Big Data Opportunities,
Applications and Tools,” Intelligent Systems and Computer Vision (ISCV), IEEE, pp. 1-6, March (2015).
[3] http://research.omicsgroup.org/index.php/National_e-Governance_Plan#Vision (accessed on:- 2/4/2017 )
[4] http://indiaegovernance.blogspot.in/2008/03/objectives-of-e-governance.html #Vision(accessed on :-
2/4/2017)
[5] Mahesh Kumar Sharma, Kunwar Singh Vaisla “Towards Cloud Supported E-Governance Services
Delivery Model,” 2014 Fourth International Conference on Communication Systems and Network
Technologies (CSNT), IEEE, pp. 537-539, (2014).
[6] Matthias Finger, GaellePecoud, “From e-Government to e-Governance? Towards a Model of e-
Governance,” 3rd European Conference on e-Government Switzerland, pp. 52-60, (2016).
[7] http://pib.nic.in/newsite/PrintRelease.aspx?relid=138555#Vision(accessed on :- 3/4/2017)
[8] http://www.csml.ucl.ac.uk/projects/pascal-outreach-drupal-6.20/?q=node/29#Vision(accessed on :-
3/4/2017)
[9] JagdevBhogal, Imran Choksi, “Handling Big Data using NoSQL,” 2015 29th International Conference on
Advanced Information Networking and Applications Workshops, IEEE, pp. 393-398, (2015).
[10] Rhoda C. Joseph, Norman A. Johnson, “Big Data and Transformational Government,” IT Professional
IEEE, Volume 15, Issue 6, pp.1520-9202, (2013).
[11] A. Cardenas, P. Manadhata, S. Rajan, "Big data analytics for security”, IEEE Security & Privacy, vol.
11, no. 6, pp. 74-76, Nov (2013).
[12] Rajagopalan M.R, Solaimurugan Vellaipandiyan, “Big Data Framework for National e-Governance Plan”,
2013 Eleventh International Conference on ICT and Knowledge Engineering, IEEE, pp.1-5, Nov (2013).

Copyright ⓒ 2017 SERSC Australia 19


International Journal of Grid and Distributed Computing
Vol. 10, No. 8 (2017)

20 Copyright ⓒ 2017 SERSC Australia

View publication stats

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy