Cloud Applications-Edited-2
Cloud Applications-Edited-2
6
Processing pipelines
Indexing large datasets created by web crawler engines.
Data mining - searching large collections of records to locate
items of interests.
Image processing .
Image conversion, e.g., enlarge an image or create thumbnails.
Compress or encrypt images.
Video transcoding from one video format to another, e.g., from AVI
to MPEG.
Document processing.
Convert large collections of documents from one format to
another, e.g., from Word to PDF.
Encrypt documents.
Use Optical Character Recognition to produce digital images
of documents.
Cloud Computing: Theory and Practice
7
Batch processing applications
Generation of daily, weekly, monthly, and annual activity reports for
retail, manufacturing, other economical sectors.
C B
a b c
B A B
A A
XOR XOR C OR
C B C
d e f
B B
A A
AND XOR D AND DIS D
C C
g h
B
B
A X
XOR
A C
AND 2/ E C
3
i j
Master instance
1 1 7
Map
Segment 1
instance 1 Local disk
Reduce
Segment 12 Map instance 1
Segment instance 2 Local disk Shared
Map Reduce storage
Segment 3 instance 3 instance 2
Local disk
Shared
storage
Reduce
3 4 5 instance R 6
Map
Segment M instance M Local disk
All these activities require “big” data storage and systems capable
to deliver abundant computing cycles.
Computing clouds are able to provide such resources and support
collaborative environments.
Cloud Computing: Theory and Practice.
25
Online data discovery
Phases of data discovery in large scientific data sets:
recognition of the information problem.
generation of search queries using one or more search engines.
evaluation of the search results.
evaluation of the web documents.
comparing information from different sources.
Large scientific data sets:
biomedical and genomic data from the National Center for
Biotechnology Information (NCBI).
astrophysics data from NASA.
atmospheric data from the National Oceanic and Atmospheric
Administration (NOAA) and the National Center for
Atmospheric Research (NCAR).
31
Source: Cloud Computing: Theory and Practice. Chapter 4
by Dan C. Marinhu
32