Multimedia Systems Development: Dr. Omar Masmali
Multimedia Systems Development: Dr. Omar Masmali
• A consumer entertainment vendor: interactive TV with hundreds of digital channels available, or a cable
• A Computer Science (CS) student: applications that use multiple modalities, including text, images,
• Graphics, visualization, artificial intelligence, computer vision, data compression, graph theory,
• It’s the applications that use multiple modalities, including text, images, drawings
• Television: the new medium for the 20th century, established video as a
commonly available medium and has since changed the world of mass
communications.
Multimedia in the New World
• 2010 - Netflix migrated its infrastructure to the Amazon’s cloud computing
platform. - Microsoft introduced Kinect, a horizontal bar with full-body 3D motion
capture, facial recognition and voice recognition capabilities, for its game console
Xbox 360.
• 2012 - HTML5 subsumes the previous version, HTML4. it is able to run on low
powered devices such as smartphones and tablets.
Multimedia in the New World
• 2013 - Twitter offered Vine, a mobile app that enables its users to create and post short
video clips. - Sony released its PlayStation 4 a video game console, which is to be
integrated with Gaikai, a cloud-based gaming service that offers streaming video game
content. - 4K resolution TV started to be available in the markets.
• 2015 YouTube launched support for publishing and viewing 360- degree videos, with
playback on its website and its Android mobile apps. - AlphaGo, a computer program that
plays the board game Go, became the first program to beat a human professional player.
Its core technology Deep Learning attracted significant attention and have seen success
in multimedia content understanding and generation.
Multimedia in the New World
• 2018 The world’s first 16K Ultra High Definition (UHD) short video film, Prairie
Wind, was created. 5G cellular systems started deployment, providing enhanced
mobile broadband and ultra low latency access. The WiFi 6 (802.11ax) standard
was released, offering theoretical maximum throughput of 1 Gbps.
• 2020 Due to the outbreak of corona virus (COVID-19) around the world,
work/study from home became a norm in early 2020. Multimedia-empowered
online meeting and teaching tools, e.g., Zoom, Google Class, and Microsoft
Teams, saw booming use during this period.
Multimedia Systems Development
Dr. Omar Masmali
Multimedia
Characteristics of a Multimedia Systems
• Integrated
• All multimedia components (audio, video, text, graphics) used in the system
must be somehow integrated.
• Interactivity
• Level 1: Interactivity strictly on information delivery. Users select the time at which
the presentation starts, the order, the speed and the form of the presentation itself.
• Level 2: Users can modify or enrich the content of the information, and this
modification is recorded.
• Level 3: Actual processing of users input and the computer generate genuine result
based on the users input.
Characteristics of a Multimedia Systems
• Digitally Represented
• Heading / Title
• Paragraph
• Navigation
• Text as graphics
• ….
Text Usages
• Bullet / list
• Heading / Title
• Paragraph
• Navigation
• Text as graphics
• ….
Text Usages
• Bullet / list
• Heading / Title
• Paragraph
• Navigation
• Text as graphics
• ….
Font
• A design for a set of characters.
• The font size is the distance from the top of the “Ascender Height”
to the bottom of the “Descender Height" in letters.
Font Size
Font Size Example
Text
8 point
12 point Text
24 point Text
48 point
Text
96 point
Text
Font Style
• Refers to whether text is bold, italicized, underlined, or any combination of the
three
• The term font style refers to the particular style of textual characters.
• With the help of decorative fonts any informal passage can become more reader-
friendly: it will quickly capture attention of the readers and make a text easier to
perceive, unusual, and fascinating.
Paragraph Alignment
• Centered
• Justified.
Multimedia Systems Development
Dr. Omar Masmali
Image
Categories of Digital Graphics
• 1. Bitmap
• 2. Vector
• 3. Meta
• 4. Animated
1- Bitmaps
• They store this information in a grid of points, or pixels, which has a fixed width
and height.
• They can store various ranges of colors according to the image type.
1-Bit Images
• Images consist of pixels (picture elements in digital images).
• A 1-bit image (also called binary image) consists of on and off bits only and thus is
the simplest type of image.
• Each pixel is stored as a single bit (0 or 1)
• It is also sometimes called a 1-bit monochrome (called Lena image by scientists)
image since it contains no color.
• 1-bit images can be satisfactory for pictures containing only simple graphics and
text.
• fax machines use 1-bit data, so in fact 1-bit images are still important.
Monochrome 1-bit image
• To get such images, special cameras that view more than just 3 colors (RGB) are
used.
• Such images are called multispectral (more than three colors) or hyperspectral
(224 colors for satellite imaging).
Graphics File Formats
• There are many standard formats for saving bitmaps in files.
• Bitmap (BMP)
• Graphics Interchange Format (GIF)
• Joint Photographic Experts Group (JPEG)
• Exchangeable Image File (Exif)
• Portable Network Graphics (PNG)
• Tag Image File Format (TIFF)
• .......
• https://developer.mozilla.org/en-US/docs/Web/Media/Formats/Image_types
2- Vector images
• Vector images are completely computer generated.
• Vectors are used to create graphics such as interface elements (banners, buttons)
text, line art and detailed drawings (plans, maps).
• Effects can be added to vector graphics to add realism, however, they need to be
converted to bitmaps in order to do this.
Raster graphics
• raster graphic is a mechanism that represents a two-dimensional image as a
rectangular matrix or grid of square pixels, viewable via a computer display, paper,
or other display medium.
• A raster is technically characterized by the width and height of the image in pixels
and by the number of bits per pixel.
• Raster images are stored in image files with varying dissemination, production,
generation, and acquisition formats.
Raster graphics
3- Meta Graphics
• Meta graphics can be termed as hybrid graphics as they are a combination of
bitmap and vector graphics.
• They aren’t as widely used as bitmaps and vectors, and aren’t as widely supported.
• Effects generated by bitmaps can be added and bitmaps themselves can also be
animated.
Three Dimensional (3D) Drawing and
Rendering
• 3 Dimensional refers to objects that are rendered visually on paper, film or on screen
in three planes representing width, height and depth (X, Y and Z).
• The 3D object that a user creates is called a model and it can be simple as well as
complex.
Multimedia Systems Development
Dr. Omar Masmali
Digital Image Processing (1)
Digital Image
• Image Acquisition:
• Image Enhancement:
• It is among the simplest and most appealing areas of digital image processing.
The idea behind this is to bring out details that are obscured or simply to
• Image Restoration:
• It is an area that is been gaining importance because of the use of digital images
• Color image processing deals with basically color models and their
• Multiresolution Processing:
• These are the foundation for representing image in various degrees of resolution.
Steps of Digital Image Processing
• Compression:
• It deals with techniques reducing the storage required to save an image, or the
• 1. Lossless Compression
• 2. Lossy Compression
Steps of Digital Image Processing
• Morphological Processing:
• It deals with tools for extracting image components that are useful in the
• It always follows the output of the segmentation step that is, raw pixel data,
• Recognition:
• It is the process that assigns label to an object based on its descriptors. It is the
• 6 to 7 million cones in the human eye can be divided into three principal sensing
• 65%: red
• 33%: green
• The characteristics generally used to distinguish one color from another are:
perceived by an observer.
• Saturation: relative purity or the amount of white light mixed with its hue.
Color Characteristics
• The characteristics generally used to distinguish one color from another are:
• The characteristics generally used to distinguish one color from another are:
perceived by an observer.
Color Characteristics
• The characteristics generally used to distinguish one color from another are:
• Saturation: relative purity or the amount of white light mixed with its hue.
Image Computerized Processes
The inputs to the process are generally images but outputs are attributes
analysis, and performing the cognitive functions normally associated with vision.
Image Computerized Processes
Sampling and Quantization
• To create a digital image, we need to convert the continuous sensed data into
• An image may be continuous with respect to the x and y coordinates and also in
amplitude. To convert it into digital form we have to sample the function in both
• Assume that an image f(x,y) is sampled so that the resulting digital image has M
• The values of the coordinates (x,y) now become discrete quantities thus the value
• Find bits required to store a 4*4 digital image if we are using 64 different gray
levels
Digital Image Storage Required (Examples)
• Find bits required to store a 4*4 digital image if we are using 64 different gray
levels
• M = 32
• L = 23 = 8
• Size = ??
Digital Image Storage Required (Examples)
• N = 32
• M = 32
• L = 23 = 8
• Image acquisition
• e.g., light levels, sensor temperature, etc.
• Transmission
• e.g., lightning or other atmospheric disturbance in wireless network
Noise Models
• White noise
• The Fourier spectrum of noise is constant
Noise Models
• Gaussian noise
• Electronic circuit noise, sensor noise due to poor illumination and/or
high temperature
• Rayleigh noise
• Range imaging
repeated pattern.
Characteristic of Sound Waves
• Digitizing
• The process of converting an analog signal to a digital one.
Digital Audio
• Digital audio data is the representation of sound, stored in the form of samples
point.
• Quality of digital recording depends on the sampling rate, that is, the number of
• The higher the sampling rate, the more the measurements are taken (better quality).
• The lower the sampling rate, the lesser the measurements are taken (low quality).
Digital Audio
Audio File Formats
• AUDIO DIGITAL
• WINDOWS *.WAV
• MAC *.AIFF
• UNIX *.AU
• REALAUDIO *.RA
• MPEG3 *.MP3
Advantages & Disadvantages of Using Audio
• Advantages
• Ensure important information is noticed.
• Add interest.
• Can communicate more directly than other media.
• Disadvantages
• Easily overused.
• Requires special equipment for quality production.
• Not as memorable as visual media.
Multimedia Systems Development
Dr. Omar Masmali
Natural Language Processing
(NLP)
What is NLP?
• Long-lasting.
Disadvantages of Video
BNC RGB
HDMI Firewire
Broadcast Video Standards
• Video File Size = Frame size * Frame rate * Color depth * Duration
• Color depth:
• B&W video = 1 byte
• Color video = 3 bytes
Calculate Video File Size
• Calculate the file size for a video with 320 x 240 pixels, color video, 30
fps, and length 15 seconds ?
Calculate Video File Size
• Calculate the file size for a video with 320 x 240 pixels, color video, 30
fps, and length 15 seconds ?
• Video file size = Frame size * Frame rate * Color depth * Duration
• Calculate the file size for a video with 320 x 240 pixels, B&W video, 30
fps, and length 15 seconds ?
Calculate Video File Size
• Calculate the file size for a video with 320 x 240 pixels, B&W video, 30
fps, and length 15 seconds ?
• These programs can substantially reduce the size of video files, which
means that more video can fit on a single CD and that the speed of
transferring video from a CD to the computer can be increased.
Video Compression
Digital Video File Formats
• Quicktime (.mov)
• Object Landmark Detection: What are the key points for the object in the photograph?
• Object Recognition: What objects are in this photograph and where are they?
Computer Vision Tasks
Computer Vision Tasks
What makes face recognition hard?
Expression
What makes face recognition hard?
Lighting
What makes face recognition hard?
Occlusion
What makes face recognition hard?
Viewpoint
Examples of CV
• Google
Examples of CV
Examples of CV
• P-frame (Predicted picture) holds only the changes in the image from the
previous frame.
• Video data are arguably more difficult for content generation and
sharing, given their large size, high bandwidth, and long playback
duration.
User-Generated Content Sharing
• YouTube: the most significant and successful video sharing website
• Highly globalized:
• 80 languages
• 85% of YouTube traffic comes from outside of the US.
User-Generated Content Sharing
User-Generated Content Sharing
Online Social Networking
social relations.
• Facebook: -
a daily basis.
Online Social Networking
• Twitter: -
the Vine service, which enables mobile users to create and post
• A robotic agent replaces cameras and infrared range finders for the sensors,
and various motors and actuators for effectors.
Types of Agents
• Learning Agents
Agent and Environment
Simple Reflex Agents
• They choose actions only based
on the current percept.
• They are rational only if a
correct decision is made only on
the basis of current precept.
• Their environment is completely
observable.
Model Based Reflex Agents
• They use a model of the world
to choose their actions.
• Model − knowledge about “how
the things happen in the world”.
• Updating the state requires the
information about −
• How the world evolves.
• How the agent’s actions affect the
world.
Goal Based Agents
• They choose their actions in
order to achieve goals.
Performance Element:
Performs task, selects external actions
Critic:
Monitors results of performance, provides feedback to
learning element
Problem Generator:
Actively suggests experiments, generates examples to test
Performance Standard:
Method / standard of measuring performance
Types of Artificial Intelligence
• Reactive Machines perform basic operations. This level of A.I. is the simplest. These types react
to some input with some output. There is no learning that occurs.
• Limited Memory refer to an A.I.’s ability to store previous data and/or predictions, using that data
to make better predictions.
• Theory of Mind level AI will be able to better understand the entities it is interacting with by
discerning their needs, emotions, beliefs, and thought processes.
• Self-aware This is the final stage of AI development which currently exists only hypothetically.
Types of Artificial Intelligence
Multimedia Systems Development
Dr. Omar Masmali
Machine Learning
Artificial Intelligence
7
Supervised Learning
• Learning a discrete function: Classification
• Boolean classification:
• Each example is classified as true(positive) or false(negative).
• Learning a continuous function: Regression
10
Tid Attrib1 Attrib2 Attrib3 Class Learning
1 Yes Large 125K No
algorithm
2 No Medium 100K No
3 No Small 70K No
6 No Medium 60K No
Training Set
Apply
Tid Attrib1 Attrib2 Attrib3 Class Model
11 No Small 55K ?
15 No Large 67K ?
10
Test Set
11
• Example to show the Different between Classification and Regression
Some of ML Algorithms
Deep Learning
decisions.