0% found this document useful (0 votes)
53 views283 pages

Multimedia Systems Development: Dr. Omar Masmali

Multimedia systems integrate different media types like text, images, video and audio and are computer controlled, with information represented digitally; they are characterized by interactivity, with users able to modify content, and maintaining temporal relationships between media during playback is a key challenge. The document discusses text as a media type used in multimedia, covering text elements, usages, fonts, and styles.

Uploaded by

abdullah.saadp
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views283 pages

Multimedia Systems Development: Dr. Omar Masmali

Multimedia systems integrate different media types like text, images, video and audio and are computer controlled, with information represented digitally; they are characterized by interactivity, with users able to modify content, and maintaining temporal relationships between media during playback is a key challenge. The document discusses text as a media type used in multimedia, covering text elements, usages, fonts, and styles.

Uploaded by

abdullah.saadp
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 283

Multimedia Systems Development

Dr. Omar Masmali


Introduction
What is Multimedia?
• There are a quite different, or even opposing, viewpoints.

• A consumer entertainment vendor: interactive TV with hundreds of digital channels available, or a cable

TV-like service delivered over a high-speed Internet connection; a smartphone.

• A Computer Science (CS) student: applications that use multiple modalities, including text, images,

drawings (graphics), animation, video, sound including speech, and interactivity.

• Graphics, visualization, artificial intelligence, computer vision, data compression, graph theory,

networking, database systems all have contributions to make in multimedia.


What is Multimedia?
• Derived from the word “Multi” and “Media”.

• It’s the applications that use multiple modalities, including text, images, drawings

(graphics), animation, video, sound including speech, and interactivity.

• Multimedia means that computer information can be represented through text,

audio, graphics, images, video, and animation in addition to traditional media.


History of Multimedia
• Newspaper: perhaps the first mass communication medium, uses text, graphics,
and images.

• Wireless radio transmission: Guglielmo Marconi, Italy, in 1895.

• Television: the new medium for the 20th century, established video as a
commonly available medium and has since changed the world of mass
communications.
Multimedia in the New World
• 2010 - Netflix migrated its infrastructure to the Amazon’s cloud computing
platform. - Microsoft introduced Kinect, a horizontal bar with full-body 3D motion
capture, facial recognition and voice recognition capabilities, for its game console
Xbox 360.

• 2012 - HTML5 subsumes the previous version, HTML4. it is able to run on low
powered devices such as smartphones and tablets.
Multimedia in the New World
• 2013 - Twitter offered Vine, a mobile app that enables its users to create and post short
video clips. - Sony released its PlayStation 4 a video game console, which is to be
integrated with Gaikai, a cloud-based gaming service that offers streaming video game
content. - 4K resolution TV started to be available in the markets.

• 2015 YouTube launched support for publishing and viewing 360- degree videos, with
playback on its website and its Android mobile apps. - AlphaGo, a computer program that
plays the board game Go, became the first program to beat a human professional player.
Its core technology Deep Learning attracted significant attention and have seen success
in multimedia content understanding and generation.
Multimedia in the New World
• 2018 The world’s first 16K Ultra High Definition (UHD) short video film, Prairie
Wind, was created. 5G cellular systems started deployment, providing enhanced
mobile broadband and ultra low latency access. The WiFi 6 (802.11ax) standard
was released, offering theoretical maximum throughput of 1 Gbps.

• 2020 Due to the outbreak of corona virus (COVID-19) around the world,
work/study from home became a norm in early 2020. Multimedia-empowered
online meeting and teaching tools, e.g., Zoom, Google Class, and Microsoft
Teams, saw booming use during this period.
Multimedia Systems Development
Dr. Omar Masmali
Multimedia
Characteristics of a Multimedia Systems

• Multimedia systems has four basic characteristics:

1. Multimedia systems must be computer controlled.

2. Multimedia systems are integrated.

3. The interface to the final presentation of media is usually interactive.

4. The information they handle must be represented digitally.


Characteristics of a Multimedia Systems
• Computer Controlled
• Producing the content of the information – e.g. by using the authoring tools, image
editor, sound and video editor
• Storing the information – providing large and shared capacity for multimedia
information.
• Transmitting the information – through the network.
• Presenting the information to the end user – make direct use of computer peripheral
such as display device (monitor) or sound generator (speaker).
Characteristics of a Multimedia Systems

• Integrated
• All multimedia components (audio, video, text, graphics) used in the system
must be somehow integrated.

• Every device, such as microphone and camera is connected to and controlled


by a single computer.

• A single type of digital storage is used for all media type.


Characteristics of a Multimedia Systems

• Interactivity
• Level 1: Interactivity strictly on information delivery. Users select the time at which
the presentation starts, the order, the speed and the form of the presentation itself.

• Level 2: Users can modify or enrich the content of the information, and this
modification is recorded.

• Level 3: Actual processing of users input and the computer generate genuine result
based on the users input.
Characteristics of a Multimedia Systems

• Digitally Represented

• Digitization: process involved in transforming an analog signal to digital signal.


Challenges for Multimedia Systems

• The key issues multimedia systems need to deal with are:

• How to represent and store temporal information.

• How to strictly maintain the temporal relationships on play back/retrieval.

• What process are involved in the above.


Desirable Features for a Multimedia System
• Very High Processing Power
• Multimedia Capable File System
• Data Representations/File Formats that support multimedia
• Efficient and High I/O
• Special Operating System
• Storage and Memory
• Network Support
• Software Tools
Components of a Multimedia System
• Capture Devices
• Video Camera
• Video Recorder
• Audio Microphone
• Keyboards
• Graphics tablets
• 3D input devices
• Tactile sensors
• VR devices
• Digitizing/Sampling Hardware
Components of a Multimedia System
• Storage Devices
• Hard disk drive
• Zip drive
• Compact Disc
• Digital Versatile Disc (DVD)
• Blu ray Disc
Components of a Multimedia System
• Communication Networks
• Ethernet
• Token Ring
• Fiber Distributed Data Interface (FDDI)
• Asynchronous Transfer Mode
• Intranet
• Internet
Components of a Multimedia System
• Computer Systems
• Desktop computer
• Processor
• RAM
• Display card
• Sound card
• Capture card
Components of a Multimedia System
• Display Devices
• High resolution monitor
• High quality speakers
• Color printer
• Projector
Multimedia Research Topics and Projects
• Multimedia processing and coding: multimedia content analysis, content-based
multimedia retrieval, multimedia security, audio/image/video processing, compression,
etc.

• Multimedia system support and networking: network protocols, Internet, operating


systems, servers and clients, quality of service (QoS), and databases.

• Multimedia tools, end-systems and applications: hypermedia systems, user interfaces,


authoring systems, multi-modal interaction and integration — web-everywhere devices,
multimedia education including Computer Supported Collaborative Learning, and design
and applications of virtual environments.
HyperMedia

• HyperMedia: can include other media, e.g., text, graphics,


images, and especially the continuous media, sound and
video.
• The World Wide Web (WWW) — the best example of a
hypermedia application.
Multimedia Systems Development
Dr. Omar Masmali
Text
Definition of Text
• Text is words and symbols in any form, spoken or written, are the most
common system of communication.
• Text is used in most Multimedia applications.
• With multimedia technology, text can be combined with other media in a
powerful and meaningful way to present information and express moods.
• Text is the easiest to manipulate
Text Elements

• Text elements can be categories into:


• Alphabets characters: A – Z
• Number: 0 – 9
• Special characters: . , ; : ‘ “
• Symbols: @ # $ & *
Text Usages
• Bullet / list

• Heading / Title

• Paragraph

• Navigation

• Text as graphics

• ….
Text Usages
• Bullet / list

• Heading / Title

• Paragraph

• Navigation

• Text as graphics

• ….
Text Usages
• Bullet / list

• Heading / Title

• Paragraph

• Navigation

• Text as graphics

• ….
Font
• A design for a set of characters.

• A collection of characters of a single size and style belonging


to a particular typeface family.

• There is some basic consistency of look that makes the


individual characters, regardless of size and style variations,
part of the same family.
Font Size
• The size of a font, typically represented in points (pt).

• The font size is the distance from the top of the “Ascender Height”
to the bottom of the “Descender Height" in letters.
Font Size
Font Size Example

Text
8 point

12 point Text

24 point Text
48 point
Text
96 point

Text
Font Style
• Refers to whether text is bold, italicized, underlined, or any combination of the
three

• The term font style refers to the particular style of textual characters.

• Styles are usually standard.


Font Style
• Use bold fonts for emphasis, to highlight important points.
Font Style
• It is useful to readers to have titles, proper names, or key terms in a manual
within a block of copy highlighted with bold for ease in scanning.
Font Style
• Use italics to emphasize small amounts of text within a block of text.
• Avoid long passages in italics.
• It is harder to read than normal roman faces.
• Underlines are appropriate for section headings and some bibliographical
notations.
• In web browsers, default settings typically distinguish hyperlinks by
underlining them.
Font Categories

• Fonts can be characterized as


• Serif
• Sans Serif
• Decorative
Font Categories
• Used to decorate, embellish, and beautify a text.

• With the help of decorative fonts any informal passage can become more reader-
friendly: it will quickly capture attention of the readers and make a text easier to
perceive, unusual, and fascinating.
Paragraph Alignment

• The arrangement of text relative to a margin.

• Four types of alignment:


• Flush left
• Flush right

• Centered

• Justified.
Multimedia Systems Development
Dr. Omar Masmali
Image
Categories of Digital Graphics

• 1. Bitmap

• 2. Vector

• 3. Meta

• 4. Animated
1- Bitmaps

• Bitmaps are maps of binary color information.

• They store this information in a grid of points, or pixels, which has a fixed width

and height.

• They can store various ranges of colors according to the image type.
1-Bit Images
• Images consist of pixels (picture elements in digital images).
• A 1-bit image (also called binary image) consists of on and off bits only and thus is
the simplest type of image.
• Each pixel is stored as a single bit (0 or 1)
• It is also sometimes called a 1-bit monochrome (called Lena image by scientists)
image since it contains no color.
• 1-bit images can be satisfactory for pictures containing only simple graphics and
text.
• fax machines use 1-bit data, so in fact 1-bit images are still important.
Monochrome 1-bit image

• A 640×480 monochrome image requires 38.4 kB of storage


8-Bit Gray-Level Images
• 8-bit image is one for which each pixel has a gray value between 0 and 255.

• Each pixel is represented by a single byte.

• The entire image can be thought of as a two-dimensional array of pixel values


referred to as a bitmap.

• Image resolution refers to the number of pixels in a digital image (higher


resolution always yields better quality but increases size)
Grayscale image of Lena

• 640×480 grayscale image requires 300kB of storage


24-Bit Color Images
• In a color 24-bit image, each pixel is represented by three bytes, usually
representing RGB.
• Since each value is in the range 0–255, this format supports 256×256×256, or a
total of 16,777,216, possible combined colors; which increases storage size.
• a 640 × 480 24-bit color image would require 921.6 kB of storage. (without any
compression applied)
• Compression is used to decrease the image size by simply grouping pixels
effectively.
24-bit color image
Higher Bit-Depth Images
• In some fields such as medicine (security cameras, satellite imaging) more
accurate images are required to see the patient’s liver, for example.

• To get such images, special cameras that view more than just 3 colors (RGB) are
used.

• Such images are called multispectral (more than three colors) or hyperspectral
(224 colors for satellite imaging).
Graphics File Formats
• There are many standard formats for saving bitmaps in files.
• Bitmap (BMP)
• Graphics Interchange Format (GIF)
• Joint Photographic Experts Group (JPEG)
• Exchangeable Image File (Exif)
• Portable Network Graphics (PNG)
• Tag Image File Format (TIFF)
• .......
• https://developer.mozilla.org/en-US/docs/Web/Media/Formats/Image_types
2- Vector images
• Vector images are completely computer generated.

• They are known as object-oriented graphics as they consist of objects such as


shapes.

• Vectors are used to create graphics such as interface elements (banners, buttons)
text, line art and detailed drawings (plans, maps).

• Effects can be added to vector graphics to add realism, however, they need to be
converted to bitmaps in order to do this.
Raster graphics
• raster graphic is a mechanism that represents a two-dimensional image as a
rectangular matrix or grid of square pixels, viewable via a computer display, paper,
or other display medium.

• A raster is technically characterized by the width and height of the image in pixels
and by the number of bits per pixel.

• Raster images are stored in image files with varying dissemination, production,
generation, and acquisition formats.
Raster graphics
3- Meta Graphics
• Meta graphics can be termed as hybrid graphics as they are a combination of
bitmap and vector graphics.

• They aren’t as widely used as bitmaps and vectors, and aren’t as widely supported.

• An example of a meta graphic would be a map consisting of a photo showing an


aerial view of a town, where the landmarks are highlighted using vector text and
graphics.
4- Animated Graphics
• Animated graphics are ‘moving graphics’ that consist of at least more than one
graphic.

• Vector graphics are mainly the basis of animations.

• Think of cartoons such as the Simpsons and Family Guy.

• Effects generated by bitmaps can be added and bitmaps themselves can also be
animated.
Three Dimensional (3D) Drawing and
Rendering
• 3 Dimensional refers to objects that are rendered visually on paper, film or on screen
in three planes representing width, height and depth (X, Y and Z).

• The 3D object that a user creates is called a model and it can be simple as well as
complex.
Multimedia Systems Development
Dr. Omar Masmali
Digital Image Processing (1)
Digital Image

• An image defined as a two- dimensional function, f(x,y)


where x and y are spatial (plane) coordinates, and the
amplitude of f at any pair of coordinates (x, y) is called the
intensity or gray level of the image at that point.

• image is a projection of a 3- D scene into a 2D projection


plane.
Digital Image

• A digital image is composed of a finite number of elements,


each of which has a particular location and value.

• These elements are called picture elements, image elements,


pels, and pixels.

• Pixel is the term used most widely to denote the elements of


a digital image.
Digital Image
Digital Image
Digital Image Processing

• The field of digital image processing refers to processing


digital images by means of a digital computer
Digital Image Processing
• Images based on radiation from the Electromagnetic (EM) spectrum are the most
familiar, especially images in the X-ray and visual bands of the spectrum.

• Electromagnetic waves can be conceptualized as propagating sinusoidal waves of


varying wavelengths, or they can be thought of as a stream of massless particles,
each traveling in a wavelike pattern and moving at the speed of light. Each
massless particle contains a certain amount (or bundle) of energy. Each bundle of
energy is called a photon
Applications of Digital Image
• Remote sensing via satellites and other space crafts
Applications of Digital Image
• Image transmission and storage for business applications
Applications of Digital Image
• Medical processing
Applications of Digital Image
• RADAR (Radio Detection and Ranging)
Applications of Digital Image
• SONAR (Sound Navigation and Ranging)
Applications of Digital Image
• Robotics and automated inspection of industrial parts
Steps of Digital Image Processing
Steps of Digital Image Processing

• Image Acquisition:

• It could be as simple as being given an image that is already in digital form.

Generally, the image acquisition stage involves processing such scaling.


Steps of Digital Image Processing

• Image Enhancement:

• It is among the simplest and most appealing areas of digital image processing.

The idea behind this is to bring out details that are obscured or simply to

highlight certain features of interest in image.

• Image enhancement is a very subjective area of image processing.


Steps of Digital Image Processing

• Image Restoration:

• It deals with improving the appearance of an image.

• It is an objective approach, in the sense that restoration techniques tend to be

based on mathematical or probabilistic models of image processing.

• Enhancementis based on human subjective preferences regarding what

constitutes a “good” enhancement result.


Steps of Digital Image Processing

• Color Image Processing:

• It is an area that is been gaining importance because of the use of digital images

over the internet.

• Color image processing deals with basically color models and their

implementation in image processing applications.


Steps of Digital Image Processing

• Multiresolution Processing:

• These are the foundation for representing image in various degrees of resolution.
Steps of Digital Image Processing

• Compression:

• It deals with techniques reducing the storage required to save an image, or the

bandwidth required to transmit it over the network.

• It has two major approaches

• 1. Lossless Compression

• 2. Lossy Compression
Steps of Digital Image Processing

• Morphological Processing:

• It deals with tools for extracting image components that are useful in the

representation and description of shape and boundary of objects.

• It is majorly used in automated inspection applications.


Steps of Digital Image Processing

• Representation and Description:

• It always follows the output of the segmentation step that is, raw pixel data,

constituting either the boundary of an image or points in the region itself.


Steps of Digital Image Processing

• Recognition:

• It is the process that assigns label to an object based on its descriptors. It is the

last step of image processing which use artificial intelligence of software.


Multimedia Systems Development
Dr. Omar Masmali
Digital Image Processing (2)
Color Fundamentals

• 6 to 7 million cones in the human eye can be divided into three principal sensing

categories, corresponding roughly to red, green, and blue.

• 65%: red

• 33%: green

• 2%: blue (blue cones are the most sensitive)


Color Characteristics

• The characteristics generally used to distinguish one color from another are:

• Brightness: the achromatic notion of intensity.

• Hue: dominant wavelength in a mixture of light waves, represents dominant color as

perceived by an observer.

• Saturation: relative purity or the amount of white light mixed with its hue.
Color Characteristics

• The characteristics generally used to distinguish one color from another are:

• Brightness: the achromatic notion of intensity.


Color Characteristics

• The characteristics generally used to distinguish one color from another are:

• Hue: dominant wavelength in a mixture of light waves, represents dominant color as

perceived by an observer.
Color Characteristics

• The characteristics generally used to distinguish one color from another are:

• Saturation: relative purity or the amount of white light mixed with its hue.
Image Computerized Processes

• There are three types of computerized processes in the processing of image

• Low level Process

• Mid-level Image Processing

• High level Processing


Image Computerized Processes

• Low level Process:

• These involve primitive operations such as image processing to reduce noise,

contrast enhancement and image sharpening. These kinds of processes are

characterized by fact the both inputs and output are images.


Image Computerized Processes

• Mid-level Image Processing:

• It involves tasks like segmentation, description of those objects to reduce them to

a form suitable for computer processing, and classification of individual objects.

The inputs to the process are generally images but outputs are attributes

extracted from images.


Image Computerized Processes

• High level Processing:

• It involves “making sense” of an ensemble of recognized objects, as in image

analysis, and performing the cognitive functions normally associated with vision.
Image Computerized Processes
Sampling and Quantization

• To create a digital image, we need to convert the continuous sensed data into

digital from. This involves two processes – sampling and quantization.

• An image may be continuous with respect to the x and y coordinates and also in

amplitude. To convert it into digital form we have to sample the function in both

coordinates and in amplitudes.


Sampling and Quantization

• Digitalizing the coordinate values is called Sampling.

• Digitalizing the amplitude values is called Quantization.


Representing Digital Images

• The result of sampling and quantization is matrix of real numbers.

• Assume that an image f(x,y) is sampled so that the resulting digital image has M

rows and N Columns.

• The values of the coordinates (x,y) now become discrete quantities thus the value

of the coordinates at origin become (x,y) = (0,0)


Representing Digital Images
Relationship between Pixels
• A pixel p at coordinates (x,y) has four horizontal and vertical
neighbors whose coordinates are given by:

(x+1,y), (x-1, y), (x, y+1), (x,y-1)

• This set of pixels, called the 4-neighbors or p, is denoted by N4(p)


Relationship between Pixels
Relationship between Pixels
• The four diagonal neighbors of p have coordinates and are denoted
by ND(p).

(x+1, y+1), (x+1, y-1), (x-1, y+1), (x-1, y-1)


Relationship between Pixels
Relationship between Pixels
Digital Image Storage Required
Digital Image Storage Required (Examples)

• Find bits required to store a 4*4 digital image if we are using 64 different gray

levels
Digital Image Storage Required (Examples)
• Find bits required to store a 4*4 digital image if we are using 64 different gray

levels

• Resolution (number of pixels) = 4 * 4 = 16

• Number of bits on each pixel = 6 ( or )

• So, 4*4*6 = 96 bits


Digital Image Storage Required (Examples)
• N = 32

• M = 32

• L = 23 = 8

• Resolution (number of pixels) = ??

• Number of bits on each pixel = ??

• Size = ??
Digital Image Storage Required (Examples)
• N = 32

• M = 32

• L = 23 = 8

• Resolution (number of pixels) = 1024

• Number of bits on each pixel = 3

• Size = 1024 * 3 = 3072


Image Degradation
• An image that has been degraded by using a prior knowledge of the
degradation phenomenon (Noise).
Noise Sources
• The principal sources of noise in digital images arise during

• Image acquisition
• e.g., light levels, sensor temperature, etc.
• Transmission
• e.g., lightning or other atmospheric disturbance in wireless network
Noise Models
• White noise
• The Fourier spectrum of noise is constant
Noise Models
• Gaussian noise
• Electronic circuit noise, sensor noise due to poor illumination and/or
high temperature

• Rayleigh noise
• Range imaging

• Erlang (gamma) noise


• Laser imaging
Noise Models
• (b) Gaussian noise
• (c) Rayleigh noise
• (d) Range imaging
Multimedia Systems Development
Dr. Omar Masmali
Audio
What is Sound?

• Sound is a wave phenomenon like light, but is macroscopic


and involves molecules of air being compressed and
expanded under the action of some physical device.
What is Sound?

• For example, a speaker in an audio system vibrates back and


forth and produces a longitudinal pressure wave that we
perceive as sound.

• Without air there is no sound—for example, in space.


What is Sound?
• Sound comprises the spoken word, voices, music and even
noise.
• It is a complex relationship involving:
• a vibrating object (sound source)
• a transmission medium (usually air)
• a receiver (ear) and;
• a preceptor (brain).
Why Sound is Important in Multimedia?

• To reinforce message or theme

• To set the mood

• To catch the interest of the audience

• To alert the audience

• To include narration: effective for training and educational


application.
How it’s Work
The Power of Sound

• Sound is measured in dB (decibel)

• Sound waves are known as waveforms.

• A pleasant sound has a regular wave pattern. The pattern is

repeated over and over.

• But the waves of noise are irregular. They do not have a

repeated pattern.
Characteristic of Sound Waves

• Sound is described in terms of two characteristics:

• Frequency (or pitch)

• Amplitude (or loudness)


Frequency

• Frequency is a measure of how many vibrations occur in one second.


This is measured in Hertz (abbreviation Hz) and directly corresponds
to the pitch of a sound.

• People can hear from 20 Hz to 20,000 Hz (20 kHz)


• Sounds below 20 Hz are infrasonic
• Sounds above 20 kHz are ultrasonic.
Frequency
Amplitude

• Amplitude is the maximum displacement of a wave from an equilibrium position.

• The louder a sound, the more energy it has.

• This means loud sounds have a large amplitude.


Amplitude
Characteristic of Sound Waves
Analogue to Digital Audio
• Analogue audio
• The name for an electronic signal that carries its information of sound as
continuous fluctuating voltage value.
• non digital tape or audio tape recording of sound.

• Digitizing
• The process of converting an analog signal to a digital one.
Digital Audio

• Digital audio data is the representation of sound, stored in the form of samples

point.

• Quality of digital recording depends on the sampling rate, that is, the number of

samples point taken per second (Hz)

• The higher the sampling rate, the more the measurements are taken (better quality).

• The lower the sampling rate, the lesser the measurements are taken (low quality).
Digital Audio
Audio File Formats
• AUDIO DIGITAL
• WINDOWS *.WAV

• MAC *.AIFF

• UNIX *.AU

• REALAUDIO *.RA

• MPEG3 *.MP3
Advantages & Disadvantages of Using Audio
• Advantages
• Ensure important information is noticed.
• Add interest.
• Can communicate more directly than other media.

• Disadvantages
• Easily overused.
• Requires special equipment for quality production.
• Not as memorable as visual media.
Multimedia Systems Development
Dr. Omar Masmali
Natural Language Processing
(NLP)
What is NLP?

• Natural Language Processing, or NLP for short, is broadly


defined as the ability of a computer program to understand
human language as it is spoken and written

• It is the automatic manipulation of natural language, like


speech and text, by software.
Motivation for NLP
• Understand language analysis & generation
• Communication
• Language is a window to the mind
• Data is in linguistic form
• Data can be in Structured (table form), Semi structured (XML
form), Unstructured (sentence form).
Types of NLP
Natural Language Understanding (NLU)

• It helps the machine to understand the data.

• It is used to interpret data to understand the meaning of


data to be processed accordingly.

• It solves it by understanding the context, semantic, syntax,


intent, and sentiment of the text.
Natural Language Understanding (NLU)

• There are three linguistic levels to understand language.


• Syntax: It understands sentences and phrases. It checks the
grammar and syntax of the text.
• Semantic: It checks the meaning of the text.
• Pragmatic: It understands context to know what the text is trying
to achieve.
Natural Language Generation (NLG)

• NLG is a process to produce meaningful sentences in Natural


Language.

• It explains the structured data in a manner that is easy to


understand for humans with a high speed of thousands of
pages per second.
NLP Levels
Applications of (NLP)
• Search Autocorrect and Autocomplete
Applications of (NLP)
• Language Translator
Applications of (NLP)
• Voice Assistants
Applications of (NLP)
• Email Filters
Applications of (NLP)
• Chatbots
Multimedia Systems Development
Dr. Omar Masmali
Video
What is Video?

• Video is the technology of electronically capturing,


recording, processing, storing, transmitting, and
reconstructing a sequence of still images representing scenes
in motion.

• Video makes use of all of the elements of multimedia,


bringing your products and services alive, but at a high cost.
Analogue and Digital Video

• Analogue video is essentially a product of the television


industry and therefore conforms to television standards.

• Digital video is a product of the computing industry and


therefore conforms to digital data standards.
Advantages of Digital Video

• It can be easily edited.

• The video is stored as a standard computer file.

• Software motion video does not require specialized


hardware for playback.

• Long-lasting.
Disadvantages of Video

• Requires large storage capacity devices.

• Copies can be made illegally.

• Need fast computer system for playback and capture.

• Video is very hardware-intensive (require the highest


performance demand on your computer).
Some Video Cables
RCA Coaxial
(F-connector)

BNC RGB

HDMI Firewire
Broadcast Video Standards

• National Television System Committee (NTSC)

• Phase Alternating Line (PAL)

• System Electronique Pour Couleur Avec Memoire (SECAM)

• Advanced Television Systems Committee (ATSC)

• Digital Television (DTV)


File Size Considerations

• Digitized video files can be extremely large.

• A single second of high-quality color video that takes up only one-


quarter of a computer screen can be as large as 1 MB.

• Several elements determine the file size:


• Frame Size
• Frame Rate
• Color Depth
• The length of the video
Frame Size
Frame Rate

• Frame rate (expressed in frames per second or FPS) is the frequency


(rate) at which consecutive images (frames) are captured or
displayed.
Frame Rate

• Frame rate greatly impacts the style and viewing experience of a


video.
Calculate Video File Size

• Video File Size = Frame size * Frame rate * Color depth * Duration

• Color depth:
• B&W video = 1 byte
• Color video = 3 bytes
Calculate Video File Size

• Calculate the file size for a video with 320 x 240 pixels, color video, 30
fps, and length 15 seconds ?
Calculate Video File Size

• Calculate the file size for a video with 320 x 240 pixels, color video, 30
fps, and length 15 seconds ?

• Video file size = Frame size * Frame rate * Color depth * Duration

• Video file size = 320 x 240 x 3 x 30 x 15 = 103,680,000 bytes


Calculate Video File Size

• Calculate the file size for a video with 320 x 240 pixels, B&W video, 30
fps, and length 15 seconds ?
Calculate Video File Size

• Calculate the file size for a video with 320 x 240 pixels, B&W video, 30
fps, and length 15 seconds ?

• Video file size = 320 x 240 x 30 x 15 = 34,560,000 bytes


Video Compression

• Because of the large sizes associated with video files, video


compression/decompression programs, known as codecs, have been
developed.

• These programs can substantially reduce the size of video files, which
means that more video can fit on a single CD and that the speed of
transferring video from a CD to the computer can be increased.
Video Compression
Digital Video File Formats

• Motion Pictures Expert Group (.mpg)

• Quicktime (.mov)

• Audio Video Interleaved(.avi)

• Windows Media Video (.wmv)

• Adobe Flash video (.flv).


Multimedia Systems Development
Dr. Omar Masmali
Computer Vision
Computer Vision

• Computer vision is a field of artificial intelligence (AI) that enables


computers and systems to derive meaningful information from digital
images, videos and other visual inputs and take actions or make
recommendations based on that information.
How does computer vision work

• Computer vision needs lots of data.

• It analyzes data over and over until it understands distinctions and


ultimately recognizes images.

• For example, to train a computer to recognize an aircraft, it needs to


be fed vast quantities of planes images and planes-related items to
learn the differences and recognize the airplane.
How does computer vision work
Computer Vision
Computer Vision Tasks
• Object Classification: What broad category of object is in this photograph?

• Object Identification: Which type of a given object is in this photograph?

• Object Verification: Is the object in the photograph?

• Object Detection: Where are the objects in the photograph?

• Object Landmark Detection: What are the key points for the object in the photograph?

• Object Segmentation: What pixels belong to the object in the image?

• Object Recognition: What objects are in this photograph and where are they?
Computer Vision Tasks
Computer Vision Tasks
What makes face recognition hard?
Expression
What makes face recognition hard?

Lighting
What makes face recognition hard?
Occlusion
What makes face recognition hard?
Viewpoint
Examples of CV

• Google
Examples of CV
Examples of CV

• You only look once (YOLO)


Examples of CV
Computer Vision in Healthcare
• Cancer Detection

Computer vision is being successfully applied for breast and skin
cancer detection.
• With image recognition, doctors can identify anomalies by comparing
cancerous and non-cancerous cells in images.
• With automated cancer detection, doctors can diagnose cancer faster
from an MRI scan.
Computer Vision in Transportation 1
• Self-driving cars
• Computer vision is widely used in self-driving cars. It is used to detect
and classify objects (e.g., road signs or traffic lights), create 3D maps
or motion estimation, and plays a key role in making autonomous
vehicles a reality.
Computer Vision in Transportation 2
• Road Condition Monitoring
• Computer vision has also been applied for monitoring the road
infrastructure condition by accessing the variations in concrete and
tar.
Computer Vision in Manufacturing
• Defect Detection
• Total production control is usually impossible. With computer vision,
we can detect defects such as cracks in metals, paint defects, bad
prints, etc., in sizes smaller than 0.05mm.
Computer Vision in Agriculture 1
• Crop Monitoring
• With computer vision systems, real-time crop monitoring and
identification of any crop variation due to any disease or deficiency of
nutrition can be made.
Computer Vision in Agriculture 2
• Automatic Weeding
• An automatic weeding machine is an intelligent project enabled with
AI and computer vision that removes unwanted plants or weeds
around the crops. Traditionally weeding methods require human
labour, which is costly and inefficient compared to automatic weeding
systems.
Computer Vision in Retail 1
• Self-checkout
• Self-checkout enables the customers to complete their transactions
from a retailer without the need for human staff, and this becomes
possible with computer vision. Self-checkouts are now helping
retailers in avoiding long queues and manage customers.
Computer Vision in Retail 2
• Automatic replenishment
• Automatic replenishment with computer vision systems captures the
image data and performs a complete inventory scan to track the
shelves item at regular intervals.
Multimedia Systems Development
Dr. Omar Masmali
Multimedia Data
Compression
Introduction

• Compression: the process of coding that will effectively


reduce the total number of bits needed to represent certain
information.
• Lossless - the compression and • Lossy compression eliminates
decompression processes those bytes which are considered
induce no information loss. as not-noticable.
Compression ratio
Image Compression Standards
• Recent years have seen an explosion in the availability of digital
images, because of the increase in numbers of digital imaging devices
such as smart phones, webcams, digital cameras, and scanners.
• The need to efficiently process and store images in digital form has
motivated the development of many image compression standards
for various applications and needs.
• In general, standards have greater longevity than particular programs
or devices and therefore warrant careful study
Image Compression Standards
• Some current standards are:
• JPEG
• JPEG2000 standard
• JPEG-LS Standard
• JBIG Standard
• JBIG2 Standard
Video Compression
• Three types of pictures (or frames) are used in video compression
• I-frame (Intra-coded picture), a complete image, like a JPG or BMP image file

• P-frame (Predicted picture) holds only the changes in the image from the
previous frame.

• B-frame (Bidirectional predicted picture) saves even more space by using


differences between the current frame and both the preceding and following
frames.
Video Compression
Multimedia Systems Development
Dr. Omar Masmali
Social Media Sharing
Social Media Services
• Social media, a group of Internet-based applications, allow the creation and
exchange of user generated contents.
• Key factors to the success of the new generation of social media:
- Collective intelligence
- Rich connections and activities

• Two important social media services:


- User-generated content sharing
- Online social networking.
User-Generated Content Sharing

• UGC is used for a wide range of applications with different types of


media (e.g., text, music, picture, and video).

• Video data are arguably more difficult for content generation and
sharing, given their large size, high bandwidth, and long playback
duration.
User-Generated Content Sharing
• YouTube: the most significant and successful video sharing website

• Fast growth (established in 2005):


• By 2020: – Over 2 billion unique users visit YouTube per month, and
• Over 70% views are on mobile

• Highly globalized:
• 80 languages
• 85% of YouTube traffic comes from outside of the US.
User-Generated Content Sharing
User-Generated Content Sharing
Online Social Networking

• OSN provides an Internet-based platform to connect people with

social relations.

• Both Facebook and Twitter support the sharing and propagation of

such media objects as pictures, music, and video among friends.


Online Social Networking

• Facebook: -

• 2.5 billion active users.

• 1.66 billion million of them log on

a daily basis.
Online Social Networking

• Twitter: -

• A representative of microblog, recently has also begun offering

the Vine service, which enables mobile users to create and post

video clips up to six seconds long.

• 330 million monthly active users.

• 145 million daily active users..


Online Social Networking
Mobile Video Clip Sharing

• New generation video sharing services that use smart mobile

terminals to instantly capture and share ultra-short video clips

(usually of several seconds).

• Representatives: Twitter’s Vine, Instagram, Snapchat, TikTok.


Multimedia Systems Development
Dr. Omar Masmali
Artificial Intelligence
Artificial Intelligence

• It is the science and engineering of making intelligent machines,


especially intelligent computer programs.

• It is related to the similar task of using computers to understand human


intelligence.

• Artificial intelligence leverages computers and machines to mimic the


problem-solving and decision-making capabilities of the human mind.
Agent and Environment

• An agent is anything that can perceive its environment through sensors


and acts upon that environment through actuators (effectors).
• A human agent has sensory organs such as eyes, ears, nose, tongue and skin
parallel to the sensors, and other organs such as hands, legs, mouth, for
actuators.

• A robotic agent replaces cameras and infrared range finders for the sensors,
and various motors and actuators for effectors.
Types of Agents

• Simple Reflex Agents

• Model Based Reflex Agents.

• Goal Based Agents.

• Utility Based Agents.

• Learning Agents
Agent and Environment
Simple Reflex Agents
• They choose actions only based
on the current percept.
• They are rational only if a
correct decision is made only on
the basis of current precept.
• Their environment is completely
observable.
Model Based Reflex Agents
• They use a model of the world
to choose their actions.
• Model − knowledge about “how
the things happen in the world”.
• Updating the state requires the
information about −
• How the world evolves.
• How the agent’s actions affect the
world.
Goal Based Agents
• They choose their actions in
order to achieve goals.

• Goal − It is the description of


desirable situations.
Utility Based Agents
• An agent that acts based not
only on what the goal is, but the
best way to reach that goal
Learning Agents
Learning Element:
Adds knowledge, makes improvement to system

Performance Element:
Performs task, selects external actions

Critic:
Monitors results of performance, provides feedback to
learning element

Problem Generator:
Actively suggests experiments, generates examples to test

Performance Standard:
Method / standard of measuring performance
Types of Artificial Intelligence

• Reactive Machines perform basic operations. This level of A.I. is the simplest. These types react
to some input with some output. There is no learning that occurs.

• Limited Memory refer to an A.I.’s ability to store previous data and/or predictions, using that data
to make better predictions.

• Theory of Mind level AI will be able to better understand the entities it is interacting with by
discerning their needs, emotions, beliefs, and thought processes.

• Self-aware This is the final stage of AI development which currently exists only hypothetically.
Types of Artificial Intelligence
Multimedia Systems Development
Dr. Omar Masmali
Machine Learning
Artificial Intelligence

• It is the science and engineering of making intelligent machines,


especially intelligent computer programs.

• It is related to the similar task of using computers to understand human


intelligence.

• Artificial intelligence leverages computers and machines to mimic the


problem-solving and decision-making capabilities of the human mind.
Machine learning

• Machine learning (ML) is a type of artificial intelligence (AI)

• Allows software applications to become more accurate at predicting


outcomes without being explicitly programmed to do so.

• The study of computer algorithms that can improve automatically


through experience and by the use of data.
Machine Learning Areas
• Supervised Learning: Data and corresponding labels are given (Features and Labels)

• Unsupervised Learning: Only data is given, no labels provided

• Semi-supervised Learning: Some (if not all) labels are present

• Reinforcement Learning: An agent interacting with the world makes observations,


takes actions, and is rewarded or punished; it should learn to choose actions in such
a way as to obtain a lot of reward

7
Supervised Learning
• Learning a discrete function: Classification
• Boolean classification:
• Each example is classified as true(positive) or false(negative).
• Learning a continuous function: Regression

10
Tid Attrib1 Attrib2 Attrib3 Class Learning
1 Yes Large 125K No
algorithm
2 No Medium 100K No

3 No Small 70K No

4 Yes Medium 120K No


Induction
5 No Large 95K Yes

6 No Medium 60K No

7 Yes Large 220K No Learn


8 No Small 85K Yes Model
9 No Medium 75K No

10 No Small 90K Yes


Model
10

Training Set
Apply
Tid Attrib1 Attrib2 Attrib3 Class Model
11 No Small 55K ?

12 Yes Medium 80K ?

13 Yes Large 110K ? Deduction


14 No Small 95K ?

15 No Large 67K ?
10

Test Set
11
• Example to show the Different between Classification and Regression
Some of ML Algorithms
Deep Learning

• Deep Learning is a subfield of

machine learning concerned with

algorithms inspired by the structure

and function of the brain called

artificial neural networks.


Artificial Neural Networks (ANN)

• A neural network is composed of many simple neurons (processing


units) which are hooked together to generate output based on inputs
Artificial Neural Networks (ANN)
Examples
Data Science
• Data science is the domain of study that deals with

vast volumes of data using modern tools and

techniques to find unseen patterns, derive

meaningful information, and make business

decisions.

• Data science uses complex machine learning

algorithms to build predictive models.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy